Skip to main content

Questions tagged [variational-bayes]

Variational Bayesian methods approximate intractable integrals found in Bayesian inference and machine learning. Primarily, these methods serve one of two purposes: Approximating the posterior distribution, or bounding the marginal likelihood of observed data.

1 vote
1 answer
58 views

The Google Deepmind paper "Weight Uncertainty in Neural Networks" features the following algorithm: Note that the $\frac{∂f(w,θ)}{∂w}$ term of the gradients for the mean and standard ...
user494234's user avatar
3 votes
1 answer
129 views

I would like to perform clustering with a finite Gaussian Mixture model, however, I have missing data (some features are missing at random). I am using Variational Inference to fit my Bayesian GMM. Is ...
Tom's user avatar
  • 1,112
3 votes
1 answer
132 views

I have trouble understanding the minimization of the KL divergence. In this link https://www.ibm.com/think/topics/variational-autoencoder They say, "One obstacle to using KL divergence for ...
Link's user avatar
  • 63
2 votes
0 answers
130 views

This paper proposes a computationally efficient parametrization of joint gaussian approximate posteriors. I am trying to apply the methods in it to the state-space model below: $$ x_0 \sim \delta(x_0)\...
Uomond's user avatar
  • 51
5 votes
2 answers
329 views

Setup The variational autoencoder (VAE) loss is given by the following (see here, for example): $$L = - \sum_{j = 1}^J \frac{1}{2} \left(1 + \log (\sigma_i^2) - \sigma_i^2 - \mu_i^2 \right) - \frac{1}{...
Physics Enthusiast's user avatar
1 vote
0 answers
81 views

I’ve been studying Variational Inference, and there’s a part I’ve been struggling to understand for the past few days, so I decided to write this post. As you can see from the title, my question is: &...
doby's user avatar
  • 11
1 vote
1 answer
113 views

I am looking into the relationship between linear Variational Autoencoder (VAE) and probabilistic PCA (pPCA) presented by Lucas et al. (2019). Don't blame the elbo! paper In the official ...
user1571823's user avatar
0 votes
0 answers
87 views

I am reading a paper "Complex-Valued Variational Autoencoder: A Novel Deep Generative Model for Direct Representation of Complex Spectra" In this paper, the author calculate the KL ...
Jiatong LI's user avatar
2 votes
1 answer
129 views

I am currently reading into Variational Autoencoders, and although I kind of understand the mathematical background described in the original paper (Auto-encoding Variational Bayes), I am struggling ...
Marco Rosinus Serrano's user avatar
3 votes
1 answer
144 views

In learning about variational autoencoders (VAEs), I would like to come up with a nice little handcrafted example to help make sense of them thoroughly. For this, suppose I know that my samples are ...
elysian-peace's user avatar
2 votes
2 answers
176 views

I'm deeply failing to understand the first step in the ELBO derivation in VAEs. When asking my questions I'll also try to clearly state my assumptions and perhaps some of them are wrong to begin with: ...
DrPrItay's user avatar
  • 121
1 vote
0 answers
92 views

I have an estimation problem where I need to maximize the evidence lower bound: $$ \mathrm{ELBO} = -\frac{1}{2} \Bigg( \mathbb{E}_{q(\theta)} \left[ \mathrm{vec}(\mathbf{Z})^{\mathrm{H}} \mathbf{C}^{-...
CfourPiO's user avatar
  • 325
2 votes
1 answer
310 views

In DDPM, ${\tilde\mu}_t$ is the mean of the conditional distribution $q(x_{t-1}|x_t,x_0)$ while the neural network $\mu_\theta$ is modeling a different conditional distribution $p_\theta(x_{t-1}|x_t)$....
Daniel Mendoza's user avatar
2 votes
1 answer
150 views

I've seen in many tutorials on diffusion models refer to the distribution of the latent variables induced by the forward process as "ground truth". I wonder why. What we can actually see is ...
Daniel Mendoza's user avatar
9 votes
2 answers
640 views

Suppose we have a true but unknown distribution $p$ over some discrete set (i.e. assume no structure or domain knowledge), and a parameterized family of distributions $q_\phi$. In general it makes ...
user56834's user avatar
  • 3,077

15 30 50 per page
1
2 3 4 5
27