253

Multivariate Variational Autoencoder

Main:8 Pages
4 Figures
Bibliography:2 Pages
7 Tables
Abstract

We present the Multivariate Variational Autoencoder (MVAE), a VAE variant that preserves Gaussian tractability while lifting the diagonal posterior restriction. MVAE factorizes each posterior covariance, where a \emph{global} coupling matrix C\mathbf{C} induces dataset-wide latent correlations and \emph{per-sample} diagonal scales modulate local uncertainty. This yields a full-covariance family with analytic KL and an efficient reparameterization via L=Cdiag(σ)\mathbf{L}=\mathbf{C}\mathrm{diag}(\boldsymbol{\sigma}). Across Larochelle-style MNIST variants, Fashion-MNIST, CIFAR-10, and CIFAR-100, MVAE consistently matches or improves reconstruction (MSE~\downarrow) and delivers robust gains in calibration (NLL/Brier/ECE~\downarrow) and unsupervised structure (NMI/ARI~\uparrow) relative to diagonal-covariance VAEs with matched capacity, especially at mid-range latent sizes. Latent-plane visualizations further indicate smoother, more coherent factor traversals and sharper local detail. We release a fully reproducible implementation with training/evaluation scripts and sweep utilities to facilitate fair comparison and reuse.

View on arXiv
Comments on this paper