ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.09502
10
46

Disentangling Latent Space for VAE by Label Relevant/Irrelevant Dimensions

22 December 2018
Zhilin Zheng
Li Sun
    CML
    CoGe
    DRL
ArXivPDFHTML
Abstract

VAE requires the standard Gaussian distribution as a prior in the latent space. Since all codes tend to follow the same prior, it often suffers the so-called "posterior collapse". To avoid this, this paper introduces the class specific distribution for the latent code. But different from CVAE, we present a method for disentangling the latent space into the label relevant and irrelevant dimensions, zs\bm{\mathrm{z}}_szs​ and zu\bm{\mathrm{z}}_uzu​, for a single input. We apply two separated encoders to map the input into zs\bm{\mathrm{z}}_szs​ and zu\bm{\mathrm{z}}_uzu​ respectively, and then give the concatenated code to the decoder to reconstruct the input. The label irrelevant code zu\bm{\mathrm{z}}_uzu​ represent the common characteristics of all inputs, hence they are constrained by the standard Gaussian, and their encoder is trained in amortized variational inference way, like VAE. While zs\bm{\mathrm{z}}_szs​ is assumed to follow the Gaussian mixture distribution in which each component corresponds to a particular class. The parameters for the Gaussian components in zs\bm{\mathrm{z}}_szs​ encoder are optimized by the label supervision in a global stochastic way. In theory, we show that our method is actually equivalent to adding a KL divergence term on the joint distribution of zs\bm{\mathrm{z}}_szs​ and the class label ccc, and it can directly increase the mutual information between zs\bm{\mathrm{z}}_szs​ and the label ccc. Our model can also be extended to GAN by adding a discriminator in the pixel domain so that it produces high quality and diverse images.

View on arXiv
Comments on this paper