ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.05888
  4. Cited By
Speech Fusion to Face: Bridging the Gap Between Human's Vocal
  Characteristics and Facial Imaging

Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging

10 June 2020
Yeqi Bai
Tao Ma
Lipo Wang
Zhenjie Zhang
    CVBM
ArXiv (abs)PDFHTML

Papers citing "Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging"

20 / 20 papers shown
Title
Multimodal Deep Learning
Multimodal Deep Learning
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
...
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Matthias Aßenmacher
117
3,174
0
12 Jan 2023
StarGAN v2: Diverse Image Synthesis for Multiple Domains
StarGAN v2: Diverse Image Synthesis for Multiple Domains
Yunjey Choi
Youngjung Uh
Jaejun Yoo
Jung-Woo Ha
3DH
127
1,753
0
04 Dec 2019
PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph
PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph
Yikang Li
Tao Ma
Yeqi Bai
Nan Duan
Sining Wei
Xiaogang Wang
124
95
0
05 May 2019
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial
  Networks
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks
A. Duarte
Francisco Roldan
Miquel Tubau
Janna Escur
Santiago Pascual
Amaia Salvador
Eva Mohedano
Kevin McGuinness
Jordi Torres
Xavier Giró-i-Nieto
GANCVBM
61
79
0
25 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
593
10,561
0
12 Dec 2018
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Samuel Albanie
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
CVBM
61
271
0
16 Aug 2018
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces
Yandong Wen
Mahmoud Al Ismail
Weiyang Liu
Bhiksha Raj
Rita Singh
FedML
49
71
0
12 Jul 2018
Image Generation from Scene Graphs
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
303
820
0
04 Apr 2018
StarGAN: Unified Generative Adversarial Networks for Multi-Domain
  Image-to-Image Translation
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
Yunjey Choi
Min-Je Choi
M. Kim
Jung-Woo Ha
Sunghun Kim
Jaegul Choo
GAN
145
3,554
0
24 Nov 2017
StackGAN++: Realistic Image Synthesis with Stacked Generative
  Adversarial Networks
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
GAN
99
1,061
0
19 Oct 2017
VoxCeleb: a large-scale speaker identification dataset
VoxCeleb: a large-scale speaker identification dataset
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
127
2,279
0
26 Jun 2017
Synthesizing Normalized Faces from Facial Identity Features
Synthesizing Normalized Faces from Facial Identity Features
Forrester Cole
David Belanger
Dilip Krishnan
Aaron Sarna
Inbar Mosseri
William T. Freeman
3DHCVBM
72
140
0
17 Jan 2017
Pyramid Scene Parsing Network
Pyramid Scene Parsing Network
Hengshuang Zhao
Jianping Shi
Xiaojuan Qi
Xiaogang Wang
Jiaya Jia
VOSSSeg
665
12,030
0
04 Dec 2016
Lip Reading Sentences in the Wild
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
261
790
0
16 Nov 2016
Improved Techniques for Training GANs
Improved Techniques for Training GANs
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
GAN
483
9,062
0
10 Jun 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
237
10,262
0
27 Mar 2016
Pixel Recurrent Neural Networks
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSegGAN
479
2,573
0
25 Jan 2016
Unsupervised Representation Learning with Deep Convolutional Generative
  Adversarial Networks
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
Alec Radford
Luke Metz
Soumith Chintala
GANOOD
266
14,018
0
19 Nov 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
386
13,145
0
12 Mar 2015
Auto-Encoding Variational Bayes
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
452
16,923
0
20 Dec 2013
1