Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.05888
Cited By
Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging
10 June 2020
Yeqi Bai
Tao Ma
Lipo Wang
Zhenjie Zhang
CVBM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging"
20 / 20 papers shown
Title
Multimodal Deep Learning
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
...
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Matthias Aßenmacher
117
3,174
0
12 Jan 2023
StarGAN v2: Diverse Image Synthesis for Multiple Domains
Yunjey Choi
Youngjung Uh
Jaejun Yoo
Jung-Woo Ha
3DH
127
1,753
0
04 Dec 2019
PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph
Yikang Li
Tao Ma
Yeqi Bai
Nan Duan
Sining Wei
Xiaogang Wang
124
95
0
05 May 2019
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks
A. Duarte
Francisco Roldan
Miquel Tubau
Janna Escur
Santiago Pascual
Amaia Salvador
Eva Mohedano
Kevin McGuinness
Jordi Torres
Xavier Giró-i-Nieto
GAN
CVBM
61
79
0
25 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
593
10,561
0
12 Dec 2018
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Samuel Albanie
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
CVBM
61
271
0
16 Aug 2018
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces
Yandong Wen
Mahmoud Al Ismail
Weiyang Liu
Bhiksha Raj
Rita Singh
FedML
49
71
0
12 Jul 2018
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
303
820
0
04 Apr 2018
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
Yunjey Choi
Min-Je Choi
M. Kim
Jung-Woo Ha
Sunghun Kim
Jaegul Choo
GAN
145
3,554
0
24 Nov 2017
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
GAN
99
1,061
0
19 Oct 2017
VoxCeleb: a large-scale speaker identification dataset
Arsha Nagrani
Joon Son Chung
Andrew Zisserman
127
2,279
0
26 Jun 2017
Synthesizing Normalized Faces from Facial Identity Features
Forrester Cole
David Belanger
Dilip Krishnan
Aaron Sarna
Inbar Mosseri
William T. Freeman
3DH
CVBM
72
140
0
17 Jan 2017
Pyramid Scene Parsing Network
Hengshuang Zhao
Jianping Shi
Xiaojuan Qi
Xiaogang Wang
Jiaya Jia
VOS
SSeg
665
12,030
0
04 Dec 2016
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
261
790
0
16 Nov 2016
Improved Techniques for Training GANs
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
GAN
483
9,062
0
10 Jun 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
237
10,262
0
27 Mar 2016
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
479
2,573
0
25 Jan 2016
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
Alec Radford
Luke Metz
Soumith Chintala
GAN
OOD
266
14,018
0
19 Nov 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
386
13,145
0
12 Mar 2015
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
452
16,923
0
20 Dec 2013
1