Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging

10 June 2020

Papers citing "Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging"

20 / 20 papers shown

Title
Multimodal Deep Learning Cem Akkus Jiquan Ngiam Vladana Djakovic Steffen Jauch-Walser A. Khosla ... Jann Goschenhofer Honglak Lee A. Ng Daniel Schalk Matthias Aßenmacher 117 3,174 0 12 Jan 2023
StarGAN v2: Diverse Image Synthesis for Multiple Domains Yunjey Choi Youngjung Uh Jaejun Yoo Jung-Woo Ha 3DH 127 1,753 0 04 Dec 2019
PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph Yikang Li Tao Ma Yeqi Bai Nan Duan Sining Wei Xiaogang Wang 124 95 0 05 May 2019
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks A. Duarte Francisco Roldan Miquel Tubau Janna Escur Santiago Pascual Amaia Salvador Eva Mohedano Kevin McGuinness Jordi Torres Xavier Giró-i-Nieto GAN CVBM 61 79 0 25 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks Tero Karras S. Laine Timo Aila 593 10,561 0 12 Dec 2018
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild Samuel Albanie Arsha Nagrani Andrea Vedaldi Andrew Zisserman CVBM 61 271 0 16 Aug 2018
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces Yandong Wen Mahmoud Al Ismail Weiyang Liu Bhiksha Raj Rita Singh FedML 49 71 0 12 Jul 2018
Image Generation from Scene Graphs Justin Johnson Agrim Gupta Li Fei-Fei GNN 303 820 0 04 Apr 2018
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation Yunjey Choi Min-Je Choi M. Kim Jung-Woo Ha Sunghun Kim Jaegul Choo GAN 145 3,554 0 24 Nov 2017
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks Han Zhang Tao Xu Hongsheng Li Shaoting Zhang Xiaogang Wang Xiaolei Huang Dimitris N. Metaxas GAN 99 1,061 0 19 Oct 2017
VoxCeleb: a large-scale speaker identification dataset Arsha Nagrani Joon Son Chung Andrew Zisserman 127 2,279 0 26 Jun 2017
Synthesizing Normalized Faces from Facial Identity Features Forrester Cole David Belanger Dilip Krishnan Aaron Sarna Inbar Mosseri William T. Freeman 3DH CVBM 72 140 0 17 Jan 2017
Pyramid Scene Parsing Network Hengshuang Zhao Jianping Shi Xiaojuan Qi Xiaogang Wang Jiaya Jia VOS SSeg 665 12,030 0 04 Dec 2016
Lip Reading Sentences in the Wild Joon Son Chung A. Senior Oriol Vinyals Andrew Zisserman 261 790 0 16 Nov 2016
Improved Techniques for Training GANs Tim Salimans Ian Goodfellow Wojciech Zaremba Vicki Cheung Alec Radford Xi Chen GAN 483 9,062 0 10 Jun 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution Justin Johnson Alexandre Alahi Li Fei-Fei SupR 237 10,262 0 27 Mar 2016
Pixel Recurrent Neural Networks Aaron van den Oord Nal Kalchbrenner Koray Kavukcuoglu SSeg GAN 479 2,573 0 25 Jan 2016
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec Radford Luke Metz Soumith Chintala GAN OOD 266 14,018 0 19 Nov 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering Florian Schroff Dmitry Kalenichenko James Philbin 3DH 386 13,145 0 12 Mar 2015
Auto-Encoding Variational Bayes Diederik P. Kingma Max Welling BDL 452 16,923 0 20 Dec 2013