ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.01725
  4. Cited By
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip
  Reading

Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading

4 April 2022
Minsu Kim
Jeong Hun Yeo
Yong Man Ro
ArXiv (abs)PDFHTML

Papers citing "Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading"

32 / 32 papers shown
Title
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
124
0
0
07 May 2025
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Jeong Hun Yeo
Chae Won Kim
Hyunjun Kim
Hyeongseop Rha
Seunghee Han
Wen-Huang Cheng
Y. Ro
118
3
0
03 Jan 2025
Lip to Speech Synthesis with Visual Context Attentional GAN
Lip to Speech Synthesis with Visual Context Attentional GAN
Minsu Kim
Joanna Hong
Y. Ro
100
54
0
04 Apr 2022
Multi-modality Associative Bridging through Memory: Speech Sound
  Recollected from Face Video
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
Minsu Kim
Joanna Hong
Se Jin Park
Yong Man Ro
CVBM
54
40
0
04 Apr 2022
LiRA: Learning Visual Speech Representations from Audio through
  Self-supervision
LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Björn W. Schuller
Maja Pantic
SSL
48
53
0
16 Jun 2021
Video Prediction Recalling Long-term Motion Context via Memory Alignment
  Learning
Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning
Sangmin Lee
Hak Gu Kim
Dae Hwi Choi
Hyungil Kim
Yong Man Ro
73
102
0
02 Apr 2021
Lip-reading with Densely Connected Temporal Convolutional Networks
Lip-reading with Densely Connected Temporal Convolutional Networks
Pingchuan Ma
Yujiang Wang
Jie Shen
Stavros Petridis
Maja Pantic
73
57
0
29 Sep 2020
Towards Practical Lipreading with Distilled and Efficient Models
Towards Practical Lipreading with Distilled and Efficient Models
Pingchuan Ma
Brais Martínez
Stavros Petridis
Maja Pantic
68
96
0
13 Jul 2020
Discriminative Multi-modality Speech Recognition
Discriminative Multi-modality Speech Recognition
Bo Xu
Cheng Lu
Yandong Guo
Jacob Wang
72
99
0
12 May 2020
Mutual Information Maximization for Effective Lip Reading
Mutual Information Maximization for Effective Lip Reading
Xingyuan Zhao
Shuang Yang
Shiguang Shan
Xilin Chen
51
58
0
13 Mar 2020
Deformation Flow Based Two-Stream Network for Lip Reading
Deformation Flow Based Two-Stream Network for Lip Reading
Jingyun Xiao
Shuang Yang
Yuanhang Zhang
Shiguang Shan
Xilin Chen
50
64
0
12 Mar 2020
Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence
  Lip-Reading
Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Mingshuang Luo
Shuang Yang
Shiguang Shan
Xilin Chen
64
41
0
09 Mar 2020
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep
  Visual Speech Recognition
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition
Yuanhang Zhang
Shuang Yang
Jingyun Xiao
Shiguang Shan
Xilin Chen
63
64
0
06 Mar 2020
Lipreading using Temporal Convolutional Networks
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
Maja Pantic
215
240
0
23 Jan 2020
ASR is all you need: cross-modal distillation for lip reading
ASR is all you need: cross-modal distillation for lip reading
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
51
135
0
28 Nov 2019
Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
Ya Zhao
Rui Xu
Xinchao Wang
Peng Hou
Haihong Tang
Xiuming Zhang
51
90
0
26 Nov 2019
Large Memory Layers with Product Keys
Large Memory Layers with Product Keys
Guillaume Lample
Alexandre Sablayrolles
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
MoE
67
134
0
10 Jul 2019
Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for
  Lipreading
Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for Lipreading
Xinshuo Weng
Kris Kitani
70
71
0
04 May 2019
LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading
  in the Wild
LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild
Shuang Yang
Yuanhang Zhang
Dalu Feng
Mingmin Yang
Chenhao Wang
Jingyun Xiao
Keyu Long
Shiguang Shan
Xilin Chen
54
150
0
16 Oct 2018
Deep Audio-Visual Speech Recognition
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
98
708
0
06 Sep 2018
Lip2AudSpec: Speech reconstruction from silent lip movements video
Lip2AudSpec: Speech reconstruction from silent lip movements video
Hassan Akbari
Himani Arora
Liangliang Cao
N. Mesgarani
52
88
0
26 Oct 2017
mixup: Beyond Empirical Risk Minimization
mixup: Beyond Empirical Risk Minimization
Hongyi Zhang
Moustapha Cissé
Yann N. Dauphin
David Lopez-Paz
NoLa
289
9,803
0
25 Oct 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
786
132,363
0
12 Jun 2017
End-To-End Visual Speech Recognition With LSTMs
End-To-End Visual Speech Recognition With LSTMs
Stavros Petridis
Zuwei Li
Maja Pantic
VLM
49
110
0
20 Jan 2017
Lip Reading Sentences in the Wild
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
261
792
0
16 Nov 2016
LipNet: End-to-End Sentence-level Lipreading
LipNet: End-to-End Sentence-level Lipreading
Yannis Assael
Brendan Shillingford
Shimon Whiteson
Nando de Freitas
82
397
0
05 Nov 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
426
10,531
0
21 Jul 2016
Key-Value Memory Networks for Directly Reading Documents
Key-Value Memory Networks for Directly Reading Documents
Alexander H. Miller
Adam Fisch
Jesse Dodge
Amir-Hossein Karimi
Antoine Bordes
Jason Weston
RALMKELMOffRL
107
970
0
09 Jun 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,426
0
10 Dec 2015
Distilling the Knowledge in a Neural Network
Distilling the Knowledge in a Neural Network
Geoffrey E. Hinton
Oriol Vinyals
J. Dean
FedML
364
19,733
0
09 Mar 2015
Memory Networks
Memory Networks
Jason Weston
S. Chopra
Antoine Bordes
GNNKELM
147
1,709
0
15 Oct 2014
On the Properties of Neural Machine Translation: Encoder-Decoder
  Approaches
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CEAIMat
259
6,786
0
03 Sep 2014
1