Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1412.5567
Cited By
Deep Speech: Scaling up end-to-end speech recognition
17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Speech: Scaling up end-to-end speech recognition"
50 / 750 papers shown
Title
Text-To-Speech Data Augmentation for Low Resource Speech Recognition
Rodolfo Zevallos
19
4
0
01 Apr 2022
Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Jaesong Lee
Lukas Lee
Shinji Watanabe
25
8
0
31 Mar 2022
An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Huahuan Zheng
Keyu An
Zhijian Ou
Chen Huang
Ke Ding
Guanglu Wan
27
5
0
31 Mar 2022
Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?
Priyanshi Shah
Harveen Singh Chadha
Anirudh Gupta
Ankur Dhuriya
Neeraj Chhimwal
Rishabh Gaur
Vivek Raghavan
31
1
0
30 Mar 2022
Improving Speech Recognition for Indic Languages using Language Model
Ankur Dhuriya
Harveen Singh Chadha
Anirudh Gupta
Priyanshi Shah
Neeraj Chhimwal
Rishabh Gaur
Vivek Raghavan
17
2
0
30 Mar 2022
4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Shaojin Ding
Phoenix Meadowlark
Yanzhang He
Lukasz Lew
Shivani Agrawal
Oleg Rybakov
MQ
31
32
0
29 Mar 2022
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
Chen Chen
Nana Hou
Yuchen Hu
Shashank Shirol
Chng Eng Siong
NoLa
14
43
0
29 Mar 2022
WaveFuzz: A Clean-Label Poisoning Attack to Protect Your Voice
Yunjie Ge
Qianqian Wang
Jingfeng Zhang
Juntao Zhou
Yunzhu Zhang
Chao Shen
AAML
20
6
0
25 Mar 2022
Learning by non-interfering feedback chemical signaling in physical networks
Vidyesh Rao Anisetti
B. Scellier
J. M. Schwarz
11
17
0
22 Mar 2022
Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition
Marie Biolková
Bac Nguyen
AAML
33
2
0
18 Mar 2022
Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness
Tejas Gokhale
Swaroop Mishra
Man Luo
Bhavdeep Singh Sachdeva
Chitta Baral
52
29
0
15 Mar 2022
aaeCAPTCHA: The Design and Implementation of Audio Adversarial CAPTCHA
Md. Imran Hossen
X. Hei
31
4
0
05 Mar 2022
A Survey of Multilingual Models for Automatic Speech Recognition
Hemant Yadav
Sunayana Sitaram
24
35
0
25 Feb 2022
Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Ngoc Dung Huynh
Mohamed Reda Bouadjenek
Imran Razzak
Kevin Lee
Chetan Arora
Ali Hassani
A. Zaslavsky
AAML
29
6
0
22 Feb 2022
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Mario Esparza
24
0
0
21 Feb 2022
Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines
Alexander Isenko
R. Mayer
Jeffrey Jedele
Hans-Arno Jacobsen
19
23
0
17 Feb 2022
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Chao-Han Huck Yang
Zeeshan Ahmed
Yile Gu
Joseph Szurley
Roger Ren
Linda Liu
A. Stolcke
I. Bulyko
AAML
21
3
0
17 Feb 2022
Vau da muntanialas: Energy-efficient multi-die scalable acceleration of RNN inference
G. Paulin
Francesco Conti
Lukas Cavigelli
Luca Benini
24
8
0
14 Feb 2022
I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy
H.C.M. Turner
Giulio Lovisotto
Simon Eberz
Ivan Martinovic
13
1
0
13 Feb 2022
FAAG: Fast Adversarial Audio Generation through Interactive Attack Optimisation
Yuantian Miao
Chao Chen
Lei Pan
Jun Zhang
Yang Xiang
AAML
20
2
0
11 Feb 2022
Convergence of a New Learning Algorithm
Feng Lin
3DV
16
0
0
08 Feb 2022
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian
P. Mihajlik
A. Balog
T. E. Gráczi
A. Kohári
Balázs Tarján
K. Mády
25
8
0
01 Feb 2022
Visualizing Automatic Speech Recognition -- Means for a Better Understanding?
Karla Markert
Romain Parracone
Mykhailo Kulakov
Philip Sperl
Ching-yu Kao
Konstantin Böttinger
19
8
0
01 Feb 2022
Language Dependencies in Adversarial Attacks on Speech Recognition Systems
Karla Markert
Donika Mirdita
Konstantin Böttinger
AAML
SILM
19
4
0
01 Feb 2022
Unicorn: Reasoning about Configurable System Performance through the lens of Causality
Md Shahriar Iqbal
R. Krishna
Mohammad Ali Javidian
Baishakhi Ray
Pooyan Jamshidi
LRM
26
28
0
20 Jan 2022
iDECODe: In-distribution Equivariance for Conformal Out-of-distribution Detection
R. Kaur
Susmit Jha
Anirban Roy
Sangdon Park
Yan Sun
O. Sokolsky
Insup Lee
OODD
19
45
0
07 Jan 2022
Discrete and continuous representations and processing in deep learning: Looking forward
Ruben Cartuyvels
Graham Spinks
Marie-Francine Moens
OCL
33
20
0
04 Jan 2022
Multi-Dialect Arabic Speech Recognition
Abbas Raza Ali
14
15
0
25 Dec 2021
Parameter identifiability of a deep feedforward ReLU neural network
Joachim Bona-Pellissier
François Bachoc
François Malgouyres
41
15
0
24 Dec 2021
Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion
S. Agarwal
Liwen Hu
Evonne Ng
Trevor Darrell
Hao Li
Anna Rohrbach
AAML
31
19
0
21 Dec 2021
ImportantAug: a data augmentation agent for speech
V. Trinh
Hassan Salami Kavaki
Michael I. Mandel
27
10
0
14 Dec 2021
Real-Time Neural Voice Camouflage
Mia Chiquier
Chengzhi Mao
Carl Vondrick
27
6
0
14 Dec 2021
Detecting Audio Adversarial Examples with Logit Noising
N. Park
Sangwoo Ji
Jong Kim
AAML
30
5
0
13 Dec 2021
Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications
Yongqiang Tian
Wuqi Zhang
Ming Wen
Shing-Chi Cheung
Chengnian Sun
Shiqing Ma
Yu Jiang
29
7
0
06 Dec 2021
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Yingruo Fan
Zhaojiang Lin
Jun Saito
Wenping Wang
Taku Komura
31
21
0
04 Dec 2021
Catch Me If You Can: Blackbox Adversarial Attacks on Automatic Speech Recognition using Frequency Masking
Xiao-lan Wu
A. Rajan
AAML
19
4
0
03 Dec 2021
Transformer-S2A: Robust and Efficient Speech-to-Animation
Liyang Chen
Zhiyong Wu
Jun Ling
Runnan Li
Xu Tan
Sheng Zhao
29
18
0
18 Nov 2021
A Survey on Adversarial Attacks for Malware Analysis
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
AAML
34
49
0
16 Nov 2021
Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception
Joel Dapello
J. Feather
Hang Le
Tiago Marques
David D. Cox
Josh H. McDermott
J. DiCarlo
SueYeon Chung
AAML
OOD
19
25
0
12 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
35
363
0
02 Nov 2021
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition
Evangelos Kazakos
Jaesung Huh
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
50
45
0
01 Nov 2021
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis
Haozhe Wu
Jia Jia
Haoyu Wang
Yishun Dou
Chao Duan
Qingshan Deng
CVBM
11
73
0
30 Oct 2021
TorchAudio: Building Blocks for Audio and Speech Processing
Yao-Yuan Yang
Moto Hira
Zhaoheng Ni
Anjali Chourdia
Artyom Astafurov
...
Sean Narenthiran
Shinji Watanabe
Soumith Chintala
Vincent Quenneville-Bélair
Yangyang Shi
31
165
0
28 Oct 2021
Beyond
L
p
L_p
L
p
clipping: Equalization-based Psychoacoustic Attacks against ASRs
H. Abdullah
Muhammad Sajidur Rahman
Christian Peeters
Cassidy Gibson
Washington Garcia
Vincent Bindschaedler
T. Shrimpton
Patrick Traynor
AAML
19
9
0
25 Oct 2021
Deep Neural Networks on EEG Signals to Predict Auditory Attention Score Using Gramian Angular Difference Field
Mahak Kothari
Shreyansh Joshi
Adarsh Nandanwar
Aadetya Jaiswal
V. Baths
15
1
0
24 Oct 2021
Asynchronous Decentralized Distributed Training of Acoustic Models
Xiaodong Cui
Wei Zhang
Abdullah Kayi
Mingrui Liu
Ulrich Finkler
Brian Kingsbury
G. Saon
David S. Kung
32
3
0
21 Oct 2021
Activation Landscapes as a Topological Summary of Neural Network Performance
Matthew Wheeler
Jose J. Bouza
Peter Bubenik
34
19
0
19 Oct 2021
Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition
Haozhe Chen
Weiming Zhang
Kunlin Liu
Kejiang Chen
Han Fang
Nenghai Yu
19
4
0
19 Oct 2021
Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information
Baolin Zheng
Peipei Jiang
Qian Wang
Qi Li
Chao Shen
Cong Wang
Yunjie Ge
Qingyang Teng
Shenyi Zhang
AAML
18
69
0
19 Oct 2021
Intent Classification Using Pre-trained Language Agnostic Embeddings For Low Resource Languages
Hemant Yadav
Akshat Gupta
Sai Krishna Rallabandi
A. Black
R. Shah
11
0
0
18 Oct 2021
Previous
1
2
3
4
5
6
...
13
14
15
Next