Title
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection Xuanru Zhou Jiachen Lian Cheol Jun Cho Jingwen Liu Zongli Ye ... Jet M J Vonk Z. Ezzes Zachary Miller M. G. Tempini Gopala Anumanchipalli 36 5 0 20 Sep 2024
Self-supervised Speech Models for Word-Level Stuttered Speech Detection Yi-Jen Shih Zoi Gkalitsiou A. Dimakis David Harwath 59 3 0 16 Sep 2024
Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection Xuanru Zhou Cheol Jun Cho Ayati Sharma Brittany Morin D. Baquirin ... Zachary Miller B. Tee M. G. Tempini Jiachen Lian Gopala Anumanchipalli 39 5 0 15 Sep 2024
SSDM: Scalable Speech Dysfluency Modeling Jiachen Lian Xuanru Zhou Z. Ezzes Jet M J Vonk Brittany Morin D. Baquirin Zachary Mille M. G. Tempini Gopala Anumanchipalli AuLLM 43 3 0 29 Aug 2024
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection Xuanru Zhou Anshul Kashyap Steve Li Ayati Sharma Brittany Morin ... Z. Ezzes Zachary Miller M. G. Tempini Jiachen Lian Gopala Krishna Anumanchipalli 44 8 0 27 Aug 2024
Large Language Models for Dysfluency Detection in Stuttered Speech Dominik Wagner Sebastian P. Bayerl Ilja Baumann Korbinian Riedhammer Elmar Nöth Tobias Bocklet 86 6 0 16 Jun 2024
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude Maxim Enis Mark Hopkins 57 41 0 22 Apr 2024
Towards Hierarchical Spoken Language Dysfluency Modeling Jiachen Lian Gopala Anumanchipalli 37 11 0 18 Jan 2024
Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection Jiachen Lian Carly Feng Naasir Farooqi Steve Li Anshul Kashyap Cheol Jun Cho Peter Wu Robin Netzorg Tingle Li Gopala Krishna Anumanchipalli 67 15 0 20 Dec 2023
Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling Theodoros Kouzelis Georgios Paraskevopoulos Athanasios Katsamanis Vassilis Katsouros 47 9 0 30 May 2023
Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization Jiachen Lian A. Black Yijingxiu Lu Louis Goldstein Shinji Watanabe Gopala K. Anumanchipalli 72 16 0 29 Oct 2022
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition Jiachen Lian A. Black Louis Goldstein Gopala Krishna Anumanchipalli 41 18 0 01 Apr 2022
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass Olabanji Shonibare Xiaosu Tong Venkatesh Ravichandran 39 28 0 08 Feb 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Sanyuan Chen Chengyi Wang Zhengyang Chen Yu-Huan Wu Shujie Liu ... Yao Qian Jian Wu Micheal Zeng Xiangzhan Yu Furu Wei SSL 164 1,794 0 26 Oct 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition Qiantong Xu Alexei Baevski Michael Auli VLM 87 81 0 23 Sep 2021
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech Jaehyeon Kim Jungil Kong Juhee Son DRL 94 866 0 11 Jun 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski Henry Zhou Abdel-rahman Mohamed Michael Auli SSL 36 5,677 0 20 Jun 2020
Universal Phone Recognition with a Multilingual Allophone System Xinjian Li Siddharth Dalmia Juncheng Billy Li Matthew Russell Lee Patrick Littell ... Antonios Anastasopoulos David R. Mortensen Graham Neubig A. Black Florian Metze 19 128 0 26 Feb 2020
Disfluency Detection using a Bidirectional LSTM Vicky Zayats Mari Ostendorf Hannaneh Hajishirzi 39 117 0 12 Apr 2016