ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.16351
  4. Cited By
Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection

Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection

22 May 2025
Chenxu Guo
Jiachen Lian
Xuanru Zhou
Jinming Zhang
Shuhe Li
Zongli Ye
Hwi Joo Park
Anaisha Das
Z. Ezzes
Jet M J Vonk
Brittany Morin
Rian Bogley
Lisa Wauters
Zachary Miller
M. G. Tempini
Gopala Anumanchipalli
ArXivPDFHTML

Papers citing "Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection"

19 / 19 papers shown
Title
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection
Xuanru Zhou
Jiachen Lian
Cheol Jun Cho
Jingwen Liu
Zongli Ye
...
Jet M J Vonk
Z. Ezzes
Zachary Miller
M. G. Tempini
Gopala Anumanchipalli
36
5
0
20 Sep 2024
Self-supervised Speech Models for Word-Level Stuttered Speech Detection
Self-supervised Speech Models for Word-Level Stuttered Speech Detection
Yi-Jen Shih
Zoi Gkalitsiou
A. Dimakis
David Harwath
59
3
0
16 Sep 2024
Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection
Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection
Xuanru Zhou
Cheol Jun Cho
Ayati Sharma
Brittany Morin
D. Baquirin
...
Zachary Miller
B. Tee
M. G. Tempini
Jiachen Lian
Gopala Anumanchipalli
39
5
0
15 Sep 2024
SSDM: Scalable Speech Dysfluency Modeling
SSDM: Scalable Speech Dysfluency Modeling
Jiachen Lian
Xuanru Zhou
Z. Ezzes
Jet M J Vonk
Brittany Morin
D. Baquirin
Zachary Mille
M. G. Tempini
Gopala Anumanchipalli
AuLLM
43
3
0
29 Aug 2024
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
Xuanru Zhou
Anshul Kashyap
Steve Li
Ayati Sharma
Brittany Morin
...
Z. Ezzes
Zachary Miller
M. G. Tempini
Jiachen Lian
Gopala Krishna Anumanchipalli
44
8
0
27 Aug 2024
Large Language Models for Dysfluency Detection in Stuttered Speech
Large Language Models for Dysfluency Detection in Stuttered Speech
Dominik Wagner
Sebastian P. Bayerl
Ilja Baumann
Korbinian Riedhammer
Elmar Nöth
Tobias Bocklet
86
6
0
16 Jun 2024
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
From LLM to NMT: Advancing Low-Resource Machine Translation with Claude
Maxim Enis
Mark Hopkins
57
41
0
22 Apr 2024
Towards Hierarchical Spoken Language Dysfluency Modeling
Towards Hierarchical Spoken Language Dysfluency Modeling
Jiachen Lian
Gopala Anumanchipalli
37
11
0
18 Jan 2024
Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and
  Detection
Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection
Jiachen Lian
Carly Feng
Naasir Farooqi
Steve Li
Anshul Kashyap
Cheol Jun Cho
Peter Wu
Robin Netzorg
Tingle Li
Gopala Krishna Anumanchipalli
67
15
0
20 Dec 2023
Weakly-supervised forced alignment of disfluent speech using
  phoneme-level modeling
Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling
Theodoros Kouzelis
Georgios Paraskevopoulos
Athanasios Katsamanis
Vassilis Katsouros
47
9
0
30 May 2023
Articulatory Representation Learning Via Joint Factor Analysis and
  Neural Matrix Factorization
Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization
Jiachen Lian
A. Black
Yijingxiu Lu
Louis Goldstein
Shinji Watanabe
Gopala K. Anumanchipalli
72
16
0
29 Oct 2022
Deep Neural Convolutive Matrix Factorization for Articulatory
  Representation Decomposition
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition
Jiachen Lian
A. Black
Louis Goldstein
Gopala Krishna Anumanchipalli
41
18
0
01 Apr 2022
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and
  Pass
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass
Olabanji Shonibare
Xiaosu Tong
Venkatesh Ravichandran
39
28
0
08 Feb 2022
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech
  Processing
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
164
1,794
0
26 Oct 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
87
81
0
23 Sep 2021
Conditional Variational Autoencoder with Adversarial Learning for
  End-to-End Text-to-Speech
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Jaehyeon Kim
Jungil Kong
Juhee Son
DRL
94
866
0
11 Jun 2021
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
36
5,677
0
20 Jun 2020
Universal Phone Recognition with a Multilingual Allophone System
Universal Phone Recognition with a Multilingual Allophone System
Xinjian Li
Siddharth Dalmia
Juncheng Billy Li
Matthew Russell Lee
Patrick Littell
...
Antonios Anastasopoulos
David R. Mortensen
Graham Neubig
A. Black
Florian Metze
19
128
0
26 Feb 2020
Disfluency Detection using a Bidirectional LSTM
Disfluency Detection using a Bidirectional LSTM
Vicky Zayats
Mari Ostendorf
Hannaneh Hajishirzi
39
117
0
12 Apr 2016
1