Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.13237
Cited By
SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information
19 May 2025
Chih-Kai Yang
Neo Ho
Yen-Ting Piao
Hung-yi Lee
AuLLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information"
12 / 12 papers shown
Title
A Preliminary Exploration with GPT-4o Voice Mode
Yu-Xiang Lin
Chih-Kai Yang
Wei-Chih Chen
Chen-An Li
Chien-yu Huang
Xuanjun Chen
Hung-yi Lee
AuLLM
81
6
0
17 Feb 2025
DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data
Ke-Han Lu
Zhehuai Chen
Szu-Wei Fu
Chao-Han Huck Yang
Jagadeesh Balam
Boris Ginsburg
Yu-Te Wang
Hung-yi Lee
AuLLM
SyDa
128
16
0
28 Jan 2025
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
Junyi Ao
Yuancheng Wang
Xiaohai Tian
Dekun Chen
Jing Zhang
Lu Lu
Yansen Wang
Haizhou Li
Zhikai Wu
AuLLM
123
23
0
17 Jan 2025
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Chien-yu Huang
Wei-Chih Chen
Shu-Wen Yang
Andy T. Liu
Chen-An Li
...
David Harwath
Shinji Watanabe
Hung-yi Lee
Shinji Watanabe
Hung-yi Lee
ELM
AuLLM
47
27
0
08 Nov 2024
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Qingkai Fang
Shoutao Guo
Yan Zhou
Zhengrui Ma
Shaolei Zhang
Yang Feng
AuLLM
67
49
0
10 Sep 2024
Distributional reasoning in LLMs: Parallel reasoning processes in multi-hop reasoning
Yuval Shalev
Amir Feder
Ariel Goldstein
LRM
90
9
0
19 Jun 2024
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
Chun-Yi Kuan
Wei-Ping Huang
Hung-yi Lee
AuLLM
46
11
0
12 Jun 2024
Do Large Language Models Latently Perform Multi-Hop Reasoning?
Sohee Yang
E. Gribovskaya
Nora Kassner
Mor Geva
Sebastian Riedel
ReLM
LRM
91
99
0
26 Feb 2024
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
Qian Yang
Jin Xu
Wenrui Liu
Yunfei Chu
Ziyue Jiang
...
Yichong Leng
Yuanjun Lv
Zhou Zhao
Chang Zhou
Jingren Zhou
LM&MA
AuLLM
ALM
70
80
0
12 Feb 2024
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
Sreyan Ghosh
Ashish Seth
Sonal Kumar
Utkarsh Tyagi
Chandra Kiran Reddy Evuru
S. Ramaneswaran
S. Sakshi
Oriol Nieto
R. Duraiswami
Dinesh Manocha
AuLLM
VLM
CoGe
65
26
0
12 Oct 2023
Measuring and Narrowing the Compositionality Gap in Language Models
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLM
KELM
LRM
143
617
0
07 Oct 2022
Emotion Detection on TV Show Transcripts with Sequence-based Convolutional Neural Networks
Sayyed M. Zahiri
Jinho Choi
83
222
0
14 Aug 2017
1