ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.13289
  4. Cited By
SALMONN: Towards Generic Hearing Abilities for Large Language Models
v1v2 (latest)

SALMONN: Towards Generic Hearing Abilities for Large Language Models

20 October 2023
Changli Tang
Wenyi Yu
Guangzhi Sun
Xianzhao Chen
Tian Tan
Wei Li
Lu Lu
Zejun Ma
Chao Zhang
    LM&MAAuLLM
ArXiv (abs)PDFHTMLGithub (1235★)

Papers citing "SALMONN: Towards Generic Hearing Abilities for Large Language Models"

6 / 56 papers shown
Title
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of
  Transcribed Audio
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Guoguo Chen
Shuzhou Chai
Guan-Bo Wang
Jiayu Du
Weiqiang Zhang
...
Xuchen Yao
Yongqing Wang
Yujun Wang
Zhao You
Zhiyong Yan
116
383
0
13 Jun 2021
GLM: General Language Model Pretraining with Autoregressive Blank
  Infilling
GLM: General Language Model Pretraining with Autoregressive Blank Infilling
Zhengxiao Du
Yujie Qian
Xiao Liu
Ming Ding
J. Qiu
Zhilin Yang
Jie Tang
BDLAI4CE
142
1,553
0
18 Mar 2021
SLURP: A Spoken Language Understanding Resource Package
SLURP: A Spoken Language Understanding Resource Package
E. Bastianelli
Andrea Vanzo
P. Swietojanski
Verena Rieser
VLM
93
231
0
26 Nov 2020
Emotion recognition by fusing time synchronous and time asynchronous
  representations
Emotion recognition by fusing time synchronous and time asynchronous representations
Wen Wu
Chao Zhang
P. Woodland
65
67
0
27 Oct 2020
Clotho: An Audio Captioning Dataset
Clotho: An Audio Captioning Dataset
Konstantinos Drossos
Samuel Lipping
Tuomas Virtanen
101
394
0
21 Oct 2019
Learning Features of Music from Scratch
Learning Features of Music from Scratch
John Thickstun
Zaïd Harchaoui
Sham Kakade
159
202
0
29 Nov 2016
Previous
12