Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.06547
Cited By
Investigations in Audio Captioning: Addressing Vocabulary Imbalance and Evaluating Suitability of Language-Centric Performance Metrics
12 November 2022
Sandeep Reddy Kothinti
Dimitra Emmanouilidou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Investigations in Audio Captioning: Addressing Vocabulary Imbalance and Evaluating Suitability of Language-Centric Performance Metrics"
5 / 5 papers shown
Title
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
VLM
SSL
133
1,068
0
21 Dec 2019
Clotho: An Audio Captioning Dataset
Konstantinos Drossos
Samuel Lipping
Tuomas Virtanen
77
381
0
21 Oct 2019
Audio Caption: Listen and Tell
Mengyue Wu
Heinrich Dinkel
Kai Yu
41
61
0
25 Feb 2019
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
94
2,488
0
29 Sep 2016
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
229
4,451
0
20 Nov 2014
1