Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.09418
Cited By
Audio Retrieval with Natural Language Queries: A Benchmark Study
17 December 2021
A. Sophia Koepke
Andreea-Maria Oncescu
João F. Henriques
Zeynep Akata
Samuel Albanie
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Audio Retrieval with Natural Language Queries: A Benchmark Study"
14 / 14 papers shown
Title
Audio-Language Datasets of Scenes and Events: A Survey
Gijs Wijngaard
Elia Formisano
Michele Esposito
M. Dumontier
111
2
0
10 Jan 2025
Do Audio-Language Models Understand Linguistic Variations?
Ramaneswaran Selvakumar
Sonal Kumar
Hemant Kumar Giri
Nishit Anand
Ashish Seth
Sreyan Ghosh
Dinesh Manocha
AuLLM
VLM
101
1
0
21 Oct 2024
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
Rory Young
Nicolas Pugeault
AAML
82
0
0
14 Oct 2024
Language-based Audio Moment Retrieval
Hokuto Munakata
Taichi Nishimura
Shota Nakada
Tatsuya Komatsu
89
1
0
24 Sep 2024
Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions
Yifei Xin
Yuexian Zou
79
9
0
28 Jul 2023
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events
Xuenan Xu
Heinrich Dinkel
Mengyue Wu
Kai Yu
36
25
0
23 Feb 2021
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
504
602
0
21 Jul 2020
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
VLM
SSL
111
1,068
0
21 Dec 2019
Clotho: An Audio Captioning Dataset
Konstantinos Drossos
Samuel Lipping
Tuomas Virtanen
70
381
0
21 Oct 2019
Audio Caption: Listen and Tell
Mengyue Wu
Heinrich Dinkel
Kai Yu
39
61
0
25 Feb 2019
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
45
234
0
07 Apr 2018
NELS -- Never-Ending Learner of Sounds
Benjamin Elizalde
Rohan Badlani
Ankit Parag Shah
Anurag Kumar
Bhiksha Raj
CLL
20
8
0
17 Jan 2018
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
92
2,488
0
29 Sep 2016
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
552
31,406
0
16 Jan 2013
1