ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.09967
  4. Cited By

Language-based Audio Retrieval Task in DCASE 2022 Challenge

20 September 2022
Huang Xie
Samuel Lipping
Tuomas Virtanen
ArXivPDFHTML

Papers citing "Language-based Audio Retrieval Task in DCASE 2022 Challenge"

13 / 13 papers shown
Title
Audio-Language Datasets of Scenes and Events: A Survey
Audio-Language Datasets of Scenes and Events: A Survey
Gijs Wijngaard
Elia Formisano
Michele Esposito
M. Dumontier
81
2
0
10 Jan 2025
Language-based Audio Retrieval with Co-Attention Networks
Language-based Audio Retrieval with Co-Attention Networks
Haoran Sun
Zehua Wang
Qiuyi Chen
Jianjun Chen
Jia Wang
Haiyang Zhang
39
0
0
31 Dec 2024
The language of sound search: Examining User Queries in Audio Search
  Engines
The language of sound search: Examining User Queries in Audio Search Engines
Benno Weck
Frederic Font
25
1
0
10 Oct 2024
A decade of DCASE: Achievements, practices, evaluations and future
  challenges
A decade of DCASE: Achievements, practices, evaluations and future challenges
A. Mesaros
Romain Serizel
Toni Heittola
Tuomas Virtanen
Mark D. Plumbley
39
2
0
07 Oct 2024
Computer Audition: From Task-Specific Machine Learning to Foundation
  Models
Computer Audition: From Task-Specific Machine Learning to Foundation Models
Andreas Triantafyllopoulos
Iosif Tsangko
Alexander Gebhard
A. Mesaros
Tuomas Virtanen
Björn Schuller
45
4
0
22 Jul 2024
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio
  Models
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models
Florian Schmid
Khaled Koutini
Gerhard Widmer
18
11
0
24 Oct 2023
Advancing Natural-Language Based Audio Retrieval with PaSST and Large
  Audio-Caption Data Sets
Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
Paul Primus
Khaled Koutini
Gerhard Widmer
24
13
0
08 Aug 2023
Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances
Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances
Huang Xie
Khazar Khorrami
Okko Rasanen
Tuomas Virtanen
16
4
0
16 Jun 2023
Audio-Text Models Do Not Yet Leverage Natural Language
Audio-Text Models Do Not Yet Leverage Natural Language
Ho-Hsiang Wu
Oriol Nieto
J. P. Bello
Justin Salamon
VLM
11
28
0
19 Mar 2023
Data leakage in cross-modal retrieval training: A case study
Data leakage in cross-modal retrieval training: A case study
Benno Weck
Xavier Serra
20
7
0
23 Feb 2023
On Negative Sampling for Contrastive Audio-Text Retrieval
On Negative Sampling for Contrastive Audio-Text Retrieval
Huang Xie
Okko Rasanen
Tuomas Virtanen
27
7
0
08 Nov 2022
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound
  Classification and Detection
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
121
264
0
02 Feb 2022
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
278
31,267
0
16 Jan 2013
1