Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.07313
Cited By
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
14 October 2021
Sangeeta Srivastava
Yun Wang
Andros Tjandra
Anurag Kumar
Chunxi Liu
Kritika Singh
Yatharth Saraf
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks"
14 / 14 papers shown
Title
Enhancing Infant Crying Detection with Gradient Boosting for Improved Emotional and Mental Health Diagnostics
Kyunghun Lee
Lauren M. Henry
Eleanor Hansen
Elizabeth Tandilashvili
Lauren S. Wakschlag
Elizabeth Norton
Daniel S. Pine
Melissa A. Brotman
Francisco Pereira
23
0
0
11 Oct 2024
A-JEPA: Joint-Embedding Predictive Architecture Can Listen
Zhengcong Fei
Mingyuan Fan
Junshi Huang
25
17
0
27 Nov 2023
Multi-Source Contrastive Learning from Musical Audio
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
24
6
0
14 Feb 2023
Contrastive Audio-Visual Masked Autoencoder
Yuan Gong
Andrew Rouditchenko
Alexander H. Liu
David F. Harwath
Leonid Karlinsky
Hilde Kuehne
James R. Glass
32
120
0
02 Oct 2022
Equivariant Self-Supervision for Musical Tempo Estimation
Elio Quinton
32
9
0
03 Sep 2022
SampleMatch: Drum Sample Retrieval by Musical Context
Stefan Lattner
22
7
0
01 Aug 2022
Masked Autoencoders that Listen
Po-Yao (Bernie) Huang
Hu Xu
Juncheng Billy Li
Alexei Baevski
Michael Auli
Wojciech Galuba
Florian Metze
Christoph Feichtenhofer
13
268
0
13 Jul 2022
AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification
Juncheng Billy Li
Shuhui Qu
Po-Yao (Bernie) Huang
Florian Metze
VLM
27
9
0
25 Mar 2022
A Study on Robustness to Perturbations for Representations of Environmental Sound
Sangeeta Srivastava
Ho-Hsiang Wu
Joao Rulff
Magdalena Fuentes
M. Cartwright
Claudio Silva
Anish Arora
J. P. Bello
20
5
0
20 Mar 2022
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
35
106
0
02 Mar 2022
Multimodal Self-Supervised Learning of General Audio Representations
Luyu Wang
Pauline Luc
Adrià Recasens
Jean-Baptiste Alayrac
Aaron van den Oord
SSL
78
41
0
26 Apr 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
248
577
0
22 Apr 2021
Exploring wav2vec 2.0 on speaker verification and language identification
Zhiyun Fan
Meng Li
Shiyu Zhou
Bo Xu
103
202
0
11 Dec 2020
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Yu Zhang
James Qin
Daniel S. Park
Wei Han
Chung-Cheng Chiu
Ruoming Pang
Quoc V. Le
Yonghui Wu
VLM
SSL
146
308
0
20 Oct 2020
1