Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.03533
Cited By
Revisiting Pre-training in Audio-Visual Learning
7 February 2023
Ruoxuan Feng
Wenke Xia
Di Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revisiting Pre-training in Audio-Visual Learning"
4 / 4 papers shown
Title
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
116
264
0
02 Feb 2022
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
Golnaz Ghiasi
Yin Cui
A. Srinivas
Rui Qian
Tsung-Yi Lin
E. D. Cubuk
Quoc V. Le
Barret Zoph
ISeg
226
968
0
13 Dec 2020
Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Yu Zhang
James Qin
Daniel S. Park
Wei Han
Chung-Cheng Chiu
Ruoming Pang
Quoc V. Le
Yonghui Wu
VLM
SSL
143
308
0
20 Oct 2020
Xception: Deep Learning with Depthwise Separable Convolutions
François Chollet
MDE
BDL
PINN
203
14,357
0
07 Oct 2016
1