Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.14579
Cited By
v1
v2
v3 (latest)
Real-Time Idling Vehicles Detection using Combined Audio-Visual Deep Learning
23 May 2023
Xiwen Li
Tristalee Mangin
Surojit Saha
Evan K. Blanchard
Di Tang
Henry Poppe
Nathan Searle
Ouk Choi
Kerry E Kelly
Ross T. Whitaker
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Real-Time Idling Vehicles Detection using Combined Audio-Visual Deep Learning"
16 / 16 papers shown
Title
First-place Solution for Streetscape Shop Sign Recognition Competition
Bin Wang
Li Jing
457
0
0
06 Jan 2025
Self-supervised object detection from audio-visual correspondence
Triantafyllos Afouras
Yuki M. Asano
Francois Fagan
Andrea Vedaldi
Florian Metze
SSL
84
47
0
13 Apr 2021
SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification
Alireza Nasiri
Jianjun Hu
49
18
0
02 Mar 2021
Contrastive Learning of General-Purpose Audio Representations
Aaqib Saeed
David Grangier
Neil Zeghidour
VLM
SSL
76
272
0
21 Oct 2020
3DFCNN: Real-Time Action Recognition using 3D Deep Neural Networks with Raw Depth Information
Adrián Sánchez-Caballero
Sergio de López-Diz
D. Fuentes-Jiménez
Cristina Losada-Gutiérrez
Marta Marrón-Romera
D. Casillas-Pérez
Mohammad Ibrahim Sarker
HAI
96
63
0
13 Jun 2020
Supervised Contrastive Learning
Prannay Khosla
Piotr Teterwak
Chen Wang
Aaron Sarna
Yonglong Tian
Phillip Isola
Aaron Maschinot
Ce Liu
Dilip Krishnan
SSL
165
4,572
0
23 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
378
18,866
0
13 Feb 2020
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
Qiuqiang Kong
Yin Cao
Turab Iqbal
Yuxuan Wang
Wenwu Wang
Mark D. Plumbley
VLM
SSL
194
1,084
0
21 Dec 2019
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
79
144
0
15 Nov 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
213
12,124
0
13 Nov 2019
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
169
3,282
0
10 Dec 2018
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events
Sanjeel Parekh
S. Essid
A. Ozerov
Ngoc Q. K. Duong
P. Pérez
G. Richard
SSL
65
19
0
19 Apr 2018
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Andrew Owens
Alexei A. Efros
SSL
98
753
0
10 Apr 2018
Object Detection in Video with Spatiotemporal Sampling Networks
Gedas Bertasius
Lorenzo Torresani
Jianbo Shi
ViT
52
220
0
15 Mar 2018
YOLO9000: Better, Faster, Stronger
Joseph Redmon
Ali Farhadi
VLM
ObjD
183
15,633
0
25 Dec 2016
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
522
10,347
0
16 Nov 2016
1