ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.13386
  4. Cited By
Self-supervised Contrastive Learning for Audio-Visual Action Recognition

Self-supervised Contrastive Learning for Audio-Visual Action Recognition

28 April 2022
Yang Liu
Y. Tan
Haoyu Lan
    SSL
ArXivPDFHTML

Papers citing "Self-supervised Contrastive Learning for Audio-Visual Action Recognition"

20 / 20 papers shown
Title
Urban Regional Function Guided Traffic Flow Prediction
Urban Regional Function Guided Traffic Flow Prediction
Kuo Wang
Lingbo Liu
Yang Liu
Guanbin Li
Fan Zhou
Liang Lin
48
27
0
17 Mar 2023
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question
  Answering
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering
Yang Liu
Guanbin Li
Liang Lin
LRM
82
84
0
26 Jul 2022
TCGL: Temporal Contrastive Graph for Self-supervised Video
  Representation Learning
TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
Yang Liu
Keze Wang
Lingbo Liu
Hao Lan
Liang Lin
SSL
AI4TS
86
113
0
07 Dec 2021
Barlow Twins: Self-Supervised Learning via Redundancy Reduction
Barlow Twins: Self-Supervised Learning via Redundancy Reduction
Jure Zbontar
Li Jing
Ishan Misra
Yann LeCun
Stéphane Deny
SSL
300
2,344
0
04 Mar 2021
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation Learning
Xinlei Chen
Kaiming He
SSL
253
4,052
0
20 Nov 2020
Learning Representations from Audio-Visual Spatial Alignment
Learning Representations from Audio-Visual Spatial Alignment
Pedro Morgado
Yi Li
Nuno Vasconcelos
SSL
71
122
0
03 Nov 2020
Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision
  Action Recognition
Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition
Yang Liu
Keze Wang
Guanbin Li
Liang Lin
79
89
0
01 Sep 2020
Look, Listen, and Attend: Co-Attention Network for Self-Supervised
  Audio-Visual Representation Learning
Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning
Ying Cheng
Ruize Wang
Zhihao Pan
Rui Feng
Yuejie Zhang
SSL
128
108
0
13 Aug 2020
Self-supervised Video Representation Learning Using Inter-intra
  Contrastive Framework
Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework
Li Tao
Xueting Wang
T. Yamasaki
SSL
65
106
0
06 Aug 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
360
6,797
0
13 Jun 2020
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Pedro Morgado
Nuno Vasconcelos
Ishan Misra
SSL
80
273
0
27 Apr 2020
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
78
252
0
10 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
196
12,073
0
13 Nov 2019
Transferable Feature Representation for Visible-to-Infrared
  Cross-Dataset Human Action Recognition
Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition
Yang Liu
Zhaoyang Lu
Jing Li
Chao Yao
Yanzi Deng
34
49
0
18 Sep 2019
Global Temporal Representation based CNNs for Infrared Action
  Recognition
Global Temporal Representation based CNNs for Infrared Action Recognition
Yang Liu
Zhaoyang Lu
Jing Li
Tao Yang
Chao Yao
45
55
0
18 Sep 2019
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action
  Recognition
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
Evangelos Kazakos
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
57
337
0
22 Aug 2019
Hierarchically Learned View-Invariant Representations for Cross-View
  Action Recognition
Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition
Yang Liu
Zhaoyang Lu
Jing Li
Tao Yang
107
55
0
03 Sep 2018
Objects that Sound
Objects that Sound
Relja Arandjelović
Andrew Zisserman
ObjD
VOS
98
530
0
18 Dec 2017
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection
Huijuan Xu
Abir Das
Kate Saenko
3DPC
128
718
0
22 Mar 2017
Two-Stream Convolutional Networks for Action Recognition in Videos
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan
Andrew Zisserman
242
7,535
0
09 Jun 2014
1