Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 1,516 papers shown
Title
Consistency-based Self-supervised Learning for Temporal Anomaly Localization
Aniello Panariello
Angelo Porrello
Simone Calderara
Rita Cucchiara
AI4TS
21
15
0
10 Aug 2022
BabyNet: A Lightweight Network for Infant Reaching Action Recognition in Unconstrained Environments to Support Future Pediatric Rehabilitation Applications
Amel Dechemi
Vikarn Bhakri
Ipsita Sahin
Arjun Modi
Julya Mestas
Pamodya Peiris
Dannya Enriquez Barrundia
Elena Kokkoni
Konstantinos Karydis
30
10
0
09 Aug 2022
Vision-Based Activity Recognition in Children with Autism-Related Behaviors
P. Wei
David Ahmedt-Aristizabal
Harshala Gammulle
Simon Denman
M. Armin
48
31
0
08 Aug 2022
Weakly Supervised Online Action Detection for Infant General Movements
Tong Luo
Jia Xiao
Chuncao Zhang
Siheng Chen
Yuan Tian
Guangjun Yu
K. Dang
Xiaowei Ding
24
2
0
07 Aug 2022
Frozen CLIP Models are Efficient Video Learners
Ziyi Lin
Shijie Geng
Renrui Zhang
Peng Gao
Gerard de Melo
Xiaogang Wang
Jifeng Dai
Yu Qiao
Hongsheng Li
CLIP
VLM
28
202
0
06 Aug 2022
Blockwise Temporal-Spatial Pathway Network
SeulGi Hong
Min-Kook Choi
31
1
0
05 Aug 2022
Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos
Juncheng Billy Li
Junlin Xie
Linchao Zhu
Long Qian
Siliang Tang
...
Haochen Shi
Shengyu Zhang
Longhui Wei
Qi Tian
Yueting Zhuang
41
12
0
03 Aug 2022
Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living
Zdravko Marinov
David Schneider
Alina Roitberg
Rainer Stiefelhagen
VGen
37
2
0
03 Aug 2022
Dyadic Movement Synchrony Estimation Under Privacy-preserving Conditions
Jicheng Li
Anjana Bhat
R. Barmaki
49
4
0
01 Aug 2022
Video Question Answering with Iterative Video-Text Co-Tokenization
A. Piergiovanni
K. Morton
Weicheng Kuo
Michael S. Ryoo
A. Angelova
39
18
0
01 Aug 2022
Uncertainty-Driven Action Quality Assessment
Caixia Zhou
Yaping Huang
23
10
0
29 Jul 2022
Reducing the Vision and Language Bias for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Wei Hu
23
49
0
27 Jul 2022
Skimming, Locating, then Perusing: A Human-Like Framework for Natural Language Video Localization
Daizong Liu
Wei Hu
37
39
0
27 Jul 2022
Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation
Michal Balazia
Philippe Muller
Ákos Levente Tánczos
A. V. Liechtenstein
Franccois Brémond
27
22
0
26 Jul 2022
Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Md. Mohaiminul Islam
Gedas Bertasius
27
7
0
24 Jul 2022
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
37
42
0
24 Jul 2022
Audio-driven Neural Gesture Reenactment with Video Motion Graphs
Yang Zhou
Jimei Yang
Dingzeyu Li
Jun Saito
Deepali Aneja
E. Kalogerakis
DiffM
SLR
42
20
0
23 Jul 2022
EgoEnv: Human-centric environment representations from egocentric video
Tushar Nagarajan
Santhosh Kumar Ramakrishnan
Ruta Desai
James M. Hillis
Kristen Grauman
EgoV
45
19
0
22 Jul 2022
Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022
María Escobar
Laura Alexandra Daza
Cristina González
Jordi Pont-Tuset
Pablo Arbelaez
21
8
0
22 Jul 2022
Fact sheet: Automatic Self-Reported Personality Recognition Track
Francisca Pessanha
Gizem Sogancioglu
16
6
0
22 Jul 2022
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
Grant Van Horn
Rui Qian
Kimberly Wilber
Hartwig Adam
Oisin Mac Aodha
Serge Belongie
36
10
0
21 Jul 2022
Sequence Models for Drone vs Bird Classification
Fatih Çagatay Akyön
Erdem Akagündüz
S. Altinuc
A. Temi̇zel
26
1
0
21 Jul 2022
A Generalized & Robust Framework For Timestamp Supervision in Temporal Action Segmentation
R. Rahaman
Dipika Singhania
Alexandre Hoang Thiery
Angela Yao
44
2
0
20 Jul 2022
BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis
Davide Moltisanti
Jinyi Wu
Bo Dai
Chen Change Loy
DiffM
23
4
0
20 Jul 2022
Is an Object-Centric Video Representation Beneficial for Transfer?
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
ViT
42
27
0
20 Jul 2022
ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network
Nikolaos Gkalelis
Dimitrios Daskalakis
Vasileios Mezaris
24
10
0
20 Jul 2022
Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition
Huabin Liu
Weixian Lv
John See
W. Lin
TTA
34
11
0
20 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
32
1
0
20 Jul 2022
HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers
Tae-Kyung Kang
Gun-Hee Lee
Seong-Whan Lee
38
10
0
20 Jul 2022
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
35
166
0
19 Jul 2022
Action Quality Assessment with Temporal Parsing Transformer
Yang Bai
Desen Zhou
Songyang Zhang
Jian Wang
Errui Ding
Yu Guan
Yang Long
Jingdong Wang
ViT
29
39
0
19 Jul 2022
Time Is MattEr: Temporal Self-supervision for Video Transformers
Sukmin Yun
Jaehyung Kim
Dongyoon Han
Hwanjun Song
Jung-Woo Ha
Jinwoo Shin
ViT
19
12
0
19 Jul 2022
Zero-Shot Temporal Action Detection via Vision-Language Prompting
Sauradip Nag
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
VLM
33
65
0
17 Jul 2022
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Yansong Tang
Xingyu Liu
Xumin Yu
Danyang Zhang
Jiwen Lu
Jie Zhou
32
20
0
17 Jul 2022
SVGraph: Learning Semantic Graphs from Instructional Videos
Madeline Chantry Schiappa
Yogesh S Rawat
17
4
0
16 Jul 2022
TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
Yuqi Liu
Pengfei Xiong
Luhui Xu
Shengming Cao
Qin Jin
44
114
0
16 Jul 2022
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
22
75
0
14 Jul 2022
ReAct: Temporal Action Detection with Relational Queries
Ding Shi
Yujie Zhong
Qiong Cao
Jing Zhang
Lin Ma
Jia Li
Dacheng Tao
ViT
32
68
0
14 Jul 2022
Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action Localization
Ziqiang Li
Yongxin Ge
Jiaruo Yu
Zhongming Chen
26
19
0
14 Jul 2022
Masked Autoencoders that Listen
Po-Yao (Bernie) Huang
Hu Xu
Juncheng Billy Li
Alexei Baevski
Michael Auli
Wojciech Galuba
Florian Metze
Christoph Feichtenhofer
28
270
0
13 Jul 2022
Compound Prototype Matching for Few-shot Action Recognition
Yifei Huang
Lijin Yang
Yoichi Sato
35
44
0
12 Jul 2022
Modality-Aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection
Jiashuo Yu
Jin-Yuan Liu
Ying Cheng
Rui Feng
Yuejie Zhang
29
35
0
12 Jul 2022
Hunting Group Clues with Transformers for Social Group Activity Recognition
Masato Tamura
Rahul Vishwakarma
Ravigopal Vennelakanti
32
23
0
12 Jul 2022
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
Long Zhuo
Guangcong Wang
Shikai Li
Wayne Wu
Ziwei Liu
VGen
58
20
0
11 Jul 2022
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval
Jinbin Bai
Chunhui Liu
Feiyue Ni
Haofan Wang
Mengying Hu
Xiaofeng Guo
Lele Cheng
53
11
0
11 Jul 2022
Beyond Transfer Learning: Co-finetuning for Action Localisation
Anurag Arnab
Xuehan Xiong
A. Gritsenko
Rob Romijnders
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
43
8
0
08 Jul 2022
VidConv: A modernized 2D ConvNet for Efficient Video Recognition
Chuong H. Nguyen
Su Huynh
Vinh Nguyen
Ngoc-Khanh Nguyen
ViT
29
3
0
08 Jul 2022
OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
Mamshad Nayeem Rizve
Navid Kardan
Salman Khan
Fahad Shahbaz Khan
M. Shah
49
50
0
05 Jul 2022
Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in VIS and NIR Scenario
Yukai Wang
Chunlei Peng
Decheng Liu
N. Wang
Xinbo Gao
52
15
0
05 Jul 2022
Disentangled Action Recognition with Knowledge Bases
Zhekun Luo
Shalini Ghosh
Devin Guillory
Keizo Kato
Trevor Darrell
Huijuan Xu
21
7
0
04 Jul 2022
Previous
1
2
3
...
10
11
12
...
29
30
31
Next