Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
Class Semantics-based Attention for Action Detection
Deepak Sridhar
N. Quader
S. Muralidharan
Yaoxin Li
Peng Dai
Juwei Lu
68
67
0
06 Sep 2021
Temporal Shift Reinforcement Learning
Deep Thomas
Tichakorn Wongpiromsarn
Ali Jannesari
OffRL
55
0
0
05 Sep 2021
Efficient Action Recognition Using Confidence Distillation
Shervin Manzuri Shalmani
Fei Chiang
Ronghuo Zheng
114
6
0
05 Sep 2021
Spatiotemporal Inconsistency Learning for DeepFake Video Detection
Zhihao Gu
Yang Chen
Taiping Yao
Shouhong Ding
Jilin Li
Feiyue Huang
Lizhuang Ma
124
156
0
04 Sep 2021
Revisiting 3D ResNets for Video Recognition
Xianzhi Du
Yeqing Li
Huayu Chen
Rui Qian
Jing Li
Irwan Bello
160
17
0
03 Sep 2021
Hierarchical 3D Feature Learning for Pancreas Segmentation
Federica Proietto Salanitri
Giovanni Bellitto
Ismail Irmakci
S. Palazzo
Ulas Bagci
C. Spampinato
MedIm
44
10
0
03 Sep 2021
Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition
James Hong
Matthew Fisher
Michael Gharbi
Kayvon Fatahalian
3DH
109
41
0
03 Sep 2021
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction
Joseph Gesnouin
Steve Pechberti
B. Stanciulescu
Fabien Moutarde
94
24
0
02 Sep 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
Nada Osman
Guglielmo Camporese
Pasquale Coscia
Lamberto Ballan
EgoV
108
21
0
02 Sep 2021
Spatio-Temporal Perturbations for Video Attribution
Zhenqiang Li
Weimin Wang
Zuoyue Li
Yifei Huang
Yoichi Sato
60
6
0
01 Sep 2021
LIGAR: Lightweight General-purpose Action Recognition
Evgeny Izutov
48
3
0
30 Aug 2021
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
W. Dong
Jianbo Shi
108
18
0
30 Aug 2021
Searching for Two-Stream Models in Multivariate Space for Video Recognition
Xinyu Gong
Heng Wang
Zheng Shou
Matt Feiszli
Zhangyang Wang
Zhicheng Yan
87
9
0
30 Aug 2021
Zero-shot Natural Language Video Localization
Jinwoo Nam
Daechul Ahn
Dongyeop Kang
S. Ha
Jonghyun Choi
191
43
0
29 Aug 2021
GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal Transformer
Shuaicheng Li
Qianggang Cao
Lingbo Liu
Kunlin Yang
Shinan Liu
Jun Hou
Shuai Yi
ViT
99
106
0
28 Aug 2021
Learning Cross-modal Contrastive Features for Video Domain Adaptation
Donghyun Kim
Yi-Hsuan Tsai
Bingbing Zhuang
Xiang Yu
Stan Sclaroff
Kate Saenko
Manmohan Chandraker
92
73
0
26 Aug 2021
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Xuefan Zha
Wentao Zhu
Tingxun Lv
Sen Yang
Ji Liu
AI4TS
ViT
92
27
0
26 Aug 2021
A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games
Henrik Biermann
Jonas Theiner
Manuel Bassek
Dominik Raabe
D. Memmert
Ralph Ewerth
64
11
0
25 Aug 2021
Spatio-Temporal Self-Attention Network for Video Saliency Prediction
Ziqiang Wang
Zhi Liu
Gongyang Li
Yang Wang
Tianhong Zhang
Lihua Xu
Jijun Wang
3DPC
106
47
0
24 Aug 2021
Support-Set Based Cross-Supervision for Video Grounding
Xinpeng Ding
N. Wang
Shiwei Zhang
De Cheng
Xiaomeng Li
Ziyuan Huang
Mingqian Tang
Xinbo Gao
86
42
0
24 Aug 2021
ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning
Zhiwu Qing
Ziyuan Huang
Shiwei Zhang
Mingqian Tang
Changxin Gao
M. Ang
Ronglei Ji
Nong Sang
83
3
0
24 Aug 2021
Dynamic Network Quantization for Efficient Video Inference
Ximeng Sun
Yikang Shen
Chun-Fu Chen
A. Oliva
Rogerio Feris
Kate Saenko
93
46
0
23 Aug 2021
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Jianwei Yang
Yonatan Bisk
Jianfeng Gao
123
140
0
23 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
101
78
0
20 Aug 2021
Weakly-supervised Joint Anomaly Detection and Classification
Snehashis Majhi
Srijan Das
Francois Bremond
Ratnakar Dash
Pankaj K. Sa
33
20
0
20 Aug 2021
Few Shot Activity Recognition Using Variational Inference
Neeraj Kumar
Siddhansh Narang
BDL
VLM
45
5
0
20 Aug 2021
Video Relation Detection via Tracklet based Visual Transformer
Kaifeng Gao
Long Chen
Yifeng Huang
Jun Xiao
ViT
83
30
0
19 Aug 2021
Self-Supervised Video Representation Learning with Meta-Contrastive Network
Yuanze Lin
Xun Guo
Yan Lu
SSL
78
41
0
19 Aug 2021
Social Fabric: Tubelet Compositions for Video Relation Detection
Shuo Chen
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
ViT
83
21
0
18 Aug 2021
The Multi-Modal Video Reasoning and Analyzing Competition
Haoran Peng
He Huang
Li Xu
Tianjiao Li
Jing Liu
...
Yuanzhong Liu
Tao He
Fuwei Zhang
Xianbin Liu
Tao Lin
54
2
0
18 Aug 2021
Target Adaptive Context Aggregation for Video Scene Graph Generation
Yao Teng
Limin Wang
Zhifeng Li
Gangshan Wu
96
64
0
18 Aug 2021
Channel-Temporal Attention for First-Person Video Domain Adaptation
Xianyuan Liu
Shuo Zhou
Tao Lei
Haiping Lu
EgoV
44
0
0
17 Aug 2021
Group-aware Contrastive Regression for Action Quality Assessment
Xumin Yu
Yongming Rao
Wenliang Zhao
Jiwen Lu
Jie Zhou
AI4TS
85
101
0
17 Aug 2021
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
113
21
0
17 Aug 2021
Temporal Action Segmentation with High-level Complex Activity Labels
Guodong Ding
Angela Yao
87
18
0
15 Aug 2021
Exploring Temporal Coherence for More General Video Face Forgery Detection
Yinglin Zheng
Jianmin Bao
Dong Chen
Ming Zeng
Fang Wen
CVBM
ViT
84
216
0
15 Aug 2021
Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-Learning
Jiahao Wang
Yunhong Wang
Sheng Liu
Annan Li
73
17
0
15 Aug 2021
Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization
Linjiang Huang
Liang Wang
Hongsheng Li
100
78
0
14 Aug 2021
Conditional Temporal Variational AutoEncoder for Action Video Prediction
Xiaogang Xu
Yi Wang
Liwei Wang
Bei Yu
Jiaya Jia
VGen
83
5
0
12 Aug 2021
Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Meng Cao
Can Zhang
Long Chen
Mike Zheng Shou
Yuexian Zou
85
22
0
12 Aug 2021
Mounting Video Metadata on Transformer-based Language Model for Open-ended Video Question Answering
Donggeon Lee
Seongho Choi
Youwon Jang
Byoung-Tak Zhang
91
2
0
11 Aug 2021
Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data
Kuluhan Binici
N. Pham
T. Mitra
K. Leman
86
42
0
11 Aug 2021
Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization
Pilhyeon Lee
H. Byun
134
65
0
11 Aug 2021
Learning to Cut by Watching Movies
Alejandro Pardo
Fabian Caba Heilbron
Juan Carlos León Alcázar
Ali K. Thabet
Guohao Li
VGen
125
20
0
09 Aug 2021
AutoVideo: An Automated Video Action Recognition System
Daochen Zha
Zaid Pervaiz Bhat
Yi-Wei Chen
Yicheng Wang
Sirui Ding
...
Mohammad Bhat
AnmollKumar Jain
Alfredo Costilla Reyes
Na Zou
Helen Zhou
118
11
0
09 Aug 2021
Pose is all you need: The pose only group activity recognition system (POGARS)
Haritha Thilakarathne
Aiden Nibali
Zhen He
Stuart Morgan
53
28
0
09 Aug 2021
Discriminative Latent Semantic Graph for Video Captioning
Yang Bai
Junyan Wang
Yang Long
Bingzhang Hu
Yang Song
Maurice Pagnucco
Yu Guan
90
31
0
08 Aug 2021
Skeleton-Contrastive 3D Action Representation Learning
Fida Mohammad Thoker
Hazel Doughty
Cees G. M. Snoek
SSL
83
133
0
08 Aug 2021
Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection
Rui Dai
Srijan Das
Francois Bremond
86
40
0
08 Aug 2021
Temporal Action Localization Using Gated Recurrent Units
Hassan Keshvari Khojasteh
Hoda Mohammadzade
H. Behroozi
119
3
0
07 Aug 2021
Previous
1
2
3
...
43
44
45
...
71
72
73
Next