Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
SeqHAND:RGB-Sequence-Based 3D Hand Pose and Shape Estimation
John Yang
H. Chang
Seungeui Lee
Nojun Kwak
3DH
108
50
0
10 Jul 2020
Generalized Few-Shot Video Classification with Video Retrieval and Feature Generation
Yongqin Xian
Bruno Korbar
Matthijs Douze
Lorenzo Torresani
Bernt Schiele
Zeynep Akata
VGen
70
18
0
09 Jul 2020
Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision
Peng Wu
Jing Liu
Yujia Shi
Yujia Sun
Fang Shao
Zhaoyang Wu
Zhiwei Yang
107
324
0
09 Jul 2020
Aligning Videos in Space and Time
Senthil Purushwalkam
Tian-Chun Ye
Saurabh Gupta
Abhinav Gupta
77
23
0
09 Jul 2020
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
Shijie Geng
Peng Gao
Moitreya Chatterjee
Chiori Hori
Jonathan Le Roux
Yongfeng Zhang
Hongsheng Li
A. Cherian
101
11
0
08 Jul 2020
Unsupervised object-centric video generation and decomposition in 3D
Paul Henderson
Christoph H. Lampert
OCL
107
36
0
07 Jul 2020
Decoupled Spatial-Temporal Attention Network for Skeleton-Based Action Recognition
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
86
49
0
07 Jul 2020
VPN: Learning Video-Pose Embedding for Activities of Daily Living
Srijan Das
Saurav Sharma
Rui Dai
Francois Bremond
Monique Thonnat
ViT
116
127
0
06 Jul 2020
Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos
Mahsa Ehsanpour
Alireza Abedin
F. Saleh
Javen Qinfeng Shi
Ian Reid
Hamid Rezatofighi
111
71
0
06 Jul 2020
Quo Vadis, Skeleton Action Recognition ?
Pranay Gupta
Anirudh Thatipelli
Aditya Aggarwal
Shubhanshu Maheshwari
Neel Trivedi
Sourav Das
Ravi Kiran Sarvadevabhatla
97
65
0
04 Jul 2020
Modality Shifting Attention Network for Multi-modal Video Question Answering
Junyeong Kim
Minuk Ma
T. Pham
Kyungsu Kim
Chang D. Yoo
84
72
0
04 Jul 2020
Weakly Supervised Temporal Action Localization with Segment-Level Labels
Xinpeng Ding
Nannan Wang
Xinbo Gao
Jie Li
Xiaoyu Wang
Tongliang Liu
60
12
0
03 Jul 2020
Attention-Oriented Action Recognition for Real-Time Human-Robot Interaction
Ziyang Song
Ziyi Yin
Zejian Yuan
Chong Zhang
Wanchao Chi
Yonggen Ling
Shenghao Zhang
112
21
0
02 Jul 2020
Low-light Environment Neural Surveillance
Michael Potter
Henry Gridley
Noah Lichtenstein
Kevin Hines
John Nguyen
Jacob Walsh
19
3
0
02 Jul 2020
Group Ensemble: Learning an Ensemble of ConvNets in a single ConvNet
Hao Chen
Abhinav Shrivastava
57
14
0
01 Jul 2020
The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose
Yizhak Ben-Shabat
Xin Yu
F. Saleh
Dylan Campbell
Cristian Rodriguez-Opazo
Hongdong Li
Stephen Gould
102
115
0
01 Jul 2020
Ultra2Speech -- A Deep Learning Framework for Formant Frequency Estimation and Tracking from Ultrasound Tongue Images
Pramit Saha
Yadong Liu
B. Gick
S. Fels
39
12
0
29 Jun 2020
Self-Supervised MultiModal Versatile Networks
Jean-Baptiste Alayrac
Adrià Recasens
R. Schneider
Relja Arandjelović
Jason Ramapuram
J. Fauw
Lucas Smaira
Sander Dieleman
Andrew Zisserman
SSL
204
375
0
29 Jun 2020
Automatic Operating Room Surgical Activity Recognition for Robot-Assisted Surgery
Aidean Sharghi
Helene Haugerud
Daniel Oh
Omid Mohareri
86
46
0
29 Jun 2020
Explainable 3D Convolutional Neural Networks by Learning Temporal Transformations
Gabrielle Ras
L. Ambrogioni
Pim Haselager
Marcel van Gerven
Umut Gucclu
3DPC
16
3
0
29 Jun 2020
Unsupervised Learning of Video Representations via Dense Trajectory Clustering
P. Tokmakov
M. Hebert
Cordelia Schmid
SSL
59
22
0
28 Jun 2020
Learning Goals from Failure
Dave Epstein
Carl Vondrick
27
3
0
28 Jun 2020
Dynamic Sampling Networks for Efficient Action Recognition in Videos
Yin-Dong Zheng
Zhaoyang Liu
Tong Lu
Limin Wang
77
77
0
28 Jun 2020
Video Representation Learning with Visual Tempo Consistency
Ceyuan Yang
Yinghao Xu
Bo Dai
Bolei Zhou
68
92
0
28 Jun 2020
Deepfake Detection using Spatiotemporal Convolutional Networks
Oscar de Lima
Sean Franklin
Shreshtha Basu
Blake Karwoski
A. George
3DPC
78
114
0
26 Jun 2020
Space-Time Correspondence as a Contrastive Random Walk
Allan Jabri
Andrew Owens
Alexei A. Efros
SSL
OT
152
304
0
25 Jun 2020
SmallBigNet: Integrating Core and Contextual Views for Video Classification
Xianhang Li
Yali Wang
Zhipeng Zhou
Yu Qiao
ViT
67
92
0
25 Jun 2020
Rescaling Egocentric Vision
Dima Damen
Hazel Doughty
G. Farinella
Antonino Furnari
Evangelos Kazakos
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
178
471
0
23 Jun 2020
Sequential Feature Filtering Classifier
Min-seok Seo
Jaemin Lee
Jongchan Park
Dong-Geol Choi
38
3
0
21 Jun 2020
Weak Supervision and Referring Attention for Temporal-Textual Association Learning
Zhiyuan Fang
Shu Kong
Zhe Wang
Charless C. Fowlkes
Yezhou Yang
68
17
0
21 Jun 2020
Motion Representation Using Residual Frames with 3D CNN
Li Tao
Xueting Wang
T. Yamasaki
3DPC
55
1
0
21 Jun 2020
Driver Intention Anticipation Based on In-Cabin and Driving Scene Monitoring
Yao Rong
Zeynep Akata
Enkelejda Kasneci
49
31
0
20 Jun 2020
Forward Prediction for Physical Reasoning
Rohit Girdhar
Laura Gustafson
Aaron B. Adcock
Laurens van der Maaten
LRM
AI4CE
129
21
0
18 Jun 2020
Language Guided Networks for Cross-modal Moment Retrieval
Kun Liu
Huadong Ma
Chuang Gan
32
2
0
18 Jun 2020
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation
Shijie Li
Yazan Abu Farha
Yun-Hai Liu
Mingg-Ming Cheng
Juergen Gall
76
31
0
16 Jun 2020
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Andrew Rouditchenko
Angie Boggust
David Harwath
Brian Chen
D. Joshi
...
Rogerio Feris
Brian Kingsbury
M. Picheny
Antonio Torralba
James R. Glass
SSL
88
142
0
16 Jun 2020
1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020
Siyu Chen
Junting Pan
Guanglu Song
Manyuan Zhang
Hao Shao
Ziyi Lin
Jing Shao
Hongsheng Li
Yu Liu
3DPC
51
4
0
16 Jun 2020
Learn to cycle: Time-consistent feature discovery for action recognition
Alexandros Stergiou
R. Poppe
52
23
0
15 Jun 2020
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
Junting Pan
Siyu Chen
Zheng Shou
Yu Liu
Jing Shao
Hongsheng Li
3DPC
112
151
0
14 Jun 2020
Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning
Yuqing Song
Shizhe Chen
Yida Zhao
Qin Jin
30
6
0
14 Jun 2020
Uncertainty-aware Score Distribution Learning for Action Quality Assessment
Yansong Tang
Zanlin Ni
Jiahuan Zhou
Danyang Zhang
Jiwen Lu
Ying Nian Wu
Jie Zhou
EDL
107
127
0
13 Jun 2020
DTG-Net: Differentiated Teachers Guided Self-Supervised Video Action Recognition
Ziming Liu
Guangyu Gao
•. A. K. Qin
Jinyang Li
ViT
60
1
0
13 Jun 2020
CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)
Xiang Wang
Baiteng Ma
Zhiwu Qing
Yongpeng Sang
Changxin Gao
Shiwei Zhang
Nong Sang
36
18
0
13 Jun 2020
Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)
Zhiwu Qing
Xiang Wang
Yongpeng Sang
Changxin Gao
Shiwei Zhang
Nong Sang
31
3
0
13 Jun 2020
The DeepFake Detection Challenge (DFDC) Dataset
Brian Dolhansky
Joanna Bitton
Ben Pflaum
Jikuo Lu
Russ Howes
Menglin Wang
Cristian Canton Ferrer
PICV
88
247
0
12 Jun 2020
Video Understanding as Machine Translation
Bruno Korbar
Fabio Petroni
Rohit Girdhar
Lorenzo Torresani
SSL
88
29
0
12 Jun 2020
Weakly-supervised Temporal Action Localization by Uncertainty Modeling
Pilhyeon Lee
Jinglu Wang
Yan Lu
H. Byun
EDL
62
11
0
12 Jun 2020
Disentangled Non-Local Neural Networks
Minghao Yin
Zhuliang Yao
Yue Cao
Xiu Li
Zheng Zhang
Stephen Lin
Han Hu
163
330
0
11 Jun 2020
Privacy-Aware Activity Classification from First Person Office Videos
Partho Ghosh
Md. Abrar Istiak
Nayeeb Rashid
Ahsan Habib Akash
Ridwan Abrar
Ankan Ghosh Dastider
Asif Sushmit
Taufiq Hasan
PICV
47
2
0
11 Jun 2020
TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model
Bo Pang
Yizhuo Li
Yifan Zhang
Muchen Li
Cewu Lu
VOT
51
239
0
10 Jun 2020
Previous
1
2
3
...
57
58
59
...
71
72
73
Next