Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.07750
Cited By
v1
v2
v3 (latest)
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
22 May 2017
João Carreira
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"
50 / 3,647 papers shown
Title
PressureVision: Estimating Hand Pressure from a Single RGB Image
Patrick Grady
Chengcheng Tang
Samarth Brahmbhatt
Christopher D. Twigg
Chengde Wan
James Hays
Charles C. Kemp
3DH
87
20
0
19 Mar 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Thanh-Dat Truong
Quoc-Huy Bui
C. Duong
Han-Seok Seo
Son Lam Phung
Xin Li
Khoa Luu
ViT
135
51
0
19 Mar 2022
RoVISQ: Reduction of Video Service Quality via Adversarial Attacks on Deep Learning-based Video Compression
Jung-Woo Chang
Mojan Javaheripi
Seira Hidano
F. Koushanfar
98
8
0
18 Mar 2022
Multi-input segmentation of damaged brain in acute ischemic stroke patients using slow fusion with skip connection
Luca Tomasetti
M. Khanmohammadi
K. Engan
Liv Jorunn Høllesli
K. D. Kurz
52
5
0
18 Mar 2022
Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Chen Liang
Wenguan Wang
Tianfei Zhou
Jiaxu Miao
Yawei Luo
Yi Yang
VOS
100
79
0
18 Mar 2022
Group Contextualization for Video Recognition
Y. Hao
Haotong Zhang
Chong-Wah Ngo
Xiangnan He
62
27
0
18 Mar 2022
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos
Yan Wang
Yixuan Sun
Yiwen Huang
Zhongying Liu
Shuyong Gao
Wei Zhang
Weifeng Ge
Wenqiang Zhang
83
89
0
17 Mar 2022
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image
Xuanchi Ren
Xiaolong Wang
VGen
108
58
0
17 Mar 2022
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation
Khoa T. Vo
Kashu Yamazaki
Sang Truong
M. Tran
Akihiro Sugimoto
Ngan Le
EgoV
71
9
0
16 Mar 2022
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
90
24
0
16 Mar 2022
Know your sensORs -- A Modality Study For Surgical Action Classification
Lennart Bastian
Tobias Czempiel
C. Heiliger
K. Karcz
U. Eck
Benjamin Busam
Nassir Navab
83
5
0
16 Mar 2022
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
Dominik Rivoir
Isabel Funke
Stefanie Speidel
108
19
0
15 Mar 2022
All in One: Exploring Unified Video-Language Pre-training
Alex Jinpeng Wang
Yixiao Ge
Rui Yan
Yuying Ge
Xudong Lin
Guanyu Cai
Jianping Wu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
131
202
0
14 Mar 2022
RCL: Recurrent Continuous Localization for Temporal Action Detection
Qiang Wang
Yanhao Zhang
Yun Zheng
Pan Pan
ObjD
77
38
0
14 Mar 2022
Active Learning by Feature Mixing
Amin Parvaneh
Ehsan Abbasnejad
Damien Teney
Reza Haffari
Anton Van Den Hengel
Javen Qinfeng Shi
81
94
0
14 Mar 2022
Decontextualized I3D ConvNet for ultra-distance runners performance analysis at a glance
David Freire-Obregón
J. Lorenzo-Navarro
Modesto Castrillón-Santana
43
5
0
13 Mar 2022
Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Bin Li
Yixuan Weng
Bin Sun
Shutao Li
167
33
0
13 Mar 2022
WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language
Federico Tavella
Viktor Schlegel
Marta Romeo
Aphrodite Galata
Angelo Cangelosi
92
10
0
11 Mar 2022
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning
Shiwen Zhang
AI4TS
96
12
0
11 Mar 2022
BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis
Haiyang Liu
Zihao Zhu
Naoya Iwamoto
Yichen Peng
Zhengqing Li
You Zhou
E. Bozkurt
Bo Zheng
SLR
CVBM
129
144
0
10 Mar 2022
A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach
Xiaohan Lan
Yitian Yuan
Xin Eric Wang
Long Chen
Zhi Wang
Lin Ma
Wenwu Zhu
CML
72
16
0
10 Mar 2022
End-to-End Semantic Video Transformer for Zero-Shot Action Recognition
Keval Doshi
Yasin Yılmaz
ViT
88
2
0
10 Mar 2022
OpenTAL: Towards Open Set Temporal Action Localization
Wentao Bao
Qi Yu
Yu Kong
EDL
74
29
0
10 Mar 2022
Do better ImageNet classifiers assess perceptual similarity better?
Manoj Kumar
N. Houlsby
Nal Kalchbrenner
E. D. Cubuk
113
34
0
09 Mar 2022
Human Gaze Guided Attention for Surgical Activity Recognition
Abdishakour Awale
Duygu Sarikaya
66
0
0
09 Mar 2022
Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition
Yuecong Xu
Jianfei Yang
Haozhi Cao
Keyu Wu
Min-man Wu
Zhenghua Chen
TTA
97
33
0
09 Mar 2022
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
Yutong Chen
Fangyun Wei
Xiao Sun
Zhirong Wu
Stephen Lin
SLR
88
104
0
08 Mar 2022
End-to-End Semi-Supervised Learning for Video Action Detection
Akash Kumar
Yogesh S Rawat
77
32
0
08 Mar 2022
Gait Recognition with Mask-based Regularization
Chuanfu Shen
Beibei Lin
Shunli Zhang
George Q. Huang
Shiqi Yu
Xin-cen Yu
CVBM
169
19
0
08 Mar 2022
Universal Prototype Transport for Zero-Shot Action Recognition and Localization
Pascal Mettes
99
5
0
08 Mar 2022
Live Laparoscopic Video Retrieval with Compressed Uncertainty
Tong Yu
Pietro Mascagni
J. Verde
J. Marescaux
Didier Mutter
N. Padoy
84
7
0
08 Mar 2022
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos
Yunhao Du
Zhihang Tong
Jun-Jun Wan
Binyu Zhang
Yanyun Zhao
74
3
0
08 Mar 2022
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
G. Moon
E. Cyr
59
5
0
07 Mar 2022
Behavior Recognition Based on the Integration of Multigranular Motion Features
Lizong Zhang
Yiming Wang
Bei Hui
Xiu Zhang
Sijuan Liu
Shuxin Feng
34
0
0
07 Mar 2022
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos
Saghir Alfasly
Jian Lu
C. Xu
Yuru Zou
103
20
0
06 Mar 2022
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding
Daizong Liu
Xiang Fang
Wei Hu
Pan Zhou
102
37
0
06 Mar 2022
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
Linjiang Huang
Liang Wang
Hongsheng Li
AI4TS
137
69
0
06 Mar 2022
Machine Learning Applications in Lung Cancer Diagnosis, Treatment and Prognosis
Yawei Li
Xin Wu
P. Yang
Guoqian Jiang
Yuan Luo
AI4CE
94
2
0
05 Mar 2022
Audio-visual speech separation based on joint feature representation with cross-modal attention
Jun Xiong
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
Yanni Zhang
50
3
0
05 Mar 2022
A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy Detection
Sifeng He
Xudong Yang
Chenhan Jiang
Gang Liang
Wei Zhang
...
Kaiming Huang
Yuan Cheng
Feng Qian
Xiaobo Zhang
Lei Yang
60
12
0
05 Mar 2022
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
Yunze Liu
Yun-Hai Liu
Chen Jiang
Kangbo Lyu
Weikang Wan
Hao Shen
Bo-Hua Liang
Zhoujie Fu
He Wang
Li Yi
132
188
0
03 Mar 2022
SegTAD: Precise Temporal Action Detection via Semantic Segmentation
Chen Zhao
Merey Ramazanova
Mengmeng Xu
Guohao Li
43
7
0
03 Mar 2022
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
104
109
0
02 Mar 2022
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
Le Yang
Junwei Han
Dingwen Zhang
90
38
0
02 Mar 2022
TransDARC: Transformer-based Driver Activity Recognition with Latent Space Feature Calibration
Kunyu Peng
Alina Roitberg
Kailun Yang
Jiaming Zhang
Rainer Stiefelhagen
ViT
85
34
0
02 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
105
15
0
01 Mar 2022
Concept Graph Neural Networks for Surgical Video Understanding
Yutong Ban
J. Eckhoff
Thomas M. Ward
Daniel A. Hashimoto
O. Meireles
Daniela Rus
Guy Rosman
NAI
90
18
0
27 Feb 2022
Continuous Human Action Recognition for Human-Machine Interaction: A Review
Harshala Gammulle
David Ahmedt-Aristizabal
Akila Pemasiri
Lachlan Tychsen-Smith
L. Petersson
Clinton Fookes
124
28
0
26 Feb 2022
On Modality Bias Recognition and Reduction
Yangyang Guo
Liqiang Nie
Harry Cheng
Zhiyong Cheng
Mohan S. Kankanhalli
A. Bimbo
80
28
0
25 Feb 2022
Motion-driven Visual Tempo Learning for Video-based Action Recognition
Yuanzhong Liu
Junsong Yuan
Zhigang Tu
90
61
0
24 Feb 2022
Previous
1
2
3
...
37
38
39
...
71
72
73
Next