ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Challenges and Opportunities for Computer Vision in Real-life Soccer
  Analytics
Challenges and Opportunities for Computer Vision in Real-life Soccer Analytics
Neha Bhargava
Fabio Cuzzolin
95
1
0
13 Apr 2020
Event detection in coarsely annotated sports videos via parallel multi
  receptive field 1D convolutions
Event detection in coarsely annotated sports videos via parallel multi receptive field 1D convolutions
Kanav Vats
M. Fani
Pascale Walters
David A Clausi
John S. Zelek
AI4TS
75
37
0
13 Apr 2020
SpeedNet: Learning the Speediness in Videos
SpeedNet: Learning the Speediness in Videos
Sagie Benaim
Ariel Ephrat
Oran Lang
Inbar Mosseri
William T. Freeman
Michael Rubinstein
Michal Irani
Tali Dekel
92
261
0
13 Apr 2020
A Survey of Single-Scene Video Anomaly Detection
A Survey of Single-Scene Video Anomaly Detection
B. Ramachandra
Michael J. Jones
Ranga Raju Vatsavai
AI4TS
100
181
0
13 Apr 2020
YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in
  Domain-Specific Videos
YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Shizhe Chen
Weiying Wang
Ludan Ruan
Linli Yao
Qin Jin
37
3
0
12 Apr 2020
Towards Anomaly Detection in Dashcam Videos
Towards Anomaly Detection in Dashcam Videos
S. Haresh
Sateesh Kumar
M. Zia
Quoc-Huy Tran
94
30
0
11 Apr 2020
Attend and Decode: 4D fMRI Task State Decoding Using Attention Models
Attend and Decode: 4D fMRI Task State Decoding Using Attention Models
Sam Nguyen
Brenda Ng
Alan Kaplan
Priyadip Ray
63
25
0
10 Apr 2020
ASL Recognition with Metric-Learning based Lightweight Network
ASL Recognition with Metric-Learning based Lightweight Network
Evgeny Izutov
SLR
53
6
0
10 Apr 2020
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View
Yizhou Zhou
Xiaoyan Sun
Chong Luo
Zhengjun Zha
Wenjun Zeng
3DPC
65
20
0
10 Apr 2020
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Hirokatsu Kataoka
Tenga Wakamiya
Kensho Hara
Y. Satoh
3DPC
151
89
0
10 Apr 2020
Models Genesis
Models Genesis
Zongwei Zhou
V. Sodha
Jiaxuan Pang
Michael B. Gotway
Jianming Liang
MedIm
75
34
0
09 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
187
1,029
0
09 Apr 2020
Temporal Pyramid Network for Action Recognition
Temporal Pyramid Network for Action Recognition
Ceyuan Yang
Yinghao Xu
Jianping Shi
Bo Dai
Bolei Zhou
59
376
0
07 Apr 2020
Dense Regression Network for Video Grounding
Dense Regression Network for Video Grounding
Runhao Zeng
Haoming Xu
Wenbing Huang
Peihao Chen
Mingkui Tan
Chuang Gan
92
284
0
07 Apr 2020
What and Where: Modeling Skeletons from Semantic and Spatial
  Perspectives for Action Recognition
What and Where: Modeling Skeletons from Semantic and Spatial Perspectives for Action Recognition
Lei Shi
Yifan Zhang
Jian Cheng
Hanqing Lu
35
1
0
07 Apr 2020
When, Where, and What? A New Dataset for Anomaly Detection in Driving
  Videos
When, Where, and What? A New Dataset for Anomaly Detection in Driving Videos
Yu Yao
Xizi Wang
Mingze Xu
Zelin Pu
E. Atkins
David J. Crandall
86
44
0
06 Apr 2020
Temporal Shift GAN for Large Scale Video Generation
Temporal Shift GAN for Large Scale Video Generation
Andres Munoz
Mohammadreza Zolfaghari
Max Argus
Thomas Brox
EGVM
73
0
0
04 Apr 2020
TimeGate: Conditional Gating of Segments in Long-range Activities
TimeGate: Conditional Gating of Segments in Long-range Activities
Noureldien Hussein
Mihir Jain
B. Bejnordi
AI4TS
108
16
0
03 Apr 2020
Two-Stream AMTnet for Action Detection
Two-Stream AMTnet for Action Detection
Suman Saha
Gurkirt Singh
Fabio Cuzzolin
ViT
79
13
0
03 Apr 2020
TEA: Temporal Excitation and Aggregation for Action Recognition
TEA: Temporal Excitation and Aggregation for Action Recognition
Yan-Ran Li
Bin Ji
Xintian Shi
Jianguo Zhang
Bin Kang
Limin Wang
ViT
102
450
0
03 Apr 2020
BosphorusSign22k Sign Language Recognition Dataset
BosphorusSign22k Sign Language Recognition Dataset
Ogulcan Özdemir
A. Kındıroglu
Necati Cihan Camgöz
L. Akarun
48
38
0
02 Apr 2020
Knowing What, Where and When to Look: Efficient Video Action Modeling
  with Attention
Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention
Juan-Manuel Perez-Rua
Brais Martínez
Xiatian Zhu
Antoine Toisoul
Victor Escorcia
Tao Xiang
129
19
0
02 Apr 2020
Temporal Accumulative Features for Sign Language Recognition
Temporal Accumulative Features for Sign Language Recognition
A. Kındıroglu
Ogulcan Özdemir
L. Akarun
SLR
43
18
0
02 Apr 2020
Learning Longterm Representations for Person Re-Identification Using
  Radio Signals
Learning Longterm Representations for Person Re-Identification Using Radio Signals
Lijie Fan
Tianhong Li
Rongyao Fang
Rumen Hristov
Yuan. Yuan
Dina Katabi
87
89
0
02 Apr 2020
PaStaNet: Toward Human Activity Knowledge Engine
PaStaNet: Toward Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Shiyi Wang
Haoshu Fang
Ze Ma
Mingyang Chen
Cewu Lu
97
152
0
02 Apr 2020
Scene-Adaptive Video Frame Interpolation via Meta-Learning
Scene-Adaptive Video Frame Interpolation via Meta-Learning
Myungsub Choi
Janghoon Choi
Sungyong Baik
Tae Hyun Kim
Kyoung Mu Lee
93
46
0
02 Apr 2020
Spatio-temporal Tubelet Feature Aggregation and Object Linking in Videos
Spatio-temporal Tubelet Feature Aggregation and Object Linking in Videos
Daniel Cores
V. Brea
M. Mucientes
ViT
45
1
0
01 Apr 2020
Spatio-Temporal Action Detection with Multi-Object Interaction
Spatio-Temporal Action Detection with Multi-Object Interaction
Huijuan Xu
Lizhi Yang
Stan Sclaroff
Kate Saenko
Trevor Darrell
74
7
0
01 Apr 2020
Weakly-Supervised Action Localization with Expectation-Maximization
  Multi-Instance Learning
Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning
Zhekun Luo
Devin Guillory
Baifeng Shi
Wei Ke
Fang Wan
Trevor Darrell
Huijuan Xu
133
121
0
31 Mar 2020
Explaining Motion Relevance for Activity Recognition in Video Deep
  Learning Models
Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models
Liam Hiley
Alun D. Preece
Y. Hicks
Supriyo Chakraborty
Prudhvi K. Gurram
Richard J. Tomsett
FAtt
90
15
0
31 Mar 2020
SCT: Set Constrained Temporal Transformer for Set Supervised Action
  Segmentation
SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation
Mohsen Fayyaz
Juergen Gall
ViT
75
71
0
31 Mar 2020
Long Short-Term Relation Networks for Video Action Detection
Long Short-Term Relation Networks for Video Action Detection
Dong Li
Ting Yao
Zhaofan Qiu
Houqiang Li
Tao Mei
60
22
0
31 Mar 2020
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
Boxiao Pan
Haoye Cai
De-An Huang
Kuan-Hui Lee
Adrien Gaidon
Ehsan Adeli
Juan Carlos Niebles
79
236
0
31 Mar 2020
TITAN: Future Forecast using Action Priors
TITAN: Future Forecast using Action Priors
Srikanth Malla
Behzad Dariush
Chiho Choi
125
122
0
31 Mar 2020
RetinaTrack: Online Single Stage Joint Detection and Tracking
RetinaTrack: Online Single Stage Joint Detection and Tracking
Zhichao Lu
V. Rathod
Ronny Votel
Jonathan Huang
VOT
95
190
0
30 Mar 2020
Combining detection and tracking for human pose estimation in videos
Combining detection and tracking for human pose estimation in videos
Manchen Wang
Joseph Tighe
Davide Modolo
VOT
79
110
0
30 Mar 2020
Speech2Action: Cross-modal Supervision for Action Recognition
Speech2Action: Cross-modal Supervision for Action Recognition
Arsha Nagrani
Chen Sun
David A. Ross
Rahul Sukthankar
Cordelia Schmid
Andrew Zisserman
88
54
0
30 Mar 2020
TapLab: A Fast Framework for Semantic Video Segmentation Tapping into
  Compressed-Domain Knowledge
TapLab: A Fast Framework for Semantic Video Segmentation Tapping into Compressed-Domain Knowledge
Junyi Feng
Songyuan Li
Xi Li
Leilei Gan
Qi Tian
Ming-Hsuan Yang
Haibin Ling
VOS
52
24
0
30 Mar 2020
Learning Interactions and Relationships between Movie Characters
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
82
51
0
29 Mar 2020
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with
  a Wise Selection
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection
Jie Chen
Zhiheng Li
Jiebo Luo
Chenliang Xu
54
13
0
29 Mar 2020
Omni-sourced Webly-supervised Learning for Video Recognition
Omni-sourced Webly-supervised Learning for Video Recognition
Haodong Duan
Yue Zhao
Yuanjun Xiong
Wentao Liu
Dahua Lin
VLM
99
88
0
29 Mar 2020
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks
Qihang Yu
Yingwei Li
Jieru Mei
Yuyin Zhou
Alan Yuille
3DPC
66
3
0
28 Mar 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
73
182
0
28 Mar 2020
Weakly-Supervised Action Localization by Generative Attention Modeling
Weakly-Supervised Action Localization by Generative Attention Modeling
Baifeng Shi
Qi Dai
Yadong Mu
Jingdong Wang
WSOL
93
150
0
27 Mar 2020
Multi-Granularity Reference-Aided Attentive Feature Aggregation for
  Video-based Person Re-identification
Multi-Granularity Reference-Aided Attentive Feature Aggregation for Video-based Person Re-identification
Zhizheng Zhang
Cuiling Lan
Wenjun Zeng
Zhibo Chen
VOS
80
100
0
27 Mar 2020
Negative Margin Matters: Understanding Margin in Few-shot Classification
Negative Margin Matters: Understanding Margin in Few-shot Classification
Bin Liu
Yue Cao
Yutong Lin
Qi Li
Zheng Zhang
Mingsheng Long
Han Hu
105
323
0
26 Mar 2020
Coronary Artery Segmentation in Angiographic Videos Using A 3D-2D CE-Net
Coronary Artery Segmentation in Angiographic Videos Using A 3D-2D CE-Net
Lu Wang
Dongxue Liang
Xiao-Lei Yin
Jing Qiu
Zhi-Yun Yang
Jun-Hui Xing
Jian-Zeng Dong
Zhao-Yuan Ma
MedIm
23
0
0
26 Mar 2020
Modeling Cross-view Interaction Consistency for Paired Egocentric
  Interaction Recognition
Modeling Cross-view Interaction Consistency for Paired Egocentric Interaction Recognition
Zhongguo Li
Fan Lyu
Wei Feng
Song Wang
EgoV
27
1
0
24 Mar 2020
Learning Object Permanence from Video
Learning Object Permanence from Video
Aviv Shamsian
Ofri Kleinfeld
Amir Globerson
Gal Chechik
SSL
140
32
0
23 Mar 2020
Temporally Coherent Embeddings for Self-Supervised Video Representation
  Learning
Temporally Coherent Embeddings for Self-Supervised Video Representation Learning
Joshua Knights
Ben Harwood
Daniel Ward
Anthony Vanderkop
Olivia Mackenzie-Ross
Peyman Moghadam
AI4TS
97
38
0
21 Mar 2020
Previous
123...596061...717273
Next