ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXivPDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 1,387 papers shown
Title
BubbleNets: Learning to Select the Guidance Frame in Video Object
  Segmentation by Deep Sorting Frames
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames
Brent A. Griffin
Jason J. Corso
VOS
30
42
0
28 Mar 2019
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai
S. Divvala
Louis-Philippe Morency
Ruslan Salakhutdinov
Ali Farhadi
27
103
0
25 Mar 2019
StartNet: Online Detection of Action Start in Untrimmed Videos
StartNet: Online Detection of Action Start in Untrimmed Videos
M. Gao
Mingze Xu
L. Davis
R. Socher
Caiming Xiong
38
51
0
23 Mar 2019
On the Importance of Video Action Recognition for Visual Lipreading
Xinshuo Weng
25
3
0
22 Mar 2019
Forecasting Time-to-Collision from Monocular Video: Feasibility,
  Dataset, and Challenges
Forecasting Time-to-Collision from Monocular Video: Feasibility, Dataset, and Challenges
A. Manglik
Xinshuo Weng
Eshed Ohn-Bar
Kris Kitani
27
15
0
21 Mar 2019
Cross-task weakly supervised learning from instructional videos
Cross-task weakly supervised learning from instructional videos
Dimitri Zhukov
Jean-Baptiste Alayrac
R. G. Cinbis
David Fouhey
Ivan Laptev
Josef Sivic
SSL
25
243
0
19 Mar 2019
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action
  Classifier for Anomaly Detection
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
Jia-Xing Zhong
Nannan Li
Weijie Kong
Shan Liu
Thomas H. Li
Ge Li
NoLa
SSL
24
397
0
18 Mar 2019
Investigation on Combining 3D Convolution of Image Data and Optical Flow
  to Generate Temporal Action Proposals
Investigation on Combining 3D Convolution of Image Data and Optical Flow to Generate Temporal Action Proposals
Patrick Schlosser
David Münch
Michael Arens
3DPC
18
3
0
11 Mar 2019
COIN: A Large-scale Dataset for Comprehensive Instructional Video
  Analysis
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Yansong Tang
Dajun Ding
Yongming Rao
Yu Zheng
Danyang Zhang
Lili Zhao
Jiwen Lu
Jie Zhou
18
305
0
07 Mar 2019
Video-based surgical skill assessment using 3D convolutional neural
  networks
Video-based surgical skill assessment using 3D convolutional neural networks
Isabel Funke
S. T. Mees
Jürgen Weitz
Stefanie Speidel
16
173
0
06 Mar 2019
MS-TCN: Multi-Stage Temporal Convolutional Network for Action
  Segmentation
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
Yazan Abu Farha
Juergen Gall
28
651
0
05 Mar 2019
Collaborative Spatio-temporal Feature Learning for Video Action
  Recognition
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
27
82
0
04 Mar 2019
Less is More: Learning Highlight Detection from Video Duration
Less is More: Learning Highlight Detection from Video Duration
Bo Xiong
Yannis Kalantidis
Deepti Ghadiyaram
Kristen Grauman
14
108
0
03 Mar 2019
Efficient Video Classification Using Fewer Frames
Efficient Video Classification Using Fewer Frames
S. Bhardwaj
Mukundhan Srinivasan
Mitesh M. Khapra
40
88
0
27 Feb 2019
IF-TTN: Information Fused Temporal Transformation Network for Video
  Action Recognition
IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition
Ke Yang
Peng Qiao
Dongsheng Li
Y. Dou
ViT
35
8
0
26 Feb 2019
Exploring Frame Segmentation Networks for Temporal Action Localization
Exploring Frame Segmentation Networks for Temporal Action Localization
Ke Yang
Xiaolong Shen
Peng Qiao
Shijie Li
Dongsheng Li
Y. Dou
41
10
0
14 Feb 2019
DistInit: Learning Video Representations Without a Single Labeled Video
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
27
54
0
26 Jan 2019
Audio-Visual Scene-Aware Dialog
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
25
189
0
25 Jan 2019
Skeleton-based Action Recognition of People Handling Objects
Skeleton-based Action Recognition of People Handling Objects
Sunoh Kim
Kimin Yun
Jongyoul Park
J. Choi
29
37
0
21 Jan 2019
Semantic Image Networks for Human Action Recognition
Semantic Image Networks for Human Action Recognition
Sunder Ali Khowaja
Seok-Lyong Lee
21
32
0
21 Jan 2019
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video
  Action Recognition
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition
Zheng Shou
Xudong Lin
Yannis Kalantidis
Laura Sevilla-Lara
Marcus Rohrbach
Shih-Fu Chang
Zhicheng Yan
VGen
37
120
0
11 Jan 2019
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and
  Video-Text Retrievals
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals
Lu Jin
Zechao Li
Jinhui Tang
16
71
0
09 Jan 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly
  Supervised Action Alignment and Segmentation
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
22
156
0
09 Jan 2019
Dynamics are Important for the Recognition of Equine Pain in Video
Dynamics are Important for the Recognition of Equine Pain in Video
Sofia Broomé
K. Gleerup
P. Andersen
Hedvig Kjellström
35
26
0
07 Jan 2019
EgoReID Dataset: Person Re-identification in Videos Acquired by Mobile
  Devices with First-Person Point-of-View
EgoReID Dataset: Person Re-identification in Videos Acquired by Mobile Devices with First-Person Point-of-View
Emrah Basaran
Yonatan Tariku Tesfaye
M. Shah
27
0
0
22 Dec 2018
Temporal Hockey Action Recognition via Pose and Optical Flows
Temporal Hockey Action Recognition via Pose and Optical Flows
Zixi Cai
H. Neher
Kanav Vats
David A Clausi
John S. Zelek
11
38
0
22 Dec 2018
A Multi-task Neural Approach for Emotion Attribution, Classification and
  Summarization
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Guoyun Tu
Yanwei Fu
Boyang Albert Li
Jiarui Gao
Yu-Gang Jiang
Xiangyang Xue
11
29
0
21 Dec 2018
From FiLM to Video: Multi-turn Question Answering with Multi-modal
  Context
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
15
33
0
17 Dec 2018
TAN: Temporal Aggregation Network for Dense Multi-label Action
  Recognition
TAN: Temporal Aggregation Network for Dense Multi-label Action Recognition
Xiyang Dai
Bharat Singh
Joe Yue-Hei Ng
L. Davis
ViT
32
25
0
14 Dec 2018
Action Machine: Rethinking Action Recognition in Trimmed Videos
Action Machine: Rethinking Action Recognition in Trimmed Videos
Jiagang Zhu
Wei Zou
Liang Xu
Yiming Hu
Zheng Zhu
Manyu Chang
Junjie Huang
Guan Huang
Dalong Du
35
37
0
14 Dec 2018
Dynamic Graph Modules for Modeling Object-Object Interactions in
  Activity Recognition
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
24
3
0
13 Dec 2018
The Pros and Cons: Rank-aware Temporal Attention for Skill Determination
  in Long Videos
The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos
Hazel Doughty
W. Mayol-Cuevas
Dima Damen
36
138
0
13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
62
477
0
12 Dec 2018
A Structured Model For Action Detection
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
28
101
0
09 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
36
20
0
07 Dec 2018
Video Action Transformer Network
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
28
702
0
06 Dec 2018
An Empirical Study towards Understanding How Deep Convolutional Nets
  Recognize Falls
An Empirical Study towards Understanding How Deep Convolutional Nets Recognize Falls
Yan Zhang
Heiko Neumann
26
5
0
05 Dec 2018
The Visual Centrifuge: Model-Free Layered Video Representations
The Visual Centrifuge: Model-Free Layered Video Representations
Jean-Baptiste Alayrac
João Carreira
Andrew Zisserman
21
48
0
04 Dec 2018
Timeception for Complex Action Recognition
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
21
212
0
04 Dec 2018
Towards Accurate Generative Models of Video: A New Metric & Challenges
Towards Accurate Generative Models of Video: A New Metric & Challenges
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
27
691
0
03 Dec 2018
Iterative Projection and Matching: Finding Structure-preserving
  Representatives and Its Application to Computer Vision
Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision
M. Joneidi
Alireza Zaeemzadeh
Nazanin Rahnavard
M. Shah
30
13
0
29 Nov 2018
A Coarse-to-fine Deep Convolutional Neural Network Framework for Frame
  Duplication Detection and Localization in Forged Videos
A Coarse-to-fine Deep Convolutional Neural Network Framework for Frame Duplication Detection and Localization in Forged Videos
Chengjiang Long
Arslan Basharat
A. Hoogs
14
5
0
27 Nov 2018
Self-Supervised Video Representation Learning with Space-Time Cubic
  Puzzles
Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles
Dahun Kim
Donghyeon Cho
In So Kweon
SSL
17
343
0
24 Nov 2018
Driver Behavior Recognition via Interwoven Deep Convolutional Neural
  Nets with Multi-stream Inputs
Driver Behavior Recognition via Interwoven Deep Convolutional Neural Nets with Multi-stream Inputs
Chaoyun Zhang
Rui Li
Woojin Kim
Daesub Yoon
P. Patras
31
49
0
22 Nov 2018
Rethinking ImageNet Pre-training
Rethinking ImageNet Pre-training
Kaiming He
Ross B. Girshick
Piotr Dollár
VLM
SSeg
47
1,077
0
21 Nov 2018
Learning Motion in Feature Space: Locally-Consistent Deformable
  Convolution Networks for Fine-Grained Action Detection
Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection
Khoi-Nguyen C. Mac
D. Joshi
Raymond A. Yeh
Jinjun Xiong
Rogerio Feris
Minh Do
27
42
0
21 Nov 2018
Compressing Recurrent Neural Networks with Tensor Ring for Action
  Recognition
Compressing Recurrent Neural Networks with Tensor Ring for Action Recognition
Yu Pan
Jing Xu
Maolin Wang
Jinmian Ye
Fei Wang
Kun Bai
Zenglin Xu
MQ
18
104
0
19 Nov 2018
Multi-scale 3D Convolution Network for Video Based Person
  Re-Identification
Multi-scale 3D Convolution Network for Video Based Person Re-Identification
Jianing Li
Shiliang Zhang
Tiejun Huang
3DPC
25
163
0
19 Nov 2018
Temporal Recurrent Networks for Online Action Detection
Temporal Recurrent Networks for Online Action Detection
Mingze Xu
M. Gao
Yi-Ting Chen
L. Davis
David J. Crandall
OffRL
31
162
0
18 Nov 2018
Relational Long Short-Term Memory for Video Action Recognition
Relational Long Short-Term Memory for Video Action Recognition
Zexi Chen
B. Ramachandra
Tianfu Wu
Ranga Raju Vatsavai
24
5
0
16 Nov 2018
Previous
123...25262728
Next