ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,645 papers shown
Title
Efficient Video Classification Using Fewer Frames
Efficient Video Classification Using Fewer Frames
S. Bhardwaj
Mukundhan Srinivasan
Mitesh M. Khapra
81
88
0
27 Feb 2019
Equi-normalization of Neural Networks
Equi-normalization of Neural Networks
Pierre Stock
Benjamin Graham
Rémi Gribonval
Hervé Jégou
ODL
46
18
0
27 Feb 2019
STAR-Net: Action Recognition using Spatio-Temporal Activation
  Reprojection
STAR-Net: Action Recognition using Spatio-Temporal Activation Reprojection
William J. McNally
A. Wong
J. McPhee
HAI3DH
65
26
0
26 Feb 2019
IF-TTN: Information Fused Temporal Transformation Network for Video
  Action Recognition
IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition
Ke Yang
Peng Qiao
Dongsheng Li
Y. Dou
ViT
59
8
0
26 Feb 2019
Self-supervised Visual Feature Learning with Deep Neural Networks: A
  Survey
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
Longlong Jing
Yingli Tian
SSL
235
1,706
0
16 Feb 2019
Exploring Frame Segmentation Networks for Temporal Action Localization
Exploring Frame Segmentation Networks for Temporal Action Localization
Ke Yang
Xiaolong Shen
Peng Qiao
Shijie Li
Dongsheng Li
Y. Dou
68
10
0
14 Feb 2019
MOTS: Multi-Object Tracking and Segmentation
MOTS: Multi-Object Tracking and Segmentation
P. Voigtlaender
Michael Krause
Aljosa Osep
Jonathon Luiten
Berin Balachandar Gnana Sekar
Andreas Geiger
Bastian Leibe
VOT
94
581
0
10 Feb 2019
Saliency Tubes: Visual Explanations for Spatio-Temporal Convolutions
Saliency Tubes: Visual Explanations for Spatio-Temporal Convolutions
Alexandros Stergiou
G. Kapidis
Grigorios Kalliatakis
C. Chrysoulas
R. Veltkamp
R. Poppe
FAtt
69
47
0
04 Feb 2019
Differentiable Grammars for Videos
Differentiable Grammars for Videos
A. Piergiovanni
A. Angelova
Michael S. Ryoo
86
6
0
01 Feb 2019
Anomaly Locality in Video Surveillance
Anomaly Locality in Video Surveillance
Federico Landi
Cees G. M. Snoek
Rita Cucchiara
74
54
0
29 Jan 2019
DistInit: Learning Video Representations Without a Single Labeled Video
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
68
54
0
26 Jan 2019
Audio-Visual Scene-Aware Dialog
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
61
195
0
25 Jan 2019
Skeleton-based Action Recognition of People Handling Objects
Skeleton-based Action Recognition of People Handling Objects
Sunoh Kim
Kimin Yun
Jongyoul Park
J. Choi
61
38
0
21 Jan 2019
Semantic Image Networks for Human Action Recognition
Semantic Image Networks for Human Action Recognition
Sunder Ali Khowaja
Seok-Lyong Lee
41
33
0
21 Jan 2019
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video
  Action Recognition
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition
Zheng Shou
Xudong Lin
Yannis Kalantidis
Laura Sevilla-Lara
Marcus Rohrbach
Shih-Fu Chang
Zhicheng Yan
VGen
111
120
0
11 Jan 2019
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and
  Video-Text Retrievals
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals
Lu Jin
Zechao Li
Jinhui Tang
46
72
0
09 Jan 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly
  Supervised Action Alignment and Segmentation
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
145
157
0
09 Jan 2019
Thinking Outside the Pool: Active Training Image Creation for Relative
  Attributes
Thinking Outside the Pool: Active Training Image Creation for Relative Attributes
Aron Yu
Kristen Grauman
51
23
0
08 Jan 2019
Dynamics are Important for the Recognition of Equine Pain in Video
Dynamics are Important for the Recognition of Equine Pain in Video
Sofia Broomé
K. Gleerup
P. Andersen
Hedvig Kjellström
67
26
0
07 Jan 2019
Mutual Context Network for Jointly Estimating Egocentric Gaze and
  Actions
Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions
Yifei Huang
Zhenqiang Li
Minjie Cai
Yoichi Sato
EgoV
82
64
0
07 Jan 2019
Action2Vec: A Crossmodal Embedding Approach to Action Learning
Action2Vec: A Crossmodal Embedding Approach to Action Learning
Meera Hahn
Andrew Silva
James M. Rehg
80
58
0
02 Jan 2019
Actor Conditioned Attention Maps for Video Action Detection
Actor Conditioned Attention Maps for Video Action Detection
Oytun Ulutan
S. Rallapalli
Mudhakar Srivatsa
Carlos Torres
B. S. Manjunath
65
42
0
30 Dec 2018
A Multi-Stream Convolutional Neural Network Framework for Group Activity
  Recognition
A Multi-Stream Convolutional Neural Network Framework for Group Activity Recognition
Sina Mokhtarzadeh Azar
Mina Ghadimi Atigh
A. Nickabadi
40
22
0
26 Dec 2018
Coupled Recurrent Network (CRN)
Coupled Recurrent Network (CRN)
Lin Sun
Kui Jia
Yuejia Shen
Silvio Savarese
Dit-Yan Yeung
Bertram E. Shi
40
4
0
25 Dec 2018
EgoReID Dataset: Person Re-identification in Videos Acquired by Mobile
  Devices with First-Person Point-of-View
EgoReID Dataset: Person Re-identification in Videos Acquired by Mobile Devices with First-Person Point-of-View
Emrah Basaran
Yonatan Tariku Tesfaye
M. Shah
55
0
0
22 Dec 2018
Temporal Hockey Action Recognition via Pose and Optical Flows
Temporal Hockey Action Recognition via Pose and Optical Flows
Zixi Cai
H. Neher
Kanav Vats
David A Clausi
John S. Zelek
45
39
0
22 Dec 2018
A Multi-task Neural Approach for Emotion Attribution, Classification and
  Summarization
A Multi-task Neural Approach for Emotion Attribution, Classification and Summarization
Guoyun Tu
Yanwei Fu
Boyang Albert Li
Jiarui Gao
Yu-Gang Jiang
Xiangyang Xue
36
29
0
21 Dec 2018
D3D: Distilled 3D Networks for Video Action Recognition
D3D: Distilled 3D Networks for Video Action Recognition
Jonathan C. Stroud
David A. Ross
Chen Sun
Jia Deng
Rahul Sukthankar
3DPC
58
160
0
19 Dec 2018
From FiLM to Video: Multi-turn Question Answering with Multi-modal
  Context
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
69
33
0
17 Dec 2018
TAN: Temporal Aggregation Network for Dense Multi-label Action
  Recognition
TAN: Temporal Aggregation Network for Dense Multi-label Action Recognition
Xiyang Dai
Bharat Singh
Joe Yue-Hei Ng
L. Davis
ViT
81
25
0
14 Dec 2018
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition
  with Multimodal Training
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training
Mahdi Abavisani
Hamid Reza Vaezi Joze
Vishal M. Patel
78
131
0
14 Dec 2018
Action Machine: Rethinking Action Recognition in Trimmed Videos
Action Machine: Rethinking Action Recognition in Trimmed Videos
Jiagang Zhu
Wei Zou
Liang Xu
Yiming Hu
Zheng Zhu
Manyu Chang
Junjie Huang
Guan Huang
Dalong Du
97
37
0
14 Dec 2018
Dynamic Graph Modules for Modeling Object-Object Interactions in
  Activity Recognition
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition
Hao Huang
Luowei Zhou
Wei Zhang
Jason J. Corso
Chenliang Xu
59
3
0
13 Dec 2018
The Pros and Cons: Rank-aware Temporal Attention for Skill Determination
  in Long Videos
The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos
Hazel Doughty
W. Mayol-Cuevas
Dima Damen
82
140
0
13 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
236
481
0
12 Dec 2018
Learning Discriminative Motion Features Through Detection
Learning Discriminative Motion Features Through Detection
Gedas Bertasius
Christoph Feichtenhofer
Du Tran
Jianbo Shi
Lorenzo Torresani
77
15
0
11 Dec 2018
SlowFast Networks for Video Recognition
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
201
3,300
0
10 Dec 2018
A Structured Model For Action Detection
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
118
101
0
09 Dec 2018
An Attempt towards Interpretable Audio-Visual Video Captioning
An Attempt towards Interpretable Audio-Visual Video Captioning
Yapeng Tian
Chenxiao Guan
Justin Goodman
Marc Moore
Chenliang Xu
91
20
0
07 Dec 2018
Tri-axial Self-Attention for Concurrent Activity Recognition
Tri-axial Self-Attention for Concurrent Activity Recognition
Yanyi Zhang
Xinyu Li
Kaixiang Huang
Yehan Wang
Shuhong Chen
I. Marsic
HAI
27
0
0
06 Dec 2018
Video Action Transformer Network
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
183
709
0
06 Dec 2018
An Empirical Study towards Understanding How Deep Convolutional Nets
  Recognize Falls
An Empirical Study towards Understanding How Deep Convolutional Nets Recognize Falls
Yan Zhang
Heiko Neumann
76
5
0
05 Dec 2018
The Visual Centrifuge: Model-Free Layered Video Representations
The Visual Centrifuge: Model-Free Layered Video Representations
Jean-Baptiste Alayrac
João Carreira
Andrew Zisserman
87
48
0
04 Dec 2018
Timeception for Complex Action Recognition
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
147
215
0
04 Dec 2018
Spatio-Temporal Action Graph Networks
Spatio-Temporal Action Graph Networks
Roei Herzig
Elad Levi
Huijuan Xu
Hang Gao
Eli Brosh
Xiaolong Wang
Amir Globerson
Trevor Darrell
GNN
79
20
0
04 Dec 2018
MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American
  Sign Language
MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language
Hamid Reza Vaezi Joze
Oscar Koller
SLR
87
254
0
03 Dec 2018
SUSiNet: See, Understand and Summarize it
SUSiNet: See, Understand and Summarize it
Petros Koutras
Petros Maragos
52
26
0
03 Dec 2018
Towards Accurate Generative Models of Video: A New Metric & Challenges
Towards Accurate Generative Models of Video: A New Metric & Challenges
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVMVGen
101
748
0
03 Dec 2018
Pedestrian Detection with Autoregressive Network Phases
Pedestrian Detection with Autoregressive Network Phases
Garrick Brazil
Xiaoming Liu
83
72
0
02 Dec 2018
Multi-modal Capsule Routing for Actor and Action Video Segmentation
  Conditioned on Natural Language Queries
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries
Bruce McIntosh
Kevin Duarte
Yogesh S Rawat
M. Shah
MedIm
64
17
0
02 Dec 2018
Previous
123...686970717273
Next