ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Activity Graph Transformer for Temporal Action Localization
Activity Graph Transformer for Temporal Action Localization
Megha Nawhal
Greg Mori
133
71
0
21 Jan 2021
Video Relation Detection with Trajectory-aware Multi-modal Features
Video Relation Detection with Trajectory-aware Multi-modal Features
W. Xie
Guanghui Ren
Si Liu
127
21
0
20 Jan 2021
Few-shot Action Recognition with Prototype-centered Attentive Learning
Few-shot Action Recognition with Prototype-centered Attentive Learning
Xiatian Zhu
Antoine Toisoul
Juan-Manuel Prez-Ra
Li Zhang
Brais Martínez
Tao Xiang
95
53
0
20 Jan 2021
TCLR: Temporal Contrastive Learning for Video Representation
TCLR: Temporal Contrastive Learning for Video Representation
I. Dave
Rohit Gupta
Mamshad Nayeem Rizve
Mubarak Shah
SSLAI4TS
123
180
0
20 Jan 2021
Machine-Generated Hierarchical Structure of Human Activities to Reveal
  How Machines Think
Machine-Generated Hierarchical Structure of Human Activities to Reveal How Machines Think
Mahsun Altin
Furkan Gursoy
Lina Xu
HAIAI4CE
29
2
0
19 Jan 2021
Initialization Using Perlin Noise for Training Networks with a Limited
  Amount of Data
Initialization Using Perlin Noise for Training Networks with a Limited Amount of Data
Nakamasa Inoue
Eisuke Yamagata
Hirokatsu Kataoka
43
11
0
19 Jan 2021
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action
  Recognition
CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Shreyank N. Gowda
Laura Sevilla-Lara
Frank Keller
Marcus Rohrbach
VLM
112
23
0
18 Jan 2021
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity
  Recognition
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition
Zachary Wharton
Ardhendu Behera
Yonghuai Liu
Nikolaos Bessis
64
36
0
17 Jan 2021
Temporal-Relational CrossTransformers for Few-Shot Action Recognition
Temporal-Relational CrossTransformers for Few-Shot Action Recognition
Toby Perrett
A. Masullo
T. Burghardt
Majid Mirmehdi
Dima Damen
ViT
99
151
0
15 Jan 2021
Topological Deep Learning
Topological Deep Learning
Ephy R. Love
Benjamin Filippenko
Vasileios Maroulas
Gunnar Carlsson
82
11
0
14 Jan 2021
Exploration of Visual Features and their weighted-additive fusion for
  Video Captioning
Exploration of Visual Features and their weighted-additive fusion for Video Captioning
V. PraveenS.
Akhilesh Bharadwaj
Harsh Raj
Janhavi Dadhania
Ganesh Samarth C.A
Nikhil Pareek
S. M. I. S. R. Mahadeva Prasanna
50
1
0
14 Jan 2021
Video action recognition for lane-change classification and prediction
  of surrounding vehicles
Video action recognition for lane-change classification and prediction of surrounding vehicles
Mahdi Biparva
David Fernández-Llorca
Rubén Izquierdo-Gonzalo
John K. Tsotsos
109
47
0
13 Jan 2021
Learning from Weakly-labeled Web Videos via Exploring Sub-Concepts
Learning from Weakly-labeled Web Videos via Exploring Sub-Concepts
Kunpeng Li
Zizhao Zhang
Guanhang Wu
Xuehan Xiong
Chen-Yu Lee
Zhichao Lu
Y. Fu
Tomas Pfister
78
5
0
11 Jan 2021
ArrowGAN : Learning to Generate Videos by Learning Arrow of Time
ArrowGAN : Learning to Generate Videos by Learning Arrow of Time
Kibeom Hong
Youngjung Uh
H. Byun
GAN
159
9
0
11 Jan 2021
Entropy-Based Uncertainty Calibration for Generalized Zero-Shot Learning
Entropy-Based Uncertainty Calibration for Generalized Zero-Shot Learning
Zhi Chen
Zi Huang
Jingjing Li
Zheng Zhang
UQCV
41
12
0
09 Jan 2021
InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial
  Network for Video Generation
InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation
Yaohui Wang
Francois Bremond
A. Dantcheva
VGenGAN
225
25
0
08 Jan 2021
Reinforcement Learning with Latent Flow
Reinforcement Learning with Latent Flow
Wenling Shang
Xiaofei Wang
A. Srinivas
Aravind Rajeswaran
Yang Gao
Pieter Abbeel
Michael Laskin
OffRL
75
23
0
06 Jan 2021
WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection
WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection
Bojia Zi
Minghao Chang
Jingjing Chen
Xingjun Ma
Yu-Gang Jiang
CVBM
153
394
0
05 Jan 2021
Global2Local: Efficient Structure Search for Video Action Segmentation
Global2Local: Efficient Structure Search for Video Action Segmentation
Shanghua Gao
Qi Han
Zhong-Yu Li
Pai Peng
Liang Wang
Ming-Ming Cheng
EgoV
145
74
0
04 Jan 2021
A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action
  Localization
A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization
Ashraful Islam
Chengjiang Long
Richard J. Radke
117
127
0
03 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and
  the CARING Models
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
Alina Roitberg
Monica Haurilet
Manuel Martínez
Rainer Stiefelhagen
UQCV
53
6
0
02 Jan 2021
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Sourav Garg
Niko Sünderhauf
Feras Dayoub
D. Morrison
Akansel Cosgun
...
Tat-Jun Chin
Ian Reid
Stephen Gould
Peter Corke
Michael Milford
204
117
0
02 Jan 2021
Refining activation downsampling with SoftPool
Refining activation downsampling with SoftPool
Alexandros Stergiou
R. Poppe
Grigorios Kalliatakis
89
163
0
02 Jan 2021
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video
  Recognition
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Hengduo Li
Zuxuan Wu
Abhinav Shrivastava
L. Davis
80
35
0
29 Dec 2020
Tensor Representations for Action Recognition
Tensor Representations for Action Recognition
Piotr Koniusz
Lei Wang
A. Cherian
123
70
0
28 Dec 2020
Context-Aware Personality Inference in Dyadic Scenarios: Introducing the
  UDIVA Dataset
Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset
Cristina Palmero
Javier Selva
Sorina Smeureanu
Julio C. S. Jacques Junior
Albert Clapés
...
Zejian Zhang
D. Gallardo-Pujol
G. Guilera
D. Leiva
Sergio Escalera
87
55
0
28 Dec 2020
CNNs for JPEGs: A Study in Computational Cost
CNNs for JPEGs: A Study in Computational Cost
Samuel Felipe dos Santos
N. Sebe
Jurandy Almeida
72
2
0
26 Dec 2020
Faster and Accurate Compressed Video Action Recognition Straight from
  the Frequency Domain
Faster and Accurate Compressed Video Action Recognition Straight from the Frequency Domain
Samuel Felipe dos Santos
Jurandy Almeida
58
16
0
26 Dec 2020
Global Context Networks
Global Context Networks
Yue Cao
Jiarui Xu
Stephen Lin
Fangyun Wei
Han Hu
ISeg
122
100
0
24 Dec 2020
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
237
2,294
0
23 Dec 2020
Human Action Recognition from Various Data Modalities: A Review
Human Action Recognition from Various Data Modalities: A Review
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
184
536
0
22 Dec 2020
SMART Frame Selection for Action Recognition
SMART Frame Selection for Action Recognition
Shreyank N. Gowda
Marcus Rohrbach
Laura Sevilla-Lara
78
148
0
19 Dec 2020
Temporal Bilinear Encoding Network of Audio-Visual Features at Low
  Sampling Rates
Temporal Bilinear Encoding Network of Audio-Visual Features at Low Sampling Rates
Feiyan Hu
Eva Mohedano
Noel E. O'Connor
Kevin McGuinness
43
1
0
18 Dec 2020
TDN: Temporal Difference Networks for Efficient Action Recognition
TDN: Temporal Difference Networks for Efficient Action Recognition
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
149
402
0
18 Dec 2020
Exploring Motion Boundaries in an End-to-End Network for Vision-based
  Parkinson's Severity Assessment
Exploring Motion Boundaries in an End-to-End Network for Vision-based Parkinson's Severity Assessment
Amirhossein Dadashzadeh
Alan Whone
M. Rolinski
Majid Mirmehdi
55
10
0
17 Dec 2020
Weakly-Supervised Action Localization and Action Recognition using
  Global-Local Attention of 3D CNN
Weakly-Supervised Action Localization and Action Recognition using Global-Local Attention of 3D CNN
N. Yudistira
M. Kavitha
Takio Kurita
3DPC
66
13
0
17 Dec 2020
Multi-shot Temporal Event Localization: a Benchmark
Multi-shot Temporal Event Localization: a Benchmark
Xiaolong Liu
Yao Hu
S. Bai
Fei Ding
X. Bai
Philip Torr
116
84
0
17 Dec 2020
FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
Tarun Kalluri
Deepak Pathak
Manmohan Chandraker
Du Tran
VGen
89
148
0
15 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding
GTA: Global Temporal Attention for Video Action Understanding
Bo He
Xitong Yang
Zuxuan Wu
Hao Chen
Ser-Nam Lim
Abhinav Shrivastava
ViT
102
28
0
15 Dec 2020
Point-Level Temporal Action Localization: Bridging Fully-supervised
  Proposals to Weakly-supervised Losses
Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses
Chen Ju
Peisen Zhao
Ya Zhang
Yanfeng Wang
Qi Tian
49
27
0
15 Dec 2020
NUTA: Non-uniform Temporal Aggregation for Action Recognition
NUTA: Non-uniform Temporal Aggregation for Action Recognition
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Hao Chen
Joseph Tighe
ViT
53
16
0
15 Dec 2020
Temporal Relational Modeling with Self-Supervision for Action
  Segmentation
Temporal Relational Modeling with Self-Supervision for Action Segmentation
Dong Wang
Di Hu
Xingjian Li
Dejing Dou
86
53
0
14 Dec 2020
TDAF: Top-Down Attention Framework for Vision Tasks
TDAF: Top-Down Attention Framework for Vision Tasks
Bo Pang
Yizhuo Li
Jiefeng Li
Muchen Li
Hanwen Cao
Cewu Lu
83
10
0
14 Dec 2020
MSAF: Multimodal Split Attention Fusion
MSAF: Multimodal Split Attention Fusion
Lang Su
Chuqing Hu
Guofa Li
Dongpu Cao
90
39
0
13 Dec 2020
Iterative Knowledge Exchange Between Deep Learning and Space-Time
  Spectral Clustering for Unsupervised Segmentation in Videos
Iterative Knowledge Exchange Between Deep Learning and Space-Time Spectral Clustering for Unsupervised Segmentation in Videos
Emanuela Haller
A. Florea
Marius Leordeanu
VOS
49
8
0
13 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLMAI4TS
129
188
0
11 Dec 2020
D2-Net: Weakly-Supervised Action Localization via Discriminative
  Embeddings and Denoised Activations
D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations
Sanath Narayan
Hisham Cholakkal
Munawar Hayat
Fahad Shahbaz Khan
Ming-Hsuan Yang
Ling Shao
106
55
0
11 Dec 2020
Intrinsic Temporal Regularization for High-resolution Human Video
  Synthesis
Intrinsic Temporal Regularization for High-resolution Human Video Synthesis
Lingbo Yang
Zhanning Gao
Peiran Ren
Siwei Ma
Wen Gao
3DH
123
1
0
11 Dec 2020
Interactive Fusion of Multi-level Features for Compositional Activity
  Recognition
Interactive Fusion of Multi-level Features for Compositional Activity Recognition
Rui Yan
Lingxi Xie
Xiangbo Shu
Jinhui Tang
65
17
0
10 Dec 2020
Developing Motion Code Embedding for Action Recognition in Videos
Developing Motion Code Embedding for Action Recognition in Videos
Maxat Alibayev
D. Paulius
Yu Sun
51
1
0
10 Dec 2020
Previous
123...515253...717273
Next