ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Driving Behavior Explanation with Multi-level Fusion
Driving Behavior Explanation with Multi-level Fusion
H. Ben-younes
Éloi Zablocki
Patrick Pérez
Matthieu Cord
73
33
0
09 Dec 2020
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging
  Studies
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies
Jinzheng Cai
Youbao Tang
K. Yan
Adam P. Harrison
Jing Xiao
Gigin Lin
Le Lu
MedIm
97
31
0
09 Dec 2020
VideoMix: Rethinking Data Augmentation for Video Classification
VideoMix: Rethinking Data Augmentation for Video Classification
Sangdoo Yun
Seong Joon Oh
Byeongho Heo
Dongyoon Han
Jinhyung Kim
454
76
0
07 Dec 2020
Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer
  Learning
Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer Learning
Haokui Zhang
Ying Li
Yenan Jiang
Peng Wang
Qiang Shen
Chunhua Shen
70
171
0
07 Dec 2020
Bifold and Semantic Reasoning for Pedestrian Behavior Prediction
Bifold and Semantic Reasoning for Pedestrian Behavior Prediction
Amir Rasouli
Mohsen Rohani
Jun Luo
113
53
0
06 Dec 2020
Skeleon-Based Typing Style Learning For Person Identification
Skeleon-Based Typing Style Learning For Person Identification
Lior Gelberg
D. Mendlovic
D. Raviv
3DH
39
0
0
06 Dec 2020
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with
  Natural Language
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Songyang Zhang
Houwen Peng
Jianlong Fu
Yijuan Lu
Jiebo Luo
91
53
0
04 Dec 2020
Rethinking movie genre classification with fine-grained semantic
  clustering
Rethinking movie genre classification with fine-grained semantic clustering
Edward Fish
Jon Weinbren
Andrew Gilbert
VLM
79
7
0
04 Dec 2020
Spatial-Temporal Alignment Network for Action Recognition and Detection
Spatial-Temporal Alignment Network for Action Recognition and Detection
Junwei Liang
Liangliang Cao
Xuehan Xiong
Ting Yu
Alexander G. Hauptmann
3DPC
70
9
0
04 Dec 2020
SAM: Self-supervised Learning of Pixel-wise Anatomical Embeddings in
  Radiological Images
SAM: Self-supervised Learning of Pixel-wise Anatomical Embeddings in Radiological Images
K. Yan
Jinzheng Cai
D. Jin
S. Miao
Dazhou Guo
Adam P. Harrison
Youbao Tang
Jing Xiao
Jingjing Lu
Le Lu
MedIm
103
86
0
04 Dec 2020
SAFCAR: Structured Attention Fusion for Compositional Action Recognition
SAFCAR: Structured Attention Fusion for Compositional Action Recognition
Tae Soo Kim
Gregory Hager
CoGe
69
10
0
03 Dec 2020
Learning Order Parameters from Videos of Dynamical Phases for Skyrmions
  with Neural Networks
Learning Order Parameters from Videos of Dynamical Phases for Skyrmions with Neural Networks
Weidi Wang
Zeyuan Wang
Yinghui Zhang
Bo Sun
K. Xia
28
1
0
02 Dec 2020
Open-Ended Multi-Modal Relational Reasoning for Video Question Answering
Open-Ended Multi-Modal Relational Reasoning for Video Question Answering
Haozheng Luo
Ruiyang Qin
Chenwei Xu
Guo Ye
Zening Luo
106
4
0
01 Dec 2020
Pose-based Sign Language Recognition using GCN and BERT
Pose-based Sign Language Recognition using GCN and BERT
Anirudh Tunga
Sai Vidyaranya Nuthalapati
J. Wachs
SLR
63
70
0
01 Dec 2020
Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization
  for Efficient Video Classification
Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification
Youngwan Lee
Hyungil Kim
Kimin Yun
Jinyoung Moon
51
12
0
01 Dec 2020
Driver Behavior Extraction from Videos in Naturalistic Driving Datasets
  with 3D ConvNets
Driver Behavior Extraction from Videos in Naturalistic Driving Datasets with 3D ConvNets
H. Miao
Shenmin Zhang
Carol Flannagan
42
4
0
30 Nov 2020
Video Self-Stitching Graph Network for Temporal Action Localization
Video Self-Stitching Graph Network for Temporal Action Localization
Chen Zhao
Ali K. Thabet
Guohao Li
93
141
0
30 Nov 2020
Just One Moment: Structural Vulnerability of Deep Action Recognition
  against One Frame Attack
Just One Moment: Structural Vulnerability of Deep Action Recognition against One Frame Attack
Ian Ryu
Jun-Hyuk Kim
Jun-Ho Choi
Jong-Seok Lee
AAML
104
18
0
30 Nov 2020
Annotation-Efficient Untrimmed Video Action Recognition
Annotation-Efficient Untrimmed Video Action Recognition
Yixiong Zou
Shanghang Zhang
Guangyao Chen
Yonghong Tian
Kurt Keutzer
J. M. F. Moura
58
5
0
30 Nov 2020
Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal
  Heatmaps
Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal Heatmaps
Mattia Segu
Federico Pirovano
Gianmario Fumagalli
Amedeo Fabris
59
2
0
26 Nov 2020
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
  Broadcast Soccer Videos
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos
Adrien Deliège
A. Cioppa
Silvio Giancola
M. J. Seikavandi
J. Dueholm
Kamal Nasrollahi
Guohao Li
T. Moeslund
Marc Van Droogenbroeck
106
154
0
26 Nov 2020
Group-Skeleton-Based Human Action Recognition in Complex Events
Group-Skeleton-Based Human Action Recognition in Complex Events
Tingtian Li
Zixun Sun
Xiao Chen
43
5
0
26 Nov 2020
t-EVA: Time-Efficient t-SNE Video Annotation
t-EVA: Time-Efficient t-SNE Video Annotation
Soroosh Poorgholi
O. Kayhan
Jan van Gemert
48
5
0
26 Nov 2020
Sign language segmentation with temporal convolutional networks
Sign language segmentation with temporal convolutional networks
Katrin Renz
N. Stache
Samuel Albanie
Gül Varol
SLR
55
25
0
25 Nov 2020
Recent Progress in Appearance-based Action Recognition
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
57
0
0
25 Nov 2020
Independent Sign Language Recognition with 3D Body, Hands, and Face
  Reconstruction
Independent Sign Language Recognition with 3D Body, Hands, and Face Reconstruction
Agelos Kratimenos
Georgios Pavlakos
Petros Maragos
SLRCVBM
42
18
0
24 Nov 2020
A3D: Adaptive 3D Networks for Video Action Recognition
A3D: Adaptive 3D Networks for Video Action Recognition
Sijie Zhu
Taojiannan Yang
Matías Mendieta
Chong Chen
3DH
70
13
0
24 Nov 2020
Play Fair: Frame Attributions in Video Models
Play Fair: Frame Attributions in Video Models
Will Price
Dima Damen
FAtt
60
5
0
24 Nov 2020
Temporal Action Detection with Multi-level Supervision
Temporal Action Detection with Multi-level Supervision
Baifeng Shi
Qi Dai
Judy Hoffman
Kate Saenko
Trevor Darrell
Huijuan Xu
90
14
0
24 Nov 2020
Planar 3D Transfer Learning for End to End Unimodal MRI Unbalanced Data
  Segmentation
Planar 3D Transfer Learning for End to End Unimodal MRI Unbalanced Data Segmentation
M. Kolarík
Radim Burget
C. Travieso-González
J. Kocica
18
6
0
23 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
94
126
0
23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised
  Video Representation Learning
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning
Zehua Zhang
David J. Crandall
AI4TSSSL
86
23
0
23 Nov 2020
Learnable Sampling 3D Convolution for Video Enhancement and Action
  Recognition
Learnable Sampling 3D Convolution for Video Enhancement and Action Recognition
Shuyang Gu
Jianmin Bao
Dong Chen
40
2
0
22 Nov 2020
We don't Need Thousand Proposals$\colon$ Single Shot Actor-Action
  Detection in Videos
We don't Need Thousand Proposals ⁣:\colon: Single Shot Actor-Action Detection in Videos
A. J. Rana
Yogesh S Rawat
ViT
46
11
0
22 Nov 2020
The complementarity of a diverse range of deep learning features
  extracted from video content for video recommendation
The complementarity of a diverse range of deep learning features extracted from video content for video recommendation
A. Almeida
J. D. Villiers
A. Freitas
Mergandran Velayudan
34
17
0
21 Nov 2020
Boundary-sensitive Pre-training for Temporal Localization in Videos
Boundary-sensitive Pre-training for Temporal Localization in Videos
Mengmeng Xu
Juan-Manuel Perez-Rua
Victor Escorcia
Brais Martínez
Xiatian Zhu
Li Zhang
Guohao Li
Tao Xiang
80
61
0
21 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild
Visual Recognition of Great Ape Behaviours in the Wild
Faizaan Sakib
T. Burghardt
77
24
0
21 Nov 2020
3D attention mechanism for fine-grained classification of table tennis
  strokes using a Twin Spatio-Temporal Convolutional Neural Networks
3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
J. Morlier
3DPC
45
12
0
20 Nov 2020
Consistency-Aware Graph Network for Human Interaction Understanding
Consistency-Aware Graph Network for Human Interaction Understanding
Zhenhua Wang
Jiajun Meng
Dongyan Guo
Jianhua Zhang
Javen Qinfeng Shi
Shengyong Chen
GNN
42
3
0
20 Nov 2020
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled
  Videos
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos
Reza Ghoddoosian
S. Sayed
V. Athitsos
AI4TS
32
7
0
20 Nov 2020
StressNet: Detecting Stress in Thermal Videos
StressNet: Detecting Stress in Thermal Videos
Shylendra Kumar
A S M Iftekhar
Michael Goebel
Tom Bullock
M. MacLean
Michael B. Miller
Tyler Santander
B. Giesbrecht
Scott T. Grafton
B. S. Manjunath
43
15
0
18 Nov 2020
Master Thesis: Neural Sign Language Translation by Learning Tokenization
Master Thesis: Neural Sign Language Translation by Learning Tokenization
Alptekin Orbay
SLR
24
0
0
18 Nov 2020
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural
  Networks
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks
Thomas Teixeira
Eric Granger
Alessandro Lameiras Koerich
CVBM
62
10
0
18 Nov 2020
Game Plan: What AI can do for Football, and What Football can do for AI
Game Plan: What AI can do for Football, and What Football can do for AI
K. Tuyls
Shayegan Omidshafiei
Paul Muller
Zhe Wang
Jerome T. Connor
...
Simon Bouton
Nathalie Beauguerlange
Jackson Broshear
T. Graepel
Demis Hassabis
100
80
0
18 Nov 2020
A Hierarchical Multi-Modal Encoder for Moment Localization in Video
  Corpus
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus
Bowen Zhang
Hexiang Hu
Joonseok Lee
Mingde Zhao
Sheide Chammas
Vihan Jain
Eugene Ie
Fei Sha
92
34
0
18 Nov 2020
RAIST: Learning Risk Aware Traffic Interactions via Spatio-Temporal
  Graph Convolutional Networks
RAIST: Learning Risk Aware Traffic Interactions via Spatio-Temporal Graph Convolutional Networks
Videsh Suman
Phu-Cuong Pham
Aniket Bera
GNN
47
4
0
17 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions
3D CNNs with Adaptive Temporal Feature Resolutions
Mohsen Fayyaz
Emad Bahrami Rad
Ali Diba
M. Noroozi
Ehsan Adeli
Luc Van Gool
Juergen Gall
3DPC
80
31
0
17 Nov 2020
Semi-Supervised Few-Shot Atomic Action Recognition
Semi-Supervised Few-Shot Atomic Action Recognition
Xiaoyuan Ni
Sizhe Song
Yu-Wing Tai
Chi-Keung Tang
70
3
0
17 Nov 2020
Multi-Modal Hybrid Architecture for Pedestrian Action Prediction
Multi-Modal Hybrid Architecture for Pedestrian Action Prediction
Amir Rasouli
Tiffany Yau
Mohsen Rohani
Jun Luo
80
43
0
16 Nov 2020
JOLO-GCN: Mining Joint-Centered Light-Weight Information for
  Skeleton-Based Action Recognition
JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition
Jinmiao Cai
Nianjuan Jiang
Xiaoguang Han
Kui Jia
Jiangbo Lu
55
85
0
16 Nov 2020
Previous
123...525354...717273
Next