ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.06950
  4. Cited By
The Kinetics Human Action Video Dataset

The Kinetics Human Action Video Dataset

19 May 2017
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
ArXivPDFHTML

Papers citing "The Kinetics Human Action Video Dataset"

50 / 2,017 papers shown
Title
Just One Moment: Structural Vulnerability of Deep Action Recognition
  against One Frame Attack
Just One Moment: Structural Vulnerability of Deep Action Recognition against One Frame Attack
Jaehui Hwang
Jun-Hyuk Kim
Jun-Ho Choi
Jong-Seok Lee
AAML
21
15
0
30 Nov 2020
Annotation-Efficient Untrimmed Video Action Recognition
Annotation-Efficient Untrimmed Video Action Recognition
Yixiong Zou
Shanghang Zhang
Guangyao Chen
Yonghong Tian
Kurt Keutzer
J. M. F. Moura
11
5
0
30 Nov 2020
Semi-Supervised Learning for Sparsely-Labeled Sequential Data:
  Application to Healthcare Video Processing
Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing
Florian Dubost
Erin Hong
Nandita Bhaskhar
Siyi Tang
D. Rubin
Christopher Lee-Messer
NoLa
21
0
0
28 Nov 2020
Patch-VQ: 'Patching Up' the Video Quality Problem
Patch-VQ: 'Patching Up' the Video Quality Problem
Zhenqiang Ying
Maniratnam Mandal
Deepti Ghadiyaram
AI Facebook
16
164
0
27 Nov 2020
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of
  Broadcast Soccer Videos
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos
Adrien Deliège
A. Cioppa
Silvio Giancola
M. J. Seikavandi
J. Dueholm
Kamal Nasrollahi
Guohao Li
T. Moeslund
Marc Van Droogenbroeck
20
152
0
26 Nov 2020
Spatio-Temporal Inception Graph Convolutional Networks for
  Skeleton-Based Action Recognition
Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition
Zhen Huang
Xu Shen
Xinmei Tian
Houqiang Li
Jianqiang Huang
Xiansheng Hua
GNN
37
56
0
26 Nov 2020
Group-Skeleton-Based Human Action Recognition in Complex Events
Group-Skeleton-Based Human Action Recognition in Complex Events
Tingtian Li
Zixun Sun
Xiao Chen
32
5
0
26 Nov 2020
t-EVA: Time-Efficient t-SNE Video Annotation
t-EVA: Time-Efficient t-SNE Video Annotation
Soroosh Poorgholi
O. Kayhan
Jan van Gemert
16
5
0
26 Nov 2020
Can Temporal Information Help with Contrastive Self-Supervised Learning?
Can Temporal Information Help with Contrastive Self-Supervised Learning?
Yutong Bai
Haoqi Fan
Ishan Misra
Ganesh Venkatesh
Yongyi Lu
Yuyin Zhou
Qihang Yu
Vikas Chandra
Alan Yuille
24
40
0
25 Nov 2020
Sign language segmentation with temporal convolutional networks
Sign language segmentation with temporal convolutional networks
Katrin Renz
N. Stache
Samuel Albanie
Gül Varol
SLR
24
25
0
25 Nov 2020
Recent Progress in Appearance-based Action Recognition
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
24
0
0
25 Nov 2020
A3D: Adaptive 3D Networks for Video Action Recognition
A3D: Adaptive 3D Networks for Video Action Recognition
Sijie Zhu
Taojiannan Yang
Matías Mendieta
Chong Chen
3DH
32
12
0
24 Nov 2020
Play Fair: Frame Attributions in Video Models
Play Fair: Frame Attributions in Video Models
Will Price
Dima Damen
FAtt
31
5
0
24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
35
123
0
23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised
  Video Representation Learning
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning
Zehua Zhang
David J. Crandall
AI4TS
SSL
28
23
0
23 Nov 2020
Learnable Sampling 3D Convolution for Video Enhancement and Action
  Recognition
Learnable Sampling 3D Convolution for Video Enhancement and Action Recognition
Shuyang Gu
Jianmin Bao
Dong Chen
28
2
0
22 Nov 2020
Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings
Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings
Karan Sikka
Jihua Huang
Andrew Silberfarb
Prateeth Nayak
Luke Rohrer
Pritish Sahu
John Byrnes
Ajay Divakaran
R. Rohwer
35
4
0
21 Nov 2020
Boundary-sensitive Pre-training for Temporal Localization in Videos
Boundary-sensitive Pre-training for Temporal Localization in Videos
Mengmeng Xu
Juan-Manuel Perez-Rua
Victor Escorcia
Brais Martínez
Xiatian Zhu
Li Zhang
Guohao Li
Tao Xiang
33
61
0
21 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild
Visual Recognition of Great Ape Behaviours in the Wild
Faizaan Sakib
T. Burghardt
22
24
0
21 Nov 2020
3D attention mechanism for fine-grained classification of table tennis
  strokes using a Twin Spatio-Temporal Convolutional Neural Networks
3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
J. Morlier
3DPC
30
12
0
20 Nov 2020
Neuro-Symbolic Representations for Video Captioning: A Case for
  Leveraging Inductive Biases for Vision and Language
Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language
Hassan Akbari
Hamid Palangi
Jianwei Yang
Sudha Rao
Asli Celikyilmaz
Roland Fernandez
P. Smolensky
Jianfeng Gao
Shih-Fu Chang
34
3
0
18 Nov 2020
A Hierarchical Multi-Modal Encoder for Moment Localization in Video
  Corpus
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus
Bowen Zhang
Hexiang Hu
Joonseok Lee
Mingde Zhao
Sheide Chammas
Vihan Jain
Eugene Ie
Fei Sha
25
30
0
18 Nov 2020
RAIST: Learning Risk Aware Traffic Interactions via Spatio-Temporal
  Graph Convolutional Networks
RAIST: Learning Risk Aware Traffic Interactions via Spatio-Temporal Graph Convolutional Networks
Videsh Suman
Phu-Cuong Pham
Aniket Bera
GNN
19
4
0
17 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions
3D CNNs with Adaptive Temporal Feature Resolutions
Mohsen Fayyaz
Emad Bahrami Rad
Ali Diba
M. Noroozi
Ehsan Adeli
Luc Van Gool
Juergen Gall
3DPC
24
30
0
17 Nov 2020
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey,
  Opportunities, and Open Research Issues
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues
A. Alam
I. Ullah
Young-Koo Lee
47
22
0
16 Nov 2020
ActBERT: Learning Global-Local Video-Text Representations
ActBERT: Learning Global-Local Video-Text Representations
Linchao Zhu
Yi Yang
ViT
49
417
0
14 Nov 2020
Unsupervised Video Representation Learning by Bidirectional Feature
  Prediction
Unsupervised Video Representation Learning by Bidirectional Feature Prediction
Nadine Behrmann
Juergen Gall
M. Noroozi
SSL
MDE
32
29
0
11 Nov 2020
Progressive Spatio-Temporal Graph Convolutional Network for
  Skeleton-Based Human Action Recognition
Progressive Spatio-Temporal Graph Convolutional Network for Skeleton-Based Human Action Recognition
Negar Heidari
Alexandros Iosifidis
GNN
3DH
38
14
0
11 Nov 2020
Multimodal Pretraining for Dense Video Captioning
Multimodal Pretraining for Dense Video Captioning
Gabriel Huang
Bo Pang
Zhenhai Zhu
Clara E. Rivera
Radu Soricut
21
81
0
10 Nov 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial
  Expression Recognition
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition
T. Ayral
M. Pedersoli
Simon L Bacon
Eric Granger
CVBM
3DH
13
11
0
10 Nov 2020
Mutual Modality Learning for Video Action Classification
Mutual Modality Learning for Video Action Classification
Stepan Alekseevich Komkov
Maksim Dzabraev
Aleksandr Petiushko
27
9
0
04 Nov 2020
Learning Representations from Audio-Visual Spatial Alignment
Learning Representations from Audio-Visual Spatial Alignment
Pedro Morgado
Yi Li
Nuno Vasconcelos
SSL
27
121
0
03 Nov 2020
Leveraging Activity Recognition to Enable Protective Behavior Detection
  in Continuous Data
Leveraging Activity Recognition to Enable Protective Behavior Detection in Continuous Data
Chongyang Wang
Yuan Gao
Akhil Mathur
A. Williams
Nicholas D. Lane
N. Bianchi-Berthouze
32
34
0
03 Nov 2020
Content-based Analysis of the Cultural Differences between TikTok and
  Douyin
Content-based Analysis of the Cultural Differences between TikTok and Douyin
Li-yao Sun
Haoqi Zhang
Songyang Zhang
Jiebo Luo
18
24
0
03 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition
PV-NAS: Practical Neural Architecture Search for Video Recognition
Zihao Wang
Chen Lin
Lu Sheng
Junjie Yan
Jing Shao
ViT
25
7
0
02 Nov 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom
  Interpretation
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation
Zhengyuan Yang
Amanda Kay
Yuncheng Li
Wendi F. Cross
Jiebo Luo
33
18
0
30 Oct 2020
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised
  Video Representation Leaning
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning
L. Tao
Xueting Wang
T. Yamasaki
VLM
SSL
23
14
0
29 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture
  Searching
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching
Haoyuan Zhang
Yonghong Hou
Pichao Wang
Zihui Guo
Wanqing Li
32
15
0
29 Oct 2020
Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity
  Detection
Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection
Rui Dai
Srijan Das
Saurav Sharma
Luca Minciullo
Lorenzo Garattoni
Francois Bremond
Gianpiero Francesca
23
50
0
28 Oct 2020
Cycle-Contrast for Self-Supervised Video Representation Learning
Cycle-Contrast for Self-Supervised Video Representation Learning
Quan Kong
Wen Wei
Ziwei Deng
Tomoaki Yoshinaga
Tomokazu Murakami
SSL
19
55
0
28 Oct 2020
ElderSim: A Synthetic Data Generation Platform for Human Action
  Recognition in Eldercare Applications
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications
Hochul Hwang
Cheongjae Jang
Geonwoo Park
Junghyun Cho
Ig-Jae Kim
37
70
0
28 Oct 2020
Multi-object tracking with self-supervised associating network
Multi-object tracking with self-supervised associating network
Tae-Young Chung
Heansung Lee
Myeongah Cho
Suhwan Cho
Sangyoun Lee
VOT
16
0
0
26 Oct 2020
Temporal Attention-Augmented Graph Convolutional Network for Efficient
  Skeleton-Based Human Action Recognition
Temporal Attention-Augmented Graph Convolutional Network for Efficient Skeleton-Based Human Action Recognition
Negar Heidari
Alexandros Iosifidis
GNN
42
30
0
23 Oct 2020
Spatio-temporal Features for Generalized Detection of Deepfake Videos
Spatio-temporal Features for Generalized Detection of Deepfake Videos
Ipek Ganiyusufoglu
L. Ngô
N. Savov
Sezer Karaoglu
Theo Gevers
32
41
0
22 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action
  Recognition
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
Chun-Fu Chen
Yikang Shen
K. Ramakrishnan
Rogerio Feris
J. M. Cohn
A. Oliva
Quanfu Fan
23
95
0
22 Oct 2020
Learning to Sort Image Sequences via Accumulated Temporal Differences
Learning to Sort Image Sequences via Accumulated Temporal Differences
Gagan Kanojia
Shanmuganathan Raman
19
0
0
22 Oct 2020
Shedding Light on Blind Spots: Developing a Reference Architecture to
  Leverage Video Data for Process Mining
Shedding Light on Blind Spots: Developing a Reference Architecture to Leverage Video Data for Process Mining
Wolfgang Kratsch
Fabian König
Maximilian Röglinger
16
25
0
21 Oct 2020
A Short Note on the Kinetics-700-2020 Human Action Dataset
A Short Note on the Kinetics-700-2020 Human Action Dataset
Lucas Smaira
João Carreira
Eric Noland
Ellen Clancy
Amy Wu
Andrew Zisserman
32
137
0
21 Oct 2020
AttendAffectNet: Self-Attention based Networks for Predicting Affective
  Responses from Movies
AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies
Ha Thi Phuong Thao
Balamurali B.T.
Dorien Herremans
Gemma Roig
30
7
0
21 Oct 2020
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization
Nakul Agarwal
Yi-Ting Chen
Behzad Dariush
Ming-Hsuan Yang
27
8
0
19 Oct 2020
Previous
123...293031...394041
Next