ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.06950
  4. Cited By
The Kinetics Human Action Video Dataset

The Kinetics Human Action Video Dataset

19 May 2017
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
ArXivPDFHTML

Papers citing "The Kinetics Human Action Video Dataset"

50 / 2,015 papers shown
Title
Learning from Multiview Correlations in Open-Domain Videos
Learning from Multiview Correlations in Open-Domain Videos
Nils Holzenberger
Shruti Palaskar
Pranava Madhyastha
Florian Metze
R. Arora
SSL
6
11
0
21 Nov 2018
MAC: Mining Activity Concepts for Language-based Temporal Localization
MAC: Mining Activity Concepts for Language-based Temporal Localization
Runzhou Ge
J. Gao
Kan Chen
Ram Nevatia
19
177
0
21 Nov 2018
TSM: Temporal Shift Module for Efficient Video Understanding
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
47
1,671
0
20 Nov 2018
Multi-Task Learning of Generalizable Representations for Video Action
  Recognition
Multi-Task Learning of Generalizable Representations for Video Action Recognition
Zhiyu Yao
Yunbo Wang
Mingsheng Long
Jianmin Wang
Philip S Yu
Jiaguang Sun
13
3
0
20 Nov 2018
Segregated Temporal Assembly Recurrent Networks for Weakly Supervised
  Multiple Action Detection
Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection
Yunlu Xu
Chengwei Zhang
Zhanzhan Cheng
Jianwen Xie
Yi Niu
Shiliang Pu
Fei Wu
33
80
0
19 Nov 2018
Recurrent Convolutions for Causal 3D CNNs
Recurrent Convolutions for Causal 3D CNNs
Gurkirt Singh
Fabio Cuzzolin
3DPC
19
0
0
17 Nov 2018
Natural Environment Benchmarks for Reinforcement Learning
Natural Environment Benchmarks for Reinforcement Learning
Amy Zhang
Yuxin Wu
Joelle Pineau
OffRL
OOD
12
69
0
14 Nov 2018
Skeleton-Based Action Recognition with Synchronous Local and Non-local
  Spatio-temporal Learning and Frequency Attention
Skeleton-Based Action Recognition with Synchronous Local and Non-local Spatio-temporal Learning and Frequency Attention
Guyue Hu
Bo Cui
Shan Yu
19
40
0
10 Nov 2018
Identify, locate and separate: Audio-visual object extraction in large
  video collections using weak supervision
Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision
Sanjeel Parekh
A. Ozerov
S. Essid
Ngoc Q. K. Duong
P. Pérez
G. Richard
23
16
0
09 Nov 2018
Cross and Learn: Cross-Modal Self-Supervision
Cross and Learn: Cross-Modal Self-Supervision
Nawid Sayed
Biagio Brattoli
Bjorn Ommer
SSL
25
78
0
09 Nov 2018
Multimodal Grounding for Sequence-to-Sequence Speech Recognition
Multimodal Grounding for Sequence-to-Sequence Speech Recognition
Ozan Caglayan
Ramon Sanabria
Shruti Palaskar
Loïc Barrault
Florian Metze
21
25
0
09 Nov 2018
BAR: Bayesian Activity Recognition using variational inference
BAR: Bayesian Activity Recognition using variational inference
R. Krishnan
Mahesh Subedar
S. Bhatnagar
BDL
UQCV
19
20
0
08 Nov 2018
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video
  Captioning
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning
Yoonchang Sung
Jiawei Wu
Da Zhang
Yu-Chuan Su
Pratap Tokekar
24
39
0
07 Nov 2018
Toward Driving Scene Understanding: A Dataset for Learning Driver
  Behavior and Causal Reasoning
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning
Vasili Ramanishka
Yi-Ting Chen
Teruhisa Misu
Kate Saenko
14
277
0
06 Nov 2018
Leveraging Random Label Memorization for Unsupervised Pre-Training
Leveraging Random Label Memorization for Unsupervised Pre-Training
Vinaychandran Pondenkandath
Michele Alberti
Sammer Puran
Rolf Ingold
Marcus Liwicki
NoLa
14
14
0
05 Nov 2018
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition
Dongliang He
Zhichao Zhou
Chuang Gan
Fu Li
Xiao-Chang Liu
Yandong Li
Limin Wang
Shilei Wen
30
133
0
05 Nov 2018
Random Temporal Skipping for Multirate Video Analysis
Random Temporal Skipping for Multirate Video Analysis
Yi Zhu
Shawn D. Newsam
6
14
0
30 Oct 2018
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in
  the Wild
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild
Lianghua Huang
Xin Zhao
Kaiqi Huang
46
1,323
0
29 Oct 2018
$A^2$-Nets: Double Attention Networks
A2A^2A2-Nets: Double Attention Networks
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
24
529
0
27 Oct 2018
Fine-grained Video Categorization with Redundancy Reduction Attention
Fine-grained Video Categorization with Redundancy Reduction Attention
Chen Zhu
Xiao Tan
Feng Zhou
Xiao-Chang Liu
Kaiyu Yue
Errui Ding
Yi Ma
31
46
0
26 Oct 2018
Cross-Modal and Hierarchical Modeling of Video and Text
Cross-Modal and Hierarchical Modeling of Video and Text
Bowen Zhang
Hexiang Hu
Fei Sha
BDL
AI4TS
12
187
0
16 Oct 2018
CSI-Net: Unified Human Body Characterization and Pose Recognition
CSI-Net: Unified Human Body Characterization and Pose Recognition
Fei Wang
Jinsong Han
Shiyuan Zhang
Xuming He
Dong Huang
CVBM
19
13
0
07 Oct 2018
Deep Generative Video Compression
Deep Generative Video Compression
Jun Han
Salvator Lombardo
Christopher Schroers
Stephan Mandt
VGen
32
58
0
05 Oct 2018
Representation Flow for Action Recognition
Representation Flow for Action Recognition
A. Piergiovanni
Michael S. Ryoo
30
146
0
02 Oct 2018
Non-local NetVLAD Encoding for Video Classification
Non-local NetVLAD Encoding for Video Classification
Yongyi Tang
Xing Zhang
Jingwen Wang
Shaoxiang Chen
Lin Ma
Yu-Gang Jiang
16
41
0
29 Sep 2018
Using phase instead of optical flow for action recognition
Using phase instead of optical flow for action recognition
Omar Hommos
S. Pintea
Pascal Mettes
Jan van Gemert
45
13
0
10 Sep 2018
Optimizing deep video representation to match brain activity
Optimizing deep video representation to match brain activity
Hugo Richard
Ana Luísa Pinho
B. Thirion
Guillaume Charpiat
8
5
0
07 Sep 2018
Targeted Nonlinear Adversarial Perturbations in Images and Videos
Targeted Nonlinear Adversarial Perturbations in Images and Videos
R. Rey-de-Castro
H. Rabitz
AAML
16
10
0
27 Aug 2018
Predicting Action Tubes
Predicting Action Tubes
Gurkirt Singh
Suman Saha
Fabio Cuzzolin
ViT
27
22
0
23 Aug 2018
Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video
  Action Recognition
Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition
Unaiza Ahsan
Rishi Madhok
Irfan Essa
SSL
14
106
0
22 Aug 2018
Deep Adaptive Temporal Pooling for Activity Recognition
Deep Adaptive Temporal Pooling for Activity Recognition
Sibo Song
Ngai-man Cheung
V. Chandrasekhar
Bappaditya Mandal
6
16
0
22 Aug 2018
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in
  Videos
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos
Zhaoyang Zhang
Zhanghui Kuang
Ping Luo
Xue Jiang
Wayne Zhang
16
12
0
15 Aug 2018
Fast Video Shot Transition Localization with Deep Structured Models
Fast Video Shot Transition Localization with Deep Structured Models
Shitao Tang
Xue Jiang
Zhanghui Kuang
Yimin Chen
Wayne Zhang
21
45
0
13 Aug 2018
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Victor Escorcia
Ranjay Krishna
S. Buch
Cuong Duc Dao
42
65
0
11 Aug 2018
Video Re-localization
Video Re-localization
Yang Feng
Lin Ma
Wei Liu
Tong Zhang
Jiebo Luo
21
71
0
05 Aug 2018
A Short Note about Kinetics-600
A Short Note about Kinetics-600
João Carreira
Eric Noland
Andras Banki-Horvath
Chloe Hillier
Andrew Zisserman
7
517
0
03 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action
  Classification
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
81
79
0
03 Aug 2018
TraMNet - Transition Matrix Network for Efficient Action Tube Proposals
TraMNet - Transition Matrix Network for Efficient Action Tube Proposals
Gurkirt Singh
Suman Saha
Fabio Cuzzolin
22
10
0
01 Aug 2018
Analyzing Human-Human Interactions: A Survey
Analyzing Human-Human Interactions: A Survey
Alexandros Stergiou
R. Poppe
32
14
0
31 Jul 2018
Multi-Fiber Networks for Video Recognition
Multi-Fiber Networks for Video Recognition
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
CVBM
19
216
0
30 Jul 2018
Actor-Centric Relation Network
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
41
220
0
28 Jul 2018
Diagnosing Error in Temporal Action Detectors
Diagnosing Error in Temporal Action Detectors
Humam Alwassel
Fabian Caba Heilbron
Victor Escorcia
Guohao Li
37
106
0
27 Jul 2018
W-TALC: Weakly-supervised Temporal Activity Localization and
  Classification
W-TALC: Weakly-supervised Temporal Activity Localization and Classification
S. Paul
Sourya Roy
A. Roy-Chowdhury
27
305
0
27 Jul 2018
A Better Baseline for AVA
A Better Baseline for AVA
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
18
66
0
26 Jul 2018
Motion Feature Network: Fixed Motion Filter for Action Recognition
Motion Feature Network: Fixed Motion Filter for Action Recognition
Myunggi Lee
Seungeui Lee
S. Son
Gyutae Park
Nojun Kwak
21
121
0
26 Jul 2018
Few-Shot Adaptation for Multimedia Semantic Indexing
Few-Shot Adaptation for Multimedia Semantic Indexing
Nakamasa Inoue
Koichi Shinoda
VLM
20
6
0
19 Jul 2018
Video-based Person Re-identification via 3D Convolutional Networks and
  Non-local Attention
Video-based Person Re-identification via 3D Convolutional Networks and Non-local Attention
Xingyu Liao
Lingxiao He
Zhouwang Yang
Chi Zhang
3DPC
27
72
0
12 Jul 2018
Cooperative Learning of Audio and Video Models from Self-Supervised
  Synchronization
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
Bruno Korbar
Du Tran
Lorenzo Torresani
9
473
0
30 Jun 2018
A flexible model for training action localization with varying levels of
  supervision
A flexible model for training action localization with varying levels of supervision
Guilhem Chéron
Jean-Baptiste Alayrac
Ivan Laptev
Cordelia Schmid
16
41
0
29 Jun 2018
YH Technologies at ActivityNet Challenge 2018
YH Technologies at ActivityNet Challenge 2018
Ting Yao
Xue Li
14
11
0
29 Jun 2018
Previous
123...38394041
Next