ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.06950
  4. Cited By
The Kinetics Human Action Video Dataset

The Kinetics Human Action Video Dataset

19 May 2017
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
ArXivPDFHTML

Papers citing "The Kinetics Human Action Video Dataset"

50 / 2,016 papers shown
Title
Learning Energy-based Spatial-Temporal Generative ConvNets for Dynamic
  Patterns
Learning Energy-based Spatial-Temporal Generative ConvNets for Dynamic Patterns
Jianwen Xie
Song-Chun Zhu
Ying Nian Wu
GAN
29
52
0
26 Sep 2019
Joint-task Self-supervised Learning for Temporal Correspondence
Joint-task Self-supervised Learning for Temporal Correspondence
Xueting Li
Sifei Liu
Shalini De Mello
Xiaolong Wang
Jan Kautz
Ming-Hsuan Yang
SSL
21
139
0
26 Sep 2019
Gated Channel Transformation for Visual Recognition
Gated Channel Transformation for Visual Recognition
Zongxin Yang
Linchao Zhu
Yu Wu
Yezhou Yang
ViT
22
203
0
25 Sep 2019
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Will Price
Dima Damen
23
6
0
20 Sep 2019
Learning 3D-aware Egocentric Spatial-Temporal Interaction via Graph
  Convolutional Networks
Learning 3D-aware Egocentric Spatial-Temporal Interaction via Graph Convolutional Networks
Chengxi Li
Yue Meng
Stanley H. Chan
Yi-Ting Chen
GNN
16
61
0
20 Sep 2019
Adversarial Attack on Skeleton-based Human Action Recognition
Adversarial Attack on Skeleton-based Human Action Recognition
Jian Liu
Naveed Akhtar
Ajmal Mian
AAML
27
68
0
14 Sep 2019
Zero-Shot Action Recognition in Videos: A Survey
Zero-Shot Action Recognition in Videos: A Survey
Valter Estevam
Hélio Pedrini
David Menotti
33
57
0
13 Sep 2019
Reasoning About Human-Object Interactions Through Dual Attention
  Networks
Reasoning About Human-Object Interactions Through Dual Attention Networks
Tete Xiao
Quanfu Fan
Dan Gutfreund
Mathew Monfort
A. Oliva
Bolei Zhou
17
33
0
10 Sep 2019
Video Representation Learning by Dense Predictive Coding
Video Representation Learning by Dense Predictive Coding
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
18
359
0
10 Sep 2019
Explainable Deep Learning for Video Recognition Tasks: A Framework &
  Recommendations
Explainable Deep Learning for Video Recognition Tasks: A Framework & Recommendations
Liam Hiley
Alun D. Preece
Y. Hicks
XAI
19
15
0
07 Sep 2019
Discriminative Video Representation Learning Using Support Vector
  Classifiers
Discriminative Video Representation Learning Using Support Vector Classifiers
Jue Wang
A. Cherian
25
5
0
05 Sep 2019
Out the Window: A Crowd-Sourced Dataset for Activity Classification in
  Security Video
Out the Window: A Crowd-Sourced Dataset for Activity Classification in Security Video
Greg Castañón
N. Shnidman
Tim Anderson
J. Byrne
19
1
0
28 Aug 2019
Controllable Video Captioning with POS Sequence Guidance Based on Gated
  Fusion Network
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
74
163
0
27 Aug 2019
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Canmiao Fu
Wenjie Pei
Qiong Cao
Chaopeng Zhang
Yong Zhao
Xiaoyong Shen
Yu-Wing Tai
24
11
0
26 Aug 2019
Action recognition with spatial-temporal discriminative filter banks
Action recognition with spatial-temporal discriminative filter banks
Brais Martínez
Davide Modolo
Yuanjun Xiong
Joseph Tighe
23
66
0
20 Aug 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query
  in Video using Guided Attention
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
30
147
0
20 Aug 2019
Weakly-supervised Action Localization with Background Modeling
Weakly-supervised Action Localization with Background Modeling
P. Nguyen
Deva Ramanan
Charless C. Fowlkes
SSL
WSOL
27
157
0
19 Aug 2019
Gradient Weighted Superpixels for Interpretability in CNNs
Gradient Weighted Superpixels for Interpretability in CNNs
Thomas Hartley
K. Sidorov
C. Willis
David Marshall
FAtt
12
3
0
16 Aug 2019
Differentiable Learning-to-Group Channels via Groupable Convolutional
  Neural Networks
Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks
Zhaoyang Zhang
Jingyu Li
Wenqi Shao
Zhanglin Peng
Ruimao Zhang
Xiaogang Wang
Ping Luo
22
37
0
16 Aug 2019
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for
  Video Saliency Detection
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection
Kyle Min
Jason J. Corso
30
149
0
15 Aug 2019
Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of
  Autonomous Vehicles
Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of Autonomous Vehicles
Dong Cao
Lisha Xu
17
2
0
15 Aug 2019
Three Branches: Detecting Actions With Richer Features
Three Branches: Detecting Actions With Richer Features
Jinchao Xia
Jiajun Tang
Cewu Lu
22
8
0
13 Aug 2019
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal
  Action Localization
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization
Chengwei Zhang
Yunlu Xu
Zhanzhan Cheng
Yi Niu
Shiliang Pu
Fei Wu
Futai Zou
27
51
0
07 Aug 2019
Discriminating Spatial and Temporal Relevance in Deep Taylor
  Decompositions for Explainable Activity Recognition
Discriminating Spatial and Temporal Relevance in Deep Taylor Decompositions for Explainable Activity Recognition
Liam Hiley
Alun D. Preece
Y. Hicks
D. Marshall
Harrison Taylor
FAtt
16
10
0
05 Aug 2019
Image to Video Domain Adaptation Using Web Supervision
Image to Video Domain Adaptation Using Web Supervision
Andrew Kae
Yale Song
18
5
0
05 Aug 2019
An Evaluation of Action Recognition Models on EPIC-Kitchens
An Evaluation of Action Recognition Models on EPIC-Kitchens
Will Price
Dima Damen
EgoV
22
13
0
02 Aug 2019
Two-Stream Video Classification with Cross-Modality Attention
Two-Stream Video Classification with Cross-Modality Attention
Lu Chi
Guiyu Tian
Yadong Mu
Qi Tian
21
22
0
01 Aug 2019
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective
  Untrimmed Video Recognition
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Shilei Wen
15
127
0
31 Jul 2019
Learning Question-Guided Video Representation for Multi-Turn Video
  Question Answering
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Guan-Lin Chao
Abhinav Rastogi
Semih Yavuz
Dilek Z. Hakkani-Tür
Jindong Chen
Ian Lane
16
6
0
31 Jul 2019
Open Set Domain Adaptation for Image and Action Recognition
Open Set Domain Adaptation for Image and Action Recognition
Pau Panareda Busto
Ahsan Iqbal
Juergen Gall
VLM
11
88
0
30 Jul 2019
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based
  Mechanism for Videos
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
Sebastian Agethen
Winston H. Hsu
HAI
24
25
0
30 Jul 2019
Temporal Attentive Alignment for Large-Scale Video Domain Adaptation
Temporal Attentive Alignment for Large-Scale Video Domain Adaptation
Min-Hung Chen
Z. Kira
G. Al-Regib
Jaekwon Yoo
Ruxin Chen
Jian Zheng
TTA
AI4TS
21
179
0
30 Jul 2019
Learning Visual Actions Using Multiple Verb-Only Labels
Learning Visual Actions Using Multiple Verb-Only Labels
Michael Wray
Dima Damen
28
7
0
25 Jul 2019
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action
  Localization
Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization
Chunfei Ma
Joonhyang Choi
Byeongwon Lee
Seungji Yang
19
0
0
25 Jul 2019
Switchable Normalization for Learning-to-Normalize Deep Representation
Switchable Normalization for Learning-to-Normalize Deep Representation
Ping Luo
Ruimao Zhang
Jiamin Ren
Zhanglin Peng
Jingyu Li
30
73
0
22 Jul 2019
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot
  Action Recognition
TARN: Temporal Attentive Relation Network for Few-Shot and Zero-Shot Action Recognition
M. Bishay
Georgios Zoumpourlis
Ioannis Patras
ViT
27
155
0
21 Jul 2019
An Efficient 3D CNN for Action/Object Segmentation in Video
An Efficient 3D CNN for Action/Object Segmentation in Video
Rui Hou
Chong Chen
Rahul Sukthankar
M. Shah
24
27
0
21 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
50
75
0
19 Jul 2019
Adversarial Video Generation on Complex Datasets
Adversarial Video Generation on Complex Datasets
Aidan Clark
Jeff Donahue
Karen Simonyan
VGen
GAN
27
74
0
15 Jul 2019
A Short Note on the Kinetics-700 Human Action Dataset
A Short Note on the Kinetics-700 Human Action Dataset
João Carreira
Eric Noland
Chloe Hillier
Andrew Zisserman
19
445
0
15 Jul 2019
AVD: Adversarial Video Distillation
AVD: Adversarial Video Distillation
M. Tavakolian
Mohammad Sabokrou
Abdenour Hadid
VGen
30
6
0
12 Jul 2019
Two-stream Spatiotemporal Feature for Video QA Task
Two-stream Spatiotemporal Feature for Video QA Task
Chiwan Song
Woobin Im
Sung-eui Yoon
19
0
0
11 Jul 2019
Video Action Recognition Via Neural Architecture Searching
Video Action Recognition Via Neural Architecture Searching
Wei Peng
Xiaopeng Hong
Guoying Zhao
41
36
0
10 Jul 2019
Sim2real transfer learning for 3D human pose estimation: motion to the
  rescue
Sim2real transfer learning for 3D human pose estimation: motion to the rescue
Carl Doersch
Andrew Zisserman
3DH
11
154
0
04 Jul 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue
  Systems
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
22
111
0
02 Jul 2019
Few-Shot Video Classification via Temporal Alignment
Few-Shot Video Classification via Temporal Alignment
Kaidi Cao
Jingwei Ji
Zhangjie Cao
C. Chang
Juan Carlos Niebles
AI4TS
27
235
0
27 Jun 2019
Multimodal Abstractive Summarization for How2 Videos
Multimodal Abstractive Summarization for How2 Videos
Shruti Palaskar
Jindrich Libovický
Spandana Gella
Florian Metze
22
95
0
19 Jun 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and
  Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Zhaofan Qiu
Dong Li
Yehao Li
Qi Cai
Yingwei Pan
Ting Yao
27
8
0
14 Jun 2019
Learning Video Representations using Contrastive Bidirectional
  Transformer
Learning Video Representations using Contrastive Bidirectional Transformer
Chen Sun
Fabien Baradel
Kevin Patrick Murphy
Cordelia Schmid
SSL
ViT
27
133
0
13 Jun 2019
Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical
  Bayes
Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical Bayes
R. Krishnan
Mahesh Subedar
Omesh Tickoo
BDL
20
46
0
12 Jun 2019
Previous
123...353637...394041
Next