ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,645 papers shown
Title
Self-Paced Video Data Augmentation with Dynamic Images Generated by
  Generative Adversarial Networks
Self-Paced Video Data Augmentation with Dynamic Images Generated by Generative Adversarial Networks
Yumeng Zhang
Gaoguo Jia
Li Chen
Mingrui Zhang
Junhai Yong
63
5
0
16 Sep 2019
Multimodal Deep Models for Predicting Affective Responses Evoked by
  Movies
Multimodal Deep Models for Predicting Affective Responses Evoked by Movies
Ha Thi Phuong Thao
Dorien Herremans
Gemma Roig
67
18
0
16 Sep 2019
Multitask Learning to Improve Egocentric Action Recognition
Multitask Learning to Improve Egocentric Action Recognition
G. Kapidis
R. Poppe
E. V. Dam
L. Noldus
R. Veltkamp
EgoV
84
37
0
15 Sep 2019
Metric-Based Few-Shot Learning for Video Action Recognition
Metric-Based Few-Shot Learning for Video Action Recognition
Chris Careaga
Brian Hutchinson
Nathan Oken Hodas
Lawrence Phillips
143
22
0
14 Sep 2019
Zero-Shot Action Recognition in Videos: A Survey
Zero-Shot Action Recognition in Videos: A Survey
Valter Estevam
Hélio Pedrini
David Menotti
95
59
0
13 Sep 2019
Tactile-Based Insertion for Dense Box-Packing
Tactile-Based Insertion for Dense Box-Packing
Siyuan Dong
Alberto Rodriguez
161
55
0
12 Sep 2019
Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos
Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos
Okan Kopuklu
Fabian Herzog
Gerhard Rigoll
63
6
0
11 Sep 2019
Reasoning About Human-Object Interactions Through Dual Attention
  Networks
Reasoning About Human-Object Interactions Through Dual Attention Networks
Tete Xiao
Quanfu Fan
Dan Gutfreund
Mathew Monfort
A. Oliva
Bolei Zhou
54
35
0
10 Sep 2019
Extreme Low Resolution Activity Recognition with Confident
  Spatial-Temporal Attention Transfer
Extreme Low Resolution Activity Recognition with Confident Spatial-Temporal Attention Transfer
Yucai Bai
Qinglong Zou
Xieyuanli Chen
Lingxi Li
Zhengming Ding
Long Chen
69
3
0
09 Sep 2019
Explainable Deep Learning for Video Recognition Tasks: A Framework &
  Recommendations
Explainable Deep Learning for Video Recognition Tasks: A Framework & Recommendations
Liam Hiley
Alun D. Preece
Y. Hicks
XAI
34
15
0
07 Sep 2019
Graph Convolutional Networks for Temporal Action Localization
Graph Convolutional Networks for Temporal Action Localization
Runhao Zeng
Wenbing Huang
Mingkui Tan
Yu Rong
P. Zhao
Junzhou Huang
Chuang Gan
GNN
108
481
0
07 Sep 2019
Discriminative Video Representation Learning Using Support Vector
  Classifiers
Discriminative Video Representation Learning Using Support Vector Classifiers
Jue Wang
A. Cherian
32
5
0
05 Sep 2019
Utilizing Temporal Information in Deep Convolutional Network for
  Efficient Soccer Ball Detection and Tracking
Utilizing Temporal Information in Deep Convolutional Network for Efficient Soccer Ball Detection and Tracking
Anna Kukleva
M. A. Khan
Hafez Farazi
Sven Behnke
41
6
0
05 Sep 2019
Tensor Analysis with n-Mode Generalized Difference Subspace
Tensor Analysis with n-Mode Generalized Difference Subspace
B. Gatto
E. M. Santos
Alessandro Lameiras Koerich
Kazuhiro Fukui
Waldir S. S. Júnior
28
18
0
04 Sep 2019
Great Ape Detection in Challenging Jungle Camera Trap Footage via
  Attention-Based Spatial and Temporal Feature Blending
Great Ape Detection in Challenging Jungle Camera Trap Footage via Attention-Based Spatial and Temporal Feature Blending
Xinyu Yang
Majid Mirmehdi
T. Burghardt
51
21
0
29 Aug 2019
Out the Window: A Crowd-Sourced Dataset for Activity Classification in
  Security Video
Out the Window: A Crowd-Sourced Dataset for Activity Classification in Security Video
Greg Castañón
N. Shnidman
Tim Anderson
J. Byrne
44
2
0
28 Aug 2019
Explainable Video Action Reasoning via Prior Knowledge and State
  Transitions
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
Tao Zhuo
Zhiyong Cheng
Peng Zhang
Yongkang Wong
Mohan Kankanhalli
FAtt
83
62
0
28 Aug 2019
Cooperative Cross-Stream Network for Discriminative Action
  Representation
Cooperative Cross-Stream Network for Discriminative Action Representation
Jingran Zhang
Fumin Shen
Xing Xu
Heng Tao Shen
62
5
0
27 Aug 2019
Controllable Video Captioning with POS Sequence Guidance Based on Gated
  Fusion Network
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
132
163
0
27 Aug 2019
Global-Local Temporal Representations For Video Person Re-Identification
Global-Local Temporal Representations For Video Person Re-Identification
Jianing Li
Jingdong Wang
Qi Tian
Tiejun Huang
Shiliang Zhang
ViT
68
48
0
27 Aug 2019
Temporal Reasoning Graph for Activity Recognition
Temporal Reasoning Graph for Activity Recognition
Jingran Zhang
Fumin Shen
Xing Xu
Heng Tao Shen
55
60
0
27 Aug 2019
Deep Concept-wise Temporal Convolutional Networks for Action
  Localization
Deep Concept-wise Temporal Convolutional Networks for Action Localization
Xin Li
Tianwei Lin
Xiao-Chang Liu
Chuang Gan
W. Zuo
Chong Li
Xiang Long
Dongliang He
Fu Li
Shilei Wen
89
31
0
26 Aug 2019
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action
  Recognition
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
Evangelos Kazakos
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
75
340
0
22 Aug 2019
3C-Net: Category Count and Center Loss for Weakly-Supervised Action
  Localization
3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization
Sanath Narayan
Hisham Cholakkal
Fahad Shahbaz Khan
Ling Shao
3DPC
85
157
0
22 Aug 2019
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue
  Generation
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue Generation
Kuan-Yen Lin
Chao-Chun Hsu
Yun-Nung Chen
Lun-Wei Ku
VGen
60
20
0
22 Aug 2019
Multi-Stream Single Shot Spatial-Temporal Action Detection
Multi-Stream Single Shot Spatial-Temporal Action Detection
Pengfei Zhang
Yu Cao
Benyuan Liu
3DPC
29
3
0
22 Aug 2019
Action recognition with spatial-temporal discriminative filter banks
Action recognition with spatial-temporal discriminative filter banks
Brais Martínez
Davide Modolo
Yuanjun Xiong
Joseph Tighe
88
66
0
20 Aug 2019
Multi-Modal Recognition of Worker Activity for Human-Centered
  Intelligent Manufacturing
Multi-Modal Recognition of Worker Activity for Human-Centered Intelligent Manufacturing
Wenjin Tao
M. C. Leu
Zhaozheng Yin
HAI
32
39
0
20 Aug 2019
ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning
Giorgos Kordopatis-Zilos
Symeon Papadopoulos
Ioannis Patras
Y. Kompatsiaris
85
79
0
20 Aug 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query
  in Video using Guided Attention
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
94
147
0
20 Aug 2019
Cross-Enhancement Transform Two-Stream 3D ConvNets for Action
  Recognition
Cross-Enhancement Transform Two-Stream 3D ConvNets for Action Recognition
Dong Cao
Lisha Xu
Dongdong Zhang
ViT
46
1
0
19 Aug 2019
Gradient Weighted Superpixels for Interpretability in CNNs
Gradient Weighted Superpixels for Interpretability in CNNs
Thomas Hartley
K. Sidorov
C. Willis
David Marshall
FAtt
22
3
0
16 Aug 2019
GODS: Generalized One-class Discriminative Subspaces for Anomaly
  Detection
GODS: Generalized One-class Discriminative Subspaces for Anomaly Detection
Jue Wang
A. Cherian
CML
74
117
0
16 Aug 2019
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for
  Video Saliency Detection
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection
Kyle Min
Jason J. Corso
74
153
0
15 Aug 2019
Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of
  Autonomous Vehicles
Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of Autonomous Vehicles
Dong Cao
Lisha Xu
50
2
0
15 Aug 2019
Video Compression With Rate-Distortion Autoencoders
Video Compression With Rate-Distortion Autoencoders
A. Habibian
T. V. Rozendaal
Jakub M. Tomczak
Taco S. Cohen
VGen
90
202
0
14 Aug 2019
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling
Yi-Ting Yeh
Tzu-Chuan Lin
Hsiao-Hua Cheng
Yuanyuan Deng
Shang-Yu Su
Yun-Nung Chen
74
16
0
14 Aug 2019
Three Branches: Detecting Actions With Richer Features
Three Branches: Detecting Actions With Richer Features
Jinchao Xia
Jiajun Tang
Cewu Lu
46
8
0
13 Aug 2019
Multi-Frame Content Integration with a Spatio-Temporal Attention
  Mechanism for Person Video Motion Transfer
Multi-Frame Content Integration with a Spatio-Temporal Attention Mechanism for Person Video Motion Transfer
K. Cheng
Haozhi Huang
C. Yuan
Lingyiqing Zhou
Wei Liu
DiffM
34
2
0
12 Aug 2019
Relation-Aware Pyramid Network (RapNet) for temporal action proposal
Relation-Aware Pyramid Network (RapNet) for temporal action proposal
Jialin Gao
Zhixiang Shi
Jiani Li
Yufeng Yuan
Jiwei Li
Xi Zhou
13
0
0
09 Aug 2019
Moviescope: Large-scale Analysis of Movies using Multiple Modalities
Moviescope: Large-scale Analysis of Movies using Multiple Modalities
Paola Cascante-Bonilla
Kalpathy Sitaraman
Mengjia Luo
Vicente Ordonez
59
40
0
08 Aug 2019
Progressive Relation Learning for Group Activity Recognition
Progressive Relation Learning for Group Activity Recognition
Guyue Hu
Bo Cui
Yuan He
Shan Yu
90
81
0
08 Aug 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition
STM: SpatioTemporal and Motion Encoding for Action Recognition
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
98
383
0
07 Aug 2019
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal
  Action Localization
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization
Chengwei Zhang
Yunlu Xu
Zhanzhan Cheng
Yi Niu
Shiliang Pu
Leilei Gan
Futai Zou
121
53
0
07 Aug 2019
Discriminating Spatial and Temporal Relevance in Deep Taylor
  Decompositions for Explainable Activity Recognition
Discriminating Spatial and Temporal Relevance in Deep Taylor Decompositions for Explainable Activity Recognition
Liam Hiley
Alun D. Preece
Y. Hicks
D. Marshall
Harrison Taylor
FAtt
33
11
0
05 Aug 2019
Image to Video Domain Adaptation Using Web Supervision
Image to Video Domain Adaptation Using Web Supervision
Andrew Kae
Yale Song
42
5
0
05 Aug 2019
Action Recognition in Untrimmed Videos with Composite Self-Attention
  Two-Stream Framework
Action Recognition in Untrimmed Videos with Composite Self-Attention Two-Stream Framework
Dong Cao
Lisha Xu
Haibo Chen
ViT
32
3
0
04 Aug 2019
Scale Matters: Temporal Scale Aggregation Network for Precise Action
  Localization in Untrimmed Videos
Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos
Guoqiang Gong
Liangfeng Zheng
Kun Bai
Yadong Mu
78
52
0
02 Aug 2019
Two-Stream Video Classification with Cross-Modality Attention
Two-Stream Video Classification with Cross-Modality Attention
Lu Chi
Guiyu Tian
Yadong Mu
Qi Tian
67
22
0
01 Aug 2019
Use What You Have: Video Retrieval Using Representations From
  Collaborative Experts
Use What You Have: Video Retrieval Using Representations From Collaborative Experts
Yang Liu
Samuel Albanie
Arsha Nagrani
Andrew Zisserman
91
391
0
31 Jul 2019
Previous
123...646566...717273
Next