ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Comprehensive Instructional Video Analysis: The COIN Dataset and
  Performance Evaluation
Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation
Yansong Tang
Jiwen Lu
Jie Zhou
80
33
0
20 Mar 2020
Fully Automated Hand Hygiene Monitoring\\in Operating Room using 3D
  Convolutional Neural Network
Fully Automated Hand Hygiene Monitoring\\in Operating Room using 3D Convolutional Neural Network
Minjee Kim
Joonmyeong Choi
Namkug Kim
13
4
0
20 Mar 2020
Normalized and Geometry-Aware Self-Attention Network for Image
  Captioning
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
203
192
0
19 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range
  Activities
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
74
13
0
18 Mar 2020
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
Xu Li
Jingwen Wang
Lin Ma
Kaihao Zhang
Fengzong Lian
Zhanhui Kang
Jinjun Wang
36
5
0
18 Mar 2020
Latent Embedding Feedback and Discriminative Features for Zero-Shot
  Classification
Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification
Sanath Narayan
Akshita Gupta
Fahad Shahbaz Khan
Cees G. M. Snoek
Ling Shao
VLM
49
224
0
17 Mar 2020
Multi-modal Dense Video Captioning
Multi-modal Dense Video Captioning
Vladimir E. Iashin
Esa Rahtu
92
172
0
17 Mar 2020
Predictively Encoded Graph Convolutional Network for Noise-Robust
  Skeleton-based Action Recognition
Predictively Encoded Graph Convolutional Network for Noise-Robust Skeleton-based Action Recognition
Jongmin Yu
Yongsang Yoon
M. Jeon
194
47
0
17 Mar 2020
On Translation Invariance in CNNs: Convolutional Layers can Exploit
  Absolute Spatial Location
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
O. Kayhan
Jan van Gemert
341
237
0
16 Mar 2020
SF-Net: Single-Frame Supervision for Temporal Action Localization
SF-Net: Single-Frame Supervision for Temporal Action Localization
Fan Ma
Linchao Zhu
Yi Yang
Shengxin Cindy Zha
Gourab Kundu
Matt Feiszli
Zheng Shou
141
142
0
15 Mar 2020
Interaction Graphs for Object Importance Estimation in On-road Driving
  Videos
Interaction Graphs for Object Importance Estimation in On-road Driving Videos
Zehua Zhang
Ashish Tawari
Sujitha Martin
David J. Crandall
GNNFAtt
138
23
0
12 Mar 2020
Top-1 Solution of Multi-Moments in Time Challenge 2019
Top-1 Solution of Multi-Moments in Time Challenge 2019
Manyuan Zhang
Hao Shao
Guanglu Song
Yu Liu
Junjie Yan
40
3
0
12 Mar 2020
Beyond the Camera: Neural Networks in World Coordinates
Beyond the Camera: Neural Networks in World Coordinates
Gunnar Sigurdsson
Abhinav Gupta
Cordelia Schmid
Alahari Karteek
41
2
0
12 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation
Visual Grounding in Video for Unsupervised Word Translation
Gunnar Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
105
50
0
11 Mar 2020
Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid
  Network
Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Jialin Gao
Zhixiang Shi
Jiani Li
Guanshuo Wang
Yufeng Yuan
Shiming Ge
Xiaoping Zhou
64
76
0
09 Mar 2020
Transformation-based Adversarial Video Prediction on Large-Scale Data
Transformation-based Adversarial Video Prediction on Large-Scale Data
Pauline Luc
Aidan Clark
Sander Dieleman
Diego de Las Casas
Yotam Doron
Albin Cassirer
Karen Simonyan
VGen
336
89
0
09 Mar 2020
Better Captioning with Sequence-Level Exploration
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
66
12
0
08 Mar 2020
Transferring Cross-domain Knowledge for Video Sign Language Recognition
Transferring Cross-domain Knowledge for Video Sign Language Recognition
Dongxu Li
Xin Yu
Chenchen Xu
L. Petersson
Hongdong Li
SLR
112
105
0
08 Mar 2020
TTPP: Temporal Transformer with Progressive Prediction for Efficient
  Action Anticipation
TTPP: Temporal Transformer with Progressive Prediction for Efficient Action Anticipation
Wen Wang
Xiaojiang Peng
Yanzhou Su
Yu Qiao
Jian Cheng
AI4TS
82
18
0
07 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal
  Learning
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
134
126
0
06 Mar 2020
Action Segmentation with Joint Self-Supervised Temporal Domain
  Adaptation
Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation
Min-Hung Chen
Baopu Li
Sid Ying-Ze Bao
G. Al-Regib
Z. Kira
TTA
158
122
0
05 Mar 2020
Self-Supervised Visual Learning by Variable Playback Speeds Prediction
  of a Video
Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video
Hyeon Cho
Taehoon Kim
H. Chang
Wonjun Hwang
58
20
0
05 Mar 2020
Detecting Attended Visual Targets in Video
Detecting Attended Visual Targets in Video
Eunji Chong
Yongxin Wang
Nataniel Ruiz
James M. Rehg
259
116
0
05 Mar 2020
ETRI-Activity3D: A Large-Scale RGB-D Dataset for Robots to Recognize
  Daily Activities of the Elderly
ETRI-Activity3D: A Large-Scale RGB-D Dataset for Robots to Recognize Daily Activities of the Elderly
Jinhyeok Jang
Dohyung Kim
Cheonshu Park
Minsu Jang
Jaeyeon Lee
Jaehong Kim
84
67
0
04 Mar 2020
MoVi: A Large Multipurpose Motion and Video Dataset
MoVi: A Large Multipurpose Motion and Video Dataset
Saeed Ghorbani
Kimia Mahdaviani
A. Thaler
Konrad Paul Kording
D. Cook
Gunnar Blohm
N. Troje
92
73
0
04 Mar 2020
Rethinking Zero-shot Video Classification: End-to-end Training for
  Realistic Applications
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
Biagio Brattoli
Joseph Tighe
Fedor Zhdanov
Pietro Perona
Krzysztof Chalupka
VLM
245
130
0
03 Mar 2020
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
Shizhe Chen
Yida Zhao
Qin Jin
Qi Wu
124
319
0
01 Mar 2020
Joint Wasserstein Distribution Matching
Joint Wasserstein Distribution Matching
Jingyun Liang
Langyuan Mo
Qing Du
Yong Guo
P. Zhao
Junzhou Huang
Mingkui Tan
28
0
0
01 Mar 2020
VideoSSL: Semi-Supervised Learning for Video Classification
VideoSSL: Semi-Supervised Learning for Video Classification
Longlong Jing
T. Parag
Zhe Wu
Yingli Tian
Hongcheng Wang
64
52
0
29 Feb 2020
Infrared and 3D skeleton feature fusion for RGB-D action recognition
Infrared and 3D skeleton feature fusion for RGB-D action recognition
Alban Main De Boissiere
R. Noumeir
99
38
0
28 Feb 2020
Joint 2D-3D Breast Cancer Classification
Joint 2D-3D Breast Cancer Classification
G. Liang
Xiaoqin Wang
Yu Zhang
Xin Xing
Hunter Blanton
Tawfiq Salem
Nathan Jacobs
57
39
0
27 Feb 2020
Evolving Losses for Unsupervised Video Representation Learning
Evolving Losses for Unsupervised Video Representation Learning
A. Piergiovanni
A. Angelova
Michael S. Ryoo
SSL
89
140
0
26 Feb 2020
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge
Hung Le
Nancy F. Chen
57
9
0
25 Feb 2020
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Peng Xu
Kun Liu
Tao Xiang
Timothy M. Hospedales
Zhanyu Ma
Jun Guo
Yi-Zhe Song
105
33
0
21 Feb 2020
Strength from Weakness: Fast Learning Using Weak Supervision
Strength from Weakness: Fast Learning Using Weak Supervision
Joshua Robinson
Stefanie Jegelka
S. Sra
67
32
0
19 Feb 2020
SummaryNet: A Multi-Stage Deep Learning Model for Automatic Video
  Summarisation
SummaryNet: A Multi-Stage Deep Learning Model for Automatic Video Summarisation
Ziyad Jappie
David Torpey
Turgay Celik
23
3
0
19 Feb 2020
Human Action Recognition using Local Two-Stream Convolution Neural
  Network Features and Support Vector Machines
Human Action Recognition using Local Two-Stream Convolution Neural Network Features and Support Vector Machines
David Torpey
Turgay Celik
13
8
0
19 Feb 2020
Knowledge Integration Networks for Action Recognition
Knowledge Integration Networks for Action Recognition
Shiwen Zhang
Sheng Guo
Limin Wang
Weilin Huang
Matthew R. Scott
123
18
0
18 Feb 2020
V4D:4D Convolutional Neural Networks for Video-level Representation
  Learning
V4D:4D Convolutional Neural Networks for Video-level Representation Learning
Shiwen Zhang
Sheng Guo
Weilin Huang
Matthew R. Scott
Limin Wang
50
73
0
18 Feb 2020
Bottom-Up Temporal Action Localization with Mutual Regularization
Bottom-Up Temporal Action Localization with Mutual Regularization
Peisen Zhao
Lingxi Xie
Chen Ju
Ya Zhang
Yanfeng Wang
Qi Tian
21
1
0
18 Feb 2020
Over-the-Air Adversarial Flickering Attacks against Video Recognition
  Networks
Over-the-Air Adversarial Flickering Attacks against Video Recognition Networks
Roi Pony
I. Naeh
Shie Mannor
AAML
80
54
0
12 Feb 2020
An End-to-End Visual-Audio Attention Network for Emotion Recognition in
  User-Generated Videos
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
69
100
0
12 Feb 2020
Learning spatio-temporal representations with temporal squeeze pooling
Learning spatio-temporal representations with temporal squeeze pooling
Guoxi Huang
A. Bors
ViT
53
12
0
11 Feb 2020
Dynamic Inference: A New Approach Toward Efficient Video Action
  Recognition
Dynamic Inference: A New Approach Toward Efficient Video Action Recognition
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Yi Yang
Shilei Wen
78
35
0
09 Feb 2020
FSD-10: A Dataset for Competitive Sports Content Analysis
FSD-10: A Dataset for Competitive Sports Content Analysis
Shenlan Liu
Xiang Liu
Gao Huang
Lin Feng
Lianyu Hu
Dong Jiang
Ai-Xuan Zhang
Yang Liu
Hong Qiao
AI4TS
57
19
0
09 Feb 2020
Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$
  Videos
Weakly-Supervised Multi-Person Action Recognition in 360∘^{\circ}∘ Videos
Junnan Li
Jianquan Liu
Yongkang Wong
Shoji Nishimura
Mohan S. Kankanhalli
120
13
0
09 Feb 2020
CTM: Collaborative Temporal Modeling for Action Recognition
CTM: Collaborative Temporal Modeling for Action Recognition
Li-Yu Daisy Liu
Tao Wang
Jie Liu
Yang Guan
Qi Bu
Longfei Yang
TTA
26
0
0
08 Feb 2020
Symbiotic Attention with Privileged Information for Egocentric Action
  Recognition
Symbiotic Attention with Privileged Information for Egocentric Action Recognition
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
74
63
0
08 Feb 2020
Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling
Mu Yuan
Lan Zhang
Xiangyang Li
Hui Xiong
VLM
56
17
0
08 Feb 2020
iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge:
  Hierarchical Group-wise Attention
iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention
Li-Yu Daisy Liu
Dongyang Cai
Jie Liu
Nan Ding
Tao Wang
20
0
0
07 Feb 2020
Previous
123...606162...717273
Next