ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,645 papers shown
Title
The Sound of Motions
The Sound of Motions
Hang Zhao
Chuang Gan
Wei-Chiu Ma
Antonio Torralba
88
254
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
87
71
0
11 Apr 2019
Recurrent Space-time Graph Neural Networks
Recurrent Space-time Graph Neural Networks
Andrei Liviu Nicolicioiu
Iulia Duta
Marius Leordeanu
GNN
128
45
0
11 Apr 2019
Attentive Action and Context Factorization
Attentive Action and Context Factorization
Yanjie Wang
Vinh Tran
Gedas Bertasius
Lorenzo Torresani
Minh Hoai
31
6
0
10 Apr 2019
Black-box Adversarial Attacks on Video Recognition Models
Black-box Adversarial Attacks on Video Recognition Models
Linxi Jiang
Xingjun Ma
Shaoxiang Chen
James Bailey
Yu-Gang Jiang
AAMLMLAU
76
147
0
10 Apr 2019
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural
  Networks with Octave Convolution
Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
Yunpeng Chen
Haoqi Fan
Bing Xu
Zhicheng Yan
Yannis Kalantidis
Marcus Rohrbach
Shuicheng Yan
Jiashi Feng
109
565
0
10 Apr 2019
Dynamic Gesture Recognition by Using CNNs and Star RGB: a Temporal
  Information Condensation
Dynamic Gesture Recognition by Using CNNs and Star RGB: a Temporal Information Condensation
Clebeson Canuto dos Santos
J. L. A. Samatelo
R. Vassallo
57
57
0
10 Apr 2019
Knowledge Distillation for Human Action Anticipation
Knowledge Distillation for Human Action Anticipation
Vinh Tran
Yang Wang
Minh Hoai
64
6
0
09 Apr 2019
Learning from Videos with Deep Convolutional LSTM Networks
Learning from Videos with Deep Convolutional LSTM Networks
Logan Courtney
R. Sreenivas
23
7
0
09 Apr 2019
Action Recognition from Single Timestamp Supervision in Untrimmed Videos
Action Recognition from Single Timestamp Supervision in Untrimmed Videos
Davide Moltisanti
Sanja Fidler
Dima Damen
84
61
0
09 Apr 2019
SCSampler: Sampling Salient Clips from Video for Efficient Action
  Recognition
SCSampler: Sampling Salient Clips from Video for Efficient Action Recognition
Bruno Korbar
Du Tran
Lorenzo Torresani
80
225
0
08 Apr 2019
Relational Action Forecasting
Relational Action Forecasting
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Rahul Sukthankar
Kevin Patrick Murphy
Cordelia Schmid
87
81
0
08 Apr 2019
Unsupervised learning of action classes with continuous temporal
  embedding
Unsupervised learning of action classes with continuous temporal embedding
Anna Kukleva
Hilde Kuehne
Fadime Sener
Juergen Gall
87
107
0
08 Apr 2019
Self-supervised Spatio-temporal Representation Learning for Videos by
  Predicting Motion and Appearance Statistics
Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
Jiangliu Wang
Jianbo Jiao
Linchao Bao
Shengfeng He
Yunhui Liu
Wen Liu
SSL
59
206
0
07 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for
  Video-and-Language Research
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
125
558
0
06 Apr 2019
Convolutional Relational Machine for Group Activity Recognition
Convolutional Relational Machine for Group Activity Recognition
Sina Mokhtarzadeh Azar
Mina Ghadimi Atigh
A. Nickabadi
Alexandre Alahi
BDL
72
106
0
05 Apr 2019
Attention Distillation for Learning Video Representations
Attention Distillation for Learning Video Representations
Miao Liu
Xin Chen
Yun C. Zhang
Yin Li
James M. Rehg
57
2
0
05 Apr 2019
Fast Weakly Supervised Action Segmentation Using Mutual Consistency
Fast Weakly Supervised Action Segmentation Using Mutual Consistency
Yaser Souri
Mohsen Fayyaz
Luca Minciullo
Gianpiero Francesca
Juergen Gall
91
52
0
05 Apr 2019
Deep Predictive Video Compression with Bi-directional Prediction
Deep Predictive Video Compression with Bi-directional Prediction
Woonsung Park
Munchurl Kim
62
7
0
05 Apr 2019
Video Classification with Channel-Separated Convolutional Networks
Video Classification with Channel-Separated Convolutional Networks
Du Tran
Heng Wang
Lorenzo Torresani
Matt Feiszli
3DV
135
591
0
04 Apr 2019
ExCL: Extractive Clip Localization Using Natural Language Descriptions
ExCL: Extractive Clip Localization Using Natural Language Descriptions
Soham Ghosh
Anuva Agarwal
Zarana Parekh
Alexander G. Hauptmann
CLIP
61
153
0
04 Apr 2019
Resource Efficient 3D Convolutional Neural Networks
Resource Efficient 3D Convolutional Neural Networks
Okan Kopuklu
Neslihan Köse
Ahmet Gunduz
Gerhard Rigoll
69
190
0
04 Apr 2019
Activity Driven Weakly Supervised Object Detection
Activity Driven Weakly Supervised Object Detection
Zhenheng Yang
D. Mahajan
Deepti Ghadiyaram
Ram Nevatia
Vignesh Ramanathan
WSOD
68
32
0
02 Apr 2019
RefineLoc: Iterative Refinement for Weakly-Supervised Action
  Localization
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization
Alejandro Pardo
Humam Alwassel
Fabian Caba Heilbron
Ali K. Thabet
Guohao Li
130
52
0
30 Mar 2019
Local Aggregation for Unsupervised Learning of Visual Embeddings
Local Aggregation for Unsupervised Learning of Visual Embeddings
Chengxu Zhuang
Alex Zhai
Daniel L. K. Yamins
SSL
105
448
0
29 Mar 2019
BubbleNets: Learning to Select the Guidance Frame in Video Object
  Segmentation by Deep Sorting Frames
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames
Brent A. Griffin
Jason J. Corso
VOS
113
43
0
28 Mar 2019
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai
S. Divvala
Louis-Philippe Morency
Ruslan Salakhutdinov
Ali Farhadi
89
103
0
25 Mar 2019
StartNet: Online Detection of Action Start in Untrimmed Videos
StartNet: Online Detection of Action Start in Untrimmed Videos
M. Gao
Mingze Xu
L. Davis
R. Socher
Caiming Xiong
67
52
0
23 Mar 2019
On the Importance of Video Action Recognition for Visual Lipreading
Xinshuo Weng
29
3
0
22 Mar 2019
Forecasting Time-to-Collision from Monocular Video: Feasibility,
  Dataset, and Challenges
Forecasting Time-to-Collision from Monocular Video: Feasibility, Dataset, and Challenges
A. Manglik
Xinshuo Weng
Eshed Ohn-Bar
Kris Kitani
105
15
0
21 Mar 2019
Value of Temporal Dynamics Information in Driving Scene Segmentation
Value of Temporal Dynamics Information in Driving Scene Segmentation
Li Ding
Jack Terwilliger
Rini Sherony
B. Reimer
Alex Fridman
51
23
0
21 Mar 2019
Cross-task weakly supervised learning from instructional videos
Cross-task weakly supervised learning from instructional videos
Dimitri Zhukov
Jean-Baptiste Alayrac
R. G. Cinbis
David Fouhey
Ivan Laptev
Josef Sivic
SSL
183
250
0
19 Mar 2019
Human Activity Recognition for Edge Devices
Human Activity Recognition for Edge Devices
Manjot Bilkhu
Hammababdullah Ayyubi
27
0
0
18 Mar 2019
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action
  Classifier for Anomaly Detection
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
Jia-Xing Zhong
Nannan Li
Weijie Kong
Shan Liu
Thomas H. Li
Ge Li
NoLaSSL
116
410
0
18 Mar 2019
Learning Super-resolution 3D Segmentation of Plant Root MRI Images from
  Few Examples
Learning Super-resolution 3D Segmentation of Plant Root MRI Images from Few Examples
Ali Oguz Uzman
Jannis Horn
Sven Behnke
24
4
0
16 Mar 2019
GolfDB: A Video Database for Golf Swing Sequencing
GolfDB: A Video Database for Golf Swing Sequencing
William J. McNally
Kanav Vats
T. Pinto
Chris Dulhanty
J. McPhee
A. Wong
59
55
0
15 Mar 2019
Two-Stream Action Recognition-Oriented Video Super-Resolution
Two-Stream Action Recognition-Oriented Video Super-Resolution
Haochen Zhang
Dong Liu
Zhiwei Xiong
SupR
64
46
0
13 Mar 2019
Asymmetric Residual Neural Network for Accurate Human Activity
  Recognition
Asymmetric Residual Neural Network for Accurate Human Activity Recognition
J. Long
Wuqing Sun
Zhan Yang
Osolo Ian Raymond
HAI
57
21
0
13 Mar 2019
Video Generation from Single Semantic Label Map
Video Generation from Single Semantic Label Map
Junting Pan
Chengyu Wang
Xu Jia
Jing Shao
Lu Sheng
Junjie Yan
Xiaogang Wang
VGen
55
104
0
11 Mar 2019
Investigation on Combining 3D Convolution of Image Data and Optical Flow
  to Generate Temporal Action Proposals
Investigation on Combining 3D Convolution of Image Data and Optical Flow to Generate Temporal Action Proposals
Patrick Schlosser
David Münch
Michael Arens
3DPC
20
3
0
11 Mar 2019
SSN: Learning Sparse Switchable Normalization via SparsestMax
SSN: Learning Sparse Switchable Normalization via SparsestMax
Wenqi Shao
Jiamin Ren
Jingyu Li
Ruimao Zhang
Yudian Li
Xiaogang Wang
Ping Luo
69
56
0
09 Mar 2019
COIN: A Large-scale Dataset for Comprehensive Instructional Video
  Analysis
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Yansong Tang
Dajun Ding
Yongming Rao
Yu Zheng
Danyang Zhang
Lili Zhao
Jiwen Lu
Jie Zhou
145
317
0
07 Mar 2019
Video-based surgical skill assessment using 3D convolutional neural
  networks
Video-based surgical skill assessment using 3D convolutional neural networks
Isabel Funke
S. T. Mees
Jürgen Weitz
Stefanie Speidel
85
177
0
06 Mar 2019
Semantic Adversarial Network with Multi-scale Pyramid Attention for
  Video Classification
Semantic Adversarial Network with Multi-scale Pyramid Attention for Video Classification
De Xie
Cheng Deng
Hao Wang
Chao Li
Dapeng Tao
41
16
0
06 Mar 2019
MS-TCN: Multi-Stage Temporal Convolutional Network for Action
  Segmentation
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
Yazan Abu Farha
Juergen Gall
90
670
0
05 Mar 2019
Collaborative Spatio-temporal Feature Learning for Video Action
  Recognition
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
83
82
0
04 Mar 2019
Less is More: Learning Highlight Detection from Video Duration
Less is More: Learning Highlight Detection from Video Duration
Bo Xiong
Yannis Kalantidis
Deepti Ghadiyaram
Kristen Grauman
61
111
0
03 Mar 2019
Attention-Based Structural-Plasticity
Attention-Based Structural-Plasticity
Soheil Kolouri
Nicholas A. Ketz
Xinyun Zou
J. Krichmar
Praveen K. Pilly
CLL
30
12
0
02 Mar 2019
Unsupervised Traffic Accident Detection in First-Person Videos
Unsupervised Traffic Accident Detection in First-Person Videos
Yu Yao
Mingze Xu
Yuchen Wang
David J. Crandall
E. Atkins
101
143
0
02 Mar 2019
Progress Regression RNN for Online Spatial-Temporal Action Localization
  in Unconstrained Videos
Progress Regression RNN for Online Spatial-Temporal Action Localization in Unconstrained Videos
Bo Hu
Jianfei Cai
Tat-Jen Cham
Junsong Yuan
81
3
0
01 Mar 2019
Previous
123...676869...717273
Next