ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.07750
  4. Cited By
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017
João Carreira
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown
Title
Sequential Density Ratio Estimation for Simultaneous Optimization of
  Speed and Accuracy
Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy
Akinori F. Ebihara
Taiki Miyagawa
K. Sakurai
Hitoshi Imaoka
69
0
0
10 Jun 2020
Open-Narrow-Synechiae Anterior Chamber Angle Classification in AS-OCT
  Sequences
Open-Narrow-Synechiae Anterior Chamber Angle Classification in AS-OCT Sequences
Huaying Hao
Huazhu Fu
Yanwu Xu
Jianlong Yang
Fei Li
Xiulan Zhang
Jiang-Dong Liu
Yitian Zhao
233
8
0
09 Jun 2020
PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local
  Module for Action Recognition
PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local Module for Action Recognition
Yuecong Xu
Haozhi Cao
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
56
5
0
09 Jun 2020
Action Recognition with Deep Multiple Aggregation Networks
Action Recognition with Deep Multiple Aggregation Networks
A. Mazari
H. Sahbi
61
0
0
08 Jun 2020
ARID: A New Dataset for Recognizing Action in the Dark
ARID: A New Dataset for Recognizing Action in the Dark
Yuecong Xu
Jianfei Yang
Haozhi Cao
K. Mao
Jianxiong Yin
Simon See
77
73
0
06 Jun 2020
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
M. Gao
Yingbo Zhou
Ran Xu
R. Socher
Caiming Xiong
102
42
0
05 Jun 2020
Egocentric Object Manipulation Graphs
Egocentric Object Manipulation Graphs
Eadom Dessalene
Michael Maynord
Chinmaya Devaraj
Cornelia Fermuller
Yiannis Aloimonos
EgoV
81
19
0
05 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter
  Network
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network
Lingyu Zhu
Esa Rahtu
110
23
0
04 Jun 2020
Temporal Aggregate Representations for Long-Range Video Understanding
Temporal Aggregate Representations for Long-Range Video Understanding
Fadime Sener
Dipika Singhania
Angela Yao
AI4TS
69
7
0
01 Jun 2020
In the Eye of the Beholder: Gaze and Actions in First Person Video
In the Eye of the Beholder: Gaze and Actions in First Person Video
Yin Li
Miao Liu
James M. Rehg
EgoV
179
71
0
31 May 2020
Complex Sequential Understanding through the Awareness of Spatial and
  Temporal Concepts
Complex Sequential Understanding through the Awareness of Spatial and Temporal Concepts
Bo Pang
Kaiwen Zha
Hanwen Cao
Jiajun Tang
Minghui Yu
Cewu Lu
77
25
0
30 May 2020
Automatic Diagnosis of Pulmonary Embolism Using an Attention-guided
  Framework: A Large-scale Study
Automatic Diagnosis of Pulmonary Embolism Using an Attention-guided Framework: A Large-scale Study
Luyao Shi
Deepta Rajan
Shafiq Abedin
Srikar Yellapragada
David Beymer
E. Dehghan
65
18
0
29 May 2020
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing
  Label Features from Multi-Modal Embeddings
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings
Pratik Mazumder
Pravendra Singh
Kranti K. Parida
Vinay P. Namboodiri
82
35
0
27 May 2020
A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
Edison Marrese-Taylor
Cristian Rodriguez-Opazo
Jorge A. Balazs
Stephen Gould
Y. Matsuo
63
3
0
27 May 2020
Unifying Few- and Zero-Shot Egocentric Action Recognition
Unifying Few- and Zero-Shot Egocentric Action Recognition
Tyler R. Scott
Michael Shvartsman
Karl Ridgeway
EgoV
52
1
0
27 May 2020
SpotFast Networks with Memory Augmented Lateral Transformers for
  Lipreading
SpotFast Networks with Memory Augmented Lateral Transformers for Lipreading
Peratham Wiriyathammabhum
62
8
0
21 May 2020
Intra- and Inter-Action Understanding via Temporal Action Parsing
Intra- and Inter-Action Understanding via Temporal Action Parsing
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
54
71
0
20 May 2020
On Evaluating Weakly Supervised Action Segmentation Methods
On Evaluating Weakly Supervised Action Segmentation Methods
Yaser Souri
Alexander Richard
Luca Minciullo
Juergen Gall
47
7
0
19 May 2020
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal
  Transformer
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Vladimir E. Iashin
Esa Rahtu
104
130
0
17 May 2020
Pedestrian Action Anticipation using Contextual Feature Fusion in
  Stacked RNNs
Pedestrian Action Anticipation using Contextual Feature Fusion in Stacked RNNs
Amir Rasouli
Iuliia Kotseruba
John K. Tsotsos
103
112
0
13 May 2020
Robust Visual Object Tracking with Two-Stream Residual Convolutional
  Networks
Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks
Ning Zhang
Jingen Liu
Ke Wang
Dan Zeng
Tao Mei
51
7
0
13 May 2020
Project RISE: Recognizing Industrial Smoke Emissions
Project RISE: Recognizing Industrial Smoke Emissions
Yen-Chia Hsu
Ting-Hao 'Kenneth' Huang
Ting-Yao Hu
P. Dille
Sean Prendi
Ryan N. Hoffman
Anastasia Tsuhlares
Jessica Pachuta
Randy Sargent
I. Nourbakhsh
60
19
0
13 May 2020
Human in Events: A Large-Scale Benchmark for Human-centric Video
  Analysis in Complex Events
Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events
Weiyao Lin
Huabin Liu
Shizhan Liu
Yuxi Li
Rui Qian
Tao Wang
Ning Xu
H. Xiong
Guojun Qi
N. Sebe
84
15
0
09 May 2020
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor
  Decomposition
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition
Miao Yin
Siyu Liao
Xiao-Yang Liu
Xiaodong Wang
Bo Yuan
82
25
0
09 May 2020
Condensed Movies: Story Based Retrieval with Contextual Embeddings
Condensed Movies: Story Based Retrieval with Contextual Embeddings
Max Bain
Arsha Nagrani
A. Brown
Andrew Zisserman
128
102
0
08 May 2020
Learning to Segment Actions from Observation and Narration
Learning to Segment Actions from Observation and Narration
Daniel Fried
Jean-Baptiste Alayrac
Phil Blunsom
Chris Dyer
S. Clark
Aida Nematzadeh
124
32
0
07 May 2020
Exploiting Inter-Frame Regional Correlation for Efficient Action
  Recognition
Exploiting Inter-Frame Regional Correlation for Efficient Action Recognition
Yuecong Xu
Jianfei Yang
K. Mao
Jianxiong Yin
Simon See
35
11
0
06 May 2020
Adaptive Interaction Modeling via Graph Operations Search
Adaptive Interaction Modeling via Graph Operations Search
Haoxin Li
Weishi Zheng
Yu Tao
Haifeng Hu
Jianhuang Lai
68
5
0
05 May 2020
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video
Antonino Furnari
G. Farinella
EgoV
66
141
0
04 May 2020
Towards Visually Explaining Video Understanding Networks with
  Perturbation
Towards Visually Explaining Video Understanding Networks with Perturbation
Zhenqiang Li
Weimin Wang
Zuoyue Li
Yifei Huang
Yoichi Sato
FAtt
38
3
0
01 May 2020
Recognizing American Sign Language Nonmanual Signal Grammar Errors in
  Continuous Videos
Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos
Elahe Vahdani
Longlong Jing
Yingli Tian
Matt Huenerfauth
26
8
0
01 May 2020
Teaching Cameras to Feel: Estimating Tactile Physical Properties of
  Surfaces From Images
Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images
Matthew Purri
Kristin J. Dana
36
16
0
29 Apr 2020
Skeleton Focused Human Activity Recognition in RGB Video
Skeleton Focused Human Activity Recognition in RGB Video
Bruce X. B. Yu
Yan Liu
Keith C. C. Chan
67
4
0
29 Apr 2020
Span-based Localizing Network for Natural Language Video Localization
Span-based Localizing Network for Natural Language Video Localization
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
113
316
0
29 Apr 2020
Inferring Temporal Compositions of Actions Using Probabilistic Automata
Inferring Temporal Compositions of Actions Using Probabilistic Automata
Rodrigo Santa Cruz
A. Cherian
Basura Fernando
Dylan Campbell
Stephen Gould
39
2
0
28 Apr 2020
AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement
  with Neural Searching
AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement with Neural Searching
Zitong Yu
Xiaobai Li
Xuesong Niu
Jingang Shi
Guoying Zhao
49
132
0
26 Apr 2020
Low-latency hand gesture recognition with a low resolution thermal
  imager
Low-latency hand gesture recognition with a low resolution thermal imager
Maarten Vandersteegen
Wouter Reusen
Kristof Van Beeck
38
17
0
24 Apr 2020
Gabriella: An Online System for Real-Time Activity Detection in
  Untrimmed Security Videos
Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos
Mamshad Nayeem Rizve
Ugur Demir
Praveen Tirupattur
A. J. Rana
Kevin Duarte
Ishan R. Dave
Yogesh S Rawat
M. Shah
47
19
0
23 Apr 2020
Action recognition in real-world videos
Action recognition in real-world videos
Waqas Sultani
Qazi Ammar Arshad
Chen Chen
80
2
0
22 Apr 2020
Human and Machine Action Prediction Independent of Object Information
Human and Machine Action Prediction Independent of Object Information
Fatemeh Ziaeetabar
Jennifer Pomp
Stefan Pfeiffer
Nadiya El-Sourani
R. Schubotz
M. Tamosiunaite
Florentin Wörgötter
6
0
0
22 Apr 2020
Group Activity Detection from Trajectory and Video Data in Soccer
Group Activity Detection from Trajectory and Video Data in Soccer
Ryan Sanford
Siavash Gorji
L. G. Hafemann
B. Pourbabaee
Mehrsan Javan
61
34
0
21 Apr 2020
TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition
TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition
Rami Ben-Ari
Mor Shpigel
Ophir Azulai
Udi Barzelay
Daniel Rotman
ViT
72
25
0
21 Apr 2020
CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture
  Recognition
CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition
Zhengwei Wang
Qi She
Tejo Chalasani
A. Smolic
3DPCSLR
70
15
0
20 Apr 2020
Motion and Region Aware Adversarial Learning for Fall Detection with
  Thermal Imaging
Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging
V. Mehta
Abhinav Dhall
Sujata Pal
Shehroz S. Khan
59
25
0
17 Apr 2020
Multiple Visual-Semantic Embedding for Video Retrieval from Query
  Sentence
Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
Huy Manh Nguyen
Tomo Miyazaki
Yoshihiro Sugaya
S. Omachi
144
1
0
16 Apr 2020
Local-Global Video-Text Interactions for Temporal Grounding
Local-Global Video-Text Interactions for Temporal Grounding
Jonghwan Mun
Minsu Cho
Bohyung Han
103
270
0
16 Apr 2020
Asynchronous Interaction Aggregation for Action Detection
Asynchronous Interaction Aggregation for Action Detection
Jiajun Tang
Jinchao Xia
Xinzhi Mu
Bo Pang
Cewu Lu
89
121
0
16 Apr 2020
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action
  Spotting in Videos
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos
Guillaume Vaudaux-Ruth
Adrien Chan-Hon-Tong
Catherine Achard
BDL
89
7
0
15 Apr 2020
FineGym: A Hierarchical Video Dataset for Fine-grained Action
  Understanding
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
78
331
0
14 Apr 2020
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised
  Learning
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning
Kangning Liu
Shuhang Gu
Andrés Romero
Radu Timofte
51
9
0
14 Apr 2020
Previous
123...585960...717273
Next