v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown

Title
Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy Akinori F. Ebihara Taiki Miyagawa K. Sakurai Hitoshi Imaoka 69 0 0 10 Jun 2020
Open-Narrow-Synechiae Anterior Chamber Angle Classification in AS-OCT Sequences Huaying Hao Huazhu Fu Yanwu Xu Jianlong Yang Fei Li Xiulan Zhang Jiang-Dong Liu Yitian Zhao 233 8 0 09 Jun 2020
PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local Module for Action Recognition Yuecong Xu Haozhi Cao Jianfei Yang K. Mao Jianxiong Yin Simon See 56 5 0 09 Jun 2020
Action Recognition with Deep Multiple Aggregation Networks A. Mazari H. Sahbi 61 0 0 08 Jun 2020
ARID: A New Dataset for Recognizing Action in the Dark Yuecong Xu Jianfei Yang Haozhi Cao K. Mao Jianxiong Yin Simon See 77 73 0 06 Jun 2020
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos M. Gao Yingbo Zhou Ran Xu R. Socher Caiming Xiong 102 42 0 05 Jun 2020
Egocentric Object Manipulation Graphs Eadom Dessalene Michael Maynord Chinmaya Devaraj Cornelia Fermuller Yiannis Aloimonos EgoV 81 19 0 05 Jun 2020
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network Lingyu Zhu Esa Rahtu 110 23 0 04 Jun 2020
Temporal Aggregate Representations for Long-Range Video Understanding Fadime Sener Dipika Singhania Angela Yao AI4TS 69 7 0 01 Jun 2020
In the Eye of the Beholder: Gaze and Actions in First Person Video Yin Li Miao Liu James M. Rehg EgoV 179 71 0 31 May 2020
Complex Sequential Understanding through the Awareness of Spatial and Temporal Concepts Bo Pang Kaiwen Zha Hanwen Cao Jiajun Tang Minghui Yu Cewu Lu 77 25 0 30 May 2020
Automatic Diagnosis of Pulmonary Embolism Using an Attention-guided Framework: A Large-scale Study Luyao Shi Deepta Rajan Shafiq Abedin Srikar Yellapragada David Beymer E. Dehghan 65 18 0 29 May 2020
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings Pratik Mazumder Pravendra Singh Kranti K. Parida Vinay P. Namboodiri 82 35 0 27 May 2020
A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews Edison Marrese-Taylor Cristian Rodriguez-Opazo Jorge A. Balazs Stephen Gould Y. Matsuo 63 3 0 27 May 2020
Unifying Few- and Zero-Shot Egocentric Action Recognition Tyler R. Scott Michael Shvartsman Karl Ridgeway EgoV 52 1 0 27 May 2020
SpotFast Networks with Memory Augmented Lateral Transformers for Lipreading Peratham Wiriyathammabhum 62 8 0 21 May 2020
Intra- and Inter-Action Understanding via Temporal Action Parsing Dian Shao Yue Zhao Bo Dai Dahua Lin 54 71 0 20 May 2020
On Evaluating Weakly Supervised Action Segmentation Methods Yaser Souri Alexander Richard Luca Minciullo Juergen Gall 47 7 0 19 May 2020
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer Vladimir E. Iashin Esa Rahtu 104 130 0 17 May 2020
Pedestrian Action Anticipation using Contextual Feature Fusion in Stacked RNNs Amir Rasouli Iuliia Kotseruba John K. Tsotsos 103 112 0 13 May 2020
Robust Visual Object Tracking with Two-Stream Residual Convolutional Networks Ning Zhang Jingen Liu Ke Wang Dan Zeng Tao Mei 51 7 0 13 May 2020
Project RISE: Recognizing Industrial Smoke Emissions Yen-Chia Hsu Ting-Hao 'Kenneth' Huang Ting-Yao Hu P. Dille Sean Prendi Ryan N. Hoffman Anastasia Tsuhlares Jessica Pachuta Randy Sargent I. Nourbakhsh 60 19 0 13 May 2020
Human in Events: A Large-Scale Benchmark for Human-centric Video Analysis in Complex Events Weiyao Lin Huabin Liu Shizhan Liu Yuxi Li Rui Qian Tao Wang Ning Xu H. Xiong Guojun Qi N. Sebe 84 15 0 09 May 2020
Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition Miao Yin Siyu Liao Xiao-Yang Liu Xiaodong Wang Bo Yuan 82 25 0 09 May 2020
Condensed Movies: Story Based Retrieval with Contextual Embeddings Max Bain Arsha Nagrani A. Brown Andrew Zisserman 128 102 0 08 May 2020
Learning to Segment Actions from Observation and Narration Daniel Fried Jean-Baptiste Alayrac Phil Blunsom Chris Dyer S. Clark Aida Nematzadeh 124 32 0 07 May 2020
Exploiting Inter-Frame Regional Correlation for Efficient Action Recognition Yuecong Xu Jianfei Yang K. Mao Jianxiong Yin Simon See 35 11 0 06 May 2020
Adaptive Interaction Modeling via Graph Operations Search Haoxin Li Weishi Zheng Yu Tao Haifeng Hu Jianhuang Lai 68 5 0 05 May 2020
Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video Antonino Furnari G. Farinella EgoV 66 141 0 04 May 2020
Towards Visually Explaining Video Understanding Networks with Perturbation Zhenqiang Li Weimin Wang Zuoyue Li Yifei Huang Yoichi Sato FAtt 38 3 0 01 May 2020
Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos Elahe Vahdani Longlong Jing Yingli Tian Matt Huenerfauth 26 8 0 01 May 2020
Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images Matthew Purri Kristin J. Dana 36 16 0 29 Apr 2020
Skeleton Focused Human Activity Recognition in RGB Video Bruce X. B. Yu Yan Liu Keith C. C. Chan 67 4 0 29 Apr 2020
Span-based Localizing Network for Natural Language Video Localization Hao Zhang Aixin Sun Wei Jing Qiufeng Wang 113 316 0 29 Apr 2020
Inferring Temporal Compositions of Actions Using Probabilistic Automata Rodrigo Santa Cruz A. Cherian Basura Fernando Dylan Campbell Stephen Gould 39 2 0 28 Apr 2020
AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement with Neural Searching Zitong Yu Xiaobai Li Xuesong Niu Jingang Shi Guoying Zhao 49 132 0 26 Apr 2020
Low-latency hand gesture recognition with a low resolution thermal imager Maarten Vandersteegen Wouter Reusen Kristof Van Beeck 38 17 0 24 Apr 2020
Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos Mamshad Nayeem Rizve Ugur Demir Praveen Tirupattur A. J. Rana Kevin Duarte Ishan R. Dave Yogesh S Rawat M. Shah 47 19 0 23 Apr 2020
Action recognition in real-world videos Waqas Sultani Qazi Ammar Arshad Chen Chen 80 2 0 22 Apr 2020
Human and Machine Action Prediction Independent of Object Information Fatemeh Ziaeetabar Jennifer Pomp Stefan Pfeiffer Nadiya El-Sourani R. Schubotz M. Tamosiunaite Florentin Wörgötter 6 0 0 22 Apr 2020
Group Activity Detection from Trajectory and Video Data in Soccer Ryan Sanford Siavash Gorji L. G. Hafemann B. Pourbabaee Mehrsan Javan 61 34 0 21 Apr 2020
TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition Rami Ben-Ari Mor Shpigel Ophir Azulai Udi Barzelay Daniel Rotman ViT 72 25 0 21 Apr 2020
CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition Zhengwei Wang Qi She Tejo Chalasani A. Smolic 3DPC SLR 70 15 0 20 Apr 2020
Motion and Region Aware Adversarial Learning for Fall Detection with Thermal Imaging V. Mehta Abhinav Dhall Sujata Pal Shehroz S. Khan 59 25 0 17 Apr 2020
Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence Huy Manh Nguyen Tomo Miyazaki Yoshihiro Sugaya S. Omachi 144 1 0 16 Apr 2020
Local-Global Video-Text Interactions for Temporal Grounding Jonghwan Mun Minsu Cho Bohyung Han 103 270 0 16 Apr 2020
Asynchronous Interaction Aggregation for Action Detection Jiajun Tang Jinchao Xia Xinzhi Mu Bo Pang Cewu Lu 89 121 0 16 Apr 2020
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos Guillaume Vaudaux-Ruth Adrien Chan-Hon-Tong Catherine Achard BDL 89 7 0 15 Apr 2020
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding Dian Shao Yue Zhao Bo Dai Dahua Lin 78 331 0 14 Apr 2020
Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning Kangning Liu Shuhang Gu Andrés Romero Radu Timofte 51 9 0 14 Apr 2020