v1v2v3 (latest)

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 3,647 papers shown

Title
Learning Class Regularized Features for Action Recognition Alexandros Stergiou R. Poppe R. Veltkamp 21 3 0 07 Feb 2020
Solving Raven's Progressive Matrices with Neural Networks Tao Zhuo Mohan S. Kankanhalli 105 26 0 05 Feb 2020
Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks M. Rashid Hedvig Kjellström Yong Jae Lee WSOL GNN 138 46 0 04 Feb 2020
Neural Sign Language Translation by Learning Tokenization Alptekin Orbay L. Akarun SLR 68 73 0 02 Feb 2020
Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks Joonatan Mänttäri Sofia Broomé John Folkesson Hedvig Kjellström FAtt 67 28 0 02 Feb 2020
ERA: A Dataset and Deep Learning Benchmark for Event Recognition in Aerial Videos Lichao Mou Yuansheng Hua P. Jin Xiaoxiang Zhu AI4TS 119 45 0 30 Jan 2020
Multi-Modal Domain Adaptation for Fine-Grained Action Recognition Jonathan Munro Dima Damen EgoV 84 196 0 27 Jan 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval Jie Lei Licheng Yu Tamara L. Berg Joey Tianyi Zhou 224 288 0 24 Jan 2020
Audiovisual SlowFast Networks for Video Recognition Fanyi Xiao Yong Jae Lee Kristen Grauman Jitendra Malik Christoph Feichtenhofer 282 209 0 23 Jan 2020
Lipreading using Temporal Convolutional Networks Brais Martínez Pingchuan Ma Stavros Petridis Maja Pantic 238 241 0 23 Jan 2020
Detecting Deficient Coverage in Colonoscopies Daniel Freedman Yochai Blau L. Katzir Amit Aides I. Shimshoni ... Tomer Golany A. Gordon Greg S. Corrado Yossi Matias Ehud Rivlin 78 55 0 23 Jan 2020
Zero-Shot Activity Recognition with Videos Evin Pınar Örnek 24 1 0 22 Jan 2020
Weakly Supervised Temporal Action Localization Using Deep Metric Learning Ashraful Islam Richard J. Radke 69 46 0 21 Jan 2020
A Comprehensive Study on Temporal Modeling for Online Action Detection Wen Wang Xiaojiang Peng Yu Qiao Jian Cheng 48 2 0 21 Jan 2020
The benefits of synthetic data for action categorization Mohamad Ballout Mohammad Tuqan Daniel C. Asmar Elie A. Shammas George E. Sakr 39 6 0 20 Jan 2020
MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion Kaiyu Shan Yongtao Wang Zhuoying Wang Tingting Liang Zhi Tang Ying-Cong Chen Yangyan Li AI4TS 40 4 0 19 Jan 2020
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video Jie Wu Guanbin Li Si Liu Liang Lin OffRL 71 104 0 18 Jan 2020
Temporal Interlacing Network Hao Shao Shengju Qian Yu Liu 60 94 0 17 Jan 2020
Sideways: Depth-Parallel Training of Video Models Mateusz Malinowski G. Swirszcz João Carreira Viorica Patraucean MDE 94 15 0 17 Jan 2020
Multi-step Joint-Modality Attention Network for Scene-Aware Dialogue System Yun-Wei Chu Kuan-Yen Lin Chao-Chun Hsu Lun-Wei Ku 137 22 0 17 Jan 2020
Spatio-Temporal Ranked-Attention Networks for Video Captioning A. Cherian Jue Wang Chiori Hori Tim K. Marks AI4TS 49 19 0 17 Jan 2020
Learning Spatiotemporal Features via Video and Text Pair Discrimination Tianhao Li Limin Wang VGen 68 57 0 16 Jan 2020
Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action Recognition Li Tao Xueting Wang T. Yamasaki 3DPC 69 24 0 16 Jan 2020
EEV: A Large-Scale Dataset for Studying Evoked Expressions from Video Jennifer J. Sun Ting Liu Alan S. Cowen Florian Schroff Hartwig Adam Gautam Prasad 38 7 0 15 Jan 2020
Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors Lei Wang Piotr Koniusz 93 51 0 14 Jan 2020
Actions as Moving Points Yixuan Li Zixu Wang Limin Wang Gangshan Wu 163 106 0 14 Jan 2020
A Novel Inspection System For Variable Data Printing Using Deep Learning O. Haik Oded Perry Eli Chen Peter J. Klammer 38 4 0 13 Jan 2020
Compressive sensing based privacy for fall detection Ronak Gupta Prashant Anand S. Chaudhury Brejesh Lall Sanjay Singh 15 5 0 10 Jan 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network A. Tsiami Petros Koutras Petros Maragos 99 73 0 09 Jan 2020
DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection Liming Jiang Ren Li Wayne Wu Chao Qian Chen Change Loy CVBM PICV 137 447 0 09 Jan 2020
Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos Naina Dhingra A. Kunz 3DPC SLR 80 36 0 04 Jan 2020
DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection Ruben Tolosana R. Vera-Rodríguez Julian Fierrez Aythami Morales J. Ortega-Garcia 3DPC CVBM 120 801 0 01 Jan 2020
Short-Term Temporal Convolutional Networks for Dynamic Hand Gesture Recognition Yi Zhang Chong Wang Ye Zheng Jieyu Zhao Yuqi Li Xijiong Xie 3DH 23 4 0 31 Dec 2019
DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition Nuno C. Garcia Sarah Adel Bargal Vitaly Ablavsky Pietro Morerio Vittorio Murino Stan Sclaroff 66 49 0 23 Dec 2019
Adversarial Cross-Domain Action Recognition with Co-Attention Boxiao Pan Zhangjie Cao Ehsan Adeli Juan Carlos Niebles ViT 80 106 0 22 Dec 2019
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks Joanna Materzynska Tete Xiao Roei Herzig Huijuan Xu Xiaolong Wang Trevor Darrell CoGe 64 176 0 20 Dec 2019
Analysis of Video Feature Learning in Two-Stream CNNs on the Example of Zebrafish Swim Bout Classification Bennet Breier A. Onken 36 4 0 20 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context Philippe Weinzaepfel Grégory Rogez 84 72 0 16 Dec 2019
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs Jingwei Ji Ranjay Krishna Li Fei-Fei Juan Carlos Niebles 94 346 0 15 Dec 2019
Multi-task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition D. Luvizon Hedi Tabia David Picard 3DH 95 121 0 15 Dec 2019
Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks Lei Shi Yifan Zhang Jian Cheng Hanqing Lu 86 432 0 15 Dec 2019
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong S. Schwarcz Peng Xu Davide D‘Ambrosio Juhana Kangaspunta A. Angelova Huong Phan Navdeep Jaitly 65 7 0 13 Dec 2019
Action Modifiers: Learning from Adverbs in Instructional Videos Hazel Doughty Ivan Laptev W. Mayol-Cuevas Dima Damen 103 30 0 13 Dec 2019
End-to-End Learning of Visual Representations from Uncurated Instructional Videos Antoine Miech Jean-Baptiste Alayrac Lucas Smaira Ivan Laptev Josef Sivic Andrew Zisserman VGen SSL 156 713 0 13 Dec 2019
Identity Preserve Transform: Understand What Activity Classification Models Have Learnt Jialing Lyu Weichao Qiu Xinyue Wei Yi Zhang Alan Yuille Zhengjun Zha VLM 51 3 0 13 Dec 2019
VIBE: Video Inference for Human Body Pose and Shape Estimation Muhammed Kocabas Nikos Athanasiou Michael J. Black 3DH 145 932 0 11 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition Jinwoo Choi Chen Gao Joseph C.E. Messou Jia-Bin Huang 145 182 0 11 Dec 2019
HyperCon: Image-To-Video Model Transfer for Video-To-Video Translation Tasks Ryan Szeto Mostafa El-Khamy Jungwon Lee Jason J. Corso DiffM 63 7 0 10 Dec 2019
Forecasting future action sequences with attention: a new approach to weakly supervised action forecasting Yan Bin Ng Basura Fernando AI4TS 43 34 0 10 Dec 2019
Appending Adversarial Frames for Universal Video Attack Zhikai Chen Lingxi Xie Shanmin Pang Yong He Qi Tian AAML 70 34 0 10 Dec 2019