Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

22 May 2017

Papers citing "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"

50 / 1,478 papers shown

Title
Generic Event Boundary Detection: A Benchmark for Event Segmentation Mike Zheng Shou Stan Weixian Lei Weiyao Wang Deepti Ghadiyaram Matt Feiszli VOS 93 76 0 26 Jan 2021
Weakly Supervised Learning for Facial Behavior Analysis : A Review G. Praveen Member Ieee Eric Granger Member Ieee Patrick Cardinal CVBM 37 6 0 25 Jan 2021
Bridging the gap between Human Action Recognition and Online Action Detection Alban Main De Boissiere R. Noumeir 22 0 0 21 Jan 2021
Video Relation Detection with Trajectory-aware Multi-modal Features W. Xie Guanghui Ren Si Liu 28 21 0 20 Jan 2021
TCLR: Temporal Contrastive Learning for Video Representation I. Dave Rohit Gupta Mamshad Nayeem Rizve Mubarak Shah SSL AI4TS 36 175 0 20 Jan 2021
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition Zachary Wharton Ardhendu Behera Yonghuai Liu Nikolaos Bessis 39 35 0 17 Jan 2021
Temporal-Relational CrossTransformers for Few-Shot Action Recognition Toby Perrett A. Masullo T. Burghardt Majid Mirmehdi Dima Damen ViT 31 145 0 15 Jan 2021
Learning from Weakly-labeled Web Videos via Exploring Sub-Concepts Kunpeng Li Zizhao Zhang Guanhang Wu Xuehan Xiong Chen-Yu Lee Zhichao Lu Y. Fu Tomas Pfister 34 5 0 11 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models Alina Roitberg Monica Haurilet Manuel Martínez Rainer Stiefelhagen UQCV 39 6 0 02 Jan 2021
Semantics for Robotic Mapping, Perception and Interaction: A Survey Sourav Garg Niko Sünderhauf Feras Dayoub D. Morrison Akansel Cosgun ... Tat-Jun Chin Ian Reid Stephen Gould Peter Corke Michael Milford 31 115 0 02 Jan 2021
Refining activation downsampling with SoftPool Alexandros Stergiou R. Poppe Grigorios Kalliatakis 36 159 0 02 Jan 2021
Tensor Representations for Action Recognition Piotr Koniusz Lei Wang A. Cherian 41 69 0 28 Dec 2020
Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset Cristina Palmero Javier Selva Sorina Smeureanu Julio C. S. Jacques Junior Albert Clapés ... Zejian Zhang D. Gallardo-Pujol G. Guilera D. Leiva Sergio Escalera 35 53 0 28 Dec 2020
CNNs for JPEGs: A Study in Computational Cost Samuel Felipe dos Santos N. Sebe Jurandy Almeida 30 2 0 26 Dec 2020
SMART Frame Selection for Action Recognition Shreyank N. Gowda Marcus Rohrbach Laura Sevilla-Lara 31 142 0 19 Dec 2020
TDN: Temporal Difference Networks for Efficient Action Recognition Limin Wang Zhan Tong Bin Ji Gangshan Wu 28 391 0 18 Dec 2020
Multi-shot Temporal Event Localization: a Benchmark Xiaolong Liu Yao Hu S. Bai Fei Ding X. Bai Philip Torr 51 82 0 17 Dec 2020
GTA: Global Temporal Attention for Video Action Understanding Bo He Xitong Yang Zuxuan Wu Hao Chen Ser-Nam Lim Abhinav Shrivastava ViT 33 27 0 15 Dec 2020
MSAF: Multimodal Split Attention Fusion Lang Su Chuqing Hu Guofa Li Dongpu Cao 30 37 0 13 Dec 2020
A Comprehensive Study of Deep Video Action Recognition Yi Zhu Xinyu Li Chunhui Liu Mohammadreza Zolfaghari Yuanjun Xiong Chongruo Wu Zhi-Li Zhang Joseph Tighe R. Manmatha Mu Li VLM AI4TS 38 185 0 11 Dec 2020
D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations Sanath Narayan Hisham Cholakkal Munawar Hayat Fahad Shahbaz Khan Ming-Hsuan Yang Ling Shao 27 54 0 11 Dec 2020
Intrinsic Temporal Regularization for High-resolution Human Video Synthesis Lingbo Yang Zhanning Gao Peiran Ren Siwei Ma Wen Gao 3DH 24 1 0 11 Dec 2020
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies Jinzheng Cai Youbao Tang K. Yan Adam P. Harrison Jing Xiao Gigin Lin Le Lu MedIm 41 29 0 09 Dec 2020
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language Songyang Zhang Houwen Peng Jianlong Fu Yijuan Lu Jiebo Luo 27 51 0 04 Dec 2020
Spatial-Temporal Alignment Network for Action Recognition and Detection Junwei Liang Liangliang Cao Xuehan Xiong Ting Yu Alexander G. Hauptmann 3DPC 18 9 0 04 Dec 2020
Video Self-Stitching Graph Network for Temporal Action Localization Chen Zhao Ali K. Thabet Guohao Li 26 138 0 30 Nov 2020
Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal Heatmaps Mattia Segu Federico Pirovano Gianmario Fumagalli Amedeo Fabris 23 2 0 26 Nov 2020
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos Adrien Deliège A. Cioppa Silvio Giancola M. J. Seikavandi J. Dueholm Kamal Nasrollahi Guohao Li T. Moeslund Marc Van Droogenbroeck 18 152 0 26 Nov 2020
t-EVA: Time-Efficient t-SNE Video Annotation Soroosh Poorgholi O. Kayhan Jan van Gemert 16 5 0 26 Nov 2020
Sign language segmentation with temporal convolutional networks Katrin Renz N. Stache Samuel Albanie Gül Varol SLR 22 25 0 25 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks Humam Alwassel Silvio Giancola Guohao Li 33 123 0 23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning Zehua Zhang David J. Crandall AI4TS SSL 28 23 0 23 Nov 2020
$We don't Need Thousand Proposals$\colon$ Single Shot Actor-Action Detection in Videos$ We don't Need Thousand Proposals $\colon$ Single Shot Actor-Action Detection in Videos A. J. Rana Yogesh S Rawat ViT 13 11 0 22 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild Faizaan Sakib T. Burghardt 22 24 0 21 Nov 2020
Game Plan: What AI can do for Football, and What Football can do for AI K. Tuyls Shayegan Omidshafiei Paul Muller Zhe Wang Jerome T. Connor ... Simon Bouton Nathalie Beauguerlange Jackson Broshear T. Graepel Demis Hassabis 46 78 0 18 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions Mohsen Fayyaz Emad Bahrami Rad Ali Diba M. Noroozi Ehsan Adeli Luc Van Gool Juergen Gall 3DPC 24 30 0 17 Nov 2020
Semi-Supervised Few-Shot Atomic Action Recognition Xiaoyuan Ni Sizhe Song Yu-Wing Tai Chi-Keung Tang 19 3 0 17 Nov 2020
Multi-Modal Hybrid Architecture for Pedestrian Action Prediction Amir Rasouli Tiffany Yau Mohsen Rohani Jun Luo 31 43 0 16 Nov 2020
JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition Jinmiao Cai Nianjuan Jiang Xiaoguang Han Kui Jia Jiangbo Lu 24 84 0 16 Nov 2020
Multimodal Pretraining for Dense Video Captioning Gabriel Huang Bo Pang Zhenhai Zhu Clara E. Rivera Radu Soricut 21 81 0 10 Nov 2020
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos Di Yang Rui Dai Yaohui Wang Rupayan Mallick Luca Minciullo Gianpiero Francesca Francois Bremond 35 16 0 10 Nov 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition T. Ayral M. Pedersoli Simon L Bacon Eric Granger CVBM 3DH 13 11 0 10 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos Alexandros Stergiou R. Poppe 29 1 0 08 Nov 2020
AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection Hao Zhu Chaoyou Fu Qianyi Wu Wayne Wu Chao Qian Ran He 41 32 0 05 Nov 2020
Content-based Analysis of the Cultural Differences between TikTok and Douyin Li-yao Sun Haoqi Zhang Songyang Zhang Jiebo Luo 13 24 0 03 Nov 2020
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning Simon Ging Mohammadreza Zolfaghari Hamed Pirsiavash Thomas Brox ViT CLIP 31 169 0 01 Nov 2020
A Survey on Contrastive Self-supervised Learning Ashish Jaiswal Ashwin Ramesh Babu Mohammad Zaki Zadeh Debapriya Banerjee F. Makedon SSL 57 1,361 0 31 Oct 2020
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning L. Tao Xueting Wang T. Yamasaki VLM SSL 23 14 0 29 Oct 2020
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications Hochul Hwang Cheongjae Jang Geonwoo Park Junghyun Cho Ig-Jae Kim 34 70 0 28 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition Chun-Fu Chen Yikang Shen K. Ramakrishnan Rogerio Feris J. M. Cohn A. Oliva Quanfu Fan 23 95 0 22 Oct 2020