Title
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks Joanna Materzynska Tete Xiao Roei Herzig Huijuan Xu Xiaolong Wang Trevor Darrell CoGe 24 173 0 20 Dec 2019
Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition Konstantinos Papadopoulos Enjie Ghorbel Djamila Aouada Björn E. Ottersten GNN 93 42 0 20 Dec 2019
Segmentations-Leak: Membership Inference Attacks and Defenses in Semantic Image Segmentation Yang He Shadi Rahimian Bernt Schiele Mario Fritz MIACV 21 49 0 20 Dec 2019
Self-Attention Network for Skeleton-based Human Action Recognition Sangwoo Cho M. H. Maqbool Fei Liu H. Foroosh 3DH 22 71 0 18 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context Philippe Weinzaepfel Grégory Rogez 19 71 0 16 Dec 2019
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs Jingwei Ji Ranjay Krishna Li Fei-Fei Juan Carlos Niebles 39 336 0 15 Dec 2019
Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks Lei Shi Yifan Zhang Jian Cheng Hanqing Lu 33 420 0 15 Dec 2019
Action Modifiers: Learning from Adverbs in Instructional Videos Hazel Doughty Ivan Laptev W. Mayol-Cuevas Dima Damen 27 30 0 13 Dec 2019
VIBE: Video Inference for Human Body Pose and Shape Estimation Muhammed Kocabas Nikos Athanasiou Michael J. Black 3DH 28 917 0 11 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition Jinwoo Choi Chen Gao Joseph C.E. Messou Jia-Bin Huang 24 177 0 11 Dec 2019
PuckNet: Estimating hockey puck location from broadcast video Kanav Vats William J. McNally Chris Dulhanty Z. Q. Lin David A Clausi John S. Zelek 8 7 0 11 Dec 2019
Context-Dependent Models for Predicting and Characterizing Facial Expressiveness Victoria Lin J. Girard Louis-Philippe Morency 16 8 0 10 Dec 2019
Listen to Look: Action Recognition by Previewing Audio Ruohan Gao Tae-Hyun Oh Kristen Grauman Lorenzo Torresani VLM 29 251 0 10 Dec 2019
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN Paritosh Parmar B. Morris 3DPC 18 9 0 10 Dec 2019
Video action detection by learning graph-based spatio-temporal interactions Matteo Tomei Lorenzo Baraldi Simone Calderara Simone Bronzin Rita Cucchiara 24 9 0 09 Dec 2019
Synthetic Humans for Action Recognition from Unseen Viewpoints Gül Varol Ivan Laptev Cordelia Schmid Andrew Zisserman 33 96 0 09 Dec 2019
VideoDG: Generalizing Temporal Relations in Videos to Novel Domains Zhiyu Yao Yunbo Wang Jianmin Wang Philip S. Yu Mingsheng Long OOD ViT 32 23 0 08 Dec 2019
ClusterFit: Improving Generalization of Visual Representations Xueting Yan Ishan Misra Abhinav Gupta Deepti Ghadiyaram D. Mahajan SSL VLM 27 132 0 06 Dec 2019
A Multigrid Method for Efficiently Training Video Models Chaoxia Wu Ross B. Girshick Kaiming He Christoph Feichtenhofer Philipp Krahenbuhl 32 94 0 02 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation Quanfu Fan Chun-Fu Chen Hilde Kuehne Marco Pistoia David D. Cox 32 126 0 02 Dec 2019
Probing the State of the Art: A Critical Look at Visual Representation Evaluation Cinjon Resnick Zeping Zhan Joan Bruna AI4TS 20 12 0 30 Nov 2019
Multimodal Machine Translation through Visuals and Speech U. Sulubacak Ozan Caglayan Stig-Arne Gronroos Aku Rouhe Desmond Elliott Lucia Specia Jörg Tiedemann 49 73 0 28 Nov 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering Humam Alwassel D. Mahajan Bruno Korbar Lorenzo Torresani Guohao Li Du Tran SSL 42 428 0 28 Nov 2019
Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision Lei Shi Yifan Zhang Jian Cheng Hanqing Lu 22 27 0 28 Nov 2019
Non-Autoregressive Coarse-to-Fine Video Captioning Bang-ju Yang Yuexian Zou Fenglin Liu Can Zhang 27 11 0 27 Nov 2019
G-TAD: Sub-Graph Localization for Temporal Action Detection Mengmeng Xu Chen Zhao D. Rojas Ali K. Thabet Guohao Li 39 435 0 26 Nov 2019
Learning Efficient Video Representation with Video Shuffle Networks Pingchuan Ma Yao Zhou Yu Lu Wayne Zhang 27 7 0 26 Nov 2019
Oops! Predicting Unintentional Action in Video Dave Epstein Boyuan Chen Carl Vondrick 27 99 0 25 Nov 2019
TEINet: Towards an Efficient Architecture for Video Recognition Zhaoyang Liu Donghao Luo Yabiao Wang Limin Wang Ying Tai Chengjie Wang Jilin Li Feiyue Huang Tong Lu ViT 36 236 0 21 Nov 2019
MMTM: Multimodal Transfer Module for CNN Fusion Hamid Reza Vaezi Joze Amirreza Shaban Michael L. Iuzzolino K. Koishida 18 277 0 20 Nov 2019
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain Barak Battash H. Barad Hanlin Tang Amit Bleiweiss 16 30 0 19 Nov 2019
SMART: Skeletal Motion Action Recognition aTtack He Wang Feixiang He Zexi Peng Yong-Liang Yang Tianjia Shao Kun Zhou David C. Hogg AAML 31 5 0 16 Nov 2019
Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism Prashant Pandey P. PrathoshA. Manu Kohli Joshua K. Pritchard 24 33 0 11 Nov 2019
Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching Wei Peng Xiaopeng Hong Haoyu Chen Guoying Zhao GNN 40 323 0 11 Nov 2019
Are we asking the right questions in MovieQA? Bhavan A. Jasani Rohit Girdhar Deva Ramanan 11 15 0 08 Nov 2019
A Spectral Nonlocal Block for Neural Networks Lei Zhu Qi She Lidan Zhang Ping Guo 18 2 0 04 Nov 2019
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding Mathew Monfort Bowen Pan K. Ramakrishnan A. Andonian Barry A. McNamara A. Lascelles Quanfu Fan Dan Gutfreund Rogerio Feris A. Oliva VLM 14 68 0 01 Nov 2019
Chirality Nets for Human Pose Regression Raymond A. Yeh Yuan-Ting Hu Alex Schwing 3DH 22 48 0 31 Oct 2019
A Self Validation Network for Object-Level Human Attention Estimation Zehua Zhang Chen Yu David J. Crandall EgoV 30 10 0 31 Oct 2019
Comprehensive Video Understanding: Video summarization with content-based video recommender design Yudong Jiang Kaixu Cui B. Peng Changliang Xu BDL 20 28 0 30 Oct 2019
Predictive Coding Networks Meet Action Recognition Xia Huang Hossein Mousavi Gemma Roig 21 1 0 22 Oct 2019
Volterra Neural Networks (VNNs) Siddharth Roheda Hamid Krim 19 10 0 21 Oct 2019
Adaptive and Iteratively Improving Recurrent Lateral Connections Barak Battash Lior Wolf 25 2 0 16 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning Rohit Girdhar Deva Ramanan 22 176 0 10 Oct 2019
Human Action Sequence Classification Yan Bin Ng Basura Fernando 30 4 0 07 Oct 2019
Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction G. Zucatelli Seonghyeon Nam R. Coelho Seon Joo Kim 16 59 0 04 Oct 2019
CLEVRER: CoLlision Events for Video REpresentation and Reasoning Kexin Yi Yuta Saito Yunzhu Li Pushmeet Kohli Jiajun Wu Antonio Torralba J. Tenenbaum NAI 43 457 0 03 Oct 2019
Learning Temporal Action Proposals With Fewer Labels Jingwei Ji Kaidi Cao Juan Carlos Niebles 6 36 0 03 Oct 2019
Training Kinetics in 15 Minutes: Large-scale Distributed Training on Videos Ji Lin Chuang Gan Song Han 12 10 0 01 Oct 2019
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition Alexandros Stergiou R. Poppe 3DH 20 19 0 30 Sep 2019