Title
Just One Moment: Structural Vulnerability of Deep Action Recognition against One Frame Attack Jaehui Hwang Jun-Hyuk Kim Jun-Ho Choi Jong-Seok Lee AAML 21 15 0 30 Nov 2020
Annotation-Efficient Untrimmed Video Action Recognition Yixiong Zou Shanghang Zhang Guangyao Chen Yonghong Tian Kurt Keutzer J. M. F. Moura 11 5 0 30 Nov 2020
Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing Florian Dubost Erin Hong Nandita Bhaskhar Siyi Tang D. Rubin Christopher Lee-Messer NoLa 21 0 0 28 Nov 2020
Patch-VQ: 'Patching Up' the Video Quality Problem Zhenqiang Ying Maniratnam Mandal Deepti Ghadiyaram AI Facebook 16 164 0 27 Nov 2020
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos Adrien Deliège A. Cioppa Silvio Giancola M. J. Seikavandi J. Dueholm Kamal Nasrollahi Guohao Li T. Moeslund Marc Van Droogenbroeck 20 152 0 26 Nov 2020
Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition Zhen Huang Xu Shen Xinmei Tian Houqiang Li Jianqiang Huang Xiansheng Hua GNN 37 56 0 26 Nov 2020
Group-Skeleton-Based Human Action Recognition in Complex Events Tingtian Li Zixun Sun Xiao Chen 32 5 0 26 Nov 2020
t-EVA: Time-Efficient t-SNE Video Annotation Soroosh Poorgholi O. Kayhan Jan van Gemert 16 5 0 26 Nov 2020
Can Temporal Information Help with Contrastive Self-Supervised Learning? Yutong Bai Haoqi Fan Ishan Misra Ganesh Venkatesh Yongyi Lu Yuyin Zhou Qihang Yu Vikas Chandra Alan Yuille 24 40 0 25 Nov 2020
Sign language segmentation with temporal convolutional networks Katrin Renz N. Stache Samuel Albanie Gül Varol SLR 24 25 0 25 Nov 2020
Recent Progress in Appearance-based Action Recognition J. Humphreys Zhe Chen Dacheng Tao 24 0 0 25 Nov 2020
A3D: Adaptive 3D Networks for Video Action Recognition Sijie Zhu Taojiannan Yang Matías Mendieta Chong Chen 3DH 32 12 0 24 Nov 2020
Play Fair: Frame Attributions in Video Models Will Price Dima Damen FAtt 31 5 0 24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks Humam Alwassel Silvio Giancola Guohao Li 35 123 0 23 Nov 2020
Hierarchically Decoupled Spatial-Temporal Contrast for Self-supervised Video Representation Learning Zehua Zhang David J. Crandall AI4TS SSL 28 23 0 23 Nov 2020
Learnable Sampling 3D Convolution for Video Enhancement and Action Recognition Shuyang Gu Jianmin Bao Dong Chen 28 2 0 22 Nov 2020
Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings Karan Sikka Jihua Huang Andrew Silberfarb Prateeth Nayak Luke Rohrer Pritish Sahu John Byrnes Ajay Divakaran R. Rohwer 35 4 0 21 Nov 2020
Boundary-sensitive Pre-training for Temporal Localization in Videos Mengmeng Xu Juan-Manuel Perez-Rua Victor Escorcia Brais Martínez Xiatian Zhu Li Zhang Guohao Li Tao Xiang 33 61 0 21 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild Faizaan Sakib T. Burghardt 22 24 0 21 Nov 2020
3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks Pierre-Etienne Martin J. Benois-Pineau Renaud Péteri J. Morlier 3DPC 30 12 0 20 Nov 2020
Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language Hassan Akbari Hamid Palangi Jianwei Yang Sudha Rao Asli Celikyilmaz Roland Fernandez P. Smolensky Jianfeng Gao Shih-Fu Chang 34 3 0 18 Nov 2020
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus Bowen Zhang Hexiang Hu Joonseok Lee Mingde Zhao Sheide Chammas Vihan Jain Eugene Ie Fei Sha 25 30 0 18 Nov 2020
RAIST: Learning Risk Aware Traffic Interactions via Spatio-Temporal Graph Convolutional Networks Videsh Suman Phu-Cuong Pham Aniket Bera GNN 19 4 0 17 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions Mohsen Fayyaz Emad Bahrami Rad Ali Diba M. Noroozi Ehsan Adeli Luc Van Gool Juergen Gall 3DPC 24 30 0 17 Nov 2020
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues A. Alam I. Ullah Young-Koo Lee 47 22 0 16 Nov 2020
ActBERT: Learning Global-Local Video-Text Representations Linchao Zhu Yi Yang ViT 49 417 0 14 Nov 2020
Unsupervised Video Representation Learning by Bidirectional Feature Prediction Nadine Behrmann Juergen Gall M. Noroozi SSL MDE 32 29 0 11 Nov 2020
Progressive Spatio-Temporal Graph Convolutional Network for Skeleton-Based Human Action Recognition Negar Heidari Alexandros Iosifidis GNN 3DH 38 14 0 11 Nov 2020
Multimodal Pretraining for Dense Video Captioning Gabriel Huang Bo Pang Zhenhai Zhu Clara E. Rivera Radu Soricut 21 81 0 10 Nov 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition T. Ayral M. Pedersoli Simon L Bacon Eric Granger CVBM 3DH 13 11 0 10 Nov 2020
Mutual Modality Learning for Video Action Classification Stepan Alekseevich Komkov Maksim Dzabraev Aleksandr Petiushko 27 9 0 04 Nov 2020
Learning Representations from Audio-Visual Spatial Alignment Pedro Morgado Yi Li Nuno Vasconcelos SSL 27 121 0 03 Nov 2020
Leveraging Activity Recognition to Enable Protective Behavior Detection in Continuous Data Chongyang Wang Yuan Gao Akhil Mathur A. Williams Nicholas D. Lane N. Bianchi-Berthouze 32 34 0 03 Nov 2020
Content-based Analysis of the Cultural Differences between TikTok and Douyin Li-yao Sun Haoqi Zhang Songyang Zhang Jiebo Luo 18 24 0 03 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition Zihao Wang Chen Lin Lu Sheng Junjie Yan Jing Shao ViT 25 7 0 02 Nov 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation Zhengyuan Yang Amanda Kay Yuncheng Li Wendi F. Cross Jiebo Luo 33 18 0 30 Oct 2020
Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning L. Tao Xueting Wang T. Yamasaki VLM SSL 23 14 0 29 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching Haoyuan Zhang Yonghong Hou Pichao Wang Zihui Guo Wanqing Li 32 15 0 29 Oct 2020
Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection Rui Dai Srijan Das Saurav Sharma Luca Minciullo Lorenzo Garattoni Francois Bremond Gianpiero Francesca 23 50 0 28 Oct 2020
Cycle-Contrast for Self-Supervised Video Representation Learning Quan Kong Wen Wei Ziwei Deng Tomoaki Yoshinaga Tomokazu Murakami SSL 19 55 0 28 Oct 2020
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications Hochul Hwang Cheongjae Jang Geonwoo Park Junghyun Cho Ig-Jae Kim 37 70 0 28 Oct 2020
Multi-object tracking with self-supervised associating network Tae-Young Chung Heansung Lee Myeongah Cho Suhwan Cho Sangyoun Lee VOT 16 0 0 26 Oct 2020
Temporal Attention-Augmented Graph Convolutional Network for Efficient Skeleton-Based Human Action Recognition Negar Heidari Alexandros Iosifidis GNN 42 30 0 23 Oct 2020
Spatio-temporal Features for Generalized Detection of Deepfake Videos Ipek Ganiyusufoglu L. Ngô N. Savov Sezer Karaoglu Theo Gevers 32 41 0 22 Oct 2020
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition Chun-Fu Chen Yikang Shen K. Ramakrishnan Rogerio Feris J. M. Cohn A. Oliva Quanfu Fan 23 95 0 22 Oct 2020
Learning to Sort Image Sequences via Accumulated Temporal Differences Gagan Kanojia Shanmuganathan Raman 19 0 0 22 Oct 2020
Shedding Light on Blind Spots: Developing a Reference Architecture to Leverage Video Data for Process Mining Wolfgang Kratsch Fabian König Maximilian Röglinger 16 25 0 21 Oct 2020
A Short Note on the Kinetics-700-2020 Human Action Dataset Lucas Smaira João Carreira Eric Noland Ellen Clancy Amy Wu Andrew Zisserman 32 137 0 21 Oct 2020
AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies Ha Thi Phuong Thao Balamurali B.T. Dorien Herremans Gemma Roig 30 7 0 21 Oct 2020
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization Nakul Agarwal Yi-Ting Chen Behzad Dariush Ming-Hsuan Yang 27 8 0 19 Oct 2020