YouTube-8M: A Large-Scale Video Classification Benchmark

27 September 2016

Joonseok Lee

Balakrishnan Varadarajan

Sudheendra Vijayanarasimhan

VLM

ArXiv PDF HTML

Papers citing "YouTube-8M: A Large-Scale Video Classification Benchmark"

50 / 211 papers shown

Title
Large Scale Holistic Video Understanding Ali Diba Mohsen Fayyaz Vivek Sharma Manohar Paluri Jurgen Gall Rainer Stiefelhagen Luc Van Gool 29 35 0 25 Apr 2019
Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN Ya-Liang Chang Zhe-Yu Liu Kuan-Ying Lee Winston H. Hsu DiffM 23 172 0 23 Apr 2019
Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency Jia Li K. Fu Shengwei Zhao Shiming Ge 38 26 0 10 Apr 2019
Self-Supervised Learning via Conditional Motion Propagation Xiaohang Zhan Xingang Pan Ziwei Liu Dahua Lin Chen Change Loy SSL 42 47 0 27 Mar 2019
Less is More: Learning Highlight Detection from Video Duration Bo Xiong Yannis Kalantidis Deepti Ghadiyaram Kristen Grauman 14 108 0 03 Mar 2019
Efficient Video Classification Using Fewer Frames S. Bhardwaj Mukundhan Srinivasan Mitesh M. Khapra 40 88 0 27 Feb 2019
Single-frame Regularization for Temporally Stable CNNs Gabriel Eilertsen Rafał K. Mantiuk Jonas Unger 19 43 0 27 Feb 2019
Understanding and Training Deep Diagonal Circulant Neural Networks Alexandre Araujo Benjamin Négrevergne Y. Chevaleyre Jamal Atif 27 4 0 29 Jan 2019
DistInit: Learning Video Representations Without a Single Labeled Video Rohit Girdhar Du Tran Lorenzo Torresani Deva Ramanan 21 54 0 26 Jan 2019
Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset Arpan Gupta S. Muthiah 22 6 0 10 Jan 2019
Video Action Transformer Network Rohit Girdhar João Carreira Carl Doersch Andrew Zisserman ViT 28 702 0 06 Dec 2018
Morph: Flexible Acceleration for 3D CNN-based Video Understanding Kartik Hegde R. Agrawal Yulun Yao Christopher W. Fletcher 30 70 0 16 Oct 2018
Non-local NetVLAD Encoding for Video Classification Yongyi Tang Xing Zhang Jingwen Wang Shaoxiang Chen Lin Ma Yu-Gang Jiang 13 41 0 29 Sep 2018
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification Jinlai Liu Zehuan Yuan Changhu Wang 24 9 0 16 Sep 2018
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark N. Xu L. Yang Yuchen Fan Dingcheng Yue Yuchen Liang Jianchao Yang Thomas Huang VOS 22 522 0 06 Sep 2018
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In the Wild Yu Luo Jianbo Ye Reginald B. Adams Jia Li M. Newman Jianmin Wang 56 86 0 28 Aug 2018
CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams Lukas Cavigelli Luca Benini 24 26 0 15 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification Yang Du Chunfen Yuan Bing Li Lili Zhao Yangxi Li Weiming Hu 81 79 0 03 Aug 2018
Competitive Analysis System for Theatrical Movie Releases Based on Movie Trailer Deep Video Representation Miguel Campo C. Hsieh Matt Nickens J. J. Espinoza Abhinav Taliyan J. Rieger Jean Ho Bettina Sherick HAI 23 8 0 12 Jul 2018
Spatio-Temporal Instance Learning: Action Tubes from Class Supervision Pascal Mettes Cees G. M. Snoek 18 4 0 08 Jul 2018
Spatio-Temporal Channel Correlation Networks for Action Classification Ali Diba Mohsen Fayyaz Vivek Sharma M. M. Arzani Rahman Yousefzadeh Juergen Gall Luc Van Gool 3DPC 20 181 0 19 Jun 2018
Object Level Visual Reasoning in Videos Fabien Baradel Natalia Neverova Christian Wolf J. Mille Greg Mori 24 163 0 16 Jun 2018
Mining for meaning: from vision to language through multiple networks consensus Iulia Duta Andrei Liviu Nicolicioiu Simion-Vlad Bogolin Marius Leordeanu 18 3 0 05 Jun 2018
Gradient-Leaks: Understanding and Controlling Deanonymization in Federated Learning Tribhuvanesh Orekondy Seong Joon Oh Yang Zhang Bernt Schiele Mario Fritz PICV FedML 359 37 0 15 May 2018
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning Feng Yu Haofeng Chen Xin Wang Wenqi Xian Yingying Chen Fangchen Liu Vashisht Madhavan Trevor Darrell VLM 72 2,095 0 12 May 2018
I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames S. Bhardwaj Mitesh M. Khapra 23 3 0 12 May 2018
Weakly-supervised Visual Instrument-playing Action Detection in Videos Jen-Yu Liu Yi-Hsuan Yang Shyh-Kang Jeng 21 13 0 05 May 2018
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events Sanjeel Parekh S. Essid A. Ozerov Ngoc Q. K. Duong P. Pérez G. Richard SSL 8 19 0 19 Apr 2018
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos Silvio Giancola Mohieddine Amine Tarek Dghaily Guohao Li AI4TS 21 194 0 12 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset Dima Damen Hazel Doughty G. Farinella Sanja Fidler Antonino Furnari ... Davide Moltisanti Jonathan Munro Toby Perrett Will Price Michael Wray EgoV 25 997 0 08 Apr 2018
FaceForensics: A Large-scale Video Dataset for Forgery Detection in Human Faces Andreas Rossler D. Cozzolino L. Verdoliva Christian Riess Justus Thies Matthias Nießner PICV AAML CVBM 13 375 0 24 Mar 2018
Towards Universal Representation for Unseen Action Recognition Yi Zhu Yang Long Yu Guan Shawn D. Newsam Ling Shao AI4TS 22 100 0 22 Mar 2018
Recurrent Residual Module for Fast Inference in Videos Bowen Pan Wuwei Lin Xiaolin Fang Chaoqin Huang Bolei Zhou Cewu Lu ObjD 25 33 0 27 Feb 2018
Fine-Grained Land Use Classification at the City Scale Using Ground-Level Images Yi Zhu XueQing Deng Shawn D. Newsam 26 51 0 07 Feb 2018
DeepType: Multilingual Entity Linking by Neural Type System Evolution Jonathan Raiman O. Raiman BDL HAI 127 183 0 03 Feb 2018
Moments in Time Dataset: one million videos for event understanding Mathew Monfort A. Andonian Bolei Zhou K. Ramakrishnan Sarah Adel Bargal ... L. Brown Quanfu Fan Dan Gutfreund Carl Vondrick A. Oliva 47 538 0 09 Jan 2018
Cross-modal Embeddings for Video and Audio Retrieval Dídac Surís A. Duarte Amaia Salvador Jordi Torres Xavier Giró-i-Nieto SSL 16 69 0 07 Jan 2018
NSML: A Machine Learning Platform That Enables You to Focus on Your Models Nako Sung Minkyu Kim Hyunwoo Jo Youngil Yang Jingwoong Kim ... Youngkwan Kim Gayoung Lee Donghyun Kwak Jung-Woo Ha Sunghun Kim 35 86 0 16 Dec 2017
Compressed Video Action Recognition Chao-Yuan Wu Manzil Zaheer Hexiang Hu R. Manmatha Alex Smola Philipp Krahenbuhl 29 325 0 02 Dec 2017
Paris-Lille-3D: a large and high-quality ground truth urban point cloud dataset for automatic segmentation and classification Xavier Roynard Jean-Emmanuel Deschaud F. Goulette 3DPC 3DV 16 280 0 30 Nov 2017
FearNet: Brain-Inspired Model for Incremental Learning Ronald Kemker Christopher Kanan CLL 38 474 0 28 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification Xiang Long Chuang Gan Gerard de Melo Jiajun Wu Xiao-Chang Liu Shilei Wen 31 208 0 27 Nov 2017
Summarizing First-Person Videos from Third Persons' Points of Views Hsuan-I Ho Wei-Chen Chiu Y. Wang EgoV 3DH 29 28 0 24 Nov 2017
Frame Interpolation with Multi-Scale Deep Loss Functions and Generative Adversarial Networks Joost R. van Amersfoort Wenzhe Shi Alejandro Acosta Francisco Massa J. Totz Zehan Wang Jose Caballero GAN 13 40 0 16 Nov 2017
Data-Free Knowledge Distillation for Deep Neural Networks Raphael Gontijo-Lopes Stefano Fenu Thad Starner 19 270 0 19 Oct 2017
A Read-Write Memory Network for Movie Story Understanding Seil Na Sangho Lee Jisung Kim Gunhee Kim AIMat 21 98 0 27 Sep 2017
Multi-Label Zero-Shot Human Action Recognition via Joint Latent Ranking Embedding Qian Wang Ke Chen BDL 30 7 0 15 Sep 2017
Unsupervised Representation Learning by Sorting Sequences Hsin-Ying Lee Jia-Bin Huang Maneesh Kumar Singh Ming-Hsuan Yang SSL DRL 20 533 0 03 Aug 2017
Deep Learning Methods for Efficient Large Scale Video Labeling Miha Škalič M. Pekalski Xin Pan VLM 10 17 0 14 Jun 2017
Large-Scale YouTube-8M Video Understanding with Deep Neural Networks Manuk Akopyan Eshsou Khashba 22 7 0 14 Jun 2017