ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.08675
  4. Cited By
YouTube-8M: A Large-Scale Video Classification Benchmark

YouTube-8M: A Large-Scale Video Classification Benchmark

27 September 2016
Sami Abu-El-Haija
Nisarg Kothari
Joonseok Lee
Apostol Natsev
G. Toderici
Balakrishnan Varadarajan
Sudheendra Vijayanarasimhan
    VLM
ArXivPDFHTML

Papers citing "YouTube-8M: A Large-Scale Video Classification Benchmark"

50 / 211 papers shown
Title
Large Scale Holistic Video Understanding
Large Scale Holistic Video Understanding
Ali Diba
Mohsen Fayyaz
Vivek Sharma
Manohar Paluri
Jurgen Gall
Rainer Stiefelhagen
Luc Van Gool
29
35
0
25 Apr 2019
Free-form Video Inpainting with 3D Gated Convolution and Temporal
  PatchGAN
Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN
Ya-Liang Chang
Zhe-Yu Liu
Kuan-Ying Lee
Winston H. Hsu
DiffM
23
172
0
23 Apr 2019
Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial
  Video Saliency
Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency
Jia Li
K. Fu
Shengwei Zhao
Shiming Ge
38
26
0
10 Apr 2019
Self-Supervised Learning via Conditional Motion Propagation
Self-Supervised Learning via Conditional Motion Propagation
Xiaohang Zhan
Xingang Pan
Ziwei Liu
Dahua Lin
Chen Change Loy
SSL
42
47
0
27 Mar 2019
Less is More: Learning Highlight Detection from Video Duration
Less is More: Learning Highlight Detection from Video Duration
Bo Xiong
Yannis Kalantidis
Deepti Ghadiyaram
Kristen Grauman
14
108
0
03 Mar 2019
Efficient Video Classification Using Fewer Frames
Efficient Video Classification Using Fewer Frames
S. Bhardwaj
Mukundhan Srinivasan
Mitesh M. Khapra
40
88
0
27 Feb 2019
Single-frame Regularization for Temporally Stable CNNs
Single-frame Regularization for Temporally Stable CNNs
Gabriel Eilertsen
Rafał K. Mantiuk
Jonas Unger
19
43
0
27 Feb 2019
Understanding and Training Deep Diagonal Circulant Neural Networks
Understanding and Training Deep Diagonal Circulant Neural Networks
Alexandre Araujo
Benjamin Négrevergne
Y. Chevaleyre
Jamal Atif
27
4
0
29 Jan 2019
DistInit: Learning Video Representations Without a Single Labeled Video
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
21
54
0
26 Jan 2019
Cricket stroke extraction: Towards creation of a large-scale cricket
  actions dataset
Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset
Arpan Gupta
S. Muthiah
22
6
0
10 Jan 2019
Video Action Transformer Network
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
28
702
0
06 Dec 2018
Morph: Flexible Acceleration for 3D CNN-based Video Understanding
Morph: Flexible Acceleration for 3D CNN-based Video Understanding
Kartik Hegde
R. Agrawal
Yulun Yao
Christopher W. Fletcher
30
70
0
16 Oct 2018
Non-local NetVLAD Encoding for Video Classification
Non-local NetVLAD Encoding for Video Classification
Yongyi Tang
Xing Zhang
Jingwen Wang
Shaoxiang Chen
Lin Ma
Yu-Gang Jiang
13
41
0
29 Sep 2018
Towards Good Practices for Multi-modal Fusion in Large-scale Video
  Classification
Towards Good Practices for Multi-modal Fusion in Large-scale Video Classification
Jinlai Liu
Zehuan Yuan
Changhu Wang
24
9
0
16 Sep 2018
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark
N. Xu
L. Yang
Yuchen Fan
Dingcheng Yue
Yuchen Liang
Jianchao Yang
Thomas Huang
VOS
22
522
0
06 Sep 2018
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In
  the Wild
ARBEE: Towards Automated Recognition of Bodily Expression of Emotion In the Wild
Yu Luo
Jianbo Ye
Reginald B. Adams
Jia Li
M. Newman
Jianmin Wang
56
86
0
28 Aug 2018
CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional
  Network Inference on Video Streams
CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams
Lukas Cavigelli
Luca Benini
24
26
0
15 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action
  Classification
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
81
79
0
03 Aug 2018
Competitive Analysis System for Theatrical Movie Releases Based on Movie
  Trailer Deep Video Representation
Competitive Analysis System for Theatrical Movie Releases Based on Movie Trailer Deep Video Representation
Miguel Campo
C. Hsieh
Matt Nickens
J. J. Espinoza
Abhinav Taliyan
J. Rieger
Jean Ho
Bettina Sherick
HAI
23
8
0
12 Jul 2018
Spatio-Temporal Instance Learning: Action Tubes from Class Supervision
Spatio-Temporal Instance Learning: Action Tubes from Class Supervision
Pascal Mettes
Cees G. M. Snoek
18
4
0
08 Jul 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
20
181
0
19 Jun 2018
Object Level Visual Reasoning in Videos
Object Level Visual Reasoning in Videos
Fabien Baradel
Natalia Neverova
Christian Wolf
J. Mille
Greg Mori
24
163
0
16 Jun 2018
Mining for meaning: from vision to language through multiple networks
  consensus
Mining for meaning: from vision to language through multiple networks consensus
Iulia Duta
Andrei Liviu Nicolicioiu
Simion-Vlad Bogolin
Marius Leordeanu
18
3
0
05 Jun 2018
Gradient-Leaks: Understanding and Controlling Deanonymization in
  Federated Learning
Gradient-Leaks: Understanding and Controlling Deanonymization in Federated Learning
Tribhuvanesh Orekondy
Seong Joon Oh
Yang Zhang
Bernt Schiele
Mario Fritz
PICV
FedML
359
37
0
15 May 2018
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
Feng Yu
Haofeng Chen
Xin Wang
Wenqi Xian
Yingying Chen
Fangchen Liu
Vashisht Madhavan
Trevor Darrell
VLM
72
2,095
0
12 May 2018
I Have Seen Enough: A Teacher Student Network for Video Classification
  Using Fewer Frames
I Have Seen Enough: A Teacher Student Network for Video Classification Using Fewer Frames
S. Bhardwaj
Mitesh M. Khapra
23
3
0
12 May 2018
Weakly-supervised Visual Instrument-playing Action Detection in Videos
Weakly-supervised Visual Instrument-playing Action Detection in Videos
Jen-Yu Liu
Yi-Hsuan Yang
Shyh-Kang Jeng
21
13
0
05 May 2018
Weakly Supervised Representation Learning for Unsynchronized
  Audio-Visual Events
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events
Sanjeel Parekh
S. Essid
A. Ozerov
Ngoc Q. K. Duong
P. Pérez
G. Richard
SSL
8
19
0
19 Apr 2018
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos
Silvio Giancola
Mohieddine Amine
Tarek Dghaily
Guohao Li
AI4TS
21
194
0
12 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Dima Damen
Hazel Doughty
G. Farinella
Sanja Fidler
Antonino Furnari
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
25
997
0
08 Apr 2018
FaceForensics: A Large-scale Video Dataset for Forgery Detection in
  Human Faces
FaceForensics: A Large-scale Video Dataset for Forgery Detection in Human Faces
Andreas Rossler
D. Cozzolino
L. Verdoliva
Christian Riess
Justus Thies
Matthias Nießner
PICV
AAML
CVBM
13
375
0
24 Mar 2018
Towards Universal Representation for Unseen Action Recognition
Towards Universal Representation for Unseen Action Recognition
Yi Zhu
Yang Long
Yu Guan
Shawn D. Newsam
Ling Shao
AI4TS
22
100
0
22 Mar 2018
Recurrent Residual Module for Fast Inference in Videos
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan
Wuwei Lin
Xiaolin Fang
Chaoqin Huang
Bolei Zhou
Cewu Lu
ObjD
25
33
0
27 Feb 2018
Fine-Grained Land Use Classification at the City Scale Using
  Ground-Level Images
Fine-Grained Land Use Classification at the City Scale Using Ground-Level Images
Yi Zhu
XueQing Deng
Shawn D. Newsam
26
51
0
07 Feb 2018
DeepType: Multilingual Entity Linking by Neural Type System Evolution
DeepType: Multilingual Entity Linking by Neural Type System Evolution
Jonathan Raiman
O. Raiman
BDL
HAI
127
183
0
03 Feb 2018
Moments in Time Dataset: one million videos for event understanding
Moments in Time Dataset: one million videos for event understanding
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
...
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
47
538
0
09 Jan 2018
Cross-modal Embeddings for Video and Audio Retrieval
Cross-modal Embeddings for Video and Audio Retrieval
Dídac Surís
A. Duarte
Amaia Salvador
Jordi Torres
Xavier Giró-i-Nieto
SSL
16
69
0
07 Jan 2018
NSML: A Machine Learning Platform That Enables You to Focus on Your
  Models
NSML: A Machine Learning Platform That Enables You to Focus on Your Models
Nako Sung
Minkyu Kim
Hyunwoo Jo
Youngil Yang
Jingwoong Kim
...
Youngkwan Kim
Gayoung Lee
Donghyun Kwak
Jung-Woo Ha
Sunghun Kim
35
86
0
16 Dec 2017
Compressed Video Action Recognition
Compressed Video Action Recognition
Chao-Yuan Wu
Manzil Zaheer
Hexiang Hu
R. Manmatha
Alex Smola
Philipp Krahenbuhl
29
325
0
02 Dec 2017
Paris-Lille-3D: a large and high-quality ground truth urban point cloud
  dataset for automatic segmentation and classification
Paris-Lille-3D: a large and high-quality ground truth urban point cloud dataset for automatic segmentation and classification
Xavier Roynard
Jean-Emmanuel Deschaud
F. Goulette
3DPC
3DV
16
280
0
30 Nov 2017
FearNet: Brain-Inspired Model for Incremental Learning
FearNet: Brain-Inspired Model for Incremental Learning
Ronald Kemker
Christopher Kanan
CLL
38
474
0
28 Nov 2017
Attention Clusters: Purely Attention Based Local Feature Integration for
  Video Classification
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification
Xiang Long
Chuang Gan
Gerard de Melo
Jiajun Wu
Xiao-Chang Liu
Shilei Wen
31
208
0
27 Nov 2017
Summarizing First-Person Videos from Third Persons' Points of Views
Summarizing First-Person Videos from Third Persons' Points of Views
Hsuan-I Ho
Wei-Chen Chiu
Y. Wang
EgoV
3DH
29
28
0
24 Nov 2017
Frame Interpolation with Multi-Scale Deep Loss Functions and Generative
  Adversarial Networks
Frame Interpolation with Multi-Scale Deep Loss Functions and Generative Adversarial Networks
Joost R. van Amersfoort
Wenzhe Shi
Alejandro Acosta
Francisco Massa
J. Totz
Zehan Wang
Jose Caballero
GAN
13
40
0
16 Nov 2017
Data-Free Knowledge Distillation for Deep Neural Networks
Data-Free Knowledge Distillation for Deep Neural Networks
Raphael Gontijo-Lopes
Stefano Fenu
Thad Starner
19
270
0
19 Oct 2017
A Read-Write Memory Network for Movie Story Understanding
A Read-Write Memory Network for Movie Story Understanding
Seil Na
Sangho Lee
Jisung Kim
Gunhee Kim
AIMat
21
98
0
27 Sep 2017
Multi-Label Zero-Shot Human Action Recognition via Joint Latent Ranking
  Embedding
Multi-Label Zero-Shot Human Action Recognition via Joint Latent Ranking Embedding
Qian Wang
Ke Chen
BDL
30
7
0
15 Sep 2017
Unsupervised Representation Learning by Sorting Sequences
Unsupervised Representation Learning by Sorting Sequences
Hsin-Ying Lee
Jia-Bin Huang
Maneesh Kumar Singh
Ming-Hsuan Yang
SSL
DRL
20
533
0
03 Aug 2017
Deep Learning Methods for Efficient Large Scale Video Labeling
Deep Learning Methods for Efficient Large Scale Video Labeling
Miha Škalič
M. Pekalski
Xin Pan
VLM
10
17
0
14 Jun 2017
Large-Scale YouTube-8M Video Understanding with Deep Neural Networks
Large-Scale YouTube-8M Video Understanding with Deep Neural Networks
Manuk Akopyan
Eshsou Khashba
22
7
0
14 Jun 2017
Previous
12345
Next