Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1503.08909
Cited By
v1
v2 (latest)
Beyond Short Snippets: Deep Networks for Video Classification
31 March 2015
Joe Yue-Hei Ng
Matthew J. Hausknecht
Sudheendra Vijayanarasimhan
Oriol Vinyals
R. Monga
G. Toderici
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Beyond Short Snippets: Deep Networks for Video Classification"
50 / 737 papers shown
Title
Efficient Action Recognition Using Confidence Distillation
Shervin Manzuri Shalmani
Fei Chiang
Ronghuo Zheng
109
6
0
05 Sep 2021
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction
Joseph Gesnouin
Steve Pechberti
B. Stanciulescu
Fabien Moutarde
92
24
0
02 Sep 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
Nada Osman
Guglielmo Camporese
Pasquale Coscia
Lamberto Ballan
EgoV
108
21
0
02 Sep 2021
Social Fabric: Tubelet Compositions for Video Relation Detection
Shuo Chen
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
ViT
81
21
0
18 Aug 2021
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time Series Forecasting
Eitan Kosman
Dotan Di Castro
AI4TS
43
1
0
27 Jul 2021
Delta Sampling R-BERT for limited data and low-light action recognition
Sanchit Hira
Ritwik Das
Abhinav Modi
D. Pakhomov
109
17
0
12 Jul 2021
Long Short-Term Transformer for Online Action Detection
Mingze Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Xia
Zhuowen Tu
Stefano Soatto
ViT
154
137
0
07 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
169
101
0
01 Jul 2021
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
108
11
0
18 Jun 2021
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection
Mahsa Ehsanpour
F. Saleh
Silvio Savarese
Ian Reid
Hamid Rezatofighi
81
44
0
16 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
154
11
0
12 Jun 2021
Video Imprint
Zhanning Gao
Le Wang
Nebojsa Jojic
Zhenxing Niu
N. Zheng
G. Hua
36
5
0
07 Jun 2021
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
Lukas Hedegaard
Alexandros Iosifidis
3DPC
87
15
0
31 May 2021
SSAN: Separable Self-Attention Network for Video Representation Learning
Xudong Guo
Xun Guo
Yan Lu
ViT
AI4TS
55
26
0
27 May 2021
Temporal Action Proposal Generation with Transformers
Lining Wang
Haosen Yang
Wenhao Wu
Huanjin Yao
Hujie Huang
ViT
85
28
0
25 May 2021
Unsupervised Video Summarization with a Convolutional Attentive Adversarial Network
Guoqiang Liang
Yanbing Lv
Shucheng Li
Shizhou Zhang
Yanning Zhang
GAN
41
9
0
24 May 2021
A multimodal deep learning framework for scalable content based visual media retrieval
Ambareesh Ravi
Amith Nandakumar
50
3
0
18 May 2021
PLSM: A Parallelized Liquid State Machine for Unintentional Action Detection
Dipayan Das
Saumik Bhattacharya
Umapada Pal
S. Chanda
80
8
0
06 May 2021
Action in Mind: A Neural Network Approach to Action Recognition and Segmentation
Zahra Gharaee
44
3
0
30 Apr 2021
FrameExit: Conditional Early Exiting for Efficient Video Recognition
Amir Ghodrati
B. Bejnordi
A. Habibian
147
81
0
27 Apr 2021
Three-stream network for enriched Action Recognition
Ivaxi Sheth
27
4
0
27 Apr 2021
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
262
198
0
23 Apr 2021
Modeling long-term interactions to enhance action recognition
Alejandro Cartas
Petia Radeva
Mariella Dimiccoli
EgoV
51
6
0
23 Apr 2021
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
Yuan Zhi
Zhan Tong
Limin Wang
Gangshan Wu
TTA
64
74
0
20 Apr 2021
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition
Zejia Weng
Zuxuan Wu
Hengduo Li
Jingjing Chen
Yu-Gang Jiang
80
4
0
20 Apr 2021
Temporal Query Networks for Fine-grained Video Understanding
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
119
87
0
19 Apr 2021
Non-linear Functional Modeling using Neural Networks
Aniruddha Rajendra Rao
M. Reimherr
62
31
0
19 Apr 2021
Towards Extremely Compact RNNs for Video Recognition with Fully Decomposed Hierarchical Tucker Structure
Miao Yin
Siyu Liao
Xiao-Yang Liu
Xiaodong Wang
Bo Yuan
AI4TS
87
31
0
12 Apr 2021
Multimodal Object Detection via Probabilistic Ensembling
Yi-Ting Chen
Jing Shi
Zelin Ye
Christoph Mertz
Deva Ramanan
Shu Kong
108
115
0
07 Apr 2021
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories
Xitong Yang
Haoqi Fan
Lorenzo Torresani
L. Davis
Heng Wang
VLM
84
21
0
02 Apr 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
242
2,177
0
29 Mar 2021
Busy-Quiet Video Disentangling for Video Classification
Guoxi Huang
A. Bors
56
7
0
29 Mar 2021
An Image is Worth 16x16 Words, What is a Video Worth?
Gilad Sharir
Asaf Noy
Lihi Zelnik-Manor
ViT
102
125
0
25 Mar 2021
PGT: A Progressive Method for Training Models on Long Videos
Bo Pang
Gao Peng
Yizhuo Li
Cewu Lu
VLM
42
12
0
21 Mar 2021
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
Honglu Zhou
Asim Kadav
Farley Lai
Alexandru Niculescu-Mizil
Martin Renqiang Min
Mubbasir Kapadia
H. Graf
LRM
89
18
0
19 Mar 2021
CLTA: Contents and Length-based Temporal Attention for Few-shot Action Recognition
Yang Bo
Yangdi Lu
Wenbo He
VLM
92
0
0
18 Mar 2021
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
Yunbo Wang
Haixu Wu
Jianjin Zhang
Zhifeng Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
163
397
0
17 Mar 2021
Coarse-Fine Networks for Temporal Activity Detection in Videos
Kumara Kahatapitiya
Michael S. Ryoo
AI4TS
97
39
0
01 Mar 2021
Phase Space Reconstruction Network for Lane Intrusion Action Recognition
Ruiwen Zhang
Zhidong Deng
Hongsen Lin
Hongchao Lu
34
0
0
22 Feb 2021
Win-Fail Action Recognition
Paritosh Parmar
B. Morris
64
5
0
15 Feb 2021
Driving Style Representation in Convolutional Recurrent Neural Network Model of Driver Identification
Sobhan Moosavi
P. Mahajan
Srinivasan Parthasarathy
Colleen Saunders-Chukwu
R. Ramnath
AILaw
53
16
0
11 Feb 2021
Face Recognition using 3D CNNs
N. Mishra
S. Singh
3DH
CVBM
52
5
0
02 Feb 2021
CyclingNet: Detecting cycling near misses from video streams in complex urban scenes with deep learning
M. Ibrahim
James Haworth
Nicola Christie
T. Cheng
81
15
0
31 Jan 2021
Position, Padding and Predictions: A Deeper Look at Position Information in CNNs
Md. Amirul Islam
M. Kowal
Sen Jia
Konstantinos G. Derpanis
Neil D. B. Bruce
61
58
0
28 Jan 2021
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers
Mahsa Shafaei
C. Smailis
I. Kakadiaris
Thamar Solorio
403
1
0
26 Jan 2021
Bridging the gap between Human Action Recognition and Online Action Detection
Alban Main De Boissiere
R. Noumeir
97
0
0
21 Jan 2021
Personal Privacy Protection via Irrelevant Faces Tracking and Pixelation in Video Live Streaming
Jizhe Zhou
Chi-Man Pun
PICV
115
38
0
04 Jan 2021
Privacy-sensitive Objects Pixelation for Live Video Streaming
Jizhe Zhou
Chi-Man Pun
Yu Tong
74
9
0
03 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
Alina Roitberg
Monica Haurilet
Manuel Martínez
Rainer Stiefelhagen
UQCV
48
6
0
02 Jan 2021
3D Human motion anticipation and classification
Emad Barsoum
J. Kender
Zicheng Liu
3DH
50
1
0
31 Dec 2020
Previous
1
2
3
4
5
...
13
14
15
Next