ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1503.08909
  4. Cited By
Beyond Short Snippets: Deep Networks for Video Classification

Beyond Short Snippets: Deep Networks for Video Classification

31 March 2015
Joe Yue-Hei Ng
Matthew J. Hausknecht
Sudheendra Vijayanarasimhan
Oriol Vinyals
R. Monga
G. Toderici
ArXivPDFHTML

Papers citing "Beyond Short Snippets: Deep Networks for Video Classification"

50 / 739 papers shown
Title
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions
  and U-GRUs for skeletal pedestrian crossing prediction
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction
Joseph Gesnouin
Steve Pechberti
B. Stanciulescu
Fabien Moutarde
45
22
0
02 Sep 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric
  Videos
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
Nada Osman
Guglielmo Camporese
Pasquale Coscia
Lamberto Ballan
EgoV
39
20
0
02 Sep 2021
Social Fabric: Tubelet Compositions for Video Relation Detection
Social Fabric: Tubelet Compositions for Video Relation Detection
Shuo Chen
Zenglin Shi
Pascal Mettes
Cees G. M. Snoek
ViT
36
21
0
18 Aug 2021
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time
  Series Forecasting
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time Series Forecasting
Eitan Kosman
Dotan Di Castro
AI4TS
24
1
0
27 Jul 2021
Delta Sampling R-BERT for limited data and low-light action recognition
Delta Sampling R-BERT for limited data and low-light action recognition
Sanchit Hira
Ritwik Das
Abhinav Modi
D. Pakhomov
75
17
0
12 Jul 2021
Long Short-Term Transformer for Online Action Detection
Long Short-Term Transformer for Online Action Detection
Mingze Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Xia
Zhuowen Tu
Stefano Soatto
ViT
40
130
0
07 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
54
95
0
01 Jul 2021
Self-supervised Video Representation Learning with Cross-Stream
  Prototypical Contrasting
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
40
11
0
18 Jun 2021
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group
  and Activity Detection
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection
Mahsa Ehsanpour
F. Saleh
Silvio Savarese
Ian Reid
Hamid Rezatofighi
30
42
0
16 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
27
11
0
12 Jun 2021
Video Imprint
Video Imprint
Zhanning Gao
Le Wang
Nebojsa Jojic
Zhenxing Niu
N. Zheng
G. Hua
32
5
0
07 Jun 2021
Continual 3D Convolutional Neural Networks for Real-time Processing of
  Videos
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
Lukas Hedegaard
Alexandros Iosifidis
3DPC
23
14
0
31 May 2021
SSAN: Separable Self-Attention Network for Video Representation Learning
SSAN: Separable Self-Attention Network for Video Representation Learning
Xudong Guo
Xun Guo
Yan Lu
ViT
AI4TS
14
26
0
27 May 2021
Temporal Action Proposal Generation with Transformers
Temporal Action Proposal Generation with Transformers
Lining Wang
Haosen Yang
Wenhao Wu
H. Yao
Hujie Huang
ViT
38
27
0
25 May 2021
Unsupervised Video Summarization with a Convolutional Attentive
  Adversarial Network
Unsupervised Video Summarization with a Convolutional Attentive Adversarial Network
Guoqiang Liang
Yanbing Lv
Shucheng Li
Shizhou Zhang
Yanning Zhang
GAN
13
9
0
24 May 2021
A multimodal deep learning framework for scalable content based visual
  media retrieval
A multimodal deep learning framework for scalable content based visual media retrieval
Ambareesh Ravi
Amith Nandakumar
21
3
0
18 May 2021
PLSM: A Parallelized Liquid State Machine for Unintentional Action
  Detection
PLSM: A Parallelized Liquid State Machine for Unintentional Action Detection
Dipayan Das
Saumik Bhattacharya
Umapada Pal
S. Chanda
21
8
0
06 May 2021
Action in Mind: A Neural Network Approach to Action Recognition and
  Segmentation
Action in Mind: A Neural Network Approach to Action Recognition and Segmentation
Zahra Gharaee
24
3
0
30 Apr 2021
FrameExit: Conditional Early Exiting for Efficient Video Recognition
FrameExit: Conditional Early Exiting for Efficient Video Recognition
Amir Ghodrati
B. Bejnordi
A. Habibian
42
81
0
27 Apr 2021
Three-stream network for enriched Action Recognition
Three-stream network for enriched Action Recognition
Ivaxi Sheth
19
4
0
27 Apr 2021
VidTr: Video Transformer Without Convolutions
VidTr: Video Transformer Without Convolutions
Yanyi Zhang
Xinyu Li
Chunhui Liu
Bing Shuai
Yi Zhu
Biagio Brattoli
Hao Chen
I. Marsic
Joseph Tighe
ViT
148
193
0
23 Apr 2021
Modeling long-term interactions to enhance action recognition
Modeling long-term interactions to enhance action recognition
Alejandro Cartas
Petia Radeva
Mariella Dimiccoli
EgoV
27
6
0
23 Apr 2021
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
MGSampler: An Explainable Sampling Strategy for Video Action Recognition
Yuan Zhi
Zhan Tong
Limin Wang
Gangshan Wu
TTA
19
72
0
20 Apr 2021
HCMS: Hierarchical and Conditional Modality Selection for Efficient
  Video Recognition
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition
Zejia Weng
Zuxuan Wu
Hengduo Li
Jingjing Chen
Yu-Gang Jiang
32
4
0
20 Apr 2021
Temporal Query Networks for Fine-grained Video Understanding
Temporal Query Networks for Fine-grained Video Understanding
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
24
83
0
19 Apr 2021
Non-linear Functional Modeling using Neural Networks
Non-linear Functional Modeling using Neural Networks
Aniruddha Rajendra Rao
M. Reimherr
25
29
0
19 Apr 2021
Towards Extremely Compact RNNs for Video Recognition with Fully
  Decomposed Hierarchical Tucker Structure
Towards Extremely Compact RNNs for Video Recognition with Fully Decomposed Hierarchical Tucker Structure
Miao Yin
Siyu Liao
Xiao-Yang Liu
Xiaodong Wang
Bo Yuan
AI4TS
27
31
0
12 Apr 2021
Multimodal Object Detection via Probabilistic Ensembling
Multimodal Object Detection via Probabilistic Ensembling
Yi-Ting Chen
Jing Shi
Zelin Ye
Christoph Mertz
Deva Ramanan
Shu Kong
13
101
0
07 Apr 2021
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative
  Memories
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories
Xitong Yang
Haoqi Fan
Lorenzo Torresani
L. Davis
Heng Wang
VLM
27
20
0
02 Apr 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,093
0
29 Mar 2021
Busy-Quiet Video Disentangling for Video Classification
Busy-Quiet Video Disentangling for Video Classification
Guoxi Huang
A. Bors
28
6
0
29 Mar 2021
An Image is Worth 16x16 Words, What is a Video Worth?
An Image is Worth 16x16 Words, What is a Video Worth?
Gilad Sharir
Asaf Noy
Lihi Zelnik-Manor
ViT
24
121
0
25 Mar 2021
MoViNets: Mobile Video Networks for Efficient Video Recognition
MoViNets: Mobile Video Networks for Efficient Video Recognition
Dan Kondratyuk
Liangzhe Yuan
Yandong Li
Li Zhang
Mingxing Tan
Matthew A. Brown
Boqing Gong
18
228
0
21 Mar 2021
PGT: A Progressive Method for Training Models on Long Videos
PGT: A Progressive Method for Training Models on Long Videos
Bo Pang
Gao Peng
Yizhuo Li
Cewu Lu
VLM
24
12
0
21 Mar 2021
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
Honglu Zhou
Asim Kadav
Farley Lai
Alexandru Niculescu-Mizil
Martin Renqiang Min
Mubbasir Kapadia
H. Graf
LRM
48
18
0
19 Mar 2021
CLTA: Contents and Length-based Temporal Attention for Few-shot Action
  Recognition
CLTA: Contents and Length-based Temporal Attention for Few-shot Action Recognition
Yang Bo
Yangdi Lu
Wenbo He
VLM
27
0
0
18 Mar 2021
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive
  Learning
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
Yunbo Wang
Haixu Wu
Jianjin Zhang
Zhifeng Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
28
378
0
17 Mar 2021
Coarse-Fine Networks for Temporal Activity Detection in Videos
Coarse-Fine Networks for Temporal Activity Detection in Videos
Kumara Kahatapitiya
Michael S. Ryoo
AI4TS
53
38
0
01 Mar 2021
Phase Space Reconstruction Network for Lane Intrusion Action Recognition
Phase Space Reconstruction Network for Lane Intrusion Action Recognition
Ruiwen Zhang
Zhidong Deng
Hongsen Lin
Hongchao Lu
24
0
0
22 Feb 2021
Win-Fail Action Recognition
Win-Fail Action Recognition
Paritosh Parmar
B. Morris
24
5
0
15 Feb 2021
Driving Style Representation in Convolutional Recurrent Neural Network
  Model of Driver Identification
Driving Style Representation in Convolutional Recurrent Neural Network Model of Driver Identification
Sobhan Moosavi
P. Mahajan
Srinivasan Parthasarathy
Colleen Saunders-Chukwu
R. Ramnath
AILaw
26
16
0
11 Feb 2021
Face Recognition using 3D CNNs
Face Recognition using 3D CNNs
N. Mishra
S. Singh
3DH
CVBM
22
5
0
02 Feb 2021
CyclingNet: Detecting cycling near misses from video streams in complex
  urban scenes with deep learning
CyclingNet: Detecting cycling near misses from video streams in complex urban scenes with deep learning
M. Ibrahim
James Haworth
Nicola Christie
T. Cheng
34
14
0
31 Jan 2021
Position, Padding and Predictions: A Deeper Look at Position Information
  in CNNs
Position, Padding and Predictions: A Deeper Look at Position Information in CNNs
Md. Amirul Islam
M. Kowal
Sen Jia
Konstantinos G. Derpanis
Neil D. B. Bruce
30
55
0
28 Jan 2021
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting
  the Age-Suitability Rating of Movie Trailers
A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers
Mahsa Shafaei
C. Smailis
I. Kakadiaris
Thamar Solorio
165
1
0
26 Jan 2021
Bridging the gap between Human Action Recognition and Online Action
  Detection
Bridging the gap between Human Action Recognition and Online Action Detection
Alban Main De Boissiere
R. Noumeir
22
0
0
21 Jan 2021
Personal Privacy Protection via Irrelevant Faces Tracking and Pixelation
  in Video Live Streaming
Personal Privacy Protection via Irrelevant Faces Tracking and Pixelation in Video Live Streaming
Jizhe Zhou
Chi-Man Pun
PICV
32
34
0
04 Jan 2021
Privacy-sensitive Objects Pixelation for Live Video Streaming
Privacy-sensitive Objects Pixelation for Live Video Streaming
Jizhe Zhou
Chi-Man Pun
Yu Tong
38
9
0
03 Jan 2021
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and
  the CARING Models
Uncertainty-sensitive Activity Recognition: a Reliability Benchmark and the CARING Models
Alina Roitberg
Monica Haurilet
Manuel Martínez
Rainer Stiefelhagen
UQCV
39
6
0
02 Jan 2021
3D Human motion anticipation and classification
3D Human motion anticipation and classification
Emad Barsoum
J. Kender
Zicheng Liu
3DH
21
1
0
31 Dec 2020
Previous
12345...131415
Next