Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.01289
Cited By
v1
v2 (latest)
Timeception for Complex Action Recognition
4 December 2018
Noureldien Hussein
E. Gavves
A. Smeulders
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Timeception for Complex Action Recognition"
39 / 39 papers shown
Title
HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding
Shehreen Azad
Vibhav Vineet
Yogesh S Rawat
VLM
466
2
0
11 Mar 2025
Situational Scene Graph for Structured Human-centric Situation Understanding
Chinthani Sugandhika
Chen Li
Deepu Rajan
Basura Fernando
462
1
0
30 Oct 2024
Video Time: Properties, Encoders and Evaluation
Amir Ghodrati
E. Gavves
Cees G. M. Snoek
128
26
0
18 Jul 2018
Videos as Space-Time Region Graphs
Xinyu Wang
Abhinav Gupta
106
756
0
05 Jun 2018
What have we learned from deep representations for action recognition?
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
Andrew Zisserman
SSL
61
47
0
04 Jan 2018
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
226
3,033
0
30 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
126
1,936
0
27 Nov 2017
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
98
1,040
0
22 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
289
8,916
0
21 Nov 2017
Attentional Pooling for Action Recognition
Rohit Girdhar
Deva Ramanan
95
320
0
04 Nov 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
AI4TS
141
6,878
0
04 Jul 2017
Learnable pooling with Context Gating for video classification
Antoine Miech
Ivan Laptev
Josef Sivic
74
327
0
21 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
722
132,199
0
12 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
235
8,037
0
22 May 2017
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
250
3,815
0
19 May 2017
Temporal Segment Networks for Action Recognition in Videos
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
114
812
0
08 May 2017
Unified Embedding and Metric Learning for Zero-Exemplar Event Detection
Noureldien Hussein
E. Gavves
A. Smeulders
52
15
0
05 May 2017
ActionVLAD: Learning spatio-temporal aggregation for action classification
Rohit Girdhar
Deva Ramanan
Abhinav Gupta
Josef Sivic
Bryan C. Russell
AI4TS
75
451
0
10 Apr 2017
Asynchronous Temporal Fields for Action Recognition
Gunnar Sigurdsson
S. Divvala
Ali Farhadi
Abhinav Gupta
BDL
73
170
0
19 Dec 2016
Action Recognition with Dynamic Image Networks
Hakan Bilen
Basura Fernando
Efstratios Gavves
Andrea Vedaldi
FAtt
65
224
0
02 Dec 2016
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
522
10,345
0
16 Nov 2016
Xception: Deep Learning with Depthwise Separable Convolutions
François Chollet
MDE
BDL
PINN
1.4K
14,575
0
07 Oct 2016
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
406
7,405
0
12 Sep 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
PINN
3DV
775
36,861
0
25 Aug 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
105
3,838
0
02 Aug 2016
Convolutional Two-Stream Network Fusion for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
163
2,612
0
22 Apr 2016
The THUMOS Challenge on Action Recognition for Videos "in the Wild"
Haroon Idrees
Amir Zamir
Yu-Gang Jiang
Alexander N. Gorban
Ivan Laptev
Rahul Sukthankar
M. Shah
97
776
0
21 Apr 2016
Long-term Temporal Convolutions for Action Recognition
Gül Varol
Ivan Laptev
Cordelia Schmid
80
912
0
15 Apr 2016
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
Gunnar Sigurdsson
Gül Varol
Xinyu Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
VGen
106
1,245
0
06 Apr 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,322
0
10 Dec 2015
Rank Pooling for Action Recognition
Basura Fernando
E. Gavves
José Oramas
Amir Ghodrati
Tinne Tuytelaars
60
300
0
06 Dec 2015
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
883
27,412
0
02 Dec 2015
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos
Serena Yeung
Olga Russakovsky
Ning Jin
Mykhaylo Andriluka
Greg Mori
Li Fei-Fei
VLM
97
439
0
21 Jul 2015
Towards Good Practices for Very Deep Two-Stream ConvNets
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
95
445
0
08 Jul 2015
EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video
Guangnan Ye
Yitong Li
Hongliang Xu
Dong Liu
Shih-Fu Chang
43
118
0
08 Jun 2015
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
165
6,056
0
17 Nov 2014
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
480
43,685
0
17 Sep 2014
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan
Andrew Zisserman
247
7,541
0
09 Jun 2014
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIP
VGen
157
6,162
0
03 Dec 2012
1