v1v2 (latest)

Timeception for Complex Action Recognition

4 December 2018

Papers citing "Timeception for Complex Action Recognition"

39 / 39 papers shown

Title
HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding Shehreen Azad Vibhav Vineet Yogesh S Rawat VLM 466 2 0 11 Mar 2025
Situational Scene Graph for Structured Human-centric Situation Understanding Chinthani Sugandhika Chen Li Deepu Rajan Basura Fernando 462 1 0 30 Oct 2024
Video Time: Properties, Encoders and Evaluation Amir Ghodrati E. Gavves Cees G. M. Snoek 128 26 0 18 Jul 2018
Videos as Space-Time Region Graphs Xinyu Wang Abhinav Gupta 106 756 0 05 Jun 2018
What have we learned from deep representations for action recognition? Christoph Feichtenhofer A. Pinz Richard P. Wildes Andrew Zisserman SSL 61 47 0 04 Jan 2018
A Closer Look at Spatiotemporal Convolutions for Action Recognition Du Tran Heng Wang Lorenzo Torresani Jamie Ray Yann LeCun Manohar Paluri 226 3,033 0 30 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? Kensho Hara Hirokatsu Kataoka Y. Satoh 3DPC 126 1,936 0 27 Nov 2017
Temporal Relational Reasoning in Videos Bolei Zhou A. Andonian Aude Oliva Antonio Torralba NAI 98 1,040 0 22 Nov 2017
Non-local Neural Networks Xinyu Wang Ross B. Girshick Abhinav Gupta Kaiming He OffRL 289 8,916 0 21 Nov 2017
Attentional Pooling for Action Recognition Rohit Girdhar Deva Ramanan 95 320 0 04 Nov 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices Xiangyu Zhang Xinyu Zhou Mengxiao Lin Jian Sun AI4TS 141 6,878 0 04 Jul 2017
Learnable pooling with Context Gating for video classification Antoine Miech Ivan Laptev Josef Sivic 74 327 0 21 Jun 2017
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 722 132,199 0 12 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset João Carreira Andrew Zisserman 235 8,037 0 22 May 2017
The Kinetics Human Action Video Dataset W. Kay João Carreira Karen Simonyan Brian Zhang Chloe Hillier ... Tim Green T. Back Apostol Natsev Mustafa Suleyman Andrew Zisserman 250 3,815 0 19 May 2017
Temporal Segment Networks for Action Recognition in Videos Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang Luc Van Gool ViT 114 812 0 08 May 2017
Unified Embedding and Metric Learning for Zero-Exemplar Event Detection Noureldien Hussein E. Gavves A. Smeulders 52 15 0 05 May 2017
ActionVLAD: Learning spatio-temporal aggregation for action classification Rohit Girdhar Deva Ramanan Abhinav Gupta Josef Sivic Bryan C. Russell AI4TS 75 451 0 10 Apr 2017
Asynchronous Temporal Fields for Action Recognition Gunnar Sigurdsson S. Divvala Ali Farhadi Abhinav Gupta BDL 73 170 0 19 Dec 2016
Action Recognition with Dynamic Image Networks Hakan Bilen Basura Fernando Efstratios Gavves Andrea Vedaldi FAtt 65 224 0 02 Dec 2016
Aggregated Residual Transformations for Deep Neural Networks Saining Xie Ross B. Girshick Piotr Dollár Zhuowen Tu Kaiming He 522 10,345 0 16 Nov 2016
Xception: Deep Learning with Depthwise Separable Convolutions François Chollet MDE BDL PINN 1.4K 14,575 0 07 Oct 2016
WaveNet: A Generative Model for Raw Audio Aaron van den Oord Sander Dieleman Heiga Zen Karen Simonyan Oriol Vinyals Alex Graves Nal Kalchbrenner A. Senior Koray Kavukcuoglu DiffM 406 7,405 0 12 Sep 2016
Densely Connected Convolutional Networks Gao Huang Zhuang Liu Laurens van der Maaten Kilian Q. Weinberger PINN 3DV 775 36,861 0 25 Aug 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao Dahua Lin Xiaoou Tang Luc Van Gool ViT 105 3,838 0 02 Aug 2016
Convolutional Two-Stream Network Fusion for Video Action Recognition Christoph Feichtenhofer A. Pinz Andrew Zisserman 163 2,612 0 22 Apr 2016
The THUMOS Challenge on Action Recognition for Videos "in the Wild" Haroon Idrees Amir Zamir Yu-Gang Jiang Alexander N. Gorban Ivan Laptev Rahul Sukthankar M. Shah 97 776 0 21 Apr 2016
Long-term Temporal Convolutions for Action Recognition Gül Varol Ivan Laptev Cordelia Schmid 80 912 0 15 Apr 2016
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding Gunnar Sigurdsson Gül Varol Xinyu Wang Ali Farhadi Ivan Laptev Abhinav Gupta VGen 106 1,245 0 06 Apr 2016
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 2.2K 194,322 0 10 Dec 2015
Rank Pooling for Action Recognition Basura Fernando E. Gavves José Oramas Amir Ghodrati Tinne Tuytelaars 60 300 0 06 Dec 2015
Rethinking the Inception Architecture for Computer Vision Christian Szegedy Vincent Vanhoucke Sergey Ioffe Jonathon Shlens Z. Wojna 3DV BDL 883 27,412 0 02 Dec 2015
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos Serena Yeung Olga Russakovsky Ning Jin Mykhaylo Andriluka Greg Mori Li Fei-Fei VLM 97 439 0 21 Jul 2015
Towards Good Practices for Very Deep Two-Stream ConvNets Limin Wang Yuanjun Xiong Zhe Wang Yu Qiao 95 445 0 08 Jul 2015
EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video Guangnan Ye Yitong Li Hongliang Xu Dong Liu Shih-Fu Chang 43 118 0 08 Jun 2015
Long-term Recurrent Convolutional Networks for Visual Recognition and Description Jeff Donahue Lisa Anne Hendricks Marcus Rohrbach Subhashini Venugopalan S. Guadarrama Kate Saenko Trevor Darrell VLM 165 6,056 0 17 Nov 2014
Going Deeper with Convolutions Christian Szegedy Wei Liu Yangqing Jia P. Sermanet Scott E. Reed Dragomir Anguelov D. Erhan Vincent Vanhoucke Andrew Rabinovich 480 43,685 0 17 Sep 2014
Two-Stream Convolutional Networks for Action Recognition in Videos Karen Simonyan Andrew Zisserman 247 7,541 0 09 Jun 2014
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild K. Soomro Amir Zamir M. Shah CLIP VGen 157 6,162 0 03 Dec 2012