Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.6031
Cited By
Finding Action Tubes
21 November 2014
Georgia Gkioxari
Jitendra Malik
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Finding Action Tubes"
14 / 14 papers shown
Title
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
Taein Son
Soo Won Seo
Jisong Kim
S. Lee
Jun Won Choi
VGen
111
0
0
18 Dec 2024
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Andong Deng
Tongjia Chen
Shoubin Yu
Taojiannan Yang
Lincoln Spencer
Yapeng Tian
Ajmal Mian
Joey Tianyi Zhou
Chen Chen
LRM
89
2
0
15 Nov 2024
Discovering Spatio-Temporal Action Tubes
Yuancheng Ye
Xiaodong Yang
Yingli Tian
61
14
0
29 Nov 2018
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
66
325
0
04 May 2017
Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Video
Hou-Ning Hu
Yen-Chen Lin
Ming-Yuan Liu
Hsien-Tzu Cheng
Yung-Ju Chang
Min Sun
70
177
0
04 May 2017
Deep Motion Features for Visual Tracking
Susanna Gladh
Martin Danelljan
Fahad Shahbaz Khan
Michael Felsberg
86
89
0
20 Dec 2016
Asynchronous Temporal Fields for Action Recognition
Gunnar Sigurdsson
S. Divvala
Ali Farhadi
Abhinav Gupta
BDL
73
170
0
19 Dec 2016
Spatio-Temporal Attention Models for Grounded Video Captioning
M. Zanfir
Elisabeta Marinoiu
C. Sminchisescu
89
50
0
17 Oct 2016
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
1.7K
39,547
0
01 Sep 2014
Caffe: Convolutional Architecture for Fast Feature Embedding
Yangqing Jia
Evan Shelhamer
Jeff Donahue
Sergey Karayev
Jonathan Long
Ross B. Girshick
S. Guadarrama
Trevor Darrell
VLM
BDL
3DV
274
14,711
0
20 Jun 2014
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan
Andrew Zisserman
244
7,535
0
09 Jun 2014
Visualizing and Understanding Convolutional Networks
Matthew D. Zeiler
Rob Fergus
FAtt
SSL
595
15,882
0
12 Nov 2013
Rich feature hierarchies for accurate object detection and semantic segmentation
Ross B. Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
ObjD
289
26,193
0
11 Nov 2013
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIP
VGen
152
6,148
0
03 Dec 2012
1