ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.04608
  4. Cited By
Actions as Moving Points

Actions as Moving Points

14 January 2020
Yixuan Li
Zixu Wang
Limin Wang
Gangshan Wu
ArXivPDFHTML

Papers citing "Actions as Moving Points"

36 / 36 papers shown
Title
Action tube generation by person query matching for spatio-temporal action detection
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
107
0
0
17 Mar 2025
TACNet: Transition-Aware Context Network for Spatio-Temporal Action
  Detection
TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection
Lin Song
Shiwei Zhang
Gang Yu
Hongbin Sun
126
82
0
31 May 2019
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
Xitong Yang
Xiaodong Yang
Ming-Yuan Liu
Fanyi Xiao
L. Davis
Jan Kautz
56
138
0
19 Apr 2019
CenterNet: Keypoint Triplets for Object Detection
CenterNet: Keypoint Triplets for Object Detection
Kaiwen Duan
S. Bai
Lingxi Xie
H. Qi
Qingming Huang
Q. Tian
NoLa
109
2,684
0
17 Apr 2019
Objects as Points
Objects as Points
Xingyi Zhou
Dequan Wang
Philipp Krahenbuhl
3DPC
101
3,249
0
16 Apr 2019
FCOS: Fully Convolutional One-Stage Object Detection
FCOS: Fully Convolutional One-Stage Object Detection
Zhi Tian
Chunhua Shen
Hao Chen
Tong He
ObjD
114
4,997
0
02 Apr 2019
Dance with Flow: Two-in-One Stream Action Detection
Dance with Flow: Two-in-One Stream Action Detection
Jiaojiao Zhao
Cees G. M. Snoek
62
83
0
01 Apr 2019
Bottom-up Object Detection by Grouping Extreme and Center Points
Bottom-up Object Detection by Grouping Extreme and Center Points
Xingyi Zhou
Jiacheng Zhuo
Philipp Krahenbuhl
ObjD
107
865
0
23 Jan 2019
CornerNet: Detecting Objects as Paired Keypoints
CornerNet: Detecting Objects as Paired Keypoints
Hei Law
Jia Deng
ObjD
65
3,613
0
03 Aug 2018
Actor-Centric Relation Network
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
83
220
0
28 Jul 2018
Simple Baselines for Human Pose Estimation and Tracking
Simple Baselines for Human Pose Estimation and Tracking
Bin Xiao
Haiping Wu
Yichen Wei
3DH
VOT
113
1,784
0
17 Apr 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in
  Video Classification
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
137
1,325
0
13 Dec 2017
Cascade R-CNN: Delving into High Quality Object Detection
Cascade R-CNN: Delving into High Quality Object Detection
Zhaowei Cai
Nuno Vasconcelos
ObjD
128
4,912
0
03 Dec 2017
Focal Loss for Dense Object Detection
Focal Loss for Dense Object Detection
Nayeon Lee
Priya Goyal
Ross B. Girshick
Kaiming He
Piotr Dollár
ObjD
112
2,997
0
07 Aug 2017
Deep Layer Aggregation
Deep Layer Aggregation
Feng Yu
Dequan Wang
Evan Shelhamer
Trevor Darrell
AI4CE
FAtt
114
1,325
0
20 Jul 2017
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual
  Actions
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
C. Pantofaru
...
G. Toderici
Susanna Ricco
Rahul Sukthankar
Cordelia Schmid
Jitendra Malik
VGen
94
1,028
0
23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
214
7,986
0
22 May 2017
Action Tubelet Detector for Spatio-Temporal Action Localization
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
66
324
0
04 May 2017
AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep
  Architecture
AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep Architecture
Suman Saha
Gurkirt Singh
Fabio Cuzzolin
56
70
0
17 Apr 2017
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos
Rui Hou
Chong Chen
M. Shah
MedIm
65
333
0
30 Mar 2017
Online Real-time Multiple Spatiotemporal Action Localisation and
  Prediction
Online Real-time Multiple Spatiotemporal Action Localisation and Prediction
Gurkirt Singh
Suman Saha
Michael Sapienza
Philip Torr
Fabio Cuzzolin
61
286
0
25 Nov 2016
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos
Suman Saha
Gurkirt Singh
Michael Sapienza
Philip Torr
Fabio Cuzzolin
ViT
62
208
0
04 Aug 2016
Actionness Estimation Using Hybrid Fully Convolutional Networks
Actionness Estimation Using Hybrid Fully Convolutional Networks
Limin Wang
Yu Qiao
Xiaoou Tang
Luc Van Gool
35
98
0
25 Apr 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.7K
193,390
0
10 Dec 2015
SSD: Single Shot MultiBox Detector
SSD: Single Shot MultiBox Detector
Wen Liu
Dragomir Anguelov
D. Erhan
Christian Szegedy
Scott E. Reed
Cheng-Yang Fu
Alexander C. Berg
ObjD
BDL
186
29,740
0
08 Dec 2015
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
624
36,797
0
08 Jun 2015
Learning to track for spatio-temporal action localization
Learning to track for spatio-temporal action localization
Philippe Weinzaepfel
Zaïd Harchaoui
Cordelia Schmid
96
339
0
05 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
443
62,115
0
04 Jun 2015
Sequence to Sequence -- Video to Text
Sequence to Sequence -- Video to Text
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
105
1,417
0
03 May 2015
Fast R-CNN
Fast R-CNN
Ross B. Girshick
ObjD
284
25,008
0
30 Apr 2015
Describing Videos by Exploiting Temporal Structure
Describing Videos by Exploiting Temporal Structure
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
139
1,063
0
27 Feb 2015
Finding Action Tubes
Finding Action Tubes
Georgia Gkioxari
Jitendra Malik
52
598
0
21 Nov 2014
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual
  Recognition
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
ObjD
323
11,199
0
18 Jun 2014
Microsoft COCO: Common Objects in Context
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
351
43,506
0
01 May 2014
Rich feature hierarchies for accurate object detection and semantic
  segmentation
Rich feature hierarchies for accurate object detection and semantic segmentation
Ross B. Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
ObjD
250
26,147
0
11 Nov 2013
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIP
VGen
108
6,134
0
03 Dec 2012
1