ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.01861
  4. Cited By
Action Tubelet Detector for Spatio-Temporal Action Localization

Action Tubelet Detector for Spatio-Temporal Action Localization

4 May 2017
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
ArXivPDFHTML

Papers citing "Action Tubelet Detector for Spatio-Temporal Action Localization"

50 / 62 papers shown
Title
Beyond Pixels: Leveraging the Language of Soccer to Improve Spatio-Temporal Action Detection in Broadcast Videos
Beyond Pixels: Leveraging the Language of Soccer to Improve Spatio-Temporal Action Detection in Broadcast Videos
Jeremie Ochin
Raphael Chekroun
Bogdan Stanciulescu
Sotiris Manitsaris
14
0
0
14 May 2025
Action tube generation by person query matching for spatio-temporal action detection
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
65
0
0
17 Mar 2025
Query matching for spatio-temporal action detection with query-based
  object detector
Query matching for spatio-temporal action detection with query-based object detector
Shimon Hori
Kazuki Omi
Toru Tamaki
36
0
0
27 Sep 2024
Weakly Supervised Video Anomaly Detection and Localization with
  Spatio-Temporal Prompts
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts
Peng Wu
Xuerong Zhou
Guansong Pang
Zhiwei Yang
Qingsen Yan
Peng Wang
Yanning Zhang
33
9
0
12 Aug 2024
Learning to Visually Connect Actions and their Effects
Learning to Visually Connect Actions and their Effects
Eric Peh
Paritosh Parmar
Basura Fernando
24
2
0
19 Jan 2024
Semi-supervised Active Learning for Video Action Detection
Semi-supervised Active Learning for Video Action Detection
Aayush Singh
A. J. Rana
Akash Kumar
Shruti Vyas
Yogesh S Rawat
36
7
0
12 Dec 2023
A Survey on Deep Learning-based Spatio-temporal Action Detection
A Survey on Deep Learning-based Spatio-temporal Action Detection
Peng Wang
Fanwei Zeng
Yu Qian
34
5
0
03 Aug 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
End-to-End Spatio-Temporal Action Localisation with Video Transformers
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
40
13
0
24 Apr 2023
Efficient Video Action Detection with Token Dropout and Context
  Refinement
Efficient Video Action Detection with Token Dropout and Context Refinement
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
36
14
0
17 Apr 2023
DOAD: Decoupled One Stage Action Detection Network
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
Pichao Wang
Fan Wang
Jiashi Feng
Mike Zheng Show
26
4
0
01 Apr 2023
What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in
  Untrimmed Multi-Action Videos from Narrated Instructions
What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
Brian Chen
Nina Shvetsova
Andrew Rouditchenko
D. Kondermann
Samuel Thomas
Shih-Fu Chang
Rogerio Feris
James R. Glass
Hilde Kuehne
40
7
0
29 Mar 2023
YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for
  Real-time Spatio-temporal Action Detection
YOWOv2: A Stronger yet Efficient Multi-level Detection Framework for Real-time Spatio-temporal Action Detection
Jianhua Yang
Kun Dai
ObjD
27
17
0
14 Feb 2023
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers
  using Synthetic Scene Data
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data
Roei Herzig
Ofir Abramovich
Elad Ben-Avraham
Assaf Arbelle
Leonid Karlinsky
Ariel Shamir
Trevor Darrell
Amir Globerson
41
16
0
08 Dec 2022
Holistic Interaction Transformer Network for Action Detection
Holistic Interaction Transformer Network for Action Detection
Gueter Josmy Faure
Min-Hung Chen
S. Lai
33
37
0
23 Oct 2022
ROAD-R: The Autonomous Driving Dataset with Logical Requirements
ROAD-R: The Autonomous Driving Dataset with Logical Requirements
Eleonora Giunchiglia
Mihaela C. Stoian
Salman Khan
Fabio Cuzzolin
Thomas Lukasiewicz
AI4TS
47
31
0
04 Oct 2022
Spatio-Temporal Action Detection Under Large Motion
Spatio-Temporal Action Detection Under Large Motion
Gurkirt Singh
Vasileios Choutas
Suman Saha
Feng Yu
Luc Van Gool
20
12
0
06 Sep 2022
Actor-identified Spatiotemporal Action Detection -- Detecting Who Is
  Doing What in Videos
Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos
Fan Yang
Norimichi Ukita
S. Sakti
Satoshi Nakamura
19
0
0
27 Aug 2022
RADNet: A Deep Neural Network Model for Robust Perception in Moving
  Autonomous Systems
RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems
B. Mudassar
Sho Ko
Maojingjing Li
Priyabrata Saha
Saibal Mukhopadhyay
16
2
0
30 Apr 2022
Tragedy Plus Time: Capturing Unintended Human Activities from
  Weakly-labeled Videos
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Arnav Chakravarthy
Zhiyuan Fang
Yezhou Yang
35
2
0
28 Apr 2022
Point3D: tracking actions as moving points with 3D CNNs
Point3D: tracking actions as moving points with 3D CNNs
Shentong Mo
Jingfei Xia
Xiaoqing Ellen Tan
Bhiksha Raj
3DPC
20
5
0
20 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary
  Detection
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
43
14
0
01 Mar 2022
Multiple Style Transfer via Variational AutoEncoder
Multiple Style Transfer via Variational AutoEncoder
Zhi-Song Liu
Vicky Kalogeiton
Marie-Paule Cani
BDL
DRL
34
18
0
13 Oct 2021
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Elahe Vahdani
Yingli Tian
52
60
0
30 Sep 2021
Negative Sample Matters: A Renaissance of Metric Learning for Temporal
  Grounding
Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
Zhenzhi Wang
Limin Wang
Tao Wu
Tianhao Li
Gangshan Wu
AI4TS
28
116
0
10 Sep 2021
Hierarchical Object-oriented Spatio-Temporal Reasoning for Video
  Question Answering
Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering
Long Hoang Dang
T. Le
Vuong Le
T. Tran
30
60
0
25 Jun 2021
Continual 3D Convolutional Neural Networks for Real-time Processing of
  Videos
Continual 3D Convolutional Neural Networks for Real-time Processing of Videos
Lukas Hedegaard
Alexandros Iosifidis
3DPC
23
14
0
31 May 2021
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized
  Sports Actions
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions
Yixuan Li
Lei Chen
Runyu He
Zhenzhi Wang
Gangshan Wu
Limin Wang
27
97
0
16 May 2021
Object-Centric Representation Learning for Video Question Answering
Object-Centric Representation Learning for Video Question Answering
Long Hoang Dang
T. Le
Vuong Le
T. Tran
27
7
0
12 Apr 2021
Object Priors for Classifying and Localizing Unseen Actions
Object Priors for Classifying and Localizing Unseen Actions
Pascal Mettes
William Thong
Cees G. M. Snoek
27
20
0
10 Apr 2021
The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges
  and methods
The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges and methods
V. Bawa
Gurkirt Singh
Francis KapingA
I. Skarga-Bandurova
Elettra Oleari
...
Li Li
Armando Stabile
Francesco Setti
R. Muradore
Fabio Cuzzolin
25
36
0
07 Apr 2021
Local Metrics for Multi-Object Tracking
Local Metrics for Multi-Object Tracking
Jack Valmadre
Alex Bewley
Jonathan Huang
Chen Sun
C. Sminchisescu
Cordelia Schmid
VOT
50
13
0
06 Apr 2021
Time and Frequency Network for Human Action Detection in Videos
Time and Frequency Network for Human Action Detection in Videos
Changhai Li
Huawei Chen
Jingqing Lu
Yang Huang
Yingying Liu
3DH
AI4TS
13
2
0
08 Mar 2021
ROAD: The ROad event Awareness Dataset for Autonomous Driving
ROAD: The ROad event Awareness Dataset for Autonomous Driving
Gurkirt Singh
Stephen Akrigg
Manuele Di Maio
Valentina Fontana
Reza Javanmard Alitappeh
...
Salman Khan
S. Grazioso
Andrew Bradley
G. Gironimo
Fabio Cuzzolin
32
89
0
23 Feb 2021
Multi-shot Temporal Event Localization: a Benchmark
Multi-shot Temporal Event Localization: a Benchmark
Xiaolong Liu
Yao Hu
S. Bai
Fei Ding
X. Bai
Philip Torr
46
81
0
17 Dec 2020
Human-centric Spatio-Temporal Video Grounding With Visual Transformers
Human-centric Spatio-Temporal Video Grounding With Visual Transformers
Zongheng Tang
Yue Liao
Si Liu
Guanbin Li
Xiaojie Jin
Hongxu Jiang
Qian Yu
Dong Xu
21
94
0
10 Nov 2020
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization
Unsupervised Domain Adaptation for Spatio-Temporal Action Localization
Nakul Agarwal
Yi-Ting Chen
Behzad Dariush
Ming-Hsuan Yang
27
8
0
19 Oct 2020
Context-Aware RCNN: A Baseline for Action Detection in Videos
Context-Aware RCNN: A Baseline for Action Detection in Videos
Jianchao Wu
Zhanghui Kuang
Limin Wang
Wayne Zhang
Gangshan Wu
30
79
0
20 Jul 2020
Actor-Context-Actor Relation Network for Spatio-Temporal Action
  Localization
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
Junting Pan
Siyu Chen
Zheng Shou
Yu Liu
Jing Shao
Hongsheng Li
3DPC
19
150
0
14 Jun 2020
STINet: Spatio-Temporal-Interactive Network for Pedestrian Detection and
  Trajectory Prediction
STINet: Spatio-Temporal-Interactive Network for Pedestrian Detection and Trajectory Prediction
Zhishuai Zhang
Jiyang Gao
Junhua Mao
Yukai Liu
Dragomir Anguelov
Congcong Li
3DPC
3DH
29
59
0
08 May 2020
Long Short-Term Relation Networks for Video Action Detection
Long Short-Term Relation Networks for Video Action Detection
Dong Li
Ting Yao
Zhaofan Qiu
Houqiang Li
Tao Mei
12
22
0
31 Mar 2020
Actions as Moving Points
Actions as Moving Points
Yixuan Li
Zixu Wang
Limin Wang
Gangshan Wu
22
104
0
14 Jan 2020
You Only Watch Once: A Unified CNN Architecture for Real-Time
  Spatiotemporal Action Localization
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
28
143
0
15 Nov 2019
An Efficient 3D CNN for Action/Object Segmentation in Video
An Efficient 3D CNN for Action/Object Segmentation in Video
Rui Hou
Chong Chen
Rahul Sukthankar
M. Shah
24
27
0
21 Jul 2019
Deformable Tube Network for Action Detection in Videos
Deformable Tube Network for Action Detection in Videos
Wei Li
Zehuan Yuan
Dashan Guo
Lei Huang
Xiangzhong Fang
Changhu Wang
ViT
MedIm
33
5
0
03 Jul 2019
Recurrent Convolutional Strategies for Face Manipulation Detection in
  Videos
Recurrent Convolutional Strategies for Face Manipulation Detection in Videos
Ekraam Sabir
Jiaxin Cheng
Ayush Jaiswal
Wael AbdAlmageed
I. Masi
Premkumar Natarajan
AAML
CVBM
28
451
0
02 May 2019
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
Xitong Yang
Xiaodong Yang
Ming Liu
Fanyi Xiao
L. Davis
Jan Kautz
30
138
0
19 Apr 2019
Dance with Flow: Two-in-One Stream Action Detection
Dance with Flow: Two-in-One Stream Action Detection
Jiaojiao Zhao
Cees G. M. Snoek
20
83
0
01 Apr 2019
Long-Term Feature Banks for Detailed Video Understanding
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
62
477
0
12 Dec 2018
A Structured Model For Action Detection
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
28
101
0
09 Dec 2018
Video Action Transformer Network
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
28
702
0
06 Dec 2018
12
Next