ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.10271
  4. Cited By
End-to-end Temporal Action Detection with Transformer

End-to-end Temporal Action Detection with Transformer

18 June 2021
Xiaolong Liu
Qimeng Wang
Yao Hu
Xu Tang
Shiwei Zhang
S. Bai
X. Bai
    ViT
ArXivPDFHTML

Papers citing "End-to-end Temporal Action Detection with Transformer"

39 / 39 papers shown
Title
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
40
0
0
09 May 2025
WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network
WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network
Zhendong Liu
Le Zhang
Bing Li
Yingjie Zhou
Z. Chen
Ce Zhu
69
1
0
28 Jan 2025
Training Strategies for Isolated Sign Language Recognition
Training Strategies for Isolated Sign Language Recognition
Karina Kvanchiani
Roman Kraynov
Elizaveta Petrova
Petr Surovcev
Aleksandr Nagaev
A. Kapitanov
76
1
0
16 Dec 2024
Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection
Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection
Xiang Fang
Arvind Easwaran
B. Genest
36
4
0
16 Sep 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
45
1
0
12 Sep 2024
Introducing Gating and Context into Temporal Action Detection
Introducing Gating and Context into Temporal Action Detection
Aglind Reka
Diana Laura Borza
Dominick Reilly
Michal Balazia
Francois Bremond
20
0
0
06 Sep 2024
Towards Student Actions in Classroom Scenes: New Dataset and Baseline
Towards Student Actions in Classroom Scenes: New Dataset and Baseline
Zhuolin Tan
Chenqiang Gao
Anyong Qin
Ruixin Chen
Tiecheng Song
Feng Yang
Deyu Meng
29
0
0
02 Sep 2024
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in
  Streaming Videos
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Hyolim Kang
Jeongseok Hyun
Joungbin An
Youngjae Yu
Seon Joo Kim
38
0
0
17 Jul 2024
MMAD: Multi-label Micro-Action Detection in Videos
MMAD: Multi-label Micro-Action Detection in Videos
Kun Li
Pengyu Liu
Pengyu Liu
Guoliang Chen
Zhiliang Wu
Hehe Fan
Meng Wang
42
7
0
07 Jul 2024
MALT: Multi-scale Action Learning Transformer for Online Action
  Detection
MALT: Multi-scale Action Learning Transformer for Online Action Detection
Zhipeng Yang
Ruoyu Wang
Yang Tan
Liping Xie
OffRL
43
1
0
31 May 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled
  Subject-context Transformer
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
28
1
0
26 Apr 2024
Benchmarking the Robustness of Temporal Action Detection Models Against
  Temporal Corruptions
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions
Runhao Zeng
Xiaoyong Chen
Jiaming Liang
Huisi Wu
Guangzhong Cao
Yong Guo
AAML
39
3
0
29 Mar 2024
Hyperspectral Image Denoising via Spatial-Spectral Recurrent Transformer
Hyperspectral Image Denoising via Spatial-Spectral Recurrent Transformer
Guanyiman Fu
Fengchao Xiong
Jianfeng Lu
Jun Zhou
Jiantao Zhou
Yuntao Qian
ViT
19
11
0
31 Dec 2023
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization
David Pujol-Perich
Albert Clapés
Sergio Escalera
31
0
0
20 Dec 2023
Enhancing Single-Frame Supervision for Better Temporal Action
  Localization
Enhancing Single-Frame Supervision for Better Temporal Action Localization
Changjian Chen
Jiashu Chen
Weikai Yang
Haoze Wang
Johannes Knittel
Xibin Zhao
Steffen Koch
Thomas Ertl
Shixia Liu
26
3
0
08 Dec 2023
Low-power, Continuous Remote Behavioral Localization with Event Cameras
Low-power, Continuous Remote Behavioral Localization with Event Cameras
Friedhelm Hamann
Suman Ghosh
Ignacio Juarez Martinez
Tom Hart
Alex Kacelnik
Guillermo Gallego
24
7
0
06 Dec 2023
End-to-End Temporal Action Detection with 1B Parameters Across 1000
  Frames
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Shuming Liu
Chen-Da Liu-Zhang
Chen Zhao
Bernard Ghanem
33
25
0
28 Nov 2023
Boundary Discretization and Reliable Classification Network for Temporal
  Action Detection
Boundary Discretization and Reliable Classification Network for Temporal Action Detection
Zhenying Fang
Jun Yu
Richang Hong
26
0
0
10 Oct 2023
UnLoc: A Unified Framework for Video Localization Tasks
UnLoc: A Unified Framework for Video Localization Tasks
Shengjia Yan
Xuehan Xiong
Arsha Nagrani
Anurag Arnab
Zhonghao Wang
Weina Ge
David A. Ross
Cordelia Schmid
31
53
0
21 Aug 2023
Video-Specific Query-Key Attention Modeling for Weakly-Supervised
  Temporal Action Localization
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
Xijun Wang
Aggelos K. Katsaggelos
34
0
0
07 May 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
End-to-End Spatio-Temporal Action Localisation with Video Transformers
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
32
13
0
24 Apr 2023
Decomposed Cross-modal Distillation for RGB-based Temporal Action
  Detection
Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Pilhyeon Lee
Taeoh Kim
Minho Shim
Dongyoon Wee
H. Byun
30
11
0
30 Mar 2023
TemporalMaxer: Maximize Temporal Context with only Max Pooling for
  Temporal Action Localization
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
Tuan N. Tang
Kwonyoung Kim
K. Sohn
18
29
0
16 Mar 2023
Co-Occurrence Matters: Learning Action Relation for Temporal Action
  Localization
Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization
Congqi Cao
Yizhe Wang
Yuelie Lu
X. Zhang
Yanning Zhang
30
4
0
15 Mar 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense
  Video Captioning
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Antoine Yang
Arsha Nagrani
Paul Hongsuck Seo
Antoine Miech
Jordi Pont-Tuset
Ivan Laptev
Josef Sivic
Cordelia Schmid
AI4TS
VLM
36
220
0
27 Feb 2023
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Raghav Goyal
E. Mavroudi
Xitong Yang
Sainbayar Sukhbaatar
Leonid Sigal
Matt Feiszli
Lorenzo Torresani
Du Tran
23
7
0
16 Feb 2023
Vision Transformers for Action Recognition: A Survey
Vision Transformers for Action Recognition: A Survey
Anwaar Ulhaq
Naveed Akhtar
Ganna Pogrebna
Ajmal Saeed Mian
ViT
19
44
0
13 Sep 2022
ReAct: Temporal Action Detection with Relational Queries
ReAct: Temporal Action Detection with Relational Queries
Ding Shi
Yujie Zhong
Qiong Cao
Jing Zhang
Lin Ma
Jia Li
Dacheng Tao
ViT
28
68
0
14 Jul 2022
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Mingdong Yang
Guo Chen
Yin-Dong Zheng
Tong Lu
Limin Wang
36
45
0
05 May 2022
An Empirical Study of End-to-End Temporal Action Detection
An Empirical Study of End-to-End Temporal Action Detection
Xiaolong Liu
S. Bai
Xiang Bai
19
58
0
06 Apr 2022
Colar: Effective and Efficient Online Action Detection by Consulting
  Exemplars
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
Le Yang
Junwei Han
Dingwen Zhang
21
35
0
02 Mar 2022
ActionFormer: Localizing Moments of Actions with Transformers
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
25
329
0
16 Feb 2022
SWAT: Spatial Structure Within and Among Tokens
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
25
6
0
26 Nov 2021
PhysFormer: Facial Video-based Physiological Measurement with Temporal
  Difference Transformer
PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer
Zitong Yu
Yuming Shen
Jingang Shi
Hengshuang Zhao
Philip H. S. Torr
Guoying Zhao
ViT
MedIm
137
167
0
23 Nov 2021
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Elahe Vahdani
Yingli Tian
46
60
0
30 Sep 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,982
0
09 Feb 2021
Relaxed Transformer Decoders for Direct Action Proposal Generation
Relaxed Transformer Decoders for Direct Action Proposal Generation
Jing Tan
Jiaqi Tang
Limin Wang
Gangshan Wu
ViT
75
178
0
03 Feb 2021
Gaussian Temporal Awareness Networks for Action Localization
Gaussian Temporal Awareness Networks for Action Localization
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
143
319
0
09 Sep 2019
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
Tianwei Lin
Xu Zhao
Haisheng Su
Chongjing Wang
Ming Yang
139
700
0
08 Jun 2018
1