ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.03377
  4. Cited By
Long Short-Term Transformer for Online Action Detection

Long Short-Term Transformer for Online Action Detection

7 July 2021
Mingze Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Xia
Z. Tu
Stefano Soatto
    ViT
ArXivPDFHTML

Papers citing "Long Short-Term Transformer for Online Action Detection"

50 / 72 papers shown
Title
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
Haibo Wang
Bo Feng
Zhengfeng Lai
Mingze Xu
Shiyu Li
Weifeng Ge
Afshin Dehghan
Meng Cao
Ping-Chia Huang
OffRL
51
0
0
08 May 2025
Learning Streaming Video Representation via Multitask Training
Learning Streaming Video Representation via Multitask Training
Yibin Yan
Jilan Xu
Shangzhe Di
Yikun Liu
Yudi Shi
Qirui Chen
Zeqian Li
Yifei Huang
Weidi Xie
CLL
84
0
0
28 Apr 2025
Action Anticipation from SoccerNet Football Video Broadcasts
Action Anticipation from SoccerNet Football Video Broadcasts
Mohamad Dalal
Artur Xarles
A. Cioppa
Silvio Giancola
Marc Van Droogenbroeck
Bernard Ghanem
Albert Clapés
Sergio Escalera
T. Moeslund
AI4TS
36
0
0
16 Apr 2025
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Dibyadip Chatterjee
Edoardo Remelli
Yale Song
Bugra Tekin
Abhay Mittal
...
Shreyas Hampali
Eric Sauser
Shugao Ma
Angela Yao
Fadime Sener
VLM
46
0
0
10 Apr 2025
REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding
REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding
Sakib Reza
Xiyun Song
Heather Yu
Zongfang Lin
Mohsen Moghaddam
Mario Sznaier
29
0
0
07 Apr 2025
SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding
SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding
Mingze Xu
Mingfei Gao
Shiyu Li
Jiasen Lu
Zhe Gan
Zhengfeng Lai
Meng Cao
Kai Kang
Yuqing Yang
Afshin Dehghan
59
2
0
24 Mar 2025
Context-Enhanced Memory-Refined Transformer for Online Action Detection
Context-Enhanced Memory-Refined Transformer for Online Action Detection
Zhanzhong Pang
Fadime Sener
Angela Yao
OffRL
56
1
0
24 Mar 2025
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Enes Erdogan
E. Aksoy
Sanem Sariel
44
0
0
15 Mar 2025
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
Junhyeok Kim
Min Soo Kim
Jiwan Chung
Jungbin Cho
Jisoo Kim
Sungwoong Kim
Gyeongbo Sim
Youngjae Yu
EgoV
60
0
0
17 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
45
0
0
11 Feb 2025
HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent
  Action Anticipation
HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Zirui Wang
Xinran Zhao
Simon Stepputtis
Woojun Kim
Tongshuang Wu
Katia P. Sycara
Yaqi Xie
OffRL
49
0
0
03 Nov 2024
OnlineTAS: An Online Baseline for Temporal Action Segmentation
OnlineTAS: An Online Baseline for Temporal Action Segmentation
Qing Zhong
Guodong Ding
Angela Yao
40
2
0
02 Nov 2024
Human Action Anticipation: A Survey
Human Action Anticipation: A Survey
Bolin Lai
Sam Toyer
Tushar Nagarajan
Rohit Girdhar
S. Zha
James M. Rehg
Kris M. Kitani
Kristen Grauman
Ruta Desai
Miao Liu
AI4TS
41
1
0
17 Oct 2024
AI Foundation Model for Heliophysics: Applications, Design, and
  Implementation
AI Foundation Model for Heliophysics: Applications, Design, and Implementation
Sujit Roy
Talwinder Singh
Marcus Freitag
J. Schmude
Rohit Lal
...
Berkay Aydin
Nikolai Pogorelov
Juan Bernabé-Moreno
M. Maskey
Rahul Ramachandran
MedIm
AI4CE
26
0
0
30 Sep 2024
Real-Time Human Action Recognition on Embedded Platforms
Real-Time Human Action Recognition on Embedded Platforms
Ruiqi Wang
Zichen Wang
Peiqi Gao
Mingzhen Li
Jaehwan Jeong
Yihang Xu
Yejin Lee
Carolyn M. Baum
Lisa Connor
Chenyang Lu
41
2
0
09 Sep 2024
HERMES: temporal-coHERent long-forM understanding with Episodes and
  Semantics
HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics
Gueter Josmy Faure
Jia-Fong Yeh
Min-Hung Chen
Hung-Ting Su
Winston H. Hsu
Shang-Hong Lai
26
3
0
30 Aug 2024
HAT: History-Augmented Anchor Transformer for Online Temporal Action
  Localization
HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization
Sakib Reza
Yuexi Zhang
Mohsen Moghaddam
Mario Sznaier
38
1
0
12 Aug 2024
Online Temporal Action Localization with Memory-Augmented Transformer
Online Temporal Action Localization with Memory-Augmented Transformer
Youngkil Song
Dongkeun Kim
Minsu Cho
Suha Kwak
31
0
0
06 Aug 2024
Classification Matters: Improving Video Action Detection with
  Class-Specific Attention
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee
Taeoh Kim
Inwoong Lee
Minho Shim
Dongyoon Wee
Minsu Cho
Suha Kwak
54
0
0
29 Jul 2024
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in
  Streaming Videos
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Hyolim Kang
Jeongseok Hyun
Joungbin An
Youngjae Yu
Seon Joo Kim
38
0
0
17 Jul 2024
Object Aware Egocentric Online Action Detection
Object Aware Egocentric Online Action Detection
Joungbin An
Yunsu Park
Hyolim Kang
Seon Joo Kim
EgoV
31
0
0
03 Jun 2024
MALT: Multi-scale Action Learning Transformer for Online Action
  Detection
MALT: Multi-scale Action Learning Transformer for Online Action Detection
Zhipeng Yang
Ruoyu Wang
Yang Tan
Liping Xie
OffRL
43
1
0
31 May 2024
A Semantic and Motion-Aware Spatiotemporal Transformer Network for
  Action Detection
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection
Matthew Korban
Peter Youngs
Scott T. Acton
ViT
29
6
0
13 May 2024
O-TALC: Steps Towards Combating Oversegmentation within Online Action
  Segmentation
O-TALC: Steps Towards Combating Oversegmentation within Online Action Segmentation
Matthew Kent Myers
Nick Wright
A. Mcgough
Nicholas Martin
31
1
0
10 Apr 2024
Spatial-Temporal Multi-level Association for Video Object Segmentation
Spatial-Temporal Multi-level Association for Video Object Segmentation
Deshui Miao
Xin Li
Zhenyu He
Huchuan Lu
Ming-Hsuan Yang
VOS
31
0
0
09 Apr 2024
Memory Consolidation Enables Long-Context Video Understanding
Memory Consolidation Enables Long-Context Video Understanding
Ivana Balavzević
Yuge Shi
Pinelopi Papalampidi
Rahma Chaabouni
Skanda Koppula
Olivier J. Hénaff
105
24
0
08 Feb 2024
RotaTR: Detection Transformer for Dense and Rotated Object
RotaTR: Detection Transformer for Dense and Rotated Object
Yuke Zhu
Yumeng Ruan
Lei Yang
Sheng Guo
29
0
0
05 Dec 2023
A Survey on Deep Learning Techniques for Action Anticipation
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
24
7
0
29 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
35
1
0
20 Sep 2023
JOADAA: joint online action detection and action anticipation
JOADAA: joint online action detection and action anticipation
Mohammed Guermal
François Brémond
Rui Dai
Abid Ali
34
6
0
12 Sep 2023
Phase-Specific Augmented Reality Guidance for Microscopic Cataract
  Surgery Using Long-Short Spatiotemporal Aggregation Transformer
Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer
Puxun Tu
Hongfei Ye
Haochen Shi
Jeff Young
Meng Xie
Peiquan Zhao
Ce Zheng
Xiaoyi Jiang
Xiaojun Chen
12
1
0
11 Sep 2023
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action
  Spotting using Transformers
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers
J. Denize
Mykola Liashuha
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
ViT
25
13
0
03 Sep 2023
PFL-LSTR: A privacy-preserving framework for driver intention inference
  based on in-vehicle and out-vehicle information
PFL-LSTR: A privacy-preserving framework for driver intention inference based on in-vehicle and out-vehicle information
Runjia Du
Pei Li
Sikai Chen
S. Labi
6
0
0
02 Sep 2023
Memory-and-Anticipation Transformer for Online Action Understanding
Memory-and-Anticipation Transformer for Online Action Understanding
Jiahao Wang
Guo Chen
Yifei Huang
Liming Wang
Tong Lu
OffRL
62
37
0
15 Aug 2023
Temporal Sentence Grounding in Streaming Videos
Temporal Sentence Grounding in Streaming Videos
Tian Gan
Xiao Wang
Yan Sun
Jianlong Wu
Qingpei Guo
Liqiang Nie
46
2
0
14 Aug 2023
Efficient Online Processing with Deep Neural Networks
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
26
0
0
23 Jun 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
E2E-LOAD: End-to-End Long-form Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
33
5
0
13 Jun 2023
A Multi-Modal Transformer Network for Action Detection
A Multi-Modal Transformer Network for Action Detection
Matthew Korban
Scott T. Acton
Peter Youngs
ViT
43
15
0
31 May 2023
VideoLLM: Modeling Video Sequence with Large Language Models
VideoLLM: Modeling Video Sequence with Large Language Models
Guo Chen
Yin-Dong Zheng
Jiahao Wang
Jilan Xu
Yifei Huang
...
Yi Wang
Yali Wang
Yu Qiao
Tong Lu
Limin Wang
MLLM
103
77
0
22 May 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and
  External Cameras via Spatial-Temporal Transformers
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Amr Abdelraouf
Kyungtae Han
Rohit Gupta
Ziran Wang
43
11
0
13 May 2023
Memory-augmented Online Video Anomaly Detection
Memory-augmented Online Video Anomaly Detection
L. Rossi
Vittorio Bernuzzi
Tomaso Fontanini
Massimo Bertozzi
Andrea Prati
24
3
0
21 Feb 2023
PhysFormer++: Facial Video-based Physiological Measurement with SlowFast
  Temporal Difference Transformer
PhysFormer++: Facial Video-based Physiological Measurement with SlowFast Temporal Difference Transformer
Zitong Yu
Yuming Shen
Jingang Shi
Hengshuang Zhao
Yawen Cui
Jiehua Zhang
Philip Torr
Guoying Zhao
ViT
MedIm
29
80
0
07 Feb 2023
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for
  Human Action Recognition
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition
Dasom Ahn
Sangwon Kim
H. Hong
ByoungChul Ko
ViT
28
97
0
14 Oct 2022
Sparse in Space and Time: Audio-visual Synchronisation with Trainable
  Selectors
Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Vladimir E. Iashin
Weidi Xie
Esa Rahtu
Andrew Zisserman
36
20
0
13 Oct 2022
An In-depth Study of Stochastic Backpropagation
An In-depth Study of Stochastic Backpropagation
J. Fang
Ming Xu
Hao Chen
Bing Shuai
Z. Tu
Joseph Tighe
BDL
32
1
0
30 Sep 2022
Real-time Online Video Detection with Temporal Smoothing Transformers
Real-time Online Video Detection with Temporal Smoothing Transformers
Yue Zhao
Philipp Krahenbuhl
ViT
69
57
0
19 Sep 2022
A Circular Window-based Cascade Transformer for Online Action Detection
A Circular Window-based Cascade Transformer for Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
42
6
0
30 Aug 2022
Weakly Supervised Online Action Detection for Infant General Movements
Weakly Supervised Online Action Detection for Infant General Movements
Tong Luo
Jia Xiao
Chuncao Zhang
Siheng Chen
Yuan Tian
Guangjun Yu
K. Dang
Xiaowei Ding
24
2
0
07 Aug 2022
An Efficient Framework for Few-shot Skeleton-based Temporal Action
  Segmentation
An Efficient Framework for Few-shot Skeleton-based Temporal Action Segmentation
Leiyang Xu
Qianqian Wang
Xiaotian Lin
Lin Yuan
MedIm
32
6
0
20 Jul 2022
One-stage Action Detection Transformer
One-stage Action Detection Transformer
Lijun Li
Lian Zhuo
Bangyin Zhang
ViT
32
0
0
21 Jun 2022
12
Next