Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.03377
Cited By
Long Short-Term Transformer for Online Action Detection
7 July 2021
Mingze Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Xia
Z. Tu
Stefano Soatto
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Long Short-Term Transformer for Online Action Detection"
50 / 72 papers shown
Title
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
Haibo Wang
Bo Feng
Zhengfeng Lai
Mingze Xu
Shiyu Li
Weifeng Ge
Afshin Dehghan
Meng Cao
Ping-Chia Huang
OffRL
51
0
0
08 May 2025
Learning Streaming Video Representation via Multitask Training
Yibin Yan
Jilan Xu
Shangzhe Di
Yikun Liu
Yudi Shi
Qirui Chen
Zeqian Li
Yifei Huang
Weidi Xie
CLL
84
0
0
28 Apr 2025
Action Anticipation from SoccerNet Football Video Broadcasts
Mohamad Dalal
Artur Xarles
A. Cioppa
Silvio Giancola
Marc Van Droogenbroeck
Bernard Ghanem
Albert Clapés
Sergio Escalera
T. Moeslund
AI4TS
36
0
0
16 Apr 2025
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Dibyadip Chatterjee
Edoardo Remelli
Yale Song
Bugra Tekin
Abhay Mittal
...
Shreyas Hampali
Eric Sauser
Shugao Ma
Angela Yao
Fadime Sener
VLM
46
0
0
10 Apr 2025
REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding
Sakib Reza
Xiyun Song
Heather Yu
Zongfang Lin
Mohsen Moghaddam
Mario Sznaier
29
0
0
07 Apr 2025
SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding
Mingze Xu
Mingfei Gao
Shiyu Li
Jiasen Lu
Zhe Gan
Zhengfeng Lai
Meng Cao
Kai Kang
Yi Yang
Afshin Dehghan
59
2
0
24 Mar 2025
Context-Enhanced Memory-Refined Transformer for Online Action Detection
Zhanzhong Pang
Fadime Sener
Angela Yao
OffRL
56
1
0
24 Mar 2025
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Enes Erdogan
E. Aksoy
Sanem Sariel
44
0
0
15 Mar 2025
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
Junhyeok Kim
Min Soo Kim
Jiwan Chung
Jungbin Cho
Jisoo Kim
Sungwoong Kim
Gyeongbo Sim
Youngjae Yu
EgoV
57
0
0
17 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
45
0
0
11 Feb 2025
HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation
Zirui Wang
Xinran Zhao
Simon Stepputtis
Woojun Kim
Tongshuang Wu
Katia P. Sycara
Yaqi Xie
OffRL
49
0
0
03 Nov 2024
OnlineTAS: An Online Baseline for Temporal Action Segmentation
Qing Zhong
Guodong Ding
Angela Yao
40
2
0
02 Nov 2024
Human Action Anticipation: A Survey
Bolin Lai
Sam Toyer
Tushar Nagarajan
Rohit Girdhar
S. Zha
James M. Rehg
Kris M. Kitani
Kristen Grauman
Ruta Desai
Miao Liu
AI4TS
41
1
0
17 Oct 2024
AI Foundation Model for Heliophysics: Applications, Design, and Implementation
Sujit Roy
Talwinder Singh
Marcus Freitag
J. Schmude
Rohit Lal
...
Berkay Aydin
Nikolai Pogorelov
Juan Bernabé-Moreno
M. Maskey
Rahul Ramachandran
MedIm
AI4CE
26
0
0
30 Sep 2024
Real-Time Human Action Recognition on Embedded Platforms
Ruiqi Wang
Zichen Wang
Peiqi Gao
Mingzhen Li
Jaehwan Jeong
Yihang Xu
Yejin Lee
Carolyn M. Baum
Lisa Connor
Chenyang Lu
41
2
0
09 Sep 2024
HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics
Gueter Josmy Faure
Jia-Fong Yeh
Min-Hung Chen
Hung-Ting Su
Winston H. Hsu
Shang-Hong Lai
26
3
0
30 Aug 2024
HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization
Sakib Reza
Yuexi Zhang
Mohsen Moghaddam
Mario Sznaier
38
1
0
12 Aug 2024
Online Temporal Action Localization with Memory-Augmented Transformer
Youngkil Song
Dongkeun Kim
Minsu Cho
Suha Kwak
31
0
0
06 Aug 2024
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee
Taeoh Kim
Inwoong Lee
Minho Shim
Dongyoon Wee
Minsu Cho
Suha Kwak
52
0
0
29 Jul 2024
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Hyolim Kang
Jeongseok Hyun
Joungbin An
Youngjae Yu
Seon Joo Kim
38
0
0
17 Jul 2024
Object Aware Egocentric Online Action Detection
Joungbin An
Yunsu Park
Hyolim Kang
Seon Joo Kim
EgoV
29
0
0
03 Jun 2024
MALT: Multi-scale Action Learning Transformer for Online Action Detection
Zhipeng Yang
Ruoyu Wang
Yang Tan
Liping Xie
OffRL
43
1
0
31 May 2024
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection
Matthew Korban
Peter Youngs
Scott T. Acton
ViT
29
6
0
13 May 2024
O-TALC: Steps Towards Combating Oversegmentation within Online Action Segmentation
Matthew Kent Myers
Nick Wright
A. Mcgough
Nicholas Martin
29
1
0
10 Apr 2024
Spatial-Temporal Multi-level Association for Video Object Segmentation
Deshui Miao
Xin Li
Zhenyu He
Huchuan Lu
Ming-Hsuan Yang
VOS
31
0
0
09 Apr 2024
Memory Consolidation Enables Long-Context Video Understanding
Ivana Balavzević
Yuge Shi
Pinelopi Papalampidi
Rahma Chaabouni
Skanda Koppula
Olivier J. Hénaff
105
22
0
08 Feb 2024
RotaTR: Detection Transformer for Dense and Rotated Object
Yuke Zhu
Yumeng Ruan
Lei Yang
Sheng Guo
29
0
0
05 Dec 2023
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
24
7
0
29 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
35
1
0
20 Sep 2023
JOADAA: joint online action detection and action anticipation
Mohammed Guermal
François Brémond
Rui Dai
Abid Ali
31
6
0
12 Sep 2023
Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer
Puxun Tu
Hongfei Ye
Haochen Shi
Jeff Young
Meng Xie
Peiquan Zhao
Ce Zheng
Xiaoyi Jiang
Xiaojun Chen
12
1
0
11 Sep 2023
COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers
J. Denize
Mykola Liashuha
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
ViT
25
13
0
03 Sep 2023
PFL-LSTR: A privacy-preserving framework for driver intention inference based on in-vehicle and out-vehicle information
Runjia Du
Pei Li
Sikai Chen
S. Labi
6
0
0
02 Sep 2023
Memory-and-Anticipation Transformer for Online Action Understanding
Jiahao Wang
Guo Chen
Yifei Huang
Liming Wang
Tong Lu
OffRL
62
37
0
15 Aug 2023
Temporal Sentence Grounding in Streaming Videos
Tian Gan
Xiao Wang
Yan Sun
Jianlong Wu
Qingpei Guo
Liqiang Nie
46
2
0
14 Aug 2023
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
26
0
0
23 Jun 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
30
5
0
13 Jun 2023
A Multi-Modal Transformer Network for Action Detection
Matthew Korban
Scott T. Acton
Peter Youngs
ViT
43
15
0
31 May 2023
VideoLLM: Modeling Video Sequence with Large Language Models
Guo Chen
Yin-Dong Zheng
Jiahao Wang
Jilan Xu
Yifei Huang
...
Yi Wang
Yali Wang
Yu Qiao
Tong Lu
Limin Wang
MLLM
103
77
0
22 May 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Amr Abdelraouf
Kyungtae Han
Rohit Gupta
Ziran Wang
43
11
0
13 May 2023
Memory-augmented Online Video Anomaly Detection
L. Rossi
Vittorio Bernuzzi
Tomaso Fontanini
Massimo Bertozzi
Andrea Prati
22
3
0
21 Feb 2023
PhysFormer++: Facial Video-based Physiological Measurement with SlowFast Temporal Difference Transformer
Zitong Yu
Yuming Shen
Jingang Shi
Hengshuang Zhao
Yawen Cui
Jiehua Zhang
Philip Torr
Guoying Zhao
ViT
MedIm
29
80
0
07 Feb 2023
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition
Dasom Ahn
Sangwon Kim
H. Hong
ByoungChul Ko
ViT
28
97
0
14 Oct 2022
Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors
Vladimir E. Iashin
Weidi Xie
Esa Rahtu
Andrew Zisserman
33
20
0
13 Oct 2022
An In-depth Study of Stochastic Backpropagation
J. Fang
Ming Xu
Hao Chen
Bing Shuai
Z. Tu
Joseph Tighe
BDL
32
1
0
30 Sep 2022
Real-time Online Video Detection with Temporal Smoothing Transformers
Yue Zhao
Philipp Krahenbuhl
ViT
69
57
0
19 Sep 2022
A Circular Window-based Cascade Transformer for Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
42
6
0
30 Aug 2022
Weakly Supervised Online Action Detection for Infant General Movements
Tong Luo
Jia Xiao
Chuncao Zhang
Siheng Chen
Yuan Tian
Guangjun Yu
K. Dang
Xiaowei Ding
24
2
0
07 Aug 2022
An Efficient Framework for Few-shot Skeleton-based Temporal Action Segmentation
Leiyang Xu
Qianqian Wang
Xiaotian Lin
Lin Yuan
MedIm
32
6
0
20 Jul 2022
One-stage Action Detection Transformer
Lijun Li
Lian Zhuo
Bangyin Zhang
ViT
30
0
0
21 Jun 2022
1
2
Next