Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.08568
Cited By
ASFormer: Transformer for Action Segmentation
16 October 2021
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ASFormer: Transformer for Action Segmentation"
35 / 35 papers shown
Title
Multi-Stage Boundary-Aware Transformer Network for Action Segmentation in Untrimmed Surgical Videos
Rezowan Shuvo
M S Mekala
Eyad Elyan
MedIm
131
0
0
26 Apr 2025
F
3
^3
3
Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos
Zhaoyu Liu
Kan Jiang
Murong Ma
Zhé Hóu
Yun Lin
J. Dong
37
0
0
11 Apr 2025
Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks
Wei-Jin Huang
Yuan-Ming Li
Zhi-Wei Xia
Yu-Ming Tang
Kun-Yu Lin
Jian-Fang Hu
Wei-Shi Zheng
47
0
0
28 Mar 2025
Cost-Sensitive Learning for Long-Tailed Temporal Action Segmentation
Zhanzhong Pang
Fadime Sener
Shrinivas Ramasubramanian
Angela Yao
56
1
0
24 Mar 2025
End-to-End Action Segmentation Transformer
Tieqiao Wang
Sinisa Todorovic
ViT
39
0
0
08 Mar 2025
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Baoqi Pei
Y. Huang
Jilan Xu
Guo Chen
Yuping He
...
Yali Wang
Weidi Xie
Yu Qiao
Fei Wu
Limin Wang
41
0
0
02 Mar 2025
Improving action segmentation via explicit similarity measurement
Kamel Aouaidjia
Wenhao Zhang
Aofan Li
Chongsheng Zhang
44
0
0
15 Feb 2025
Hierarchical Vector Quantization for Unsupervised Action Segmentation
Federico Spurio
Emad Bahrami
Gianpiero Francesca
Juergen Gall
44
0
0
23 Dec 2024
Video LLMs for Temporal Reasoning in Long Videos
Fawad Javed Fateh
Umer Ahmed
Hamza Khan
M. Zia
Quoc-Huy Tran
VLM
89
0
0
04 Dec 2024
Faster Diffusion Action Segmentation
Shuai Wang
Shunli Wang
Mingcheng Li
Dingkang Yang
Haopeng Kuang
Ziyun Qian
Lihua Zhang
39
0
0
04 Aug 2024
Con4m: Context-aware Consistency Learning Framework for Segmented Time Series Classification
Junru Chen
Tianyu Cao
Ninon De Mecquenem
Jiahe Li
Zhilong Chen
F. Friederici
Yang Yang
40
1
0
31 Jul 2024
Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Kim Hoang Tran
Phuc Vuong Do
Ngoc Quoc Ly
Ngan Le
36
4
0
15 Apr 2024
Activity Grammars for Temporal Action Segmentation
Dayoung Gong
Joonseok Lee
Deunsol Jung
Suha Kwak
Minsu Cho
33
7
0
07 Dec 2023
BIT: Bi-Level Temporal Modeling for Efficient Supervised Action Segmentation
Zijia Lu
Ehsan Elhamifar
48
2
0
28 Aug 2023
Enhancing Transformer Backbone for Egocentric Video Action Segmentation
Sakib Reza
Balaji Sundareshan
Mohsen Moghaddam
Mario Sznaier
ViT
30
4
0
19 May 2023
DIR-AS: Decoupling Individual Identification and Temporal Reasoning for Action Segmentation
Peiyao Wang
Haibin Ling
15
2
0
04 Apr 2023
Diffusion Action Segmentation
Dao-jun Liu
Qiyue Li
A. Dinh
Ting Jiang
Mubarak Shah
Chan Xu
VGen
DiffM
19
68
0
31 Mar 2023
TAEC: Unsupervised Action Segmentation with Temporal-Aware Embedding and Clustering
Wei Lin
Anna Kukleva
Horst Possegger
Hilde Kuehne
Horst Bischof
48
2
0
09 Mar 2023
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition
Dasom Ahn
Sangwon Kim
H. Hong
ByoungChul Ko
ViT
28
97
0
14 Oct 2022
Robust Action Segmentation from Timestamp Supervision
Yaser Souri
Yazan Abu Farha
Emad Bahrami
Gianpiero Francesca
Juergen Gall
27
6
0
12 Oct 2022
Hand Hygiene Assessment via Joint Step Segmentation and Key Action Scorer
Chenglong Li
Qiwen Zhu
Tubiao Liu
Jin Tang
Yu Su
32
1
0
25 Sep 2022
Vision Transformers for Action Recognition: A Survey
Anwaar Ulhaq
Naveed Akhtar
Ganna Pogrebna
Ajmal Saeed Mian
ViT
19
44
0
13 Sep 2022
Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
Jun-Bin Zhang
Pei-Hsuan Tsai
Meng-Hsun Tsai
31
20
0
13 Sep 2022
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation
Nadine Behrmann
S. Golestaneh
Zico Kolter
Juergen Gall
M. Noroozi
22
72
0
01 Sep 2022
A Circular Window-based Cascade Transformer for Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
42
6
0
30 Aug 2022
Action Quality Assessment with Temporal Parsing Transformer
Yang Bai
Desen Zhou
Songyang Zhang
Jian Wang
Errui Ding
Yu Guan
Yang Long
Jingdong Wang
ViT
18
39
0
19 Jul 2022
Do we really need temporal convolutions in action segmentation?
Dazhao Du
Bing-Huang Su
Yu Li
Zhongang Qi
Hui Xiong
Ying Shan
ViT
21
16
0
26 May 2022
Cross-Enhancement Transformer for Action Segmentation
Jiahui Wang
Zhenyou Wang
Shanna Zhuang
Hui Wang
ViT
54
23
0
19 May 2022
On the Pitfalls of Batch Normalization for End-to-End Video Learning: A Study on Surgical Workflow Analysis
Dominik Rivoir
Isabel Funke
Stefanie Speidel
24
16
0
15 Mar 2022
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction
Yunze Liu
Yun-Hai Liu
Chen Jiang
Kangbo Lyu
Weikang Wan
Hao Shen
Bo-Hua Liang
Zhoujie Fu
He-Nan Wang
Li Yi
45
173
0
03 Mar 2022
SVIP: Sequence VerIfication for Procedures in Videos
Yichen Qian
Weixin Luo
Dongze Lian
Xu Tang
P. Zhao
Shenghua Gao
ViT
29
17
0
13 Dec 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
283
3,623
0
24 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,982
0
09 Feb 2021
Global2Local: Efficient Structure Search for Video Action Segmentation
Shanghua Gao
Qi Han
Zhong-Yu Li
Pai Peng
Liang Wang
Ming-Ming Cheng
EgoV
86
73
0
04 Jan 2021
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
282
2,015
0
28 Jul 2020
1