Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.14598
Cited By
Video Self-Stitching Graph Network for Temporal Action Localization
30 November 2020
Chen Zhao
Ali K. Thabet
Bernard Ghanem
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Video Self-Stitching Graph Network for Temporal Action Localization"
50 / 94 papers shown
Title
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
40
0
0
09 May 2025
FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action Detection
Xinnan Zhu
Yicheng Zhu
Tixin Chen
Wentao Wu
Yuanjie Dang
49
0
0
01 Apr 2025
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos
Chen-Da Liu-Zhang
Lin Sui
Shuming Liu
Fangzhou Mu
Z. Wang
Bernard Ghanem
52
1
0
09 Mar 2025
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Baoqi Pei
Y. Huang
Jilan Xu
Guo Chen
Yuping He
...
Yali Wang
Weidi Xie
Yu Qiao
Fei Wu
Limin Wang
41
0
0
02 Mar 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
68
3
0
27 Feb 2025
Temporal Action Localization with Cross Layer Task Decoupling and Refinement
Qiang Li
Di Liu
Jun Kong
Sen Li
Hui Xu
Jianzhong Wang
77
0
0
12 Dec 2024
ContextDet: Temporal Action Detection with Adaptive Context Aggregation
Ning Wang
Yun Xiao
Xiaopeng Peng
Xiaojun Chang
Xuanhong Wang
Dingyi Fang
28
2
0
20 Oct 2024
Temporal2Seq: A Unified Framework for Temporal Video Understanding Tasks
Min Yang
Zichen Zhang
Limin Wang
AI4TS
36
0
0
27 Sep 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
45
1
0
12 Sep 2024
HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization
Sakib Reza
Yuexi Zhang
Mohsen Moghaddam
Octavia Camps
32
1
0
12 Aug 2024
Online Temporal Action Localization with Memory-Augmented Transformer
Youngkil Song
Dongkeun Kim
Minsu Cho
Suha Kwak
23
0
0
06 Aug 2024
Harnessing Temporal Causality for Advanced Temporal Action Detection
Shuming Liu
Lin Sui
Chen-Da Liu-Zhang
Fangzhou Mu
Chen Zhao
Bernard Ghanem
CML
40
2
0
25 Jul 2024
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization
Feixiang Zhou
Bryan M. Williams
Hossein Rahmani
40
1
0
10 Jul 2024
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Le Yang
Ziwei Zheng
Yizeng Han
Hao-Ran Cheng
Shiji Song
Gao Huang
Fan Li
58
8
0
03 Jul 2024
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
Akshita Gupta
Aditya Arora
Sanath Narayan
Salman Khan
F. Khan
Graham W. Taylor
38
3
0
21 Jun 2024
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Ming Hu
Peng Xia
Lin Wang
Siyuan Yan
Feilong Tang
...
Xuelian Cheng
Jun Cheng
Chi Liu
Kaijing Zhou
Zongyuan Ge
43
17
0
11 Jun 2024
One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features
Trung Thanh Nguyen
Yasutomo Kawanishi
Takahiro Komamizu
Ichiro Ide
VLM
33
3
0
30 Apr 2024
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
Yingsen Zeng
Yujie Zhong
Chengjian Feng
Lin Ma
63
7
0
07 Apr 2024
UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Tiantian Geng
Teng Wang
Yanfu Zhang
Jinming Duan
Weili Guan
Feng Zheng
23
0
0
04 Apr 2024
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
Ho-Joong Kim
Jung-Ho Hong
Heejo Kong
Seong-Whan Lee
57
5
0
03 Apr 2024
Action Detection via an Image Diffusion Process
Lin Geng Foo
Tianjiao Li
Hossein Rahmani
Jun Liu
22
4
0
01 Apr 2024
Dual DETRs for Multi-Label Temporal Action Detection
Yuhan Zhu
Guozhen Zhang
Jing Tan
Gangshan Wu
Limin Wang
35
11
0
31 Mar 2024
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions
Runhao Zeng
Xiaoyong Chen
Jiaming Liang
Huisi Wu
Guangzhong Cao
Yong Guo
AAML
39
3
0
29 Mar 2024
AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue
Yunlong Tang
Daiki Shimada
Jing Bi
Chenliang Xu
VGen
34
10
0
24 Mar 2024
Dr
2
^2
2
Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning
Chen Zhao
Shuming Liu
K. Mangalam
Guocheng Qian
Fatimah Zohra
Abdulmohsen Alghannam
Jitendra Malik
Bernard Ghanem
51
3
0
08 Jan 2024
Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
Min Yang
Huan Gao
Ping Guo
Limin Wang
ViT
28
5
0
04 Dec 2023
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Shuming Liu
Chen-Da Liu-Zhang
Chen Zhao
Bernard Ghanem
33
25
0
28 Nov 2023
ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization
Elahe Vahdani
Yingli Tian
21
0
0
27 Nov 2023
Temporal Action Localization for Inertial-based Human Activity Recognition
Marius Bock
Michael Moeller
Kristof Van Laerhoven
30
0
0
27 Nov 2023
POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization
Elahe Vahdani
Yingli Tian
27
1
0
20 Oct 2023
Boundary Discretization and Reliable Classification Network for Temporal Action Detection
Zhenying Fang
Jun Yu
Richang Hong
23
0
0
10 Oct 2023
Proposal-based Temporal Action Localization with Point-level Supervision
Yuan Yin
Yifei Huang
Ryosuke Furuta
Yoichi Sato
17
1
0
09 Oct 2023
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization
Edward Fish
Jon Weinbren
Andrew Gilbert
31
0
0
05 Oct 2023
Automatic Animation of Hair Blowing in Still Portrait Photos
Wenpeng Xiao
Wentao Liu
Yitong Wang
Bernard Ghanem
Bing Li
3DH
31
10
0
25 Sep 2023
Temporal Action Localization with Enhanced Instant Discriminability
Ding Shi
Qiong Cao
Yujie Zhong
Shan An
Jian Cheng
Haogang Zhu
Dacheng Tao
36
9
0
11 Sep 2023
UnLoc: A Unified Framework for Video Localization Tasks
Shengjia Yan
Xuehan Xiong
Arsha Nagrani
Anurag Arnab
Zhonghao Wang
Weina Ge
David A. Ross
Cordelia Schmid
26
53
0
21 Aug 2023
Self-Feedback DETR for Temporal Action Detection
Jihwan Kim
Miso Lee
Jae-Pil Heo
37
17
0
21 Aug 2023
Helping Hands: An Object-Aware Ego-Centric Video Recognition Model
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
VLM
26
19
0
15 Aug 2023
Learning to Identify Critical States for Reinforcement Learning from Videos
Haozhe Liu
Mingchen Zhuge
Bing Li
Yu‐Han Wang
Francesco Faccio
Bernard Ghanem
Jürgen Schmidhuber
OffRL
15
6
0
15 Aug 2023
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization
Xiaojun Tang
Junsong Fan
Chuanchen Luo
Zhaoxiang Zhang
Man Zhang
Zongyuan Yang
38
8
0
31 Jul 2023
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick
Yale Song
Sayan Nag
Kevin Qinghong Lin
Hardik Shah
Mike Zheng Shou
Ramalingam Chellappa
Pengchuan Zhang
VLM
39
86
0
11 Jul 2023
A Multi-Modal Transformer Network for Action Detection
Matthew Korban
Scott T. Acton
Peter Youngs
ViT
40
15
0
31 May 2023
Action Sensitivity Learning for Temporal Action Localization
Jiayi Shao
Xiaohan Wang
Ruijie Quan
Junjun Zheng
Jiang Yang
Yezhou Yang
28
22
0
25 May 2023
VideoLLM: Modeling Video Sequence with Large Language Models
Guo Chen
Yin-Dong Zheng
Jiahao Wang
Jilan Xu
Yifei Huang
...
Yi Wang
Yali Wang
Yu Qiao
Tong Lu
Limin Wang
MLLM
97
76
0
22 May 2023
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
Xijun Wang
Aggelos K. Katsaggelos
34
0
0
07 May 2023
WEAR: An Outdoor Sports Dataset for Wearable and Egocentric Activity Recognition
Marius Bock
Hilde Kuehne
Kristof Van Laerhoven
Michael Moeller
EgoV
38
24
0
11 Apr 2023
Boundary-Denoising for Video Activity Localization
Mengmeng Xu
Mattia Soldan
Jialin Gao
Shuming Liu
Juan-Manuel Perez-Rua
Bernard Ghanem
21
10
0
06 Apr 2023
Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Pilhyeon Lee
Taeoh Kim
Minho Shim
Dongyoon Wee
H. Byun
26
11
0
30 Mar 2023
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Sauradip Nag
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
DiffM
VGen
41
21
0
27 Mar 2023
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline
Tiantian Geng
Teng Wang
Jinming Duan
Runmin Cong
Feng Zheng
25
28
0
22 Mar 2023
1
2
Next