Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.07485
Cited By
Actor and Action Video Segmentation from a Sentence
20 March 2018
Kirill Gavrilyuk
Amir Ghodrati
Zhenyang Li
Cees G. M. Snoek
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Actor and Action Video Segmentation from a Sentence"
34 / 34 papers shown
Title
ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025
Tianming Liang
Haichao Jiang
Wei-Shi Zheng
Jian-Fang Hu
44
0
0
30 Mar 2025
Referring Video Object Segmentation via Language-aligned Track Selection
Seongchan Kim
Woojeong Jin
Sangbeom Lim
Heeji Yoon
Hyunwook Choi
Seungryong Kim
VOS
94
0
0
02 Dec 2024
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Claudia Cuttano
Gabriele Trivigno
Gabriele Rosi
Carlo Masone
Giuseppe Averta
VOS
106
2
0
26 Nov 2024
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Mingqi Gao
Jingnan Luo
Jinyu Yang
Jungong Han
Feng Zheng
42
2
0
11 Jun 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
42
8
0
18 Mar 2024
EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving
Jiacheng Lin
Jiajun Chen
Kunyu Peng
Xuan He
Zhiyong Li
Rainer Stiefelhagen
Kailun Yang
52
6
0
28 Feb 2024
1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation
Zhuoyan Luo
Yicheng Xiao
Yong Liu
Yitong Wang
Yansong Tang
Xiu Li
Yujiu Yang
VOS
33
2
0
01 Jan 2024
Cross-modal Cognitive Consensus guided Audio-Visual Segmentation
Zhaofeng Shi
Qingbo Wu
Fanman Meng
Linfeng Xu
Hongliang Li
VOS
33
3
0
10 Oct 2023
Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
36
14
0
07 Sep 2023
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples
Guanghui Li
Mingqi Gao
Heng Liu
Xiantong Zhen
Feng Zheng
VOS
34
3
0
05 Sep 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
44
101
0
16 Aug 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
23
17
0
03 Jul 2023
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object Segmentation
Meng Lan
Fu Rong
Zuchao Li
Wei Yu
Lefei Zhang
VOS
33
5
0
02 Jul 2023
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip Torr
48
23
0
11 Mar 2023
Referring Multi-Object Tracking
Dongming Wu
Wencheng Han
Tiancai Wang
Xingping Dong
Xiangyu Zhang
Jianbing Shen
40
71
0
06 Mar 2023
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation
Zhiwei Hu
Bo Chen
Yuan Gao
Zhilong Ji
Jinfeng Bai
VOS
37
5
0
27 Dec 2022
Multi-Attention Network for Compressed Video Referring Object Segmentation
Weidong Chen
Dexiang Hong
Yuankai Qi
Zhenjun Han
Shuhui Wang
Laiyun Qing
Qingming Huang
Guorong Li
VOS
20
35
0
26 Jul 2022
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Zihan Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Jizhong Han
Si Liu
VOS
33
51
0
08 Jun 2022
Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Wangbo Zhao
Kai Wang
Xiangxiang Chu
Fuzhao Xue
Xinchao Wang
Yang You
29
21
0
06 Apr 2022
Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Chen Liang
Wenguan Wang
Tianfei Zhou
Jiaxu Miao
Yawei Luo
Yi Yang
VOS
29
74
0
18 Mar 2022
Language as Queries for Referring Video Object Segmentation
Jiannan Wu
Yi-Xin Jiang
Pei Sun
Zehuan Yuan
Ping Luo
28
141
0
03 Jan 2022
End-to-End Referring Video Object Segmentation with Multimodal Transformers
Adam Botach
Evgenii Zheltonozhskii
Chaim Baskin
VOS
34
140
0
29 Nov 2021
Video Generation from Text Employing Latent Path Construction for Temporal Modeling
Amir Mazaheri
M. Shah
30
8
0
29 Jul 2021
A Survey on Deep Learning Technique for Video Segmentation
Tianfei Zhou
Fatih Porikli
David J. Crandall
Luc Van Gool
Wenguan Wang
VOS
34
232
0
02 Jul 2021
Cross-Modal Progressive Comprehension for Referring Segmentation
Si Liu
Tianrui Hui
Shaofei Huang
Yunchao Wei
Bo-wen Li
Guanbin Li
EgoV
VOS
28
123
0
15 May 2021
SBNet: Segmentation-based Network for Natural Language-based Vehicle Search
Sangrok Lee
Taekang Woo
Sang Hun Lee
24
4
0
22 Apr 2021
Decoupled Spatial Temporal Graphs for Generic Visual Grounding
Qi Feng
Yunchao Wei
Mingming Cheng
Yi Yang
27
5
0
18 Mar 2021
We don't Need Thousand Proposals
:
\colon
:
Single Shot Actor-Action Detection in Videos
A. J. Rana
Yogesh S Rawat
ViT
13
11
0
22 Nov 2020
RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation
Míriam Bellver
Carles Ventura
Carina Silberer
Ioannis V. Kazakos
Jordi Torres
Xavier Giró-i-Nieto
VOS
26
32
0
01 Oct 2020
Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
Daizong Liu
Xiaoye Qu
Xiao-Yang Liu
Jianfeng Dong
Pan Zhou
Zichuan Xu
33
129
0
04 Aug 2020
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection
Jie Chen
Zhiheng Li
Jiebo Luo
Chenliang Xu
27
13
0
29 Mar 2020
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
Yitian Yuan
Lin Ma
Jingwen Wang
Wei Liu
Wenwu Zhu
30
242
0
31 Oct 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
27
147
0
20 Aug 2019
An Efficient 3D CNN for Action/Object Segmentation in Video
Rui Hou
Chong Chen
Rahul Sukthankar
M. Shah
24
27
0
21 Jul 2019
1