ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.03789
  4. Cited By
Language-Bridged Spatial-Temporal Interaction for Referring Video Object
  Segmentation

Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation

8 June 2022
Zihan Ding
Tianrui Hui
Junshi Huang
Xiaoming Wei
Jizhong Han
Si Liu
    VOS
ArXiv (abs)PDFHTML

Papers citing "Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation"

30 / 30 papers shown
Title
Referring Video Object Segmentation via Language-aligned Track Selection
Referring Video Object Segmentation via Language-aligned Track Selection
Seongchan Kim
Woojeong Jin
Sangbeom Lim
Heeji Yoon
Hyunwook Choi
Seungryong Kim
VOS
145
0
0
02 Dec 2024
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Claudia Cuttano
Gabriele Trivigno
Gabriele Rosi
Carlo Masone
Giuseppe Averta
VOS
167
2
0
26 Nov 2024
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Andong Deng
Tongjia Chen
Shoubin Yu
Taojiannan Yang
Lincoln Spencer
Yapeng Tian
Ajmal Mian
Joey Tianyi Zhou
Chen Chen
LRM
96
3
0
15 Nov 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOSLRM
112
5
0
18 Jul 2024
CLIP-It! Language-Guided Video Summarization
CLIP-It! Language-Guided Video Summarization
Medhini Narasimhan
Anna Rohrbach
Trevor Darrell
CLIP
110
119
0
01 Jul 2021
Cross-Modal Progressive Comprehension for Referring Segmentation
Cross-Modal Progressive Comprehension for Referring Segmentation
Si Liu
Tianrui Hui
Shaofei Huang
Yunchao Wei
Yue Liu
Guanbin Li
EgoVVOS
64
129
0
15 May 2021
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor
  Segmentation
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Tianrui Hui
Shaofei Huang
Si Liu
Zihan Ding
Guanbin Li
Wenguan Wang
Jizhong Han
Fei Wang
70
49
0
14 May 2021
M3L: Language-based Video Editing via Multi-Modal Multi-Level
  Transformers
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
Tsu-Jui Fu
Xinze Wang
Scott T. Grafton
Miguel P. Eckstein
Wenjie Wang
92
9
0
02 Apr 2021
Locate then Segment: A Strong Pipeline for Referring Image Segmentation
Locate then Segment: A Strong Pipeline for Referring Image Segmentation
Ya Jing
Tao Kong
Wei Wang
Liang Wang
Lei Li
Tieniu Tan
70
136
0
30 Mar 2021
Referring Segmentation in Images and Videos with Cross-Modal
  Self-Attention Network
Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network
Linwei Ye
Mrigank Rochan
Zhi Liu
Xiaoqin Zhang
Yang Wang
VOSEgoV
54
57
0
09 Feb 2021
SwiftNet: Real-time Video Object Segmentation
SwiftNet: Real-time Video Object Segmentation
Haochen Wang
Xiaolong Jiang
Haibing Ren
Yao Hu
S. Bai
VOS
72
150
0
09 Feb 2021
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
Brendan Duke
Abdalla Ahmed
Christian Wolf
P. Aarabi
Graham W. Taylor
VOS
61
166
0
21 Jan 2021
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised
  Video Object Segmentation
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation
Hyojin Park
Jayeon Yoo
Seohyeong Jeong
Ganesh Venkatesh
Nojun Kwak
VOS
69
36
0
21 Dec 2020
Linguistic Structure Guided Context Modeling for Referring Image
  Segmentation
Linguistic Structure Guided Context Modeling for Referring Image Segmentation
Tianrui Hui
Si Liu
Shaofei Huang
Guanbin Li
Sansi Yu
Faxi Zhang
Jizhong Han
72
154
0
01 Oct 2020
Referring Image Segmentation via Cross-Modal Progressive Comprehension
Referring Image Segmentation via Cross-Modal Progressive Comprehension
Shaofei Huang
Tianrui Hui
Si Liu
Guanbin Li
Yunchao Wei
Jizhong Han
Luoqi Liu
Yue Liu
EgoV
78
183
0
01 Oct 2020
Multi-task Collaborative Network for Joint Referring Expression
  Comprehension and Segmentation
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
ObjD
253
293
0
19 Mar 2020
Motion-Attentive Transition for Zero-Shot Video Object Segmentation
Motion-Attentive Transition for Zero-Shot Video Object Segmentation
Tianfei Zhou
Shunzhou Wang
Yi Zhou
Yazhou Yao
Jianwu Li
Ling Shao
VOS
180
189
0
09 Mar 2020
Motion Guided Attention for Video Salient Object Detection
Motion Guided Attention for Video Salient Object Detection
Haofeng Li
Guanqi Chen
Guanbin Li
Yizhou Yu
93
167
0
16 Sep 2019
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
Pengxiang Yan
Guanbin Li
Yuan Xie
Zhen Li
Chuan Wang
Tianshui Chen
Liang Lin
59
108
0
12 Aug 2019
Multi-Fiber Networks for Video Recognition
Multi-Fiber Networks for Video Recognition
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
CVBM
110
218
0
30 Jul 2018
Video Object Segmentation with Language Referring Expressions
Video Object Segmentation with Language Referring Expressions
Anna Khoreva
Anna Rohrbach
Bernt Schiele
VOS
69
196
0
21 Mar 2018
Actor and Action Video Segmentation from a Sentence
Actor and Action Video Segmentation from a Sentence
Kirill Gavrilyuk
Amir Ghodrati
Zhenyang Li
Cees G. M. Snoek
VLM
73
150
0
20 Mar 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
728
132,199
0
12 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
235
8,037
0
22 May 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
77
561
0
14 Apr 2017
V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image
  Segmentation
V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
Fausto Milletari
Nassir Navab
Seyed-Ahmad Ahmadi
229
8,706
0
15 Jun 2016
Fully Convolutional Networks for Semantic Segmentation
Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer
Jonathan Long
Trevor Darrell
VOSSSeg
741
37,886
0
20 May 2016
Segmentation from Natural Language Expressions
Segmentation from Natural Language Expressions
Ronghang Hu
Marcus Rohrbach
Trevor Darrell
VLMEgoV
74
437
0
20 Mar 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,322
0
10 Dec 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.0K
150,260
0
22 Dec 2014
1