ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.07175
  4. Cited By
Cross-Modal Progressive Comprehension for Referring Segmentation

Cross-Modal Progressive Comprehension for Referring Segmentation

15 May 2021
Si Liu
Tianrui Hui
Shaofei Huang
Yunchao Wei
Bo-wen Li
Guanbin Li
    EgoV
    VOS
ArXivPDFHTML

Papers citing "Cross-Modal Progressive Comprehension for Referring Segmentation"

50 / 71 papers shown
Title
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model
Kaiyu Li
Zepeng Xin
Li Pang
Chao Pang
Yupeng Deng
Jing Yao
Guisong Xia
Deyu Meng
Zhi Wang
Xiangyong Cao
VLM
LRM
37
0
0
13 Apr 2025
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities
Jing Liu
Wenxuan Wang
Yisi Zhang
Yepeng Tang
Xingjian He
Longteng Guo
Tongtian Yue
Xinlong Wang
ObjD
46
0
0
02 Apr 2025
CADFormer: Fine-Grained Cross-modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation
CADFormer: Fine-Grained Cross-modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation
Maofu Liu
Xin Jiang
Xiaokang Zhang
46
0
0
30 Mar 2025
Customized SAM 2 for Referring Remote Sensing Image Segmentation
Fu Rong
Meng Lan
Q. Zhang
L. Zhang
42
0
0
10 Mar 2025
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Suhwan Cho
Seunghoon Lee
Minhyeok Lee
Jungho Lee
Sangyoun Lee
VOS
77
0
0
05 Mar 2025
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models
Keyan Chen
Jiafan Zhang
Chenyang Liu
Zhengxia Zou
Zhenwei Shi
VLM
34
3
0
12 Jan 2025
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Claudia Cuttano
Gabriele Trivigno
Gabriele Rosi
Carlo Masone
Giuseppe Averta
VOS
101
2
0
26 Nov 2024
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Andong Deng
Tongjia Chen
Shoubin Yu
Taojiannan Yang
Lincoln Spencer
Yapeng Tian
Ajmal Saeed Mian
Mohit Bansal
Chen Chen
LRM
54
1
0
15 Nov 2024
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing
  Image Segmentation
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation
Zhe Dong
Yuzhe Sun
Yanfeng Gu
Tianzhu Liu
25
4
0
11 Oct 2024
One Token to Seg Them All: Language Instructed Reasoning Segmentation in
  Videos
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Zechen Bai
Tong He
Haiyang Mei
Pichao Wang
Ziteng Gao
Joya Chen
Lei Liu
Zheng Zhang
Mike Zheng Shou
VLM
VOS
MLLM
37
17
0
29 Sep 2024
Fully Aligned Network for Referring Image Segmentation
Fully Aligned Network for Referring Image Segmentation
Yong-Jin Liu
Ruihao Xu
Yansong Tang
29
0
0
29 Sep 2024
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing
  Image Segmentation
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation
Sen Lei
Xinyu Xiao
Heng-Chao Li
Z. Shi
Qing Zhu
18
12
0
20 Sep 2024
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
V. Bhat
P. Krishnamurthy
Ramesh Karri
Farshad Khorrami
42
3
0
16 Sep 2024
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic
  Narrative Grounding
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Hongyu Li
Tianrui Hui
Zihan Ding
Jing Zhang
Bin Ma
Xiaoming Wei
Jizhong Han
Si Liu
DiffM
40
1
0
12 Sep 2024
Language-Driven Interactive Shadow Detection
Language-Driven Interactive Shadow Detection
Hongqiu Wang
Wei Wang
Haipeng Zhou
Huihui Xu
Shaozhi Wu
Lei Zhu
31
6
0
16 Aug 2024
Decoupling Static and Hierarchical Motion Perception for Referring Video
  Segmentation
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
Shuting He
Henghui Ding
VOS
27
23
0
04 Apr 2024
ReMamber: Referring Image Segmentation with Mamba Twister
ReMamber: Referring Image Segmentation with Mamba Twister
Yu-Hao Yang
Chaofan Ma
Jiangchao Yao
Zhun Zhong
Ya-Qin Zhang
Yanfeng Wang
Mamba
58
20
0
26 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video
  Object Segmentation
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
37
7
0
18 Mar 2024
Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for
  Memory-Efficient Finetuning
Dr2^22Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning
Chen Zhao
Shuming Liu
K. Mangalam
Guocheng Qian
Fatimah Zohra
Abdulmohsen Alghannam
Jitendra Malik
Bernard Ghanem
46
3
0
08 Jan 2024
Tracking with Human-Intent Reasoning
Tracking with Human-Intent Reasoning
Jiawen Zhu
Zhi-Qi Cheng
Jun-Yan He
Chenyang Li
Bin Luo
Huchuan Lu
Yifeng Geng
Xuansong Xie
LRM
VOS
32
7
0
29 Dec 2023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
Jiannan Wu
Yi-Xin Jiang
Bin Yan
Huchuan Lu
Zehuan Yuan
Ping Luo
VOS
27
17
0
25 Dec 2023
Rotated Multi-Scale Interaction Network for Referring Remote Sensing
  Image Segmentation
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Sihan Liu
Yiwei Ma
Xiaoqing Zhang
Haowei Wang
Jiayi Ji
Xiaoshuai Sun
Rongrong Ji
16
38
0
19 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language
  Instruction
Universal Segmentation at Arbitrary Granularity with Language Instruction
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLM
VOS
47
15
0
04 Dec 2023
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic
  Narrative Grounding
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding
Tianrui Hui
Zihan Ding
Junshi Huang
Xiaoming Wei
Xiaolin K. Wei
Jiao Dai
Jizhong Han
Si Liu
29
4
0
02 Nov 2023
Semi-Supervised Panoptic Narrative Grounding
Semi-Supervised Panoptic Narrative Grounding
Danni Yang
Jiayi Ji
Xiaoshuai Sun
Haowei Wang
Yinan Li
Yiwei Ma
Rongrong Ji
22
5
0
27 Oct 2023
Fully Transformer-Equipped Architecture for End-to-End Referring Video
  Object Segmentation
Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation
P. Li
Yu Zhang
L. Yuan
Xianghua Xu
VOS
13
6
0
21 Sep 2023
Temporal Collection and Distribution for Referring Video Object
  Segmentation
Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
26
14
0
07 Sep 2023
Referring Image Segmentation Using Text Supervision
Referring Image Segmentation Using Text Supervision
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
L. Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
32
25
0
28 Aug 2023
Beyond One-to-One: Rethinking the Referring Image Segmentation
Beyond One-to-One: Rethinking the Referring Image Segmentation
Yutao Hu
Qixiong Wang
Wenqi Shao
Enze Xie
Zhenguo Li
Jungong Han
Ping Luo
3DV
14
37
0
26 Aug 2023
Video-Instrument Synergistic Network for Referring Video Instrument
  Segmentation in Robotic Surgery
Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery
Hongqiu Wang
Lei Zhu
Guang Yang
Yi-Ting Guo
Shenmin Zhang
Bo Xu
Yueming Jin
VOS
28
0
0
18 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion
  Expressions
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
33
101
0
16 Aug 2023
Learning Referring Video Object Segmentation from Weak Annotation
Learning Referring Video Object Segmentation from Weak Annotation
Wangbo Zhao
Ke Nan
Songyang Zhang
Kai-xiang Chen
Dahua Lin
Yang You
VOS
19
2
0
04 Aug 2023
VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects
  in Cluttered Indoor Scenes
VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes
Yuhao Lu
Yixuan Fan
Beixing Deng
F. Liu
Yali Li
Shengjin Wang
31
28
0
01 Aug 2023
Spectrum-guided Multi-granularity Referring Video Object Segmentation
Spectrum-guided Multi-granularity Referring Video Object Segmentation
Bo Miao
Bennamoun
Yongsheng Gao
Ajmal Saeed Mian
VOS
29
34
0
25 Jul 2023
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for
  Referring Image Segmentation
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
Zunnan Xu
Zhihong Chen
Yong Zhang
Yibing Song
Xiang Wan
Guanbin Li
VLM
27
47
0
21 Jul 2023
OnlineRefer: A Simple Online Baseline for Referring Video Object
  Segmentation
OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Dongming Wu
Tiancai Wang
Yuang Zhang
Xiangyu Zhang
Jianbing Shen
VOS
27
33
0
18 Jul 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring
  Video Object Segmentation
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation
Yonglin Li
Jing Zhang
Xiao Teng
Long Lan
VOS
VLM
23
17
0
03 Jul 2023
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for
  Referring Video Object Segmentation
Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object Segmentation
Meng Lan
Fu Rong
Zuchao Li
Wei Yu
L. Zhang
VOS
29
5
0
02 Jul 2023
Towards Open Vocabulary Learning: A Survey
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Bernard Ghanem
Dacheng Tao
ObjD
VLM
27
135
0
28 Jun 2023
LoSh: Long-Short Text Joint Prediction Network for Referring Video
  Object Segmentation
LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
Linfeng Yuan
Miaojing Shi
Zijie Yue
Qijun Chen
VOS
27
8
0
14 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
20
10
0
14 Jun 2023
GRES: Generalized Referring Expression Segmentation
GRES: Generalized Referring Expression Segmentation
Chang Liu
Henghui Ding
Xudong Jiang
34
139
0
01 Jun 2023
SOC: Semantic-Assisted Object Cluster for Referring Video Object
  Segmentation
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
Zhuoyan Luo
Yicheng Xiao
Yong-Jin Liu
Shuyan Li
Yitong Wang
Yansong Tang
Xiu Li
Yujiu Yang
VOS
14
32
0
26 May 2023
Multi-Modal Mutual Attention and Iterative Interaction for Referring
  Image Segmentation
Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
Chang Liu
Henghui Ding
Yulun Zhang
Xudong Jiang
19
47
0
24 May 2023
Advancing Referring Expression Segmentation Beyond Single Image
Advancing Referring Expression Segmentation Beyond Single Image
YiXuan Wu
Zhao Zhang
Xie Chi
Feng Zhu
Rui Zhao
VLM
27
18
0
21 May 2023
Meta Compositional Referring Expression Segmentation
Meta Compositional Referring Expression Segmentation
Li Xu
Mark He Huang
Xindi Shang
Zehuan Yuan
Ying Sun
Jun Liu
31
22
0
10 Apr 2023
Universal Instance Perception as Object Discovery and Retrieval
Universal Instance Perception as Object Discovery and Retrieval
B. Yan
Yi-Xin Jiang
Jiannan Wu
D. Wang
Ping Luo
Zehuan Yuan
Huchuan Lu
VOS
VLM
LRM
27
161
0
12 Mar 2023
Semantics-Aware Dynamic Localization and Refinement for Referring Image
  Segmentation
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip H. S. Torr
31
23
0
11 Mar 2023
PolyFormer: Referring Image Segmentation as Sequential Polygon
  Generation
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Jiang Liu
Hui Ding
Zhaowei Cai
Yuting Zhang
R. Satzoda
Vijay Mahadevan
R. Manmatha
ObjD
15
120
0
14 Feb 2023
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip H. S. Torr
S. Bai
VOS
25
132
0
03 Feb 2023
12
Next