ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.00968
  4. Cited By
GRES: Generalized Referring Expression Segmentation

GRES: Generalized Referring Expression Segmentation

1 June 2023
Chang Liu
Henghui Ding
Xudong Jiang
ArXiv (abs)PDFHTML

Papers citing "GRES: Generalized Referring Expression Segmentation"

28 / 128 papers shown
Title
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature
  Refinement and Regularized Image-Text Alignment
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment
M. Lavrenyuk
Shariq Farooq Bhat
Matthias Müller
Peter Wonka
ObjDMDE
86
9
0
13 Dec 2023
See, Say, and Segment: Teaching LMMs to Overcome False Premises
See, Say, and Segment: Teaching LMMs to Overcome False Premises
Tsung-Han Wu
Giscard Biamby
David M. Chan
Lisa Dunlap
Ritwik Gupta
Xudong Wang
Joseph E. Gonzalez
Trevor Darrell
VLMMLLM
113
21
0
13 Dec 2023
Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring
  Expression Segmentation
Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation
Wenxuan Wang
Tongtian Yue
Yisi Zhang
Longteng Guo
Xingjian He
Xinlong Wang
Jing Liu
ObjD
85
15
0
13 Dec 2023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun
Runjia Li
Philip Torr
Xiuye Gu
Siyang Li
VLMCLIP
140
34
0
12 Dec 2023
SmartEdit: Exploring Complex Instruction-based Image Editing with
  Multimodal Large Language Models
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
Yuzhou Huang
Liangbin Xie
Xintao Wang
Ziyang Yuan
Xiaodong Cun
...
Jiantao Zhou
Chao Dong
Rui Huang
Ruimao Zhang
Ying Shan
DiffM
74
77
0
11 Dec 2023
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Hao Zhang
Hongyang Li
Feng Li
Tianhe Ren
Xueyan Zou
...
Shijia Huang
Jianfeng Gao
Lei Zhang
Chun-yue Li
Jianwei Yang
189
76
0
05 Dec 2023
Unveiling Objects with SOLA: An Annotation-Free Image Search on the
  Object Level for Automotive Data Sets
Unveiling Objects with SOLA: An Annotation-Free Image Search on the Object Level for Automotive Data Sets
Philipp Rigoll
Jacob Langner
Eric Sax
82
4
0
04 Dec 2023
Universal Segmentation at Arbitrary Granularity with Language
  Instruction
Universal Segmentation at Arbitrary Granularity with Language Instruction
Yong Liu
Cairong Zhang
Yitong Wang
Jiahao Wang
Yujiu Yang
Yansong Tang
VLMVOS
117
20
0
04 Dec 2023
PixelLM: Pixel Reasoning with Large Multimodal Model
PixelLM: Pixel Reasoning with Large Multimodal Model
Zhongwei Ren
Zhicheng Huang
Yunchao Wei
Yao-Min Zhao
Dongmei Fu
Jiashi Feng
Xiaojie Jin
VLMMLLMLRM
118
109
0
04 Dec 2023
InstructSeq: Unifying Vision Tasks with Instruction-conditioned
  Multi-modal Sequence Generation
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation
Rongyao Fang
Shilin Yan
Zhaoyang Huang
Jingqiu Zhou
Hao Tian
Jifeng Dai
Hongsheng Li
MLLM
106
14
0
30 Nov 2023
Synchronizing Vision and Language: Bidirectional Token-Masking
  AutoEncoder for Referring Image Segmentation
Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation
Minhyeok Lee
Dogyoon Lee
Jungho Lee
Suhwan Cho
Heeseung Choi
Ig-Jae Kim
Sangyoun Lee
70
0
0
29 Nov 2023
VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search
VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search
Shuting He
Hao Luo
Wei Jiang
Xudong Jiang
Henghui Ding
99
40
0
13 Nov 2023
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
Jinjin Xu
Liwu Xu
Yuzhe Yang
Xiang Li
Fanyi Wang
Yanchun Xie
Yi-Jie Huang
Yaqian Li
MoEMLLMVLM
133
16
0
09 Nov 2023
NExT-Chat: An LMM for Chat, Detection and Segmentation
NExT-Chat: An LMM for Chat, Detection and Segmentation
Ao Zhang
Yuan Yao
Wei Ji
Zhiyuan Liu
Tat-Seng Chua
MLLMVLM
123
55
0
08 Nov 2023
GLaMM: Pixel Grounding Large Multimodal Model
GLaMM: Pixel Grounding Large Multimodal Model
H. Rasheed
Muhammad Maaz
Sahal Shaji Mullappilly
Abdelrahman M. Shaker
Salman Khan
Hisham Cholakkal
Rao M. Anwer
Erix Xing
Ming-Hsuan Yang
Fahad S. Khan
MLLMVLM
153
239
0
06 Nov 2023
Towards Omni-supervised Referring Expression Segmentation
Towards Omni-supervised Referring Expression Segmentation
Minglang Huang
Yiyi Zhou
Gen Luo
Guannan Jiang
Weilin Zhuang
Xiaoshuai Sun
84
0
0
01 Nov 2023
Language Prompt for Autonomous Driving
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
134
87
0
08 Sep 2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng
Binxin Yang
Tiankai Hang
Chen Li
Shuyang Gu
...
Jianmin Bao
Zheng Zhang
Han Hu
DongDong Chen
Baining Guo
DiffMVLM
123
107
0
07 Sep 2023
Region Generation and Assessment Network for Occluded Person
  Re-Identification
Region Generation and Assessment Network for Occluded Person Re-Identification
Shuting He
Weihua Chen
Kai Wang
Haowen Luo
F. Wang
Wei Jiang
Henghui Ding
98
39
0
07 Sep 2023
GREC: Generalized Referring Expression Comprehension
GREC: Generalized Referring Expression Comprehension
Shuting He
Henghui Ding
Chang Liu
Xudong Jiang
ObjD
92
17
0
30 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion
  Expressions
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
134
117
0
16 Aug 2023
LISA: Reasoning Segmentation via Large Language Model
LISA: Reasoning Segmentation via Large Language Model
Xin Lai
Zhuotao Tian
Yukang Chen
Yanwei Li
Yuhui Yuan
Shu Liu
Jiaya Jia
LM&RoVLMMLLMLRM
169
463
0
01 Aug 2023
Towards Open Vocabulary Learning: A Survey
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Guohao Li
Dacheng Tao
ObjDVLM
154
151
0
28 Jun 2023
Primitive Generation and Semantic-related Alignment for Universal
  Zero-Shot Segmentation
Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation
Shuting He
Henghui Ding
Wei Jiang
VLM
138
38
0
19 Jun 2023
Multi-Modal Mutual Attention and Iterative Interaction for Referring
  Image Segmentation
Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
Chang Liu
Henghui Ding
Yulun Zhang
Xudong Jiang
107
50
0
24 May 2023
Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot
  Instance Segmentation
Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
Shuting He
Henghui Ding
Wei Jiang
ISeg
130
21
0
22 May 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViTMedIm
170
147
0
19 Apr 2023
Towards Language-guided Visual Recognition via Dynamic Convolutions
Towards Language-guided Visual Recognition via Dynamic Convolutions
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Yongjian Wu
Yue Gao
Rongrong Ji
ObjD
98
19
0
17 Oct 2021
Previous
123