ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.01567
  4. Cited By
Segment Anything in High Quality
v1v2 (latest)

Segment Anything in High Quality

2 June 2023
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
    VLM
ArXiv (abs)PDFHTMLGithub (3956★)

Papers citing "Segment Anything in High Quality"

50 / 214 papers shown
Title
Robust Box Prompt based SAM for Medical Image Segmentation
Robust Box Prompt based SAM for Medical Image Segmentation
Yuhao Huang
Xin Yang
Han Zhou
Yan Cao
Haoran Dou
Fajin Dong
Dong Ni
VLMMedIm
96
5
0
31 Jul 2024
GP-VLS: A general-purpose vision language model for surgery
GP-VLS: A general-purpose vision language model for surgery
Samuel Schmidgall
Joseph Cho
C. Zakka
W. Hiesinger
LM&MA
149
6
0
27 Jul 2024
General Geometry-aware Weakly Supervised 3D Object Detection
General Geometry-aware Weakly Supervised 3D Object Detection
Guowen Zhang
Junsong Fan
Liyi Chen
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
3DPC
116
2
0
18 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffMVGen
206
17
0
17 Jul 2024
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded
  Scenes
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes
Zhi Cai
Yingjie Gao
Yaoyan Zheng
Nan Zhou
Di Huang
VLM
99
6
0
16 Jul 2024
WSESeg: Introducing a Dataset for the Segmentation of Winter Sports
  Equipment with a Baseline for Interactive Segmentation
WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive Segmentation
Robin Schon
Daniel Kienzle
Rainer Lienhart
VLM
69
1
0
12 Jul 2024
IRSAM: Advancing Segment Anything Model for Infrared Small Target
  Detection
IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection
Mingjin Zhang
Yuchun Wang
Jie-Ru Guo
Yunsong Li
Xinbo Gao
Jing Zhang
VLM
104
29
0
10 Jul 2024
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation
Zihan Gao
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Yuwei Guo
Shuyuan Yang
49
1
0
01 Jul 2024
OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
Adam Sun
Tiange Xiang
Scott Delp
Li Fei-Fei
Ehsan Adeli
111
2
0
29 Jun 2024
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Yuxuan Zhang
Tianheng Cheng
Lianghui Zhu
Lei Liu
Heng Liu
Longjin Ran
Xiaoxin Chen
Xiaoxin Chen
Wenyu Liu
Xinggang Wang
VLM
205
31
0
28 Jun 2024
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment
  Anything Model
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Haobo Yuan
Xiangtai Li
Lu Qi
Tao Zhang
Ming-Hsuan Yang
Shuicheng Yan
Chen Change Loy
VLM
120
10
0
27 Jun 2024
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Henghui Ding
Chang Liu
Yunchao Wei
Nikhila Ravi
Shuting He
...
Bo Zhao
Jing Liu
Feiyu Pan
Hao Fang
Xiankai Lu
123
8
0
24 Jun 2024
2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion
  Expression guided Video Segmentation
2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Bin Cao
Yisi Zhang
Xuanxu Lin
Xingjian He
Bo Zhao
Jing Liu
135
2
0
20 Jun 2024
Imagination Policy: Using Generative Point Cloud Models for Learning
  Manipulation Policies
Imagination Policy: Using Generative Point Cloud Models for Learning Manipulation Policies
Haojie Huang
Karl Schmeckpeper
Dian Wang
Ondrej Biza
Yaoyao Qian
Haotian Liu
Mingxi Jia
Robert Platt
Robin Walters
VGenLM&Ro
99
8
0
17 Jun 2024
RobustSAM: Segment Anything Robustly on Degraded Images
RobustSAM: Segment Anything Robustly on Degraded Images
Wei-Ting Chen
Yu-Jiet Vong
Sy-Yen Kuo
Sizhuo Ma
Jian Wang
VLM
100
11
0
13 Jun 2024
Training-Free Robust Interactive Video Object Segmentation
Training-Free Robust Interactive Video Object Segmentation
Xiaoli Wei
Zhaoqing Wang
Yandong Guo
Chunxia Zhang
Tongliang Liu
Mingming Gong
VLMVOS
86
1
0
08 Jun 2024
Matching Anything by Segmenting Anything
Matching Anything by Segmenting Anything
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
109
27
0
06 Jun 2024
Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware
  Spatio-Temporal Sampling
Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
Pedro Miraldo
Suhas Lohit
Moitreya Chatterjee
3DGS
181
5
0
06 Jun 2024
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
An-Chieh Cheng
Hongxu Yin
Yang Fu
Qiushan Guo
Ruihan Yang
Jan Kautz
Xiaolong Wang
Sifei Liu
LRM
122
75
0
03 Jun 2024
Hyper-Transformer for Amodal Completion
Hyper-Transformer for Amodal Completion
Jianxiong Gao
Xuelin Qian
Longfei Liang
Junwei Han
Yanwei Fu
ViT
90
1
0
30 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLMISeg
157
5
0
28 May 2024
How Much You Ate? Food Portion Estimation on Spoons
How Much You Ate? Food Portion Estimation on Spoons
Aaryam Sharma
Chris Czarnecki
Yuhao Chen
Pengcheng Xi
Linlin Xu
Alexander Wong
98
1
0
12 May 2024
Structured Click Control in Transformer-based Interactive Segmentation
Structured Click Control in Transformer-based Interactive Segmentation
Long Xu
Yong-Xiang Chen
Rui Huang
Feng Wu
Shiwu Lai
74
0
0
07 May 2024
PTQ4SAM: Post-Training Quantization for Segment Anything
PTQ4SAM: Post-Training Quantization for Segment Anything
Chengtao Lv
Hong Chen
Jinyang Guo
Yifu Ding
Xianglong Liu
VLMMQ
87
16
0
06 May 2024
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular
  Videos
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
Wen-Hsuan Chu
Lei Ke
Katerina Fragkiadaki
3DGSVGen
109
33
0
03 May 2024
MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model
MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model
Rajat Sahay
Andreas E. Savakis
MoE
94
0
0
01 May 2024
ASAM: Boosting Segment Anything Model with Adversarial Tuning
ASAM: Boosting Segment Anything Model with Adversarial Tuning
Bo Li
Haoke Xiao
Lv Tang
116
11
0
01 May 2024
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
Zhangjing Yang
Dun Liu
Wensheng Cheng
Jinqiao Wang
Yi Wu
VLM
67
2
0
22 Apr 2024
Interpreting COVID Lateral Flow Tests' Results with Foundation Models
Interpreting COVID Lateral Flow Tests' Results with Foundation Models
Stuti Pandey
Josh Myers-Dean
Jarek Reynolds
Danna Gurari
59
0
0
21 Apr 2024
Learning from Unlabelled Data with Transformers: Domain Adaptation for
  Semantic Segmentation of High Resolution Aerial Images
Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images
Nikolaos Dionelis
Francesco Pro
Luca Maiano
Irene Amerini
B. L. Saux
111
4
0
17 Apr 2024
VFMM3D: Releasing the Potential of Image by Vision Foundation Model for
  Monocular 3D Object Detection
VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection
Bonan Ding
Jin Xie
Jing Nie
Jiale Cao
114
2
0
15 Apr 2024
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Junchi Wang
Lei Ke
MLLMLRMVLM
85
29
0
12 Apr 2024
COCONut: Modernizing COCO Segmentation
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
102
17
0
12 Apr 2024
Adapting the Segment Anything Model During Usage in Novel Situations
Adapting the Segment Anything Model During Usage in Novel Situations
Robin Schon
Julian Lorenz
K. Ludwig
Rainer Lienhart
VLM
77
7
0
12 Apr 2024
Practical Region-level Attack against Segment Anything Models
Practical Region-level Attack against Segment Anything Models
Yifan Shen
Zhengyuan Li
Gang Wang
VLM
78
10
0
12 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
128
2
0
06 Apr 2024
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image
  Generation
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Jingyu Sun
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
95
12
0
03 Apr 2024
Unsegment Anything by Simulating Deformation
Unsegment Anything by Simulating Deformation
Jiahao Lu
Xingyi Yang
Xinchao Wang
106
4
0
03 Apr 2024
Rethinking Interactive Image Segmentation with Low Latency, High
  Quality, and Diverse Prompts
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts
Qin Liu
Jaemin Cho
Mohit Bansal
Marc Niethammer
VLM
103
12
0
31 Mar 2024
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP
  to Alleviate Single Tag Bias
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
Sang-Kee Jo
Soohyun Ryu
Sungyub Kim
Eunho Yang
Kyungsu Kim
107
2
0
30 Mar 2024
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and
  Intra-Class Regions for Weakly-Supervised Semantic Segmentation
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
Sang-Kee Jo
Fei Pan
In-Jae Yu
Kyungsu Kim
111
2
0
30 Mar 2024
Efficient 3D Instance Mapping and Localization with Neural Fields
Efficient 3D Instance Mapping and Localization with Neural Fields
George Tang
Krishna Murthy Jatavallabhula
Antonio Torralba
ISeg
96
5
0
28 Mar 2024
SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent
  Objects
SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects
Avinash Ummadisingu
Jongkeum Choi
Koki Yamane
Shimpei Masuda
Naoki Fukaya
Kuniyuki Takahashi
84
3
0
28 Mar 2024
Annolid: Annotate, Segment, and Track Anything You Need
Annolid: Annotate, Segment, and Track Anything You Need
Chen Yang
Thomas A. Cleland
VOS
55
2
0
27 Mar 2024
A Roadmap Towards Automated and Regulated Robotic Systems
A Roadmap Towards Automated and Regulated Robotic Systems
Yihao Liu
Mehran Armand
92
2
0
21 Mar 2024
Opti-Acoustic Semantic SLAM with Unknown Objects in Underwater
  Environments
Opti-Acoustic Semantic SLAM with Unknown Objects in Underwater Environments
Kurran Singh
Jungseok Hong
Nick Rypkema
John J. Leonard
82
2
0
19 Mar 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded
  Information into Multi-Modal Large Language Models
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Siyuan Huang
Iaroslav Ponomarenko
Zhengkai Jiang
Xiaoqi Li
Xiaobin Hu
Peng Gao
Hongsheng Li
Hao Dong
LM&Ro
120
21
0
17 Mar 2024
WSI-SAM: Multi-resolution Segment Anything Model (SAM) for
  histopathology whole-slide images
WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images
Hong Liu
Haosen Yang
P. Diest
J. Pluim
M. Veta
VLM
103
8
0
14 Mar 2024
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video
  with Point Tracking and Segment Anything
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything
Zijian Wu
Adam Schmidt
Peter Kazanzides
Septimiu E. Salcudean
67
2
0
12 Mar 2024
Reframe Anything: LLM Agent for Open World Video Reframing
Reframe Anything: LLM Agent for Open World Video Reframing
Jiawang Cao
Yongliang Wu
Weiheng Chi
Wenbo Zhu
Ziyue Su
Jay Wu
85
4
0
10 Mar 2024
Previous
12345
Next