ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.02643
  4. Cited By
Segment Anything

Segment Anything

5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
    MLLM
    VLM
ArXivPDFHTML

Papers citing "Segment Anything"

50 / 4,267 papers shown
Title
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
Ayca Takmaz
Elisabetta Fedele
R. Sumner
Marc Pollefeys
F. Tombari
Francis Engelmann
ISeg
VLM
41
164
0
23 Jun 2023
A Survey on Multimodal Large Language Models
A Survey on Multimodal Large Language Models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Xu
Enhong Chen
MLLM
LRM
64
565
0
23 Jun 2023
Differentiable Display Photometric Stereo
Differentiable Display Photometric Stereo
Seokjun Choi
S. Yoon
Giljoo Nam
Seungyong Lee
Seung-Hwan Baek
65
1
0
23 Jun 2023
Robustness of Segment Anything Model (SAM) for Autonomous Driving in
  Adverse Weather Conditions
Robustness of Segment Anything Model (SAM) for Autonomous Driving in Adverse Weather Conditions
Xinru Shan
Chaoning Zhang
VLM
33
12
0
23 Jun 2023
Ladder Fine-tuning approach for SAM integrating complementary network
Ladder Fine-tuning approach for SAM integrating complementary network
Shurong Chai
R. Jain
Shiyu Teng
Jiaqing Liu
Yinhao Li
T. Tateyama
Yen-wei Chen
MedIm
35
31
0
22 Jun 2023
Comparative Analysis of Segment Anything Model and U-Net for Breast
  Tumor Detection in Ultrasound and Mammography Images
Comparative Analysis of Segment Anything Model and U-Net for Breast Tumor Detection in Ultrasound and Mammography Images
Mohsen Ahmadi
Masoumeh Farhadi Nia
Sara Asgarian
Kasra Danesh
Elyas Irankhah
Ahmad Gholizadeh Lonbar
Abbas Sharifi
70
8
0
21 Jun 2023
One-shot Imitation Learning via Interaction Warping
One-shot Imitation Learning via Interaction Warping
Ondrej Biza
Skye Thompson
Kishore Reddy Pagidi
Abhinav Kumar
Elise van der Pol
Robin Walters
Thomas Kipf
Jan-Willem van de Meent
Lawson L. S. Wong
Robert Platt
42
13
0
21 Jun 2023
One Policy to Dress Them All: Learning to Dress People with Diverse
  Poses and Garments
One Policy to Dress Them All: Learning to Dress People with Diverse Poses and Garments
Yufei Wang
Zhanyi Sun
Zackory M. Erickson
David Held
43
25
0
21 Jun 2023
Fast Segment Anything
Fast Segment Anything
Xu Zhao
Wen-Yan Ding
Yongqi An
Yinglong Du
Tao Yu
Min Li
Ming Tang
Jinqiao Wang
MLLM
VLM
38
265
0
21 Jun 2023
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical
  Imaging via Second-order Graph Matching
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching
D. M. Nguyen
Hoang Nguyen
Nghiem Tuong Diep
Tan Ngoc Pham
T. Cao
...
Nhat Ho
Shadi Albarqouni
P. Xie
Daniel Sonntag
Mathias Niepert
VLM
MedIm
39
49
0
20 Jun 2023
Segment Anything Model (SAM) for Radiation Oncology
Segment Anything Model (SAM) for Radiation Oncology
Lian-Cheng Zhang
Zheng Liu
Lu Zhang
Zihao Wu
Xiao-Xing Yu
...
Xiang Li
Quanzheng Li
Dajiang Zhu
Tianming Liu
Wen Liu
VLM
MedIm
35
33
0
20 Jun 2023
HomeRobot: Open-Vocabulary Mobile Manipulation
HomeRobot: Open-Vocabulary Mobile Manipulation
Sriram Yenamandra
A. Ramachandran
Karmesh Yadav
Austin S. Wang
Mukul Khanna
...
Devendra Singh Chaplot
Dhruv Batra
Roozbeh Mottaghi
Yonatan Bisk
Chris Paxton
LM&Ro
65
79
0
20 Jun 2023
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
Fan Liu
Delong Chen
Zhan-Rong Guan
Xiaocong Zhou
Jiale Zhu
Qiaolin Ye
Liyong Fu
Jun Zhou
VLM
77
201
0
19 Jun 2023
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot
  Vision-and-Language Navigation
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation
Xiwen Liang
Liang Ma
Shanshan Guo
Jianhua Han
Hang Xu
Shikui Ma
Xiaodan Liang
LM&Ro
LLMAG
114
4
0
17 Jun 2023
NBMOD: Find It and Grasp It in Noisy Background
NBMOD: Find It and Grasp It in Noisy Background
Boyuan Cao
Xinyu Zhou
Congmin Guo
Baohua Zhang
Yuchen Liu
Qianqiu Tan
71
4
0
17 Jun 2023
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress,
  and Prospects
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects
Kexin Zhang
Qingsong Wen
Chaoli Zhang
Rongyao Cai
Ming Jin
...
James Y. Zhang
Yuxuan Liang
Guansong Pang
Dongjin Song
Shirui Pan
AI4TS
126
103
0
16 Jun 2023
Group Orthogonalization Regularization For Vision Models Adaptation and
  Robustness
Group Orthogonalization Regularization For Vision Models Adaptation and Robustness
Yoav Kurtz
Noga Bar
Raja Giryes
37
0
0
16 Jun 2023
LabelBench: A Comprehensive Framework for Benchmarking Adaptive
  Label-Efficient Learning
LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning
Jifan Zhang
Yifang Chen
Gregory H. Canal
Stephen Mussmann
Arnav M. Das
...
Yinglun Zhu
Jeffrey Bilmes
S. Du
Kevin Jamieson
Robert D. Nowak
VLM
57
10
0
16 Jun 2023
The Big Data Myth: Using Diffusion Models for Dataset Generation to
  Train Deep Detection Models
The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models
Roy Voetman
Maya Aghaei
K. Dijkstra
DiffM
41
11
0
16 Jun 2023
Scaling Open-Vocabulary Object Detection
Scaling Open-Vocabulary Object Detection
Matthias Minderer
A. Gritsenko
N. Houlsby
VLM
ObjD
37
182
0
16 Jun 2023
Granger-Causal Hierarchical Skill Discovery
Granger-Causal Hierarchical Skill Discovery
Caleb Chuck
Kevin Black
Aditya Arjun
Yuke Zhu
S. Niekum
OffRL
65
1
0
15 Jun 2023
Seeing the World through Your Eyes
Seeing the World through Your Eyes
Hadi Alzayer
Kevin Zhang
Brandon Yushan Feng
Christopher A. Metzler
Jia-Bin Huang
CVBM
64
16
0
15 Jun 2023
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models
You-Chen Liu
Lingdong Kong
Jun Cen
Runnan Chen
Wenwei Zhang
Liang Pan
Kai-xiang Chen
Ziwei Liu
48
84
0
15 Jun 2023
Robustness Analysis on Foundational Segmentation Models
Robustness Analysis on Foundational Segmentation Models
Madeline Chantry Schiappa
Shehreen Azad
V. Sachidanand
Yunhao Ge
O. Mikšík
Yogesh S Rawat
Vibhav Vineet
OOD
VLM
AAML
35
6
0
15 Jun 2023
Zero-Shot Anomaly Detection with Pre-trained Segmentation Models
Zero-Shot Anomaly Detection with Pre-trained Segmentation Models
Matthew Baugh
James Batten
Johanna P. Müller
Bernhard Kainz
37
6
0
15 Jun 2023
Text Promptable Surgical Instrument Segmentation with Vision-Language
  Models
Text Promptable Surgical Instrument Segmentation with Vision-Language Models
Zijian Zhou
Oluwatosin O. Alabi
Meng Wei
Tom Vercauteren
Miaojing Shi
MedIm
47
24
0
15 Jun 2023
2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty
  Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection
2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection
Yunkang Cao
Xiaohao Xu
Chen Sun
Y. Cheng
Liang Gao
Nong Sang
61
1
0
15 Jun 2023
Temporally-Extended Prompts Optimization for SAM in Interactive Medical
  Image Segmentation
Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation
Chuyun Shen
Wenhao Li
Ya Zhang
Xiangfeng Wang
VLM
LLMAG
MedIm
49
6
0
15 Jun 2023
RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation
RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation
Gabriel Bénédict
Olivier Jeunen
Samuele Papa
Samarth Bhargav
Daan Odijk
Maarten de Rijke
DiffM
47
9
0
15 Jun 2023
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to
  Enhance Visio-Linguistic Compositional Understanding
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding
Le Zhang
Rabiul Awal
Aishwarya Agrawal
CoGe
VLM
41
12
0
15 Jun 2023
A Self-Supervised Miniature One-Shot Texture Segmentation (MOSTS) Model
  for Real-Time Robot Navigation and Embedded Applications
A Self-Supervised Miniature One-Shot Texture Segmentation (MOSTS) Model for Real-Time Robot Navigation and Embedded Applications
Yu Chen
Chirag Rastogi
Zheyuan Zhou
William R. Norris
39
4
0
15 Jun 2023
EPIC Fields: Marrying 3D Geometry and Video Understanding
EPIC Fields: Marrying 3D Geometry and Video Understanding
Vadim Tschernezki
Ahmad Darkhalil
Zhifan Zhu
David Fouhey
Iro Laina
Diane Larlus
Dima Damen
Andrea Vedaldi
EgoV
58
30
0
14 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
67
7
0
14 Jun 2023
AssistGPT: A General Multi-modal Assistant that can Plan, Execute,
  Inspect, and Learn
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn
Difei Gao
Lei Ji
Luowei Zhou
Kevin Lin
Joya Chen
Zihan Fan
Mike Zheng Shou
MLLM
45
72
0
14 Jun 2023
TomoSAM: a 3D Slicer extension using SAM for tomography segmentation
TomoSAM: a 3D Slicer extension using SAM for tomography segmentation
Federico Semeraro
Alexandre Quintart
Sergio Izquierdo
J. Ferguson
29
6
0
14 Jun 2023
POP: Prompt Of Prompts for Continual Learning
POP: Prompt Of Prompts for Continual Learning
Zhiyuan Hu
J. Lyu
Dashan Gao
Nuno Vasconcelos
CLL
LRM
VLM
40
5
0
14 Jun 2023
Robustness of SAM: Segment Anything Under Corruptions and Beyond
Robustness of SAM: Segment Anything Under Corruptions and Beyond
Yu Qiao
Chaoning Zhang
Taegoo Kang
Donghun Kim
Chenshuang Zhang
Choong Seon Hong
AAML
33
33
0
13 Jun 2023
Scalable 3D Captioning with Pretrained Models
Scalable 3D Captioning with Pretrained Models
Tiange Luo
C. Rockwell
Honglak Lee
Justin Johnson
37
155
0
12 Jun 2023
VPUFormer: Visual Prompt Unified Transformer for Interactive Image
  Segmentation
VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation
Xu Zhang
Kailun Yang
Jiacheng Lin
Jin Yuan
Zhiyong Li
Shutao Li
29
3
0
11 Jun 2023
AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt
  Encoder
AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder
Tal Shaharabany
Aviad Dahan
Raja Giryes
Lior Wolf
MedIm
VLM
24
68
0
10 Jun 2023
Leveraging Large Language Models for Scalable Vector Graphics-Driven
  Image Understanding
Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Mu Cai
Zeyi Huang
Yuheng Li
Utkarsh Ojha
Haohan Wang
Yong Jae Lee
VLM
22
2
0
09 Jun 2023
Adaptive Contextual Perception: How to Generalize to New Backgrounds and
  Ambiguous Objects
Adaptive Contextual Perception: How to Generalize to New Backgrounds and Ambiguous Objects
Zhuofan Ying
Peter Hase
Joey Tianyi Zhou
57
1
0
09 Jun 2023
Single-Image-Based Deep Learning for Segmentation of Early Esophageal
  Cancer Lesions
Single-Image-Based Deep Learning for Segmentation of Early Esophageal Cancer Lesions
Haipeng Li
Di Liu
Yunzhi Zeng
Shuaicheng Liu
Tao Gan
N. Rao
Jin-lin Yang
Bing Zeng
47
6
0
09 Jun 2023
Customizing General-Purpose Foundation Models for Medical Report
  Generation
Customizing General-Purpose Foundation Models for Medical Report Generation
Bang-ju Yang
Asif Raza
Yuexian Zou
Tong Zhang
MedIm
45
11
0
09 Jun 2023
Transferring Foundation Models for Generalizable Robotic Manipulation
Transferring Foundation Models for Generalizable Robotic Manipulation
Jiange Yang
Wenhui Tan
Chuhao Jin
Keling Yao
Bei Liu
Jianlong Fu
Ruihua Song
Gangshan Wu
Limin Wang
LM&Ro
65
6
0
09 Jun 2023
Artificial General Intelligence for Medical Imaging
Artificial General Intelligence for Medical Imaging
Xiang Li
Lu Zhang
Zihao Wu
Zheng Liu
Lin Zhao
...
Pingkuan Yan
Quanzheng Li
Wen Liu
Tianming Liu
Dinggang Shen
LM&MA
AI4CE
71
40
0
08 Jun 2023
Weakly Supervised 3D Object Detection with Multi-Stage Generalization
Weakly Supervised 3D Object Detection with Multi-Stage Generalization
Jiawei He
Yu-Quan Wang
Yuntao Chen
Zhaoxiang Zhang
3DPC
50
2
0
08 Jun 2023
R-MAE: Regions Meet Masked Autoencoders
R-MAE: Regions Meet Masked Autoencoders
Duy-Kien Nguyen
Vaibhav Aggarwal
Yanghao Li
Martin R. Oswald
Alexander Kirillov
Cees G. M. Snoek
Xinlei Chen
TPM
55
11
0
08 Jun 2023
Matting Anything
Matting Anything
Jiacheng Li
Jitesh Jain
Humphrey Shi
VLM
49
16
0
08 Jun 2023
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data
  Generation
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation
Kai Chen
Enze Xie
Zhe Chen
Yibo Wang
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
DiffM
30
21
0
07 Jun 2023
Previous
123...798081...848586
Next