Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02643
Cited By
Segment Anything
5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Segment Anything"
50 / 1,373 papers shown
Title
PKU-GoodsAD: A Supermarket Goods Dataset for Unsupervised Anomaly Detection and Segmentation
Jian Zhang
Runwei Ding
Miaoju Ban
Linhui Dai
110
16
0
11 Jul 2023
Large AI Model-Based Semantic Communications
Feibo Jiang
Yubo Peng
Li Dong
Kezhi Wang
Kun Yang
Cunhua Pan
Xiaohu You
108
54
0
07 Jul 2023
A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision
Zhonghan Zhao
Wenhao Chai
Shengyu Hao
Wenhao Hu
Guanhong Wang
Shidong Cao
Min-Gyoo Song
Lei Li
Gaoang Wang
138
18
0
07 Jul 2023
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
Chunhui Zhang
Xin Sun
Li Liu
Yiqian Yang
Qiong Liu
Xiaoping Zhou
Yanfeng Wang
221
17
0
07 Jul 2023
PSDR-Room: Single Photo to Scene using Differentiable Rendering
Kai Yan
Fujun Luan
MiloŠ HaŠAn
Thibault Groueix
Valentin Deschaintre
Shuang Zhao
69
18
0
06 Jul 2023
MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection
Ruiyang Xia
Decheng Liu
Jie Li
Lin Yuan
N. Wang
Xinbo Gao
73
21
0
06 Jul 2023
AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images
Ao Cheng
Guoqiang Zhao
Lirong Wang
Ruobing Zhang
54
3
0
05 Jul 2023
Embodied Task Planning with Large Language Models
Zhenyu Wu
Ziwei Wang
Xiuwei Xu
Jiwen Lu
Haibin Yan
LM&Ro
LLMAG
81
76
0
04 Jul 2023
Cross-modality Attention Adapter: A Glioma Segmentation Fine-tuning Method for SAM Using Multimodal Brain MR Images
Xiaoyu Shi
Shurong Chai
Yinhao Li
Jingliang Cheng
J. Bai
Guohua Zhao
Yen-Wei Chen
71
7
0
03 Jul 2023
Real-time Vision-based Navigation for a Robot in an Indoor Environment
Sagar Manglani
95
4
0
02 Jul 2023
RH20T: A Comprehensive Robotic Dataset for Learning Diverse Skills in One-Shot
Haoshu Fang
Hongjie Fang
Zhenyu Tang
Jirong Liu
Chenxi Wang
Junbo Wang
Haoyi Zhu
Cewu Lu
137
79
0
02 Jul 2023
Image Background Serves as Good Proxy for Out-of-distribution Data
Sen Pei
85
2
0
02 Jul 2023
What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation
Benedikt Blumenstiel
Johannes Jakubik
Hilde Kuhne
Michael Vossing
VLM
137
18
0
27 Jun 2023
CellViT: Vision Transformers for Precise Cell Segmentation and Classification
Fabian Horst
Moritz Rempe
Lukas Heine
C. Seibold
J. Keyl
...
S. Ugurel
J. Siveke
Barbara Grünwald
Jan Egger
Jens Kleesiek
MedIm
ViT
87
112
0
27 Jun 2023
When Foundation Model Meets Federated Learning: Motivations, Challenges, and Future Directions
Weiming Zhuang
Chen Chen
Lingjuan Lyu
Chong Chen
Yaochu Jin
Lingjuan Lyu
AIFin
AI4CE
254
99
0
27 Jun 2023
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
Ayca Takmaz
Elisabetta Fedele
R. Sumner
Marc Pollefeys
F. Tombari
Francis Engelmann
ISeg
VLM
104
173
0
23 Jun 2023
Comparative Analysis of Segment Anything Model and U-Net for Breast Tumor Detection in Ultrasound and Mammography Images
Mohsen Ahmadi
Masoumeh Farhadi Nia
Sara Asgarian
Kasra Danesh
Elyas Irankhah
Ahmad Gholizadeh Lonbar
Abbas Sharifi
128
8
0
21 Jun 2023
One-shot Imitation Learning via Interaction Warping
Ondrej Biza
Skye Thompson
Kishore Reddy Pagidi
Abhinav Kumar
Elise van der Pol
Robin Walters
Thomas Kipf
Jan-Willem van de Meent
Lawson L. S. Wong
Robert Platt
119
14
0
21 Jun 2023
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching
D. M. Nguyen
Hoang Nguyen
Nghiem Tuong Diep
Tan Ngoc Pham
T. Cao
...
Nhat Ho
Shadi Albarqouni
P. Xie
Daniel Sonntag
Mathias Niepert
VLM
MedIm
153
55
0
20 Jun 2023
HomeRobot: Open-Vocabulary Mobile Manipulation
Sriram Yenamandra
A. Ramachandran
Karmesh Yadav
Austin S. Wang
Mukul Khanna
...
Devendra Singh Chaplot
Dhruv Batra
Roozbeh Mottaghi
Yonatan Bisk
Chris Paxton
LM&Ro
149
83
0
20 Jun 2023
Zero-Shot Anomaly Detection with Pre-trained Segmentation Models
Matthew Baugh
James Batten
Johanna P. Müller
Bernhard Kainz
81
6
0
15 Jun 2023
A Self-Supervised Miniature One-Shot Texture Segmentation (MOSTS) Model for Real-Time Robot Navigation and Embedded Applications
Yu Chen
Chirag Rastogi
Zheyuan Zhou
William R. Norris
134
4
0
15 Jun 2023
AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn
Difei Gao
Lei Ji
Luowei Zhou
Kevin Lin
Joya Chen
Zihan Fan
Mike Zheng Shou
MLLM
141
76
0
14 Jun 2023
Transferring Foundation Models for Generalizable Robotic Manipulation
Jiange Yang
Wenhui Tan
Chuhao Jin
Keling Yao
Bei Liu
Jianlong Fu
Ruihua Song
Gangshan Wu
Limin Wang
LM&Ro
165
9
0
09 Jun 2023
Matting Anything
Jiacheng Li
Jitesh Jain
Humphrey Shi
VLM
99
18
0
08 Jun 2023
PhenoBench -- A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain
J. Weyler
Federico Magistri
E. Marks
Yue Linn Chong
Matteo Sodano
Gianmarco Roggiolani
Nived Chebrolu
C. Stachniss
Jens Behley
128
33
0
07 Jun 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Yang Li
Shao Zhang
Jichen Sun
Wenhao Zhang
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
105
17
0
05 Jun 2023
A survey of Generative AI Applications
Roberto Gozalo-Brizuela
Eduardo C. Garrido-Merchán
3DV
MedIm
108
91
0
05 Jun 2023
3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW
Shijie Chang
Zeqi Hao
Ben Kang
Xiaoqi Zhao
Jiawen Zhu
Zhe Chen
Lihe Zhang
Lu Zhang
Huchuan Lu
66
1
0
04 Jun 2023
Segment Anything Meets Semantic Communication
Shehbaz Tariq
Brian E. Arfeto
Chaoning Zhang
Hyundong Shin
VLM
104
16
0
03 Jun 2023
Unifying (Machine) Vision via Counterfactual World Modeling
Daniel M. Bear
Kevin T. Feigelis
Honglin Chen
Wanhee Lee
R. Venkatesh
Klemen Kotar
Alex Durango
Daniel L. K. Yamins
VGen
77
14
0
02 Jun 2023
Segment Anything in High Quality
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VLM
169
342
0
02 Jun 2023
AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation
Yuanwen Yue
Sabarinath Mahadevan
Jonas Schult
Francis Engelmann
Bastian Leibe
Konrad Schindler
Theodora Kontogianni
3DPC
127
31
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
92
21
0
01 Jun 2023
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset
Jiakang Yuan
Bo Zhang
Xiangchao Yan
Tao Chen
Botian Shi
Yikang Li
Yu Qiao
3DPC
127
27
0
01 Jun 2023
DeSAM: Decoupled Segment Anything Model for Generalizable Medical Image Segmentation
Yifan Gao
W. Xia
Dingdu Hu
Wenkui Wang
Xin Gao
OOD
VLM
MedIm
104
37
0
01 Jun 2023
Pix2Repair: Implicit Shape Restoration from Images
Xinchao Song
N. Lamb
Sean Banerjee
N. Banerjee
3DV
87
0
0
29 May 2023
GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions
Tao Wang
Kaihao Zhang
Ziqian Shao
Wenhan Luo
B. Stenger
Tong Lu
Tae-Kyun Kim
Wei Liu
Hongdong Li
ViT
112
43
0
29 May 2023
AIMS: All-Inclusive Multi-Level Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Jiuxiang Gu
Zhe Lin
Bo Du
Yu-Syuan Xu
Ming-Hsuan Yang
VLM
108
6
0
28 May 2023
Building One-class Detector for Anything: Open-vocabulary Zero-shot OOD Detection Using Text-image Models
Yunhao Ge
Jie Jessie Ren
Jiaping Zhao
Kaifeng Chen
Andrew Gallagher
Laurent Itti
Balaji Lakshminarayanan
VLM
ObjD
64
1
0
26 May 2023
Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU
Haoran Liu
Peng Li
Ming-Yuan Liu
Kai-Ming Wang
Zhuo Zuo
Bingqi Liu
92
2
0
26 May 2023
Detect Any Shadow: Segment Anything for Video Shadow Detection
Yonghui Wang
Wen-gang Zhou
Yunyao Mao
Houqiang Li
VLM
98
24
0
26 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
128
37
0
25 May 2023
Imitating Task and Motion Planning with Visuomotor Transformers
Murtaza Dalal
Ajay Mandlekar
Caelan Reed Garrett
Ankur Handa
Ruslan Salakhutdinov
Dieter Fox
183
57
0
25 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
127
62
0
25 May 2023
On the Robustness of Segment Anything
Yihao Huang
Yue Cao
Tianlin Li
Felix Juefei Xu
Di Lin
Ivor W.Tsang
Yang Liu
Qing Guo
AAML
VLM
101
27
0
25 May 2023
ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs
Zihao Zhao
Sheng Wang
Jinchen Gu
Yitao Zhu
Lanzhuju Mei
Zixu Zhuang
Zhiming Cui
Qian Wang
Dinggang Shen
LM&MA
122
43
0
25 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
115
74
0
23 May 2023
VDD: Varied Drone Dataset for Semantic Segmentation
Wenxiao Cai
Ke Jin
Jinyan Hou
Cong Guo
Letian Wu
Wankou Yang
90
13
0
23 May 2023
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
Yang Liu
Muzhi Zhu
Hengtao Li
Hao Chen
Xinlong Wang
Chunhua Shen
VLM
MLLM
188
90
0
22 May 2023
Previous
1
2
3
...
25
26
27
28
Next