Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.02643
Cited By
Segment Anything
5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Segment Anything"
50 / 4,263 papers shown
Title
SAM3D: Segment Anything in 3D Scenes
Yu-nuo Yang
Xiaoyang Wu
Tongyao He
Hengshuang Zhao
Xihui Liu
3DPC
37
86
0
06 Jun 2023
Towards Label-free Scene Understanding by Vision Foundation Models
Runnan Chen
You-Chen Liu
Lingdong Kong
Nenglun Chen
Xinge Zhu
Yuexin Ma
Tongliang Liu
Wenping Wang
VLM
35
42
0
06 Jun 2023
FAMO: Fast Adaptive Multitask Optimization
B. Liu
Yihao Feng
Peter Stone
Qian Liu
66
32
0
06 Jun 2023
Adversarial attacks and defenses in explainable artificial intelligence: A survey
Hubert Baniecki
P. Biecek
AAML
50
65
0
06 Jun 2023
Recognize Anything: A Strong Image Tagging Model
Youcai Zhang
Xinyu Huang
Jinyu Ma
Zhaoyang Li
Zhaochuan Luo
...
Tong Luo
Yaqian Li
Siyi Liu
Yandong Guo
Lei Zhang
VLM
52
230
0
06 Jun 2023
Zero-Shot 3D Shape Correspondence
Ahmed Abdelreheem
Abdelrahman Eldesokey
M. Ovsjanikov
Peter Wonka
38
24
0
05 Jun 2023
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Yang Li
Shao Zhang
Jichen Sun
Wenhao Zhang
Yali Du
Ying Wen
Xinbing Wang
Wei Pan
42
13
0
05 Jun 2023
A survey of Generative AI Applications
Roberto Gozalo-Brizuela
Eduardo C. Garrido-Merchán
3DV
MedIm
48
80
0
05 Jun 2023
Calib-Anything: Zero-training LiDAR-Camera Extrinsic Calibration Method Using Segment Anything
Zhaotong Luo
Guohang Yan
Yikang Li
VLM
36
18
0
05 Jun 2023
DAGrid: Directed Accumulator Grid
Hang Zhang
Renjiu Hu
Xiang Chen
Rongguang Wang
Jinwei Zhang
Jiahao Nick Li
MedIm
46
0
0
05 Jun 2023
Multi-View Representation is What You Need for Point-Cloud Pre-Training
Siming Yan
Chen Song
Youkang Kong
Qi-Xing Huang
3DPC
71
2
0
05 Jun 2023
Training Like a Medical Resident: Context-Prior Learning Toward Universal Medical Image Segmentation
Yunhe Gao
Zhuowei Li
Di Liu
Mu Zhou
Shaoting Zhang
Dimitris N. Metaxas
MedIm
43
12
0
04 Jun 2023
Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation
Haochen Wang
Yuchao Wang
Yujun Shen
Junsong Fan
Yuxi Wang
Zhaoxiang Zhang
UQCV
45
10
0
04 Jun 2023
3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW
Shijie Chang
Zeqi Hao
Ben Kang
Xiaoqi Zhao
Jiawen Zhu
Zhe Chen
Lihe Zhang
Lu Zhang
Huchuan Lu
21
1
0
04 Jun 2023
SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model
Dingyuan Zhang
Dingkang Liang
Hongcheng Yang
Zhikang Zou
Xiaoqing Ye
Zhe Liu
Xiang Bai
VLM
55
42
0
04 Jun 2023
Segment Anything Meets Semantic Communication
Shehbaz Tariq
Brian E. Arfeto
Chaoning Zhang
Hyundong Shin
VLM
24
15
0
03 Jun 2023
Understanding Segment Anything Model: SAM is Biased Towards Texture Rather than Shape
Chaoning Zhang
Yu Qiao
Shehbaz Tariq
Sheng Zheng
Chenshuang Zhang
Chenghao Li
Hyundong Shin
Choong Seon Hong
VLM
45
10
0
03 Jun 2023
Unifying (Machine) Vision via Counterfactual World Modeling
Daniel M. Bear
Kevin T. Feigelis
Honglin Chen
Wanhee Lee
R. Venkatesh
Klemen Kotar
Alex Durango
Daniel L. K. Yamins
VGen
30
13
0
02 Jun 2023
Segment Anything in High Quality
Lei Ke
Mingqiao Ye
Martin Danelljan
Yifan Liu
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
VLM
61
314
0
02 Jun 2023
On the Clean Generalization and Robust Overfitting in Adversarial Training from Two Theoretical Views: Representation Complexity and Training Dynamics
Binghui Li
Yuanzhi Li
AAML
49
3
0
02 Jun 2023
White-Box Transformers via Sparse Rate Reduction
Yaodong Yu
Sam Buchanan
Druv Pai
Tianzhe Chu
Ziyang Wu
Shengbang Tong
B. Haeffele
Yi Ma
ViT
57
81
0
01 Jun 2023
AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation
Yuanwen Yue
Sabarinath Mahadevan
Jonas Schult
Francis Engelmann
Bastian Leibe
Konrad Schindler
Theodora Kontogianni
3DPC
50
29
0
01 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
39
10
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
50
20
0
01 Jun 2023
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset
Jiakang Yuan
Bo Zhang
Xiangchao Yan
Tao Chen
Botian Shi
Yikang Li
Yu Qiao
3DPC
34
26
0
01 Jun 2023
DeSAM: Decoupled Segment Anything Model for Generalizable Medical Image Segmentation
Yifan Gao
W. Xia
Dingdu Hu
Wenkui Wang
Xin Gao
OOD
VLM
MedIm
29
29
0
01 Jun 2023
SAM-helps-Shadow:When Segment Anything Model meet shadow removal
Xiaofeng Zhang
Chaochen Gu
Shanying Zhu
VLM
44
11
0
01 Jun 2023
Sea Ice Extraction via Remote Sensed Imagery: Algorithms, Datasets, Applications and Challenges
Anzhu Yu
Wenjun Huang
Qing Xu
Qun Sun
Wenyue Guo
Song Ji
Bowei Wen
C. Qiu
43
3
0
01 Jun 2023
Using Visual Cropping to Enhance Fine-Detail Question Answering of BLIP-Family Models
Jiarui Zhang
Mahyar Khayatkhoei
P. Chhikara
Filip Ilievski
37
1
0
31 May 2023
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
Zhongwei Wan
Che Liu
Mi Zhang
Jie Fu
Benyou Wang
Sibo Cheng
Lei Ma
César Quilodrán-Casas
Rossella Arcucci
65
72
0
31 May 2023
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
Aoran Xiao
Xiaoqin Zhang
Ling Shao
Shijian Lu
3DPC
56
19
0
31 May 2023
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Sivan Doveh
Assaf Arbelle
Sivan Harary
Roei Herzig
Donghyun Kim
...
Yikang Shen
Raja Giryes
Rogerio Feris
S. Ullman
Leonid Karlinsky
VLM
CoGe
67
53
0
31 May 2023
PaintSeg: Training-free Segmentation via Painting
Xiang Li
Chung-Ching Lin
Yinpeng Chen
Zicheng Liu
Jinglu Wang
Bhiksha Raj
62
5
0
30 May 2023
AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Chuhao Jin
Wenhui Tan
Jiange Yang
Bei Liu
Ruihua Song
Limin Wang
Jianlong Fu
LM&Ro
LRM
30
24
0
30 May 2023
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Rui Yang
Lin Song
Yanwei Li
Sijie Zhao
Yixiao Ge
Xiu Li
Ying Shan
SyDa
MLLM
41
212
0
30 May 2023
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models
Pengzhi Li
Qinxuan Huang
Yikang Ding
Zhiheng Li
DiffM
41
36
0
30 May 2023
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang
Wei Li
Jun Han
Kaiyang Zhou
Chen Change Loy
ObjD
VLM
MLLM
61
82
0
29 May 2023
Pix2Repair: Implicit Shape Restoration from Images
Xinchao Song
N. Lamb
Sean Banerjee
N. Banerjee
3DV
36
0
0
29 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
58
91
0
29 May 2023
Explicit Visual Prompting for Universal Foreground Segmentations
Weihuang Liu
Xi Shen
Chi-Man Pun
Xiaodong Cun
VPVLM
VLM
46
14
0
29 May 2023
GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions
Tao Wang
Kaihao Zhang
Ziqian Shao
Wenhan Luo
B. Stenger
Tong Lu
Tae-Kyun Kim
Wei Liu
Hongdong Li
ViT
39
31
0
29 May 2023
AIMS: All-Inclusive Multi-Level Segmentation
Lu Qi
Jason Kuen
Weidong Guo
Jiuxiang Gu
Zhe Lin
Bo Du
Yu-Syuan Xu
Ming-Hsuan Yang
VLM
39
6
0
28 May 2023
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
42
12
0
27 May 2023
VoxDet: Voxel Learning for Novel Instance Detection
Bowen Li
Jiashun Wang
Yaoyu Hu
Chen Wang
Sebastian Scherer
52
6
0
26 May 2023
Building One-class Detector for Anything: Open-vocabulary Zero-shot OOD Detection Using Text-image Models
Yunhao Ge
Jie Jessie Ren
Jiaping Zhao
Kaifeng Chen
Andrew Gallagher
Laurent Itti
Balaji Lakshminarayanan
VLM
ObjD
31
1
0
26 May 2023
Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU
Haoran Liu
Peng Li
Ming-Yuan Liu
Kai-Ming Wang
Zhuo Zuo
Bingqi Liu
49
2
0
26 May 2023
OpenVIS: Open-vocabulary Video Instance Segmentation
Pinxue Guo
Tony Huang
Peiyang He
Xuefeng Liu
Tianjun Xiao
Zhaoyu Chen
Wenqiang Zhang
VLM
62
16
0
26 May 2023
Detect Any Shadow: Segment Anything for Video Shadow Detection
Yonghui Wang
Wen-gang Zhou
Yunyao Mao
Houqiang Li
VLM
30
22
0
26 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
36
33
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLM
DiffM
46
167
0
25 May 2023
Previous
1
2
3
...
80
81
82
...
84
85
86
Next