Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.02841
Cited By
v1
v2 (latest)
Boltzmann Attention Sampling for Image Analysis with Small Objects
4 March 2025
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Boltzmann Attention Sampling for Image Analysis with Small Objects"
26 / 26 papers shown
Title
Medical SAM 2: Segment medical images as video via Segment Anything Model 2
Jiayuan Zhu
Yunli Qi
A. El Abbadi
VLM
MedIm
120
82
0
01 Aug 2024
SAM 2: Segment Anything in Images and Videos
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
172
948
0
01 Aug 2024
ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image
Hallee E. Wong
Marianne Rakic
John Guttag
Adrian V. Dalca
78
21
0
12 Dec 2023
Window Attention is Bugged: How not to Interpolate Position Embeddings
Daniel Bolya
Chaitanya K. Ryali
Judy Hoffman
Christoph Feichtenhofer
87
11
0
09 Nov 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Chaitanya K. Ryali
Yuan-Ting Hu
Daniel Bolya
Chen Wei
Haoqi Fan
...
Omid Poursaeed
Judy Hoffman
Jitendra Malik
Yanghao Li
Christoph Feichtenhofer
3DH
128
188
0
01 Jun 2023
Segment Everything Everywhere All at Once
Xueyan Zou
Jianwei Yang
Hao Zhang
Feng Li
Linjie Li
Jianfeng Wang
Lijuan Wang
Jianfeng Gao
Yong Jae Lee
MLLM
VLM
122
493
0
13 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
474
7,476
0
05 Apr 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
Hao Zhang
Feng Li
Hu-Sheng Xu
Shijia Huang
Siyi Liu
L. Ni
Lei Zhang
ViT
MedIm
113
60
0
13 Mar 2023
Generalized Decoding for Pixel, Image, and Language
Xueyan Zou
Zi-Yi Dou
Jianwei Yang
Zhe Gan
Linjie Li
...
Lu Yuan
Nanyun Peng
Lijuan Wang
Yong Jae Lee
Jianfeng Gao
VLM
MLLM
ObjD
124
259
0
21 Dec 2022
AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
Yuanfeng Ji
Haotian Bai
Jie Yang
Chongjian Ge
Ye Zhu
...
Zhuguo Li
Lingyan Zhang
Wanling Ma
Xiang Wan
Ping Luo
OOD
106
324
0
16 Jun 2022
Unified Contrastive Learning in Image-Text-Label Space
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Bin Xiao
Ce Liu
Lu Yuan
Jianfeng Gao
VLM
SSL
148
227
0
07 Apr 2022
Focal Modulation Networks
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
3DPC
109
280
0
22 Mar 2022
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
329
2,402
0
02 Dec 2021
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
636
7,877
0
11 Nov 2021
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Bowen Cheng
Alex Schwing
Alexander Kirillov
VLM
ViT
214
1,559
0
13 Jul 2021
The Medical Segmentation Decathlon
Michela Antonelli
Annika Reinke
Spyridon Bakas
Keyvan Farahani
AnnetteKopp-Schneider
...
Zhanwei Xu
Yefeng Zheng
Amber L. Simpson
Lena Maier-Hein
M. Jorge Cardoso
OOD
123
1,002
0
10 Jun 2021
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
Pei Sun
Rufeng Zhang
Yi Jiang
Tao Kong
Chenfeng Xu
...
Masayoshi Tomizuka
Lei Li
Zehuan Yuan
Changhu Wang
Ping Luo
ObjD
109
1,106
0
25 Nov 2020
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
357
5,139
0
08 Oct 2020
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
509
13,230
0
26 May 2020
Longformer: The Long-Document Transformer
Iz Beltagy
Matthew E. Peters
Arman Cohan
RALM
VLM
230
4,109
0
10 Apr 2020
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
142
1,925
0
23 Apr 2019
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
976
133,429
0
12 Jun 2017
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
451
27,355
0
20 Mar 2017
V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation
Fausto Milletari
Nassir Navab
Seyed-Ahmad Ahmadi
356
8,762
0
15 Jun 2016
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
2.0K
77,813
0
18 May 2015
Fast R-CNN
Ross B. Girshick
ObjD
385
25,161
0
30 Apr 2015
1