Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.16249
Cited By
FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding
28 September 2023
Ana Ezquerro
Carlos Gómez-Rodríguez
Kevin Dela Rosa
Derek Hao Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding"
8 / 8 papers shown
Title
HoneyBee: A Scalable Modular Framework for Creating Multimodal Oncology Datasets with Foundational Embedding Models
Aakash Tripathi
Asim Waqas
Yasin Yilmaz
Ghulam Rasool
36
5
0
13 May 2024
Unicom: Universal and Compact Representation Learning for Image Retrieval
Xiang An
Jiankang Deng
Kaicheng Yang
Jaiwei Li
Ziyong Feng
Jia Guo
Jing Yang
Tongliang Liu
VLM
SSL
37
26
0
12 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
272
4,244
0
30 Jan 2023
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
392
4,137
0
28 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,785
0
29 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
304
3,708
0
11 Feb 2021
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
270
36,371
0
25 Aug 2016
1