Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.10305
Cited By
Analyzing Multimodal Objectives Through the Lens of Generative Diffusion Guidance
10 February 2023
Chaerin Kong
Nojun Kwak
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Analyzing Multimodal Objectives Through the Lens of Generative Diffusion Guidance"
3 / 3 papers shown
Title
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
392
4,171
0
28 Jan 2022
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,805
0
24 Feb 2021
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
307
39,238
0
01 Sep 2014
1