Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.10605
Cited By
v1
v2 (latest)
MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
20 August 2024
Yanbo Ding
Shaobin Zhuang
Kunchang Li
Zhengrong Yue
Yu Qiao
Yali Wang
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration"
12 / 12 papers shown
Title
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Zhe Chen
Weiyun Wang
Hao Tian
Shenglong Ye
Zhangwei Gao
...
Tong Lu
Dahua Lin
Yu Qiao
Jifeng Dai
Wenhai Wang
MLLM
VLM
133
642
0
25 Apr 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
95
28
0
20 Mar 2024
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Tengjiao Wang
CoGe
DiffM
105
137
0
22 Jan 2024
DiffusionGPT: LLM-Driven Text-to-Image Generation System
Jie Qin
Jie Wu
Weifeng Chen
Yuxi Ren
Huixian Li
Hefeng Wu
Xuefeng Xiao
Rui Wang
S. Wen
DiffM
89
34
0
18 Jan 2024
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
557
4,413
0
28 Jan 2022
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
1.0K
29,926
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
420
5,000
0
24 Feb 2021
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
898
42,463
0
28 May 2020
Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning
Praveen Palanisamy
86
145
0
11 Nov 2019
Generative Adversarial Networks: An Overview
Antonia Creswell
Tom White
Vincent Dumoulin
Kai Arulkumaran
B. Sengupta
Anil A Bharath
GAN
128
3,064
0
19 Oct 2017
Controllable Generative Adversarial Network
Minhyeok Lee
Junhee Seok
GAN
53
77
0
02 Aug 2017
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDa
DiffM
312
7,031
0
12 Mar 2015
1