Title |
---|
![]() EvolveDirector: Approaching Advanced Text-to-Image Generation with Large
Vision-Language Models Rui Zhao Hangjie Yuan Yujie Wei Shiwei Zhang Yuchao Gu ...Xiang Wang Zhangjie Wu Junhao Zhang Yingya Zhang Mike Zheng Shou |
![]() Revisit Large-Scale Image-Caption Data in Pre-training Multimodal
Foundation Models Zhengfeng Lai Vasileios Saveris Chen Chen Hong-You Chen Haotian Zhang ...Wenze Hu Zhe Gan Peter Grasch Meng Cao Yinfei Yang |
![]() Diffusion & Adversarial Schr\"odinger Bridges via Iterative Proportional Markovian Fitting Sergei Kholkin Grigoriy Ksenofontov David Li Nikita Kornilov Nikita Gushchin Alexandra Suvorikova Alexey Kroshnin Evgeny Burnaev Alexander Korotin |
![]() ControlAR: Controllable Image Generation with Autoregressive Models Zongming Li Tianheng Cheng Shoufa Chen Peize Sun Haocheng Shen Longjin Ran Xiaoxin Chen Wenyu Liu Xinggang Wang |
![]() Emu3: Next-Token Prediction is All You Need Xinlong Wang Xiaosong Zhang Zhengxiong Luo Quan-Sen Sun Yufeng Cui ...Xi Yang Jingjing Liu Yonghua Lin Tiejun Huang Zhongyuan Wang |