
v1v2 (latest)
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
Oğuzhan Fatih Kar
Mingfei Gao
Papers citing "4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities"
50 / 57 papers shown
Title |
---|
![]() PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Weifeng Lin Xinyu Wei Renrui Zhang Le Zhuo Shitian Zhao ...Junlin Xie Junlin Xie Yu Qiao Peng Gao Hongsheng Li |
![]() GAIA-1: A Generative World Model for Autonomous Driving Masane Fuchi Lloyd Russell Hudson Yeo Zak Murez Hiroto Minami Alex Kendall Tomohiro Takagi Gianluca Corrado |
![]() StyleDrop: Text-to-Image Generation in Any Style Kihyuk Sohn Nataniel Ruiz Kimin Lee Daniel Castro Chin Irina Blok ...Yuanzhen Li Yuan Hao Irfan Essa Michael Rubinstein Dilip Krishnan |