ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.15807
  4. Cited By
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
  Haystack

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

27 September 2023
Xiaoliang Dai
Ji Hou
Chih-Yao Ma
Sam S. Tsai
Jialiang Wang
Rui Wang
Peizhao Zhang
Simon Vandenhende
Xiaofang Wang
Abhimanyu Dubey
Matthew Yu
Abhishek Kadian
Filip Radenovic
D. Mahajan
Kunpeng Li
Yue Zhao
Vladan Petrovic
Mitesh Kumar Singh
Simran Motwani
Yiqian Wen
Yi-Zhe Song
Roshan Sumbaly
Vignesh Ramanathan
Zijian He
Peter Vajda
Devi Parikh
    VLM
ArXivPDFHTML

Papers citing "Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack"

50 / 173 papers shown
Title
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
  Controls to Any Diffusion Model
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
69
20
0
15 Apr 2024
Magic Clothing: Controllable Garment-Driven Image Synthesis
Magic Clothing: Controllable Garment-Driven Image Synthesis
Weifeng Chen
Tao Gu
Yuhao Xu
Chengcai Chen
48
16
0
15 Apr 2024
YaART: Yet Another ART Rendering Technology
YaART: Yet Another ART Rendering Technology
Sergey Kastryulin
Artem Konev
Alexander Shishenya
Eugene Lyapustin
Artem Khurshudov
...
Dmitrii Kornilov
Mikhail Romanov
Artem Babenko
Sergei Ovcharenko
Valentin Khrulkov
EGVM
41
1
0
08 Apr 2024
Aligning Diffusion Models by Optimizing Human Utility
Aligning Diffusion Models by Optimizing Human Utility
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
107
29
0
06 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
  Prediction
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi-Xin Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
42
250
0
03 Apr 2024
On the Scalability of Diffusion-based Text-to-Image Generation
On the Scalability of Diffusion-based Text-to-Image Generation
Hao Li
Yang Zou
Ying Wang
Orchid Majumder
Yusheng Xie
R. Manmatha
Ashwin Swaminathan
Zhuowen Tu
Stefano Ermon
Stefano Soatto
64
20
0
03 Apr 2024
CosmicMan: A Text-to-Image Foundation Model for Humans
CosmicMan: A Text-to-Image Foundation Model for Humans
Shikai Li
Jianglin Fu
Kaiyuan Liu
Wentao Wang
Kwan-Yee Lin
Wayne Wu
DiffM
40
19
0
01 Apr 2024
TextCraftor: Your Text Encoder Can be Image Quality Controller
TextCraftor: Your Text Encoder Can be Image Quality Controller
Yanyu Li
Xian Liu
Anil Kag
Ju Hu
Yerlan Idelbayev
Dhritiman Sagar
Yanzhi Wang
Sergey Tulyakov
Jian Ren
45
15
0
27 Mar 2024
InstructBrush: Learning Attention-based Instruction Optimization for
  Image Editing
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
38
4
0
27 Mar 2024
Garment3DGen: 3D Garment Stylization and Texture Generation
Garment3DGen: 3D Garment Stylization and Texture Generation
N. Sarafianos
Tuur Stuyck
Xiaoyu Xiang
Yilei Li
Jovan Popovic
Rakesh Ranjan
3DH
110
17
0
27 Mar 2024
Improving Text-to-Image Consistency via Automatic Prompt Optimization
Improving Text-to-Image Consistency via Automatic Prompt Optimization
Oscar Manas
Pietro Astolfi
Melissa Hall
Candace Ross
Jack Urbanek
Adina Williams
Aishwarya Agrawal
Adriana Romero Soriano
M. Drozdzal
36
27
0
26 Mar 2024
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in
  Text-to-Image Generation
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
Jingkun An
Yinghao Zhu
Zongjian Li
Haoran Feng
Bohua Chen
Yemin Shi
Chengwei Pan
43
2
0
20 Mar 2024
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion
  Models
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han
Filippos Kokkinos
Philip Torr
VGen
80
40
0
18 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion
  Distillation
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
46
109
0
18 Mar 2024
Recent Advances in 3D Gaussian Splatting
Recent Advances in 3D Gaussian Splatting
Tong Wu
Yu-Jie Yuan
Ling-Xiao Zhang
Jie Yang
Yan-Pei Cao
Ling-Qi Yan
Lin Gao
3DGS
73
85
0
17 Mar 2024
Reward Guided Latent Consistency Distillation
Reward Guided Latent Consistency Distillation
Jiachen Li
Weixi Feng
Wenhu Chen
William Y. Wang
EGVM
31
11
0
16 Mar 2024
Video Editing via Factorized Diffusion Distillation
Video Editing via Factorized Diffusion Distillation
Uriel Singer
Amit Zohar
Yuval Kirstain
Shelly Sheynin
Adam Polyak
Devi Parikh
Yaniv Taigman
DiffM
VGen
46
12
0
14 Mar 2024
PFStorer: Personalized Face Restoration and Super-Resolution
PFStorer: Personalized Face Restoration and Super-Resolution
Tuomas Varanka
Tapani Toivonen
Soumya Tripathy
Guoying Zhao
Erman Acar
DiffM
39
2
0
13 Mar 2024
DragAnything: Motion Control for Anything using Entity Representation
DragAnything: Motion Control for Anything using Entity Representation
Wejia Wu
Zhuang Li
Yuchao Gu
Rui Zhao
Yefei He
David Junhao Zhang
Mike Zheng Shou
Yan Li
Tingting Gao
Di Zhang
VGen
93
51
0
12 Mar 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with
  Auto-Generated Data
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Jialu Li
Jaemin Cho
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
MoMe
DiffM
47
8
0
11 Mar 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
59
72
0
08 Mar 2024
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
Wendi Zheng
Jiayan Teng
Zhuoyi Yang
Weihan Wang
Jidong Chen
Xiaotao Gu
Yuxiao Dong
Ming Ding
Jie Tang
DiffM
32
35
0
08 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
124
1,085
0
05 Mar 2024
AtomoVideo: High Fidelity Image-to-Video Generation
AtomoVideo: High Fidelity Image-to-Video Generation
Litong Gong
Yiran Zhu
Weijie Li
Xiaoyang Kang
Biao Wang
Tiezheng Ge
Bo Zheng
DiffM
VGen
132
12
0
04 Mar 2024
DistriFusion: Distributed Parallel Inference for High-Resolution
  Diffusion Models
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Muyang Li
Tianle Cai
Jiaxin Cao
Qinsheng Zhang
Han Cai
Junjie Bai
Yangqing Jia
Ming-Yu Liu
Kai Li
Song Han
DiffM
37
41
0
29 Feb 2024
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion
  Latent Aligners
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Yazhou Xing
Yin-Yin He
Zeyue Tian
Xintao Wang
Qifeng Chen
35
52
0
27 Feb 2024
T-HITL Effectively Addresses Problematic Associations in Image
  Generation and Maintains Overall Visual Quality
T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality
Susan Epstein
Li Chen
Alessandro Vecchiato
Ankit Jain
14
0
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
86
0
27 Feb 2024
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Huizhuo Yuan
Zixiang Chen
Kaixuan Ji
Quanquan Gu
65
24
0
15 Feb 2024
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality
  3D Generation
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation
Luke Melas-Kyriazi
Iro Laina
Christian Rupprecht
Natalia Neverova
Andrea Vedaldi
Oran Gafni
Filippos Kokkinos
3DGS
32
64
0
13 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
34
23
0
13 Feb 2024
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Animated Stickers: Bringing Stickers to Life with Video Diffusion
David Yan
Winnie Zhang
Luxin Zhang
Anmol Kalia
Dingkang Wang
...
Guan Pang
Ali K. Thabet
Peter Vajda
Amy Bearman
Licheng Yu
VGen
DiffM
62
2
0
08 Feb 2024
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass
  Diffusion Transformers
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers
Katherine Crowson
Stefan Andreas Baumann
Alex Birch
Tanishq Mathew Abraham
Daniel Z. Kaplan
Enrico Shippole
29
48
0
21 Jan 2024
Large-scale Reinforcement Learning for Diffusion Models
Large-scale Reinforcement Learning for Diffusion Models
Yinan Zhang
Eric Tzeng
Yilun Du
Dmitry Kislyuk
VLM
37
31
0
20 Jan 2024
Vlogger: Make Your Dream A Vlog
Vlogger: Make Your Dream A Vlog
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGen
DiffM
38
35
0
17 Jan 2024
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for
  Text-to-Image Generation
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee
Yinxiao Li
Junjie Ke
Innfarn Yoo
Han Zhang
...
Junfeng He
Gang Li
Sangpil Kim
Irfan Essa
Feng Yang
EGVM
41
18
0
11 Jan 2024
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video
  Synthesis
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Feng Liang
Bichen Wu
Jialiang Wang
Licheng Yu
Kunpeng Li
...
Ishan Misra
Jia-Bin Huang
Peizhao Zhang
Peter Vajda
Diana Marculescu
VGen
DiffM
40
32
0
29 Dec 2023
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
45
113
0
21 Dec 2023
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
Bichen Wu
Ching-Yao Chuang
Xiaoyan Wang
Yichen Jia
K. Krishnakumar
Tong Xiao
Feng Liang
Licheng Yu
Peter Vajda
DiffM
VGen
20
22
0
20 Dec 2023
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion
  Models
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
Angela Castillo
Jonas Kohler
Juan C. Pérez
Juan Pablo Pérez
Albert Pumarola
Guohao Li
Pablo Arbelaez
Ali K. Thabet
32
12
0
19 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked
  Transformers
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
27
4
0
19 Dec 2023
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion
  Models
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Senmao Li
Taihang Hu
Fahad Shahbaz Khan
Linxuan Li
Shiqi Yang
Yaxing Wang
Ming-Ming Cheng
Jian Yang
DiffM
34
1
0
15 Dec 2023
ControlRoom3D: Room Generation using Semantic Proxy Rooms
ControlRoom3D: Room Generation using Semantic Proxy Rooms
Jonas Schult
Sam S. Tsai
Lukas Höllein
Bichen Wu
Jialiang Wang
...
Zijian He
Peizhao Zhang
Bastian Leibe
Peter Vajda
Ji Hou
35
31
0
08 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
39
38
0
07 Dec 2023
Context Diffusion: In-Context Aware Image Generation
Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska
Animesh Sinha
Abhimanyu Dubey
Dhruv Mahajan
Vignesh Ramanathan
Filip Radenovic
DiffM
21
10
0
06 Dec 2023
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Felix Wimbauer
Bichen Wu
Edgar Schoenfeld
Xiaoliang Dai
Ji Hou
...
Jonas Kohler
Christian Rupprecht
Daniel Cremers
Peter Vajda
Jialiang Wang
DiffM
38
58
0
06 Dec 2023
Training on Synthetic Data Beats Real Data in Multimodal Relation
  Extraction
Training on Synthetic Data Beats Real Data in Multimodal Relation Extraction
Zilin Du
Haoxin Li
Xu Guo
Boyang Li
35
1
0
05 Dec 2023
Exploiting Diffusion Prior for Generalizable Dense Prediction
Exploiting Diffusion Prior for Generalizable Dense Prediction
Hsin-Ying Lee
Hung-Yu Tseng
Hsin-Ying Lee
Ming-Hsuan Yang
DiffM
MDE
39
18
0
30 Nov 2023
IMMA: Immunizing text-to-image Models against Malicious Adaptation
IMMA: Immunizing text-to-image Models against Malicious Adaptation
Yijia Zheng
Raymond A. Yeh
53
8
0
30 Nov 2023
Adversarial Diffusion Distillation
Adversarial Diffusion Distillation
Axel Sauer
Dominik Lorenz
A. Blattmann
Robin Rombach
138
332
0
28 Nov 2023
Previous
1234
Next