Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.05977
Cited By
v1
v2
v3
v4 (latest)
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
12 April 2023
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1412★)
Papers citing
"ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation"
50 / 271 papers shown
Title
A Taxonomy of Loss Functions for Stochastic Optimal Control
Carles Domingo-Enrich
75
4
0
01 Oct 2024
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
104
3
0
30 Sep 2024
High Quality Human Image Animation using Regional Supervision and Motion Blur Condition
Zhongcong Xu
Chaoyue Song
Guoxian Song
Jianfeng Zhang
Jun Hao Liew
...
You Xie
Linjie Luo
Guosheng Lin
Jiashi Feng
Mike Zheng Shou
DiffM
3DH
VGen
114
3
0
29 Sep 2024
ABHINAW: A method for Automatic Evaluation of Typography within AI-Generated Images
Abhinaw Jagtap
Nachiket Tapas
R. G. Brajesh
EGVM
74
0
0
18 Sep 2024
Generalizing Alignment Paradigm of Text-to-Image Generation with Preferences through
f
f
f
-divergence Minimization
Haoyuan Sun
Bo Xia
Yongzhe Chang
Xueqian Wang
EGVM
67
6
0
15 Sep 2024
Explore the Hallucination on Low-level Perception for MLLMs
Yinan Sun
Zicheng Zhang
H. Wu
Xiaohong Liu
Weisi Lin
Guangtao Zhai
Xiongkuo Min
82
2
0
15 Sep 2024
Constrained Diffusion Models via Dual Training
Shervin Khalafi
Dongsheng Ding
Alejandro Ribeiro
113
4
0
27 Aug 2024
Quality Assessment in the Era of Large Models: A Survey
Zicheng Zhang
Yingjie Zhou
Chunyi Li
Baixuan Zhao
Xiaohong Liu
Guangtao Zhai
103
12
0
17 Aug 2024
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Zhichao Zhang
Xinyue Li
Wei Sun
Jun Jia
Xiongkuo Min
...
Puyi Wang
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Guangtao Zhai
EGVM
68
5
0
31 Jul 2024
Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization
Zipeng Qi
Lichen Bai
Haoyi Xiong
Zeke Xie
DiffM
120
24
0
19 Jul 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen
Yichao Du
Zichen Wen
Yiyang Zhou
Chenhang Cui
...
Jiawei Zhou
Zhuokai Zhao
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVM
MLLM
117
35
0
05 Jul 2024
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz
Yuanzhen Li
Neal Wadhwa
Yael Pritch
Michael Rubinstein
David E. Jacobs
Shlomi Fruchter
DiffM
96
8
0
02 Jul 2024
Aligning Human Motion Generation with Human Perceptions
Haoru Wang
Wentao Zhu
Luyi Miao
Yishu Xu
Feng Gao
Qi Tian
Yizhou Wang
EGVM
135
4
0
02 Jul 2024
On Discrete Prompt Optimization for Diffusion Models
Ruochen Wang
Ting Liu
Cho-Jui Hsieh
Boqing Gong
DiffM
89
8
0
27 Jun 2024
Aligning Diffusion Models with Noise-Conditioned Perception
Alexander Gambashidze
Anton Kulikov
Yuriy Sosnin
Ilya Makarov
116
5
0
25 Jun 2024
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation
Katherine M. Collins
Najoung Kim
Yonatan Bitton
Verena Rieser
Shayegan Omidshafiei
...
Gang Li
Adrian Weller
Junfeng He
Deepak Ramachandran
Krishnamurthy Dvijotham
EGVM
84
3
0
24 Jun 2024
EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models
Zhiyu Tan
Xiaomeng Yang
Luozheng Qin
Mengping Yang
Cheng Zhang
Hao Li
106
8
0
24 Jun 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
194
39
0
24 Jun 2024
GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation
Baiqi Li
Zhiqiu Lin
Deepak Pathak
Jiayao Li
Yixin Fei
...
Tiffany Ling
Xide Xia
Pengchuan Zhang
Graham Neubig
Deva Ramanan
EGVM
138
39
0
19 Jun 2024
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Yulai Zhao
Masatoshi Uehara
Gabriele Scalia
Tommaso Biancalani
Sergey Levine
Ehsan Hajiramezanali
Ehsan Hajiramezanali
AI4CE
181
7
0
17 Jun 2024
Consistency-diversity-realism Pareto fronts of conditional image generative models
Pietro Astolfi
Marlene Careil
Melissa Hall
Oscar Manas
Matthew Muckley
Jakob Verbeek
Adriana Romero Soriano
M. Drozdzal
97
13
0
14 Jun 2024
From Pixels to Prose: A Large Dataset of Dense Image Captions
Vasu Singla
Kaiyu Yue
Sukriti Paul
Reza Shirkavand
Mayuka Jayawardhana
Alireza Ganjdanesh
Heng Huang
A. Bhatele
Gowthami Somepalli
Tom Goldstein
3DV
VLM
115
27
0
14 Jun 2024
Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Miaosen Zhang
Yixuan Wei
Zhen Xing
Yifei Ma
Zuxuan Wu
...
Zheng Zhang
Qi Dai
Chong Luo
Xin Geng
Baining Guo
VLM
86
1
0
13 Jun 2024
CMC-Bench: Towards a New Paradigm of Visual Signal Compression
Chunyi Li
Xiele Wu
H. Wu
Donghui Feng
Zicheng Zhang
Guo Lu
Xiongkuo Min
Xiaohong Liu
Guangtao Zhai
Weisi Lin
VLM
78
5
0
13 Jun 2024
Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt Optimization for Enhanced Text-to-Image Synthesis
Xinrui Yang
Zhuohan Wang
Anthony Hu
EGVM
73
0
0
13 Jun 2024
PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences
Daiwei Chen
Yi Chen
Aniket Rege
Ramya Korlakai Vinayak
114
23
0
12 Jun 2024
CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models
Hyungjin Chung
Jeongsol Kim
Geon Yeong Park
Hyelin Nam
Jong Chul Ye
DiffM
102
35
0
12 Jun 2024
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference
Jiwoo Hong
Sayak Paul
Noah Lee
Kashif Rasul
James Thorne
Jongheon Jeong
104
18
0
10 Jun 2024
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
L. Eyring
Shyamgopal Karthik
Karsten Roth
Alexey Dosovitskiy
Zeynep Akata
164
28
0
06 Jun 2024
Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback
Chen Chen
Yuchen Hu
Wen Wu
Helin Wang
Chng Eng Siong
Chao Zhang
93
12
0
02 Jun 2024
Improving GFlowNets for Text-to-Image Diffusion Alignment
Dinghuai Zhang
Yizhe Zhang
Jiatao Gu
Ruixiang Zhang
J. Susskind
Navdeep Jaitly
Shuangfei Zhai
EGVM
138
10
0
02 Jun 2024
Amortizing intractable inference in diffusion models for vision, language, and control
S. Venkatraman
Moksh Jain
Luca Scimeca
Minsu Kim
Marcin Sendera
...
Alexandre Adam
Jarrid Rector-Brooks
Yoshua Bengio
Glen Berseth
Nikolay Malkin
191
32
0
31 May 2024
Information Theoretic Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Massimo Gallo
Pietro Michiardi
176
0
0
31 May 2024
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Masatoshi Uehara
Yulai Zhao
Ehsan Hajiramezanali
Gabriele Scalia
Gökçen Eraslan
Avantika Lal
Sergey Levine
Tommaso Biancalani
133
16
0
30 May 2024
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
Jiachen Li
Weixi Feng
Tsu-Jui Fu
Xinyi Wang
Sugato Basu
Wenhu Chen
William Y. Wang
VGen
91
34
0
29 May 2024
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Litu Rout
Yujia Chen
Nataniel Ruiz
Abhishek Kumar
Constantine Caramanis
Sanjay Shakkottai
Wen-Sheng Chu
DiffM
98
26
0
27 May 2024
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
C. N. Vasconcelos
Abdullah Rashwan Austin Waters
Trevor Walker
Keyang Xu
Jimmy Yan
...
Wenlei Zhou
Kevin Swersky
David J. Fleet
Jason Baldridge
Oliver Wang
120
3
0
27 May 2024
Ensembling Diffusion Models via Adaptive Feature Aggregation
Cong Wang
Kuan Tian
Yonghang Guan
Jun Zhang
Zhiwei Jiang
Fei Shen
Xiao Han
127
6
0
27 May 2024
Multi-Player Approaches for Dueling Bandits
Or Raveh
Junya Honda
Masashi Sugiyama
155
1
0
25 May 2024
Product Design Using Generative Adversarial Network: Incorporating Consumer Preference and External Data
Hui Li
Jian Ni
Fangzhu Yang
44
0
0
24 May 2024
Score Distillation via Reparametrized DDIM
Artem Lukoianov
Haitz Sáez de Ocáriz Borde
Kristjan Greenewald
Vitor Campagnolo Guizilini
Timur M. Bagautdinov
Vincent Sitzmann
Justin Solomon
DiffM
105
19
0
24 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
145
127
0
23 May 2024
A Survey On Text-to-3D Contents Generation In The Wild
Chenhan Jiang
115
5
0
15 May 2024
A Survey on Personalized Content Synthesis with Diffusion Models
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Jiaxin Wu
Zhen Lei
Zhaoxiang Zhang
Zhen Lei
Qing Li
EGVM
225
22
0
09 May 2024
Multi-modal Learnable Queries for Image Aesthetics Assessment
Zhiwei Xiong
Yunfan Zhang
Zhiqi Shen
Peiran Ren
Han Yu
EGVM
70
1
0
02 May 2024
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Zinan Guo
Yanze Wu
Zhuowei Chen
Lang Chen
Qian He
DiffM
100
66
0
24 Apr 2024
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation
Xun Wu
Shaohan Huang
Furu Wei
85
10
0
23 Apr 2024
Gradient Guidance for Diffusion Models: An Optimization Perspective
Yingqing Guo
Hui Yuan
Yukang Yang
Minshuo Chen
Mengdi Wang
91
25
0
23 Apr 2024
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
Yuxi Ren
Xin Xia
Yanzuo Lu
Jiacheng Zhang
Jie Wu
Pan Xie
Xing Wang
Xuefeng Xiao
167
79
0
21 Apr 2024
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
95
0
0
15 Apr 2024
Previous
1
2
3
4
5
6
Next