Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.05977
Cited By
v1
v2
v3
v4 (latest)
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
12 April 2023
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1412★)
Papers citing
"ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation"
50 / 271 papers shown
Title
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
Qingming Liu
Zhen Liu
Dinghuai Zhang
Kui Jia
43
0
0
18 Jun 2025
Control and Realism: Best of Both Worlds in Layout-to-Image without Training
Bonan li
Yinhan Hu
Songhua Liu
Xinchao Wang
DiffM
48
0
0
18 Jun 2025
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal
Abhinav Shrivastava
M. Gwilliam
56
0
0
18 Jun 2025
Fine-Grained Perturbation Guidance via Attention Head Selection
Donghoon Ahn
Jiwon Kang
Sanghyun Lee
Minjae Kim
Jaewon Min
Wooseok Jang
Saungwu Lee
Sayak Paul
S. Hong
Seungryong Kim
DiffM
AAML
127
0
0
12 Jun 2025
DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision
Xiandong Zou
Ruihao Xia
Hongsong Wang
Pan Zhou
AI4TS
64
0
0
11 Jun 2025
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi
Carlos E. Jimenez
Shunyu Yao
Nick Haber
Diyi Yang
Karthik Narasimhan
47
0
0
05 Jun 2025
Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences
Yunhong Lu
Qichao Wang
H. Cao
Xiaoyin Xu
Min Zhang
53
0
0
03 Jun 2025
Adaptive Destruction Processes for Diffusion Samplers
Timofei Gritsaev
Nikita Morozov
Kirill Tamogashev
D. Tiapkin
S. Samsonov
A. Naumov
Dmitry Vetrov
Nikolay Malkin
62
0
0
02 Jun 2025
Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models
Taehoon Yoon
Yunhong Min
Kyeongmin Yeo
Minhyuk Sung
82
0
0
02 Jun 2025
Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation
Jinjin Zhang
Qiuyu Huang
Junjie Liu
Xiefan Guo
Di Huang
61
0
0
02 Jun 2025
Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control
Danfeng li
Hui Zhang
Sheng Wang
Jiacheng Li
Zuxuan Wu
DiffM
VLM
38
0
0
31 May 2025
Inference-Time Alignment of Diffusion Models with Evolutionary Algorithms
Purvish Jajal
Nick Eliopoulos
Benjamin Shiue-Hal Chou
George K. Thiruvathukal
James C. Davis
Yung-Hsiang Lu
32
0
0
30 May 2025
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation
Yucheng Zhou
Jiahao Yuan
Qianning Wang
EGVM
34
0
0
30 May 2025
Rhetorical Text-to-Image Generation via Two-layer Diffusion Policy Optimization
Yuxi Zhang
Yueting Li
Xinyu Du
Sibo Wang
DiffM
EGVM
73
0
0
28 May 2025
D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples
Zijing Hu
Fengda Zhang
Kun Kuang
63
1
0
28 May 2025
Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape
Ruichen Chen
Keith G. Mills
Liyao Jiang
Chao Gao
Di Niu
VGen
93
0
0
28 May 2025
SageAttention2++: A More Efficient Implementation of SageAttention2
Jintao Zhang
Xiaoming Xu
Jia Wei
Haofeng Huang
Pengle Zhang
Chendong Xiang
Jun Zhu
Jianfei Chen
MQ
VLM
89
7
0
27 May 2025
DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization
Shamil Ayupov
M. Nakhodnov
Anastasia Yaschenko
Andrey Kuznetsov
Aibek Alanov
52
0
0
27 May 2025
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models
Dar-Yen Chen
Hmrishav Bandyopadhyay
Kai Zou
Yi-Zhe Song
54
0
0
27 May 2025
StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
Yi Wu
Lingting Zhu
Shengju Qian
Lei Liu
Wandi Qiao
Lequan Yu
Bin Li
72
0
0
26 May 2025
Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning
Ziyi Zhang
Li Shen
Deheng Ye
Yong Luo
Huangxuan Zhao
Lefei Zhang
29
0
0
26 May 2025
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs
Juntong Wang
Jiarui Wang
Huiyu Duan
Guangtao Zhai
Xiongkuo Min
41
1
0
26 May 2025
Step-level Reward for Free in RL-based T2I Diffusion Model Fine-tuning
Xinyao Liao
Wei Wei
Xiaoye Qu
Yu Cheng
EGVM
62
0
0
25 May 2025
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffM
AI4CE
181
0
0
25 May 2025
Flex-Judge: Think Once, Judge Anywhere
Jongwoo Ko
S. Kim
Sungwoo Cho
Se-Young Yun
ELM
LRM
218
0
0
24 May 2025
Rethinking Direct Preference Optimization in Diffusion Models
Junyong Kang
Seohyun Lim
Kyungjune Baek
Hyunjung Shim
780
0
0
24 May 2025
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
Wenchao Zhang
Jiahe Tian
Runze He
Jizhong Han
Jiao Dai
Miaomiao Feng
Wei Mi
Xiaodan Zhang
113
0
0
24 May 2025
Scaling Image and Video Generation via Test-Time Evolutionary Search
Haoran He
Jiajun Liang
X. Wang
Pengfei Wan
Di Zhang
Kun Gai
Ling Pan
DiffM
244
0
0
23 May 2025
A Minimalist Method for Fine-tuning Text-to-Image Diffusion Models
Yanting Miao
William Loh
Suraj Kothawade
Pacal Poupart
45
0
0
23 May 2025
InfLVG: Reinforce Inference-Time Consistent Long Video Generation with GRPO
Xueji Fang
Liyuan Ma
Zhiyang Chen
Mingyuan Zhou
Guo-Jun Qi
VGen
234
0
0
23 May 2025
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO
Chengzhuo Tong
Ziyu Guo
Renrui Zhang
Wenyu Shan
Xinyu Wei
Zhenghao Xing
Hongsheng Li
Pheng-Ann Heng
EGVM
OffRL
LRM
116
1
0
22 May 2025
Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation
Cheng Jin
Zhenyu Xiao
Chutao Liu
Yuantao Gu
DiffM
37
2
0
21 May 2025
MMaDA: Multimodal Large Diffusion Language Models
Ling Yang
Ye Tian
Bowen Li
Xinchen Zhang
Ke Shen
Yunhai Tong
Mengdi Wang
VLM
LRM
144
6
0
21 May 2025
Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Sucheng Ren
Qihang Yu
Ju He
Alan Yuille
Liang-Chieh Chen
135
0
0
20 May 2025
RLVR-World: Training World Models with Reinforcement Learning
Jialong Wu
Shaofeng Yin
Ningya Feng
Mingsheng Long
OffRL
VGen
87
2
0
20 May 2025
AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings
Yilin Ye
Junchao Huang
Xingchen Zeng
Jiazhi Xia
Wei Zeng
149
0
0
20 May 2025
Improving Compositional Generation with Diffusion Models Using Lift Scores
Chenning Yu
Sicun Gao
DiffM
843
0
0
19 May 2025
Seeing the Unseen: How EMoE Unveils Bias in Text-to-Image Diffusion Models
Lucas Berry
Axel Brando
Wei-Di Chang
Juan Camilo Gamboa Higuera
David Meger
DiffM
58
0
0
19 May 2025
Anti-Inpainting: A Proactive Defense against Malicious Diffusion-based Inpainters under Unknown Conditions
Yimao Guo
Zuomin Qu
Wei Lu
Xiangyang Luo
DiffM
AAML
68
0
0
19 May 2025
LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation
Jiarui Wang
Huiyu Duan
Ziheng Jia
Yu Zhao
Woo Yi Yang
...
Zhongfu Chen
Juntong Wang
Yuke Xing
Guangtao Zhai
Xiongkuo Min
VGen
84
1
0
17 May 2025
DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models
Giulia Bertazzini
Daniele Baracchi
Dasara Shullani
Isao Echizen
Alessandro Piva
129
0
0
16 May 2025
Towards Self-Improvement of Diffusion Models via Group Preference Optimization
Renjie Chen
Wenfeng Lin
Yichen Zhang
Jiangchuan Wei
Boyuan Liu
Chao Feng
Jiao Ran
Mingyu Guo
71
0
0
16 May 2025
CompAlign: Improving Compositional Text-to-Image Generation with a Complex Benchmark and Fine-Grained Feedback
Yixin Wan
Kai-Wei Chang
EGVM
CoGe
102
2
0
16 May 2025
Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models
Fu-Yun Wang
Yunhao Shui
Jingtan Piao
Keqiang Sun
Hongsheng Li
97
4
0
16 May 2025
ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization
Wenhao Shen
Wanqi Yin
Xiaofeng Yang
Cheng Chen
Chaoyue Song
Zhongang Cai
Lei Yang
Hao Wang
Guosheng Lin
147
0
0
15 May 2025
An Exploration of Default Images in Text-to-Image Generation
Hannu Simonen
Atte Kiviniemi
Jonas Oppenlaender
VLM
75
0
0
14 May 2025
DanceGRPO: Unleashing GRPO on Visual Generation
Zeyue Xue
Jie Wu
Yu Gao
Fangyuan Kong
Lingting Zhu
...
Zhiheng Liu
Wei Liu
Qiushan Guo
Weilin Huang
Ping Luo
EGVM
VGen
96
8
0
12 May 2025
Flow-GRPO: Training Flow Matching Models via Online RL
Jie Liu
Gongye Liu
Jiajun Liang
Yongqian Li
Jiaheng Liu
Xinyu Wang
Pengfei Wan
Di Zhang
Wanli Ouyang
AI4CE
227
5
0
08 May 2025
CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion
Yongqian Li
Pencheng Wan
Liang Han
Yaowei Wang
Liqiang Nie
Min Zhang
77
0
0
07 May 2025
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Yibin Wang
Zhimin Li
Yuhang Zang
Chunyu Wang
Qinglin Lu
Cheng Jin
Jinqiao Wang
LRM
144
11
0
06 May 2025
1
2
3
4
5
6
Next