Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.07775
Cited By
v1
v2
v3
v4
v5 (latest)
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
10 December 2024
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets"
28 / 78 papers shown
Title
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
307
2,632
0
12 Apr 2022
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
424
1,650
0
07 Apr 2022
Equivariant Diffusion for Molecule Generation in 3D
Emiel Hoogeboom
Victor Garcia Satorras
Clément Vignac
Max Welling
DiffM
175
628
0
31 Mar 2022
GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation
Minkai Xu
Lantao Yu
Yang Song
Chence Shi
Stefano Ermon
Jian Tang
BDL
DiffM
177
525
0
06 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
1.3K
13,290
0
04 Mar 2022
Bayesian Structure Learning with Generative Flow Networks
T. Deleu
António Góis
Chris C. Emezue
M. Rankawat
Simon Lacoste-Julien
Stefan Bauer
Yoshua Bengio
BDL
117
157
0
28 Feb 2022
Generative Flow Networks for Discrete Probabilistic Modeling
Dinghuai Zhang
Nikolay Malkin
Ziqiang Liu
Alexandra Volokhova
Aaron Courville
Yoshua Bengio
95
110
0
03 Feb 2022
Trajectory balance: Improved credit assignment in GFlowNets
Nikolay Malkin
Moksh Jain
Emmanuel Bengio
Chen Sun
Yoshua Bengio
293
187
0
31 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
652
15,868
0
20 Dec 2021
GFlowNet Foundations
Yoshua Bengio
Salem Lahlou
T. Deleu
J. E. Hu
Mo Tiwari
Emmanuel Bengio
123
241
0
17 Nov 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
849
10,659
0
17 Jun 2021
Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Emmanuel Bengio
Moksh Jain
Maksym Korablyov
Doina Precup
Yoshua Bengio
112
340
0
08 Jun 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
511
8,017
0
11 May 2021
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
591
6,609
0
26 Nov 2020
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
364
7,542
0
06 Oct 2020
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
348
2,195
0
02 Sep 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
1.1K
18,550
0
19 Jun 2020
Chip Placement with Deep Reinforcement Learning
Azalia Mirhoseini
Anna Goldie
M. Yazgan
J. Jiang
Ebrahim M. Songhori
...
Quoc V. Le
James Laudon
Richard Ho
Roger Carpenter
J. Dean
OffRL
92
220
0
22 Apr 2020
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
615
1,776
0
18 Sep 2019
Deep Reinforcement Learning and the Deadly Triad
H. V. Hasselt
Yotam Doron
Florian Strub
Matteo Hessel
Nicolas Sonnerat
Joseph Modayil
OffRL
121
232
0
06 Dec 2018
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
140
398
0
15 Nov 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
116
677
0
02 May 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
428
8,489
0
04 Jan 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
715
19,378
0
20 Jul 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
246
3,389
0
12 Jun 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
129
1,352
0
27 Feb 2017
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
2.1K
77,890
0
18 May 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
382
6,831
0
19 Feb 2015
Previous
1
2