ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.07775
  4. Cited By
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
v1v2v3v4v5 (latest)

Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets

10 December 2024
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
ArXiv (abs)PDFHTML

Papers citing "Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets"

28 / 78 papers shown
Title
Training a Helpful and Harmless Assistant with Reinforcement Learning
  from Human Feedback
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
307
2,632
0
12 Apr 2022
Video Diffusion Models
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffMVGen
424
1,650
0
07 Apr 2022
Equivariant Diffusion for Molecule Generation in 3D
Equivariant Diffusion for Molecule Generation in 3D
Emiel Hoogeboom
Victor Garcia Satorras
Clément Vignac
Max Welling
DiffM
175
628
0
31 Mar 2022
GeoDiff: a Geometric Diffusion Model for Molecular Conformation
  Generation
GeoDiff: a Geometric Diffusion Model for Molecular Conformation Generation
Minkai Xu
Lantao Yu
Yang Song
Chence Shi
Stefano Ermon
Jian Tang
BDLDiffM
177
525
0
06 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
1.3K
13,290
0
04 Mar 2022
Bayesian Structure Learning with Generative Flow Networks
Bayesian Structure Learning with Generative Flow Networks
T. Deleu
António Góis
Chris C. Emezue
M. Rankawat
Simon Lacoste-Julien
Stefan Bauer
Yoshua Bengio
BDL
117
157
0
28 Feb 2022
Generative Flow Networks for Discrete Probabilistic Modeling
Generative Flow Networks for Discrete Probabilistic Modeling
Dinghuai Zhang
Nikolay Malkin
Ziqiang Liu
Alexandra Volokhova
Aaron Courville
Yoshua Bengio
95
110
0
03 Feb 2022
Trajectory balance: Improved credit assignment in GFlowNets
Trajectory balance: Improved credit assignment in GFlowNets
Nikolay Malkin
Moksh Jain
Emmanuel Bengio
Chen Sun
Yoshua Bengio
293
187
0
31 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
652
15,868
0
20 Dec 2021
GFlowNet Foundations
GFlowNet Foundations
Yoshua Bengio
Salem Lahlou
T. Deleu
J. E. Hu
Mo Tiwari
Emmanuel Bengio
123
241
0
17 Nov 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
849
10,659
0
17 Jun 2021
Flow Network based Generative Models for Non-Iterative Diverse Candidate
  Generation
Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Emmanuel Bengio
Moksh Jain
Maksym Korablyov
Doina Precup
Yoshua Bengio
112
340
0
08 Jun 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
511
8,017
0
11 May 2021
Score-Based Generative Modeling through Stochastic Differential
  Equations
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffMSyDa
591
6,609
0
26 Nov 2020
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
364
7,542
0
06 Oct 2020
Learning to summarize from human feedback
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
348
2,195
0
02 Sep 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
1.1K
18,550
0
19 Jun 2020
Chip Placement with Deep Reinforcement Learning
Chip Placement with Deep Reinforcement Learning
Azalia Mirhoseini
Anna Goldie
M. Yazgan
J. Jiang
Ebrahim M. Songhori
...
Quoc V. Le
James Laudon
Richard Ho
Roger Carpenter
J. Dean
OffRL
92
220
0
22 Apr 2020
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
615
1,776
0
18 Sep 2019
Deep Reinforcement Learning and the Deadly Triad
Deep Reinforcement Learning and the Deadly Triad
H. V. Hasselt
Yotam Doron
Florian Strub
Matteo Hessel
Nicolas Sonnerat
Joseph Modayil
OffRL
121
232
0
06 Dec 2018
Reward learning from human preferences and demonstrations in Atari
Reward learning from human preferences and demonstrations in Atari
Borja Ibarz
Jan Leike
Tobias Pohlen
G. Irving
Shane Legg
Dario Amodei
140
398
0
15 Nov 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CEBDL
116
677
0
02 May 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
428
8,489
0
04 Jan 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
715
19,378
0
20 Jul 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
246
3,389
0
12 Jun 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
129
1,352
0
27 Feb 2017
U-Net: Convolutional Networks for Biomedical Image Segmentation
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg3DV
2.1K
77,890
0
18 May 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
382
6,831
0
19 Feb 2015
Previous
12