Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.13259
Cited By
Trajectory balance: Improved credit assignment in GFlowNets
31 January 2022
Nikolay Malkin
Moksh Jain
Emmanuel Bengio
Chen Sun
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trajectory balance: Improved credit assignment in GFlowNets"
27 / 27 papers shown
Title
Energy-based generator matching: A neural sampler for general state space
Dongyeop Woo
Minsu Kim
Minkyu Kim
Kiyoung Seong
SungSoo Ahn
27
0
0
26 May 2025
Discrete Neural Flow Samplers with Locally Equivariant Transformer
Zijing Ou
Ruixiang Zhang
Yingzhen Li
27
0
0
23 May 2025
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets
Adam Younsi
Abdalgader Abubaker
M. Seddik
Hakim Hacid
Salem Lahlou
LRM
116
0
0
28 Apr 2025
Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching
Aaron J. Havens
Benjamin Kurt Miller
Bing Yan
Carles Domingo-Enrich
Anuroop Sriram
...
Brandon Amos
Brian Karrer
Xiang Fu
Guan-Horng Liu
Ricky T. Q. Chen
DiffM
73
0
0
16 Apr 2025
Process-Supervised LLM Recommenders via Flow-guided Tuning
Chongming Gao
Mengyao Gao
Chenxiao Fan
Shuai Yuan
Wentao Shi
Xiangnan He
85
3
0
10 Mar 2025
Less is More: Adaptive Program Repair with Bug Localization and Preference Learning
Zhenlong Dai
Bingrui Chen
Zhuoluo Zhao
Xiu Tang
Sai Wu
Chang Yao
Zhipeng Gao
Jingyuan Chen
KELM
67
3
0
09 Mar 2025
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Mohit Pandey
G. Subbaraj
Artem Cherkasov
Martin Ester
Emmanuel Bengio
AI4CE
92
1
0
08 Mar 2025
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling
Emir Ceyani
Han Xie
Baturalp Buyukates
Carl Yang
Salman Avestimehr
FedML
159
0
0
22 Jan 2025
From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training
Julius Berner
Lorenz Richter
Marcin Sendera
Jarrid Rector-Brooks
Nikolay Malkin
OffRL
86
6
0
10 Jan 2025
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
142
5
0
10 Dec 2024
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
Seanie Lee
Haebin Seong
Dong Bok Lee
Minki Kang
Xiaoyin Chen
Dominik Wagner
Yoshua Bengio
Juho Lee
Sung Ju Hwang
83
5
0
02 Oct 2024
Adaptive teachers for amortized samplers
Minsu Kim
Sanghyeok Choi
Taeyoung Yun
Emmanuel Bengio
Leo Feng
Jarrid Rector-Brooks
Sungsoo Ahn
Jinkyoo Park
Nikolay Malkin
Yoshua Bengio
358
6
0
02 Oct 2024
RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets
Piotr Gaiñski
Michał Koziarski
Krzysztof Maziarz
Marwin H. S. Segler
Jacek Tabor
Marek Śmieja
75
3
0
26 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
106
7
0
03 Jun 2024
Amortizing intractable inference in diffusion models for vision, language, and control
S. Venkatraman
Moksh Jain
Luca Scimeca
Minsu Kim
Marcin Sendera
...
Alexandre Adam
Jarrid Rector-Brooks
Yoshua Bengio
Glen Berseth
Nikolay Malkin
99
28
0
31 May 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Seanie Lee
Minsu Kim
Lynn Cherif
David Dobre
Juho Lee
...
Kenji Kawaguchi
Gauthier Gidel
Yoshua Bengio
Nikolay Malkin
Moksh Jain
AAML
83
15
0
28 May 2024
Ant Colony Sampling with GFlowNets for Combinatorial Optimization
Minsu Kim
Sanghyeok Choi
Hyeon-Seob Kim
Jiwoo Son
Jinkyoo Park
Yoshua Bengio
67
28
0
11 Mar 2024
Improved off-policy training of diffusion samplers
Marcin Sendera
Minsu Kim
Sarthak Mittal
Pablo Lemos
Luca Scimeca
Jarrid Rector-Brooks
Alexandre Adam
Yoshua Bengio
Nikolay Malkin
OffRL
81
20
0
07 Feb 2024
Investigating Generalization Behaviours of Generative Flow Networks
Lazar Atanackovic
Emmanuel Bengio
AI4CE
49
3
0
07 Feb 2024
Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors
Wasu Top Piriyakulkij
Yingheng Wang
Volodymyr Kuleshov
DiffM
53
1
0
05 Jan 2024
MARS: Markov Molecular Sampling for Multi-objective Drug Discovery
Yutong Xie
Chence Shi
Hao Zhou
Yuwei Yang
Weinan Zhang
Yong Yu
Lei Li
57
142
0
18 Mar 2021
Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification
Yunsheng Shi
Zhengjie Huang
Shikun Feng
Hui Zhong
Wenjin Wang
Yu Sun
AI4CE
45
770
0
08 Sep 2020
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
120
294
0
16 Oct 2019
Deep Reinforcement Learning and the Deadly Triad
H. V. Hasselt
Yotam Doron
Florian Strub
Matteo Hessel
Nicolas Sonnerat
Joseph Modayil
OffRL
56
226
0
06 Dec 2018
Junction Tree Variational Autoencoder for Molecular Graph Generation
Wengong Jin
Regina Barzilay
Tommi Jaakkola
284
1,358
0
12 Feb 2018
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
46
1,329
0
27 Feb 2017
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
150
8,805
0
04 Feb 2016
1