ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.13259
  4. Cited By
Trajectory balance: Improved credit assignment in GFlowNets

Trajectory balance: Improved credit assignment in GFlowNets

31 January 2022
Nikolay Malkin
Moksh Jain
Emmanuel Bengio
Chen Sun
Yoshua Bengio
ArXivPDFHTML

Papers citing "Trajectory balance: Improved credit assignment in GFlowNets"

27 / 27 papers shown
Title
Energy-based generator matching: A neural sampler for general state space
Energy-based generator matching: A neural sampler for general state space
Dongyeop Woo
Minsu Kim
Minkyu Kim
Kiyoung Seong
SungSoo Ahn
38
0
0
26 May 2025
Discrete Neural Flow Samplers with Locally Equivariant Transformer
Discrete Neural Flow Samplers with Locally Equivariant Transformer
Zijing Ou
Ruixiang Zhang
Yingzhen Li
34
0
0
23 May 2025
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets
Adam Younsi
Abdalgader Abubaker
M. Seddik
Hakim Hacid
Salem Lahlou
LRM
127
0
0
28 Apr 2025
Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching
Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching
Aaron J. Havens
Benjamin Kurt Miller
Bing Yan
Carles Domingo-Enrich
Anuroop Sriram
...
Brandon Amos
Brian Karrer
Xiang Fu
Guan-Horng Liu
Ricky T. Q. Chen
DiffM
85
1
0
16 Apr 2025
Process-Supervised LLM Recommenders via Flow-guided Tuning
Process-Supervised LLM Recommenders via Flow-guided Tuning
Chongming Gao
Mengyao Gao
Chenxiao Fan
Shuai Yuan
Wentao Shi
Xiangnan He
88
3
0
10 Mar 2025
Less is More: Adaptive Program Repair with Bug Localization and Preference Learning
Zhenlong Dai
Bingrui Chen
Zhuoluo Zhao
Xiu Tang
Sai Wu
Chang Yao
Zhipeng Gao
Jingyuan Chen
KELM
74
3
0
09 Mar 2025
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation
Mohit Pandey
G. Subbaraj
Artem Cherkasov
Martin Ester
Emmanuel Bengio
AI4CE
95
1
0
08 Mar 2025
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling
FedGrAINS: Personalized SubGraph Federated Learning with Adaptive Neighbor Sampling
Emir Ceyani
Han Xie
Baturalp Buyukates
Carl Yang
Salman Avestimehr
FedML
180
0
0
22 Jan 2025
From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training
From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training
Julius Berner
Lorenz Richter
Marcin Sendera
Jarrid Rector-Brooks
Nikolay Malkin
OffRL
91
7
0
10 Jan 2025
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
151
5
0
10 Dec 2024
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
Seanie Lee
Haebin Seong
Dong Bok Lee
Minki Kang
Xiaoyin Chen
Dominik Wagner
Yoshua Bengio
Juho Lee
Sung Ju Hwang
95
5
0
02 Oct 2024
Adaptive teachers for amortized samplers
Adaptive teachers for amortized samplers
Minsu Kim
Sanghyeok Choi
Taeyoung Yun
Emmanuel Bengio
Leo Feng
Jarrid Rector-Brooks
Sungsoo Ahn
Jinkyoo Park
Nikolay Malkin
Yoshua Bengio
382
6
0
02 Oct 2024
RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets
RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets
Piotr Gaiñski
Michał Koziarski
Krzysztof Maziarz
Marwin H. S. Segler
Jacek Tabor
Marek Śmieja
80
3
0
26 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
111
7
0
03 Jun 2024
Amortizing intractable inference in diffusion models for vision, language, and control
Amortizing intractable inference in diffusion models for vision, language, and control
S. Venkatraman
Moksh Jain
Luca Scimeca
Minsu Kim
Marcin Sendera
...
Alexandre Adam
Jarrid Rector-Brooks
Yoshua Bengio
Glen Berseth
Nikolay Malkin
102
29
0
31 May 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Seanie Lee
Minsu Kim
Lynn Cherif
David Dobre
Juho Lee
...
Kenji Kawaguchi
Gauthier Gidel
Yoshua Bengio
Nikolay Malkin
Moksh Jain
AAML
88
15
0
28 May 2024
Ant Colony Sampling with GFlowNets for Combinatorial Optimization
Ant Colony Sampling with GFlowNets for Combinatorial Optimization
Minsu Kim
Sanghyeok Choi
Hyeon-Seob Kim
Jiwoo Son
Jinkyoo Park
Yoshua Bengio
72
28
0
11 Mar 2024
Improved off-policy training of diffusion samplers
Improved off-policy training of diffusion samplers
Marcin Sendera
Minsu Kim
Sarthak Mittal
Pablo Lemos
Luca Scimeca
Jarrid Rector-Brooks
Alexandre Adam
Yoshua Bengio
Nikolay Malkin
OffRL
86
22
0
07 Feb 2024
Investigating Generalization Behaviours of Generative Flow Networks
Investigating Generalization Behaviours of Generative Flow Networks
Lazar Atanackovic
Emmanuel Bengio
AI4CE
54
3
0
07 Feb 2024
Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors
Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors
Wasu Top Piriyakulkij
Yingheng Wang
Volodymyr Kuleshov
DiffM
62
1
0
05 Jan 2024
MARS: Markov Molecular Sampling for Multi-objective Drug Discovery
MARS: Markov Molecular Sampling for Multi-objective Drug Discovery
Yutong Xie
Chence Shi
Hao Zhou
Yuwei Yang
Weinan Zhang
Yong Yu
Lei Li
69
143
0
18 Mar 2021
Masked Label Prediction: Unified Message Passing Model for
  Semi-Supervised Classification
Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification
Yunsheng Shi
Zhengjie Huang
Shikun Feng
Hui Zhong
Wenjin Wang
Yu Sun
AI4CE
51
770
0
08 Sep 2020
Soft Actor-Critic for Discrete Action Settings
Soft Actor-Critic for Discrete Action Settings
Petros Christodoulou
OffRL
130
294
0
16 Oct 2019
Deep Reinforcement Learning and the Deadly Triad
Deep Reinforcement Learning and the Deadly Triad
H. V. Hasselt
Yotam Doron
Florian Strub
Matteo Hessel
Nicolas Sonnerat
Joseph Modayil
OffRL
63
226
0
06 Dec 2018
Junction Tree Variational Autoencoder for Molecular Graph Generation
Junction Tree Variational Autoencoder for Molecular Graph Generation
Wengong Jin
Regina Barzilay
Tommi Jaakkola
291
1,358
0
12 Feb 2018
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
62
1,329
0
27 Feb 2017
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
166
8,805
0
04 Feb 2016
1