Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.04363
Cited By
Amortizing intractable inference in large language models
6 October 2023
Marvin Schmitt
Moksh Jain
Daniel Habermann
Younesse Kaddar
Ullrich Kothe
Stefan T. Radev
Nikolay Malkin
AIFin
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Amortizing intractable inference in large language models"
48 / 48 papers shown
Title
Out-of-distribution generalisation is hard: evidence from ARC-like tasks
George Dimitriadis. Spyridon Samothrakis
Spyridon Samothrakis
21
0
0
14 May 2025
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets
Adam Younsi
Abdalgader Abubaker
M. Seddik
Hakim Hacid
Salem Lahlou
LRM
57
0
0
28 Apr 2025
Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling
Benjamin Lipkin
Benjamin LeBrun
Jacob Hoover Vigly
João Loula
David R. MacIver
...
Ryan Cotterell
Vikash K. Mansinghka
Timothy J. O'Donnell
Alexander K. Lew
Tim Vieira
29
0
0
07 Apr 2025
Learning to Reason for Long-Form Story Generation
Alexander Gurung
Mirella Lapata
ReLM
OffRL
LRM
55
0
0
28 Mar 2025
Less is More: Adaptive Program Repair with Bug Localization and Preference Learning
Zhenlong Dai
Bingrui Chen
Zhuoluo Zhao
Xiu Tang
Sai Wu
Chang Yao
Zhipeng Gao
Jingyuan Chen
KELM
44
2
0
09 Mar 2025
Do GFlowNets Transfer? Case Study on the Game of 24/42
Adesh Gupta
Abhinav Kumar
Mansi Gupta
Paras Chopra
100
0
0
03 Mar 2025
Consistent Amortized Clustering via Generative Flow Networks
Irit Chelly
Roy Uziel
O. Freifeld
Ari Pakman
51
0
0
26 Feb 2025
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
Taeyoung Yun
Dinghuai Zhang
Jinkyoo Park
Ling Pan
DiffM
84
2
0
17 Feb 2025
Scalable Language Models with Posterior Inference of Latent Thought Vectors
Deqian Kong
Minglu Zhao
Dehong Xu
Bo Pang
Shu Wang
...
Zhangzhang Si
Chuan Li
Jianwen Xie
Sirui Xie
Ying Nian Wu
VLM
LRM
BDL
81
5
0
03 Feb 2025
ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain
Haochen Zhao
Xiangru Tang
Ziran Yang
Xiao Han
Xuanzhi Feng
...
Senhao Cheng
Di Jin
Yilun Zhao
Arman Cohan
Mark B. Gerstein
ELM
83
1
0
23 Nov 2024
Streaming Bayes GFlowNets
Tiago da Silva
Daniel Augusto R. M. A. de Souza
Diego Mesquita
BDL
41
0
0
08 Nov 2024
GFlowNet Fine-tuning for Diverse Correct Solutions in Mathematical Reasoning Tasks
Ryoichi Takase
Masaya Tsunokake
Yuta Tsuchiya
Shota Inuzuka
LRM
43
2
0
26 Oct 2024
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Shansan Gong
Shivam Agarwal
Yizhe Zhang
Jiacheng Ye
Lin Zheng
...
Peilin Zhao
W. Bi
Jiawei Han
Hao Peng
Lingpeng Kong
AI4CE
75
15
0
23 Oct 2024
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Timofei Gritsaev
Nikita Morozov
S. Samsonov
D. Tiapkin
16
0
0
20 Oct 2024
A Complexity-Based Theory of Compositionality
Eric Elmoznino
Thomas Jiralerspong
Yoshua Bengio
Guillaume Lajoie
CoGe
61
4
0
18 Oct 2024
Proof Flow: Preliminary Study on Generative Flow Network Language Model Tuning for Formal Reasoning
Matthew Ho
Vincent Zhu
Xiaoyin Chen
Moksh Jain
Nikolay Malkin
Edwin Zhang
LRM
29
2
0
17 Oct 2024
Controllable Generation via Locally Constrained Resampling
Kareem Ahmed
Kai-Wei Chang
Guy Van den Broeck
18
2
0
17 Oct 2024
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Yuxi Xie
Anirudh Goyal
Xiaobao Wu
Xunjian Yin
Xiao Xu
Min-Yen Kan
Liangming Pan
William Yang Wang
LRM
75
1
0
12 Oct 2024
On Divergence Measures for Training GFlowNets
Tiago da Silva
Eliezer de Souza da Silva
Diego Mesquita
BDL
29
1
0
12 Oct 2024
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction
Jarrid Rector-Brooks
Mohsin Hasan
Zhangzhi Peng
Zachary Quinn
Chenghao Liu
...
Michael Bronstein
Yoshua Bengio
Pranam Chatterjee
Alexander Tong
Avishek Joey Bose
DiffM
47
6
0
10 Oct 2024
Efficient Reinforcement Learning with Large Language Model Priors
Xue Yan
Yan Song
Xidong Feng
Mengyue Yang
Haifeng Zhang
Haitham Bou Ammar
Jun Wang
OffRL
31
3
0
10 Oct 2024
Guaranteed Generation from Large Language Models
Minbeom Kim
Thibaut Thonet
Jos Rozen
Hwaran Lee
Kyomin Jung
Marc Dymetman
38
1
0
09 Oct 2024
Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Rui Hu
Yifan Zhang
Zhuoran Li
Longbo Huang
35
0
0
03 Oct 2024
Adaptive teachers for amortized samplers
Minsu Kim
Sanghyeok Choi
Taeyoung Yun
Emmanuel Bengio
Leo Feng
Jarrid Rector-Brooks
Sungsoo Ahn
Jinkyoo Park
Nikolay Malkin
Yoshua Bengio
137
2
0
02 Oct 2024
Can a Bayesian Oracle Prevent Harm from an Agent?
Yoshua Bengio
Michael K. Cohen
Nikolay Malkin
Matt MacDermott
Damiano Fornasiere
Pietro Greiner
Younesse Kaddar
45
4
0
09 Aug 2024
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models
Sean Welleck
Amanda Bertsch
Matthew Finlayson
Hailey Schoelkopf
Alex Xie
Graham Neubig
Ilia Kulikov
Zaid Harchaoui
33
49
0
24 Jun 2024
Adaptable Logical Control for Large Language Models
Honghua Zhang
Po-Nien Kung
Masahiro Yoshida
Guy Van den Broeck
Nanyun Peng
38
8
0
19 Jun 2024
Improving GFlowNets with Monte Carlo Tree Search
Nikita Morozov
D. Tiapkin
S. Samsonov
Alexey Naumov
Dmitry Vetrov
54
1
0
19 Jun 2024
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples
Fangxu Yu
Lai Jiang
Haoqiang Kang
Shibo Hao
Lianhui Qin
LRM
AI4CE
98
10
0
09 Jun 2024
Embarrassingly Parallel GFlowNets
Tiago da Silva
Luiz Max Carvalho
Amauri Souza
Samuel Kaski
Diego Mesquita
42
1
0
05 Jun 2024
Bifurcated Generative Flow Networks
Chunhui Li
Cheng-Hao Liu
Dianbo Liu
Qingpeng Cai
Ling Pan
85
2
0
04 Jun 2024
Latent Logic Tree Extraction for Event Sequence Explanation from LLMs
Zitao Song
Chao Yang
Chaojie Wang
Bo An
Shuang Li
52
4
0
03 Jun 2024
Amortizing intractable inference in diffusion models for vision, language, and control
S. Venkatraman
Moksh Jain
Luca Scimeca
Minsu Kim
Marcin Sendera
...
Alexandre Adam
Jarrid Rector-Brooks
Yoshua Bengio
Glen Berseth
Nikolay Malkin
68
24
0
31 May 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Seanie Lee
Minsu Kim
Lynn Cherif
David Dobre
Juho Lee
...
Kenji Kawaguchi
Gauthier Gidel
Yoshua Bengio
Nikolay Malkin
Moksh Jain
AAML
58
12
0
28 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
25
1
0
27 May 2024
Pessimistic Backward Policy for GFlowNets
Hyosoon Jang
Yunhui Jang
Minsu Kim
Jinkyoo Park
Sungsoo Ahn
57
4
0
25 May 2024
Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo
Stephen Zhao
Rob Brekelmans
Alireza Makhzani
Roger C. Grosse
32
9
0
26 Apr 2024
BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Yu Feng
Ben Zhou
Weidong Lin
Dan Roth
66
4
0
18 Apr 2024
Investigating Regularization of Self-Play Language Models
Réda Alami
Abdalgader Abubaker
Mastane Achab
M. Seddik
Salem Lahlou
30
3
0
04 Apr 2024
Discrete Probabilistic Inference as Control in Multi-path Environments
T. Deleu
Padideh Nouri
Nikolay Malkin
Doina Precup
Yoshua Bengio
111
28
0
15 Feb 2024
Learning to Scale Logits for Temperature-Conditional GFlowNets
Minsu Kim
Joohwan Ko
Taeyoung Yun
Dinghuai Zhang
Ling Pan
W. Kim
Jinkyoo Park
Emmanuel Bengio
Yoshua Bengio
AI4CE
29
21
0
04 Oct 2023
GFlowNets and variational inference
Nikolay Malkin
Salem Lahlou
T. Deleu
Xu Ji
J. E. Hu
Katie Everett
Dinghuai Zhang
Yoshua Bengio
BDL
134
77
0
02 Oct 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
307
4,077
0
24 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
308
3,237
0
21 Mar 2022
Trajectory balance: Improved credit assignment in GFlowNets
Nikolay Malkin
Moksh Jain
Emmanuel Bengio
Chen Sun
Yoshua Bengio
145
166
0
31 Jan 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
347
8,457
0
28 Jan 2022
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
413
2,584
0
03 Sep 2019
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
1