Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.17651
Cited By
v1
v2 (latest)
Self-Refine: Iterative Refinement with Self-Feedback
30 March 2023
Aman Madaan
Niket Tandon
Prakhar Gupta
Skyler Hallinan
Luyu Gao
Sarah Wiegreffe
Uri Alon
Nouha Dziri
Shrimai Prabhumoye
Yiming Yang
Shashank Gupta
Bodhisattwa Prasad Majumder
Katherine Hermann
Sean Welleck
Amir Yazdanbakhsh
Peter Clark
ReLM
LRM
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Self-Refine: Iterative Refinement with Self-Feedback"
50 / 431 papers shown
Title
Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning
Debjit Paul
Robert West
Antoine Bosselut
Boi Faltings
ReLM
LRM
135
29
0
21 Feb 2024
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
Zhaorui Yang
Tianyu Pang
Hao Feng
Han Wang
Wei Chen
Minfeng Zhu
Qian Liu
ALM
101
50
0
21 Feb 2024
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Xueliang Zhao
Xinting Huang
Tingchen Fu
Qintong Li
Shansan Gong
Lemao Liu
Wei Bi
Lingpeng Kong
LRM
82
1
0
21 Feb 2024
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation
Dongjin Kang
Sunghwan Kim
Taeyoon Kwon
Seungjun Moon
Hyunsouk Cho
Youngjae Yu
Dongha Lee
Jinyoung Yeo
158
20
0
20 Feb 2024
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization
Wendi Cui
Jiaxin Zhang
Zhuohang Li
Damien Lopez
Damien Lopez
Kamalika Das
Sricharan Kumar
Kumar Sricharan
105
7
0
17 Feb 2024
Retrieve Only When It Needs: Adaptive Retrieval Augmentation for Hallucination Mitigation in Large Language Models
Hanxing Ding
Liang Pang
Zihao Wei
Huawei Shen
Xueqi Cheng
HILM
RALM
152
18
0
16 Feb 2024
Natural Language Reinforcement Learning
Xidong Feng
Bo Liu
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
106
5
0
11 Feb 2024
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity
Kaiqu Liang
Zixu Zhang
J. F. Fisac
LLMAG
163
8
0
09 Feb 2024
Unified Hallucination Detection for Multimodal Large Language Models
Xiang Chen
Chenxi Wang
Yida Xue
Ningyu Zhang
Xiaoyan Yang
Qian Li
Yue Shen
Lei Liang
Jinjie Gu
Huajun Chen
HILM
135
45
0
05 Feb 2024
A Survey on Context-Aware Multi-Agent Systems: Techniques, Challenges and Future Directions
Hung Du
Srikanth Thudumu
Rajesh Vasa
K. Mouzakis
LLMAG
133
10
0
03 Feb 2024
YODA: Teacher-Student Progressive Learning for Language Models
Jianqiao Lu
Wanjun Zhong
Yufei Wang
Zhijiang Guo
Qi Zhu
...
Baojun Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
LRM
106
7
0
28 Jan 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
408
33
0
25 Jan 2024
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
Mirac Suzgun
Adam Tauman Kalai
KELM
LRM
LLMAG
ReLM
123
78
0
23 Jan 2024
CCA: Collaborative Competitive Agents for Image Editing
Tiankai Hang
Shuyang Gu
Dong Chen
Xin Geng
Baining Guo
170
5
0
23 Jan 2024
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline
Minpeng Liao
Wei Luo
Chengxi Li
Jing Wu
Kai Fan
LRM
126
48
0
16 Jan 2024
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Mouxiang Chen
Hao Tian
Zhongxi Liu
Xiaoxue Ren
Jianling Sun
SyDa
KELM
103
2
0
15 Jan 2024
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation
Meng Cao
Lei Shu
Lei Yu
Yun Zhu
Nevan Wichers
Yinxiao Liu
Lei Meng
OffRL
ALM
71
7
0
14 Jan 2024
Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
Asma Ghandeharioun
Avi Caciularu
Adam Pearce
Lucas Dixon
Mor Geva
150
114
0
11 Jan 2024
The Critique of Critique
Shichao Sun
Junlong Li
Weizhe Yuan
Ruifeng Yuan
Wenjie Li
Pengfei Liu
ELM
86
0
0
09 Jan 2024
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
Wendi Cui
Jiaxin Zhang
Zhuohang Li
Lopez Damien
Kamalika Das
Sricharan Kumar
Kumar Sricharan
69
2
0
04 Jan 2024
LARP: Language-Agent Role Play for Open-World Games
Ming Yan
Ruihao Li
Hao Zhang
Hao Wang
Zhilan Yang
Ji Yan
LLMAG
LM&Ro
AI4CE
88
17
0
24 Dec 2023
Agents meet OKR: An Object and Key Results Driven Agent System with Hierarchical Self-Collaboration and Self-Evaluation
Yi Zheng
Chongyang Ma
Kanle Shi
Haibin Huang
66
5
0
28 Nov 2023
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering
Linyong Nan
Ellen Zhang
Weijin Zou
Yilun Zhao
Wenfei Zhou
Arman Cohan
LLMAG
103
14
0
16 Nov 2023
Towards A Unified View of Answer Calibration for Multi-Step Reasoning
Shumin Deng
Ningyu Zhang
Nay Oo
Bryan Hooi
LRM
89
3
0
15 Nov 2023
Predicting Text Preference Via Structured Comparative Reasoning
Jing Nathan Yan
Tianqi Liu
Justin T Chiu
Jiaming Shen
Zhen Qin
...
Charumathi Lakshmanan
Y. Kurzion
Alexander M. Rush
Jialu Liu
Michael Bendersky
LRM
100
7
0
14 Nov 2023
ADaPT: As-Needed Decomposition and Planning with Language Models
Archiki Prasad
Alexander Koller
Mareike Hartmann
Peter Clark
Ashish Sabharwal
Mohit Bansal
Tushar Khot
LM&Ro
101
93
0
08 Nov 2023
Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting
Preethi Lahoti
Nicholas Blumm
Xiao Ma
Raghavendra Kotikalapudi
Sahitya Potluri
...
Hansa Srinivasan
Ben Packer
Ahmad Beirami
Alex Beutel
Jilin Chen
120
32
0
25 Oct 2023
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers
Theo X. Olausson
Alex Gu
Benjamin Lipkin
Cedegao E. Zhang
Armando Solar-Lezama
Josh Tenenbaum
Roger Levy
LRM
AI4CE
ReLM
194
119
0
23 Oct 2023
Tree Prompting: Efficient Task Adaptation without Fine-Tuning
John X. Morris
Chandan Singh
Alexander M. Rush
Jianfeng Gao
Yuntian Deng
VLM
LRM
94
19
0
21 Oct 2023
AutoMix: Automatically Mixing Language Models
Pranjal Aggarwal
Aman Madaan
Ankit Anand
Srividya Pranavi Potharaju
Swaroop Mishra
...
Karthik Kappaganthu
Yiming Yang
Shyam Upadhyay
Manaal Faruqui
Mausam
206
26
0
19 Oct 2023
The Consensus Game: Language Model Generation via Equilibrium Search
Athul Paul Jacob
Songlin Yang
Gabriele Farina
Jacob Andreas
98
23
0
13 Oct 2023
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules
Hung Le
Hailin Chen
Amrita Saha
Akash Gokul
Doyen Sahoo
Shafiq Joty
LRM
116
47
0
13 Oct 2023
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Tao Li
Gang Li
Zhiwei Deng
Bryan Wang
Yang Li
LM&Ro
LLMAG
137
26
0
12 Oct 2023
Can Large Language Models Really Improve by Self-critiquing Their Own Plans?
Karthik Valmeekam
Matthew Marquez
Subbarao Kambhampati
LRM
109
87
0
12 Oct 2023
Constructive Large Language Models Alignment with Diverse Feedback
Tianshu Yu
Ting-En Lin
Yuchuan Wu
Min Yang
Fei Huang
Yongbin Li
ALM
108
9
0
10 Oct 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
Jiacheng Liu
Ramakanth Pasunuru
Hannaneh Hajishirzi
Yejin Choi
Asli Celikyilmaz
LRM
ReLM
79
24
0
07 Oct 2023
Automatic and Human-AI Interactive Text Generation
Yao Dou
Philippe Laban
Claire Gardent
Wei Xu
84
4
0
05 Oct 2023
Reward Model Ensembles Help Mitigate Overoptimization
Thomas Coste
Usman Anwar
Robert Kirk
David M. Krueger
NoLa
ALM
123
139
0
04 Oct 2023
Think before you speak: Training Language Models With Pause Tokens
Sachin Goyal
Ziwei Ji
A. S. Rawat
A. Menon
Sanjiv Kumar
Vaishnavh Nagarajan
LRM
126
122
0
03 Oct 2023
Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models
Michael Xieyang Liu
Tongshuang Wu
Tianying Chen
Franklin Mingzhe Li
A. Kittur
Brad A. Myers
LRM
RALM
112
22
0
03 Oct 2023
Enabling Language Models to Implicitly Learn Self-Improvement
Ziqi Wang
Le Hou
Tianjian Lu
Yuexin Wu
Yunxuan Li
Hongkun Yu
Heng Ji
ReLM
LRM
72
6
0
02 Oct 2023
Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context Learning
Mustafa Shukor
Alexandre Ramé
Corentin Dancette
Matthieu Cord
LRM
MLLM
118
22
0
01 Oct 2023
AutoAgents: A Framework for Automatic Agent Generation
Guangyao Chen
Siwei Dong
Yu Shu
Ge Zhang
Jaward Sesay
Börje F. Karlsson
Jie Fu
Yemin Shi
LLMAG
144
130
0
29 Sep 2023
Program Repair with Minimal Edits Using CodeT5
Atsushi Shirafuji
Md. Mostafizer Rahman
Md. Faizul Ibne Amin
Yutaka Watanobe
70
10
0
26 Sep 2023
Calibrating LLM-Based Evaluator
Yuxuan Liu
Tianchi Yang
Shaohan Huang
Zihan Zhang
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
125
33
0
23 Sep 2023
EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning
Rajasekhar Reddy Mekala
Yasaman Razeghi
Sameer Singh
LRM
93
11
0
16 Sep 2023
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Hongbin Ye
Tong Liu
Aijia Zhang
Wei Hua
Weiqiang Jia
HILM
126
81
0
13 Sep 2023
Aligning Large Language Models for Clinical Tasks
Supun Manathunga
Isuru Hettigoda
LM&MA
ELM
AI4MH
92
11
0
06 Sep 2023
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Maciej Besta
Nils Blach
Aleš Kubíček
Robert Gerstenberger
Michal Podstawski
...
Joanna Gajda
Tomasz Lehmann
H. Niewiadomski
Piotr Nyczyk
Torsten Hoefler
LRM
AI4CE
LM&Ro
189
718
0
18 Aug 2023
Better Zero-Shot Reasoning with Role-Play Prompting
Aobo Kong
Shiwan Zhao
Hao Chen
Qicheng Li
Yong Qin
Ruiqi Sun
Xiaoxia Zhou
Enzhi Wang
Xiaohang Dong
ReLM
LLMAG
LRM
104
179
0
15 Aug 2023
Previous
1
2
3
4
5
6
7
8
9
Next