ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.11610
  4. Cited By
Large Language Models Can Self-Improve

Large Language Models Can Self-Improve

20 October 2022
Jiaxin Huang
S. Gu
Le Hou
Yuexin Wu
Xuezhi Wang
Hongkun Yu
Jiawei Han
    ReLM
    AI4MH
    LRM
ArXivPDFHTML

Papers citing "Large Language Models Can Self-Improve"

50 / 410 papers shown
Title
Teaching Large Language Models to Reason with Reinforcement Learning
Teaching Large Language Models to Reason with Reinforcement Learning
Alex Havrilla
Yuqing Du
Sharath Chandra Raparthy
Christoforos Nalmpantis
Jane Dwivedi-Yu
Maksym Zhuravinskyi
Eric Hambro
Sainbayar Sukhbaatar
Roberta Raileanu
ReLM
LRM
29
67
0
07 Mar 2024
Socratic Reasoning Improves Positive Text Rewriting
Socratic Reasoning Improves Positive Text Rewriting
Anmol Goel
Nico Daheim
Iryna Gurevych
Iryna Gurevych
LRM
39
4
0
05 Mar 2024
Online Training of Large Language Models: Learn while chatting
Online Training of Large Language Models: Learn while chatting
Juhao Liang
Ziwei Wang
Zhuoheng Ma
Jianquan Li
Zhiyi Zhang
Xiangbo Wu
Benyou Wang
KELM
39
3
0
04 Mar 2024
Distilling Text Style Transfer With Self-Explanation From LLMs
Distilling Text Style Transfer With Self-Explanation From LLMs
Chiyu Zhang
Honglong Cai
Yuezhang Li
Li
Yuexin Wu
Le Hou
Muhammad Abdul-Mageed
39
10
0
02 Mar 2024
Agent-Pro: Learning to Evolve via Policy-Level Reflection and
  Optimization
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
Wenqi Zhang
Ke Tang
Hai Wu
Mengna Wang
Yongliang Shen
Guiyang Hou
Zeqi Tan
Peng Li
Y. Zhuang
Weiming Lu
LLMAG
41
36
0
27 Feb 2024
Determinants of LLM-assisted Decision-Making
Determinants of LLM-assisted Decision-Making
Eva Eigner
Thorsten Händler
48
45
0
27 Feb 2024
GISTEmbed: Guided In-sample Selection of Training Negatives for Text
  Embedding Fine-tuning
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning
Aivin V. Solatorio
38
18
0
26 Feb 2024
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Ante Wang
Linfeng Song
Baolin Peng
Ye Tian
Lifeng Jin
Haitao Mi
Jinsong Su
Dong Yu
HILM
LRM
23
6
0
23 Feb 2024
RefuteBench: Evaluating Refuting Instruction-Following for Large
  Language Models
RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models
Jianhao Yan
Yun Luo
Yue Zhang
ALM
LRM
38
6
0
21 Feb 2024
Large Language Models for Data Annotation: A Survey
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Jundong Li
Lu Cheng
Huan Liu
SyDa
56
50
0
21 Feb 2024
A Survey on Knowledge Distillation of Large Language Models
A Survey on Knowledge Distillation of Large Language Models
Xiaohan Xu
Ming Li
Chongyang Tao
Tao Shen
Reynold Cheng
Jinyang Li
Can Xu
Dacheng Tao
Tianyi Zhou
KELM
VLM
44
101
0
20 Feb 2024
Learning to Check: Unleashing Potentials for Self-Correction in Large
  Language Models
Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models
Che Zhang
Zhenyang Xiao
Chengcheng Han
Yixin Lian
Yuejian Fang
LRM
25
0
0
20 Feb 2024
Chain of Thought Empowers Transformers to Solve Inherently Serial
  Problems
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Zhiyuan Li
Hong Liu
Denny Zhou
Tengyu Ma
LRM
AI4CE
28
96
0
20 Feb 2024
Task-Oriented Dialogue with In-Context Learning
Task-Oriented Dialogue with In-Context Learning
Tom Bocklisch
Thomas Werkmeister
Daksh Varshneya
Alan Nichol
37
6
0
19 Feb 2024
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot
  Interaction
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction
Jie Xu
Hanbo Zhang
Xinghang Li
Huaping Liu
Xuguang Lan
Tao Kong
LM&Ro
35
3
0
19 Feb 2024
Towards Theoretical Understandings of Self-Consuming Generative Models
Towards Theoretical Understandings of Self-Consuming Generative Models
Shi Fu
Sen Zhang
Yingjie Wang
Xinmei Tian
Dacheng Tao
36
9
0
19 Feb 2024
Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs
Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking dialogs
Arian Askari
Roxana Petcu
Chuan Meng
Mohammad Aliannejadi
Amin Abolghasemi
Evangelos Kanoulas
Suzan Verberne
21
9
0
18 Feb 2024
RLVF: Learning from Verbal Feedback without Overgeneralization
RLVF: Learning from Verbal Feedback without Overgeneralization
Moritz Stephan
Alexander Khazatsky
Eric Mitchell
Annie S. Chen
Sheryl Hsu
Archit Sharma
Chelsea Finn
34
11
0
16 Feb 2024
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate
  Controllable Controversial Statements
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li
Jiuhai Chen
Lichang Chen
Tianyi Zhou
71
17
0
16 Feb 2024
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM
  Workflows
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Ajay Patel
Colin Raffel
Chris Callison-Burch
SyDa
AI4CE
25
25
0
16 Feb 2024
Chain-of-Thought Reasoning Without Prompting
Chain-of-Thought Reasoning Without Prompting
Xuezhi Wang
Denny Zhou
ReLM
LRM
149
101
0
15 Feb 2024
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Shubham Toshniwal
Ivan Moshkov
Sean Narenthiran
Daria Gitman
Fei Jia
Igor Gitman
28
76
0
15 Feb 2024
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM
  Instruction-Tuning
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Ming Li
Lichang Chen
Jiuhai Chen
Shwai He
Jiuxiang Gu
Tianyi Zhou
26
50
0
15 Feb 2024
Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping
Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping
Haoyu Wang
Guozheng Ma
Ziqiao Meng
Zeyu Qin
Li Shen
...
Liu Liu
Yatao Bian
Tingyang Xu
Xueqian Wang
Peilin Zhao
59
13
0
12 Feb 2024
In-Context Principle Learning from Mistakes
In-Context Principle Learning from Mistakes
Tianjun Zhang
Aman Madaan
Luyu Gao
Steven Zheng
Swaroop Mishra
Yiming Yang
Niket Tandon
Uri Alon
KELM
ReLM
30
23
0
08 Feb 2024
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Shoubin Yu
Jaehong Yoon
Mohit Bansal
77
4
0
08 Feb 2024
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for
  Instruction Fine-Tuning
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
Hao Zhao
Maksym Andriushchenko
Francesco Croce
Nicolas Flammarion
ALM
91
44
0
07 Feb 2024
Understanding the Effect of Noise in LLM Training Data with Algorithmic
  Chains of Thought
Understanding the Effect of Noise in LLM Training Data with Algorithmic Chains of Thought
Alex Havrilla
Maia Iyer
19
7
0
06 Feb 2024
Discovery of the Hidden World with Large Language Models
Discovery of the Hidden World with Large Language Models
Chenxi Liu
Yongqiang Chen
Tongliang Liu
Mingming Gong
James Cheng
Bo Han
Kun Zhang
CML
65
10
0
06 Feb 2024
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on
  Model-induced Process Supervision
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision
Zihan Wang
Yunxuan Li
Yuexin Wu
Liangchen Luo
Le Hou
Hongkun Yu
Jingbo Shang
LRM
40
18
0
05 Feb 2024
Self-Debiasing Large Language Models: Zero-Shot Recognition and
  Reduction of Stereotypes
Self-Debiasing Large Language Models: Zero-Shot Recognition and Reduction of Stereotypes
Isabel O. Gallegos
Ryan A. Rossi
Joe Barrow
Md Mehrab Tanjim
Tong Yu
Hanieh Deilamsalehy
Ruiyi Zhang
Sungchul Kim
Franck Dernoncourt
24
19
0
03 Feb 2024
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Subbarao Kambhampati
Karthik Valmeekam
L. Guan
Mudit Verma
Kaya Stechly
Siddhant Bhambri
Lucas Saldyt
Anil Murthy
LRM
92
108
0
02 Feb 2024
RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced
  Query Responses
RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced Query Responses
Sahil Girhepuje
Siva Sankar Sajeev
Purvam Jain
Arya Sikder
Adithya Rama Varma
Ryan George
Akshay Govind Srinivasan
Mahendra Kurup
Ashmit Sinha
Sudip Mondal
RALM
29
0
0
28 Jan 2024
Towards Goal-oriented Prompt Engineering for Large Language Models: A
  Survey
Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey
Haochen Li
Jonathan Leung
Zhiqi Shen
LM&MA
LLMAG
LRM
23
0
0
25 Jan 2024
Augmenting Math Word Problems via Iterative Question Composing
Augmenting Math Word Problems via Iterative Question Composing
Haoxiong Liu
Yifan Zhang
Yifan Luo
Andrew Chi-Chih Yao
SyDa
LRM
34
34
0
17 Jan 2024
Small Language Model Can Self-correct
Small Language Model Can Self-correct
Haixia Han
Jiaqing Liang
Jie Shi
Qi He
Yanghua Xiao
LRM
SyDa
ReLM
KELM
40
11
0
14 Jan 2024
Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language Models
Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language Models
Lucio La Cava
Andrea Tagarelli
LLMAG
AI4CE
63
13
0
13 Jan 2024
AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
Shuofei Qiao
Ningyu Zhang
Runnan Fang
Yujie Luo
Wangchunshu Zhou
Yuchen Eleanor Jiang
Chengfei Lv
Huajun Chen
LLMAG
33
32
0
10 Jan 2024
Human-Instruction-Free LLM Self-Alignment with Limited Samples
Human-Instruction-Free LLM Self-Alignment with Limited Samples
Hongyi Guo
Yuanshun Yao
Wei Shen
Jiaheng Wei
Xiaoying Zhang
Zhaoran Wang
Yang Liu
95
20
0
06 Jan 2024
Self-Contrast: Better Reflection Through Inconsistent Solving
  Perspectives
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Qiuying Peng
Jun Wang
Y. Zhuang
Weiming Lu
LRM
LLMAG
32
49
0
04 Jan 2024
Adapting Large Language Models for Education: Foundational Capabilities,
  Potentials, and Challenges
Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges
Qingyao Li
Lingyue Fu
Weiming Zhang
Xianyu Chen
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4Ed
ELM
38
18
0
27 Dec 2023
LLMs with User-defined Prompts as Generic Data Operators for Reliable
  Data Processing
LLMs with User-defined Prompts as Generic Data Operators for Reliable Data Processing
Luyi Ma
Nikhil Thakurdesai
Jiaoayan Chen
Jianpeng Xu
Evren Körpeoglu
Sushant Kumar
Kannan Achan
AI4CE
19
0
0
26 Dec 2023
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Align on the Fly: Adapting Chatbot Behavior to Established Norms
Chunpu Xu
Steffi Chern
Ethan Chern
Ge Zhang
Zekun Wang
Ruibo Liu
Jing Li
Jie Fu
Pengfei Liu
24
20
0
26 Dec 2023
Hazards from Increasingly Accessible Fine-Tuning of Downloadable
  Foundation Models
Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models
Alan Chan
Ben Bucknall
Herbie Bradley
David M. Krueger
14
6
0
22 Dec 2023
Turning Dust into Gold: Distilling Complex Reasoning Capabilities from
  LLMs by Leveraging Negative Data
Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data
Yiwei Li
Peiwen Yuan
Shaoxiong Feng
Boyuan Pan
Bin Sun
Xinglin Wang
Heda Wang
Kan Li
LRM
27
20
0
20 Dec 2023
GeomVerse: A Systematic Evaluation of Large Models for Geometric
  Reasoning
GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning
Mehran Kazemi
Hamidreza Alvari
Ankit Anand
Jialin Wu
Xi Chen
Radu Soricut
LRM
ReLM
31
53
0
19 Dec 2023
A Survey of Reasoning with Foundation Models
A Survey of Reasoning with Foundation Models
Jiankai Sun
Chuanyang Zheng
E. Xie
Zhengying Liu
Ruihang Chu
...
Xipeng Qiu
Yi-Chen Guo
Hui Xiong
Qun Liu
Zhenguo Li
ReLM
LRM
AI4CE
24
76
0
17 Dec 2023
Mixed Distillation Helps Smaller Language Model Better Reasoning
Mixed Distillation Helps Smaller Language Model Better Reasoning
Chenglin Li
Qianglong Chen
Liangyue Li
Wang Caiyu
Yicheng Li
Zhang Yin
Yin Zhang
LRM
41
11
0
17 Dec 2023
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large
  Language Models
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models
Xin Jin
Jonathan Larson
Weiwei Yang
Zhiqiang Lin
ELM
17
20
0
15 Dec 2023
AI capabilities can be significantly improved without expensive
  retraining
AI capabilities can be significantly improved without expensive retraining
Tom Davidson
Jean-Stanislas Denain
Pablo Villalobos
Guillem Bas
OffRL
VLM
24
26
0
12 Dec 2023
Previous
123456789
Next