ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.11610
  4. Cited By
Large Language Models Can Self-Improve

Large Language Models Can Self-Improve

20 October 2022
Jiaxin Huang
S. Gu
Le Hou
Yuexin Wu
Xuezhi Wang
Hongkun Yu
Jiawei Han
    ReLM
    AI4MH
    LRM
ArXivPDFHTML

Papers citing "Large Language Models Can Self-Improve"

50 / 410 papers shown
Title
Self-Refined Large Language Model as Automated Reward Function Designer
  for Deep Reinforcement Learning in Robotics
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics
Jiayang Song
Zhehua Zhou
Jiawei Liu
Chunrong Fang
Zhan Shu
Lei Ma
25
27
0
13 Sep 2023
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Xinyi Wang
John Wieting
J. Clark
CLL
ALM
24
1
0
09 Sep 2023
Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges
Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges
Hiba Ahsan
Denis Jered McInerney
Jisoo Kim
Christopher Potter
Geoffrey S. Young
Silvio Amir
Byron C. Wallace
19
12
0
08 Sep 2023
Cognitive Architectures for Language Agents
Cognitive Architectures for Language Agents
T. Sumers
Shunyu Yao
Karthik Narasimhan
Thomas L. Griffiths
LLMAG
LM&Ro
54
153
0
05 Sep 2023
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through
  the Lens of Moral Theories?
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
Jingyan Zhou
Minda Hu
Junan Li
Xiaoying Zhang
Xixin Wu
Irwin King
Helen M. Meng
LRM
42
24
0
29 Aug 2023
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation
  Approach for the Generation and Detection of Problematic Content
Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content
Charles OÑeill
Jack Miller
I. Ciucă
Y. Ting 丁
Thang Bui
25
3
0
26 Aug 2023
ISR-LLM: Iterative Self-Refined Large Language Model for Long-Horizon
  Sequential Task Planning
ISR-LLM: Iterative Self-Refined Large Language Model for Long-Horizon Sequential Task Planning
Zhehua Zhou
Jiayang Song
Kunpeng Yao
Zhan Shu
Lei Ma
14
57
0
26 Aug 2023
Large Language Models Should Ask Clarifying Questions to Increase
  Confidence in Generated Code
Large Language Models Should Ask Clarifying Questions to Increase Confidence in Generated Code
Jiexi Wu
26
3
0
25 Aug 2023
CodeCoT: Tackling Code Syntax Errors in CoT Reasoning for Code
  Generation
CodeCoT: Tackling Code Syntax Errors in CoT Reasoning for Code Generation
Dong Huang
Qi Bu
Yuhao Qing
Heming Cui
LRM
24
16
0
17 Aug 2023
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Anna Rogers
A. Luccioni
53
19
0
14 Aug 2023
Enhancing Phenotype Recognition in Clinical Notes Using Large Language
  Models: PhenoBCBERT and PhenoGPT
Enhancing Phenotype Recognition in Clinical Notes Using Large Language Models: PhenoBCBERT and PhenoGPT
Jing Yang
Cong Liu
Wendy Deng
Dangwei Wu
C. Weng
Yunyun Zhou
Kai Wang
27
20
0
11 Aug 2023
Shepherd: A Critic for Language Model Generation
Shepherd: A Critic for Language Model Generation
Tianlu Wang
Ping Yu
Xiaoqing Ellen Tan
Sean O'Brien
Ramakanth Pasunuru
Jane Dwivedi-Yu
O. Yu. Golovneva
Luke Zettlemoyer
Maryam Fazel-Zarandi
Asli Celikyilmaz
ALM
31
78
0
08 Aug 2023
Automatically Correcting Large Language Models: Surveying the landscape
  of diverse self-correction strategies
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies
Liangming Pan
Michael Stephen Saxon
Wenda Xu
Deepak Nathani
Xinyi Wang
William Yang Wang
KELM
LRM
41
201
0
06 Aug 2023
Scaling Relationship on Learning Mathematical Reasoning with Large
  Language Models
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRM
ALM
27
160
0
03 Aug 2023
The Hitchhiker's Guide to Program Analysis: A Journey with Large
  Language Models
The Hitchhiker's Guide to Program Analysis: A Journey with Large Language Models
Haonan Li
Yu Hao
Yizhuo Zhai
Zhiyun Qian
LLMAG
30
25
0
01 Aug 2023
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for
  Complex Visual Reasoning Tasks
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks
Kousik Rajesh
Mrigank Raman
M. A. Karim
Pranit Chawla
VLM
25
2
0
31 Jul 2023
Mental-LLM: Leveraging Large Language Models for Mental Health
  Prediction via Online Text Data
Mental-LLM: Leveraging Large Language Models for Mental Health Prediction via Online Text Data
Xuhai Xu
Bingsheng Yao
Yu Dong
Saadia Gabriel
Hongfeng Yu
James A. Hendler
Marzyeh Ghassemi
A. Dey
Dakuo Wang
LM&MA
CLL
AI4MH
48
64
0
26 Jul 2023
RLCD: Reinforcement Learning from Contrastive Distillation for Language
  Model Alignment
RLCD: Reinforcement Learning from Contrastive Distillation for Language Model Alignment
Kevin Kaichuang Yang
Dan Klein
Asli Celikyilmaz
Nanyun Peng
Yuandong Tian
ALM
34
30
0
24 Jul 2023
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot
  Classification
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
Neel Guha
Mayee F. Chen
Kush S. Bhatia
Azalia Mirhoseini
Frederic Sala
Christopher Ré
26
4
0
20 Jul 2023
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities
  of Large Language Models
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
Xiaoxuan Wang
Ziniu Hu
Pan Lu
Yanqiao Zhu
Jieyu Zhang
Satyen Subramaniam
Arjun R. Loomba
Shichang Zhang
Yizhou Sun
Wei Wang
ELM
LRM
30
85
0
20 Jul 2023
Multi-Method Self-Training: Improving Code Generation With Text, And
  Vice Versa
Multi-Method Self-Training: Improving Code Generation With Text, And Vice Versa
Shriyash Upadhyay
Etan Ginsberg
SyDa
LRM
19
0
0
20 Jul 2023
Do Models Explain Themselves? Counterfactual Simulatability of Natural
  Language Explanations
Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations
Yanda Chen
Ruiqi Zhong
Narutatsu Ri
Chen Zhao
He He
Jacob Steinhardt
Zhou Yu
Kathleen McKeown
LRM
26
47
0
17 Jul 2023
Self-Adaptive Large Language Model (LLM)-Based Multiagent Systems
Self-Adaptive Large Language Model (LLM)-Based Multiagent Systems
Nathalia Nascimento
Paulo S. C. Alencar
Donald D. Cowan
LLMAG
22
37
0
12 Jul 2023
Teaching Arithmetic to Small Transformers
Teaching Arithmetic to Small Transformers
Nayoung Lee
Kartik K. Sreenivasan
Jason D. Lee
Kangwook Lee
Dimitris Papailiopoulos
LRM
32
81
0
07 Jul 2023
Self-Consuming Generative Models Go MAD
Self-Consuming Generative Models Go MAD
Sina Alemohammad
Josue Casco-Rodriguez
Lorenzo Luzi
Ahmed Imtiaz Humayun
Hossein Babaei
Daniel LeJeune
Ali Siahkoohi
Richard G. Baraniuk
WIGM
18
140
0
04 Jul 2023
Minimum Levels of Interpretability for Artificial Moral Agents
Minimum Levels of Interpretability for Artificial Moral Agents
Avish Vijayaraghavan
C. Badea
AI4CE
27
5
0
02 Jul 2023
When Foundation Model Meets Federated Learning: Motivations, Challenges, and Future Directions
When Foundation Model Meets Federated Learning: Motivations, Challenges, and Future Directions
Weiming Zhuang
Chen Chen
Lingjuan Lyu
Cheng Chen
Yaochu Jin
Lingjuan Lyu
AIFin
AI4CE
99
85
0
27 Jun 2023
Language models are weak learners
Language models are weak learners
Hariharan Manikandan
Yiding Jiang
J Zico Kolter
38
15
0
25 Jun 2023
Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think"
  Step-by-Step
Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step
Liunian Harold Li
Jack Hessel
Youngjae Yu
Xiang Ren
Kai-Wei Chang
Yejin Choi
LRM
AI4CE
ReLM
22
129
0
24 Jun 2023
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
J. Wang
Qiushi Sun
Xiang Li
Ming Gao
ReLM
LRM
21
64
0
10 Jun 2023
InstructZero: Efficient Instruction Optimization for Black-Box Large
  Language Models
InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models
Lichang Chen
Jiuhai Chen
Tom Goldstein
Heng-Chiao Huang
Dinesh Manocha
19
42
0
05 Jun 2023
What does the Failure to Reason with "Respectively" in Zero/Few-Shot
  Settings Tell Us about Language Models?
What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?
Ruixiang Cui
Seolhwa Lee
Daniel Hershcovich
Anders Søgaard
30
2
0
31 May 2023
Domain Specialization as the Key to Make Large Language Models
  Disruptive: A Comprehensive Survey
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey
Chen Ling
Xujiang Zhao
Jiaying Lu
Chengyuan Deng
Can Zheng
...
Chris White
Quanquan Gu
Jian Pei
Carl Yang
Liang Zhao
ALM
25
126
0
30 May 2023
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning
Zhanming Jie
Wei Lu
LRM
ReLM
25
15
0
29 May 2023
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain
  Feedback
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback
Shengchao Liu
Jiong Wang
Yijin Yang
Chengpeng Wang
Ling Liu
Hongyu Guo
Chaowei Xiao
LM&MA
KELM
AI4MH
28
39
0
29 May 2023
Training Socially Aligned Language Models on Simulated Social
  Interactions
Training Socially Aligned Language Models on Simulated Social Interactions
Ruibo Liu
Ruixin Yang
Chenyan Jia
Ge Zhang
Denny Zhou
Andrew M. Dai
Diyi Yang
Soroush Vosoughi
ALM
37
45
0
26 May 2023
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting
Lei Shu
Liangchen Luo
Jayakumar Hoskere
Yun Zhu
Canoee Liu
Simon Tong
Jindong Chen
Lei Meng
KELM
LRM
32
43
0
25 May 2023
SAIL: Search-Augmented Instruction Learning
SAIL: Search-Augmented Instruction Learning
Hongyin Luo
Yung-Sung Chuang
Yuan Gong
Tianhua Zhang
Yoon Kim
Xixin Wu
D. Fox
Helen Meng
James R. Glass
ALM
LRM
RALM
31
22
0
24 May 2023
Universal Self-Adaptive Prompting
Universal Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
Hootan Nakhost
H. Dai
Julian Martin Eisenschlos
Sercan Ö. Arik
Tomas Pfister
LRM
38
9
0
24 May 2023
ALGO: Synthesizing Algorithmic Programs with LLM-Generated Oracle
  Verifiers
ALGO: Synthesizing Algorithmic Programs with LLM-Generated Oracle Verifiers
Kexun Zhang
Danqing Wang
Jingtao Xia
William Yang Wang
Lei Li
33
40
0
24 May 2023
PEARL: Prompting Large Language Models to Plan and Execute Actions Over
  Long Documents
PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents
Simeng Sun
Y. Liu
Shuohang Wang
Chenguang Zhu
Mohit Iyyer
RALM
LRM
ReLM
27
51
0
23 May 2023
Language Model Self-improvement by Reinforcement Learning Contemplation
Language Model Self-improvement by Reinforcement Learning Contemplation
Jing-Cheng Pang
Pengyuan Wang
Kaiyuan Li
Xiong-Hui Chen
Jiacheng Xu
Zongzhang Zhang
Yang Yu
LRM
KELM
13
43
0
23 May 2023
Better Zero-Shot Reasoning with Self-Adaptive Prompting
Better Zero-Shot Reasoning with Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
H. Dai
Sercan Ö. Arik
Tomas Pfister
ReLM
OffRL
LRM
18
48
0
23 May 2023
The CoT Collection: Improving Zero-shot and Few-shot Learning of
  Language Models via Chain-of-Thought Fine-Tuning
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
Seungone Kim
Se June Joo
Doyoung Kim
Joel Jang
Seonghyeon Ye
Jamin Shin
Minjoon Seo
ALM
RALM
LRM
23
96
0
23 May 2023
Learning from Mistakes via Cooperative Study Assistant for Large
  Language Models
Learning from Mistakes via Cooperative Study Assistant for Large Language Models
Danqing Wang
Lei Li
32
6
0
23 May 2023
Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering
Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering
Mingda Chen
Xilun Chen
Wen-tau Yih
SyDa
14
6
0
23 May 2023
Learning Interpretable Style Embeddings via Prompting LLMs
Learning Interpretable Style Embeddings via Prompting LLMs
Ajay Patel
D. Rao
Ansh Kothary
Kathleen McKeown
Chris Callison-Burch
37
23
0
22 May 2023
Evaluation of medium-large Language Models at zero-shot closed book
  generative question answering
Evaluation of medium-large Language Models at zero-shot closed book generative question answering
René Peinl
Johannes Wirth
ELM
26
7
0
19 May 2023
Solving NLP Problems through Human-System Collaboration: A
  Discussion-based Approach
Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach
Masahiro Kaneko
Graham Neubig
Naoaki Okazaki
33
6
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive
  Critiquing
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
36
357
0
19 May 2023
Previous
123456789
Next