ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.06682
  4. Cited By
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

5 May 2024
Matthew Renze
Erhan Guven
    LRM
    LLMAG
ArXivPDFHTML

Papers citing "Self-Reflection in LLM Agents: Effects on Problem-Solving Performance"

33 / 33 papers shown
Title
Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL
Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL
Songjun Tu
Jiahao Lin
Qichao Zhang
Xiangyu Tian
Linjing Li
Xiangyuan Lan
Dongbin Zhao
OffRL
ReLM
LRM
15
0
0
16 May 2025
SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning
SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning
Z. Li
Qingxiu Dong
Jingyuan Ma
Di Zhang
Zhifang Sui
LRM
9
0
0
16 May 2025
Scalable LLM Math Reasoning Acceleration with Low-rank Distillation
Scalable LLM Math Reasoning Acceleration with Low-rank Distillation
Harry Dong
Bilge Acun
Beidi Chen
Yuejie Chi
LRM
34
0
0
08 May 2025
Fast-Slow Thinking for Large Vision-Language Model Reasoning
Fast-Slow Thinking for Large Vision-Language Model Reasoning
W. L. Xiao
Leilei Gan
Weilong Dai
Wanggui He
Ziwei Huang
...
Fangxun Shu
Zhelun Yu
Peng Zhang
Hao Jiang
Fei Wu
ReLM
LRM
AI4CE
164
1
0
25 Apr 2025
Dynamic Early Exit in Reasoning Models
Dynamic Early Exit in Reasoning Models
Chenxu Yang
Qingyi Si
Yongjie Duan
Zheliang Zhu
Chenyu Zhu
Zheng-Shen Lin
Li Cao
Li Cao
Weiping Wang
ReLM
LRM
32
0
0
22 Apr 2025
Efficient Reasoning Models: A Survey
Efficient Reasoning Models: A Survey
Sicheng Feng
Gongfan Fang
Xinyin Ma
Xinchao Wang
ReLM
LRM
145
0
0
15 Apr 2025
Reasoning Models Can Be Effective Without Thinking
Reasoning Models Can Be Effective Without Thinking
Wenjie Ma
Jingxuan He
Charlie Snell
Tyler Griggs
Sewon Min
Matei A. Zaharia
ReLM
LRM
52
6
1
14 Apr 2025
Concise Reasoning via Reinforcement Learning
Concise Reasoning via Reinforcement Learning
Mehdi Fatemi
Banafsheh Rafiee
Mingjie Tang
Kartik Talamadupula
ReLM
OffRL
LRM
52
4
0
07 Apr 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Hongyi Liu
Andrew Wen
Shaochen
Zhong
Hanjie Chen
OffRL
ReLM
LRM
75
29
0
20 Mar 2025
How Well do LLMs Compress Their Own Chain-of-Thought? A Token Complexity Approach
How Well do LLMs Compress Their Own Chain-of-Thought? A Token Complexity Approach
Ayeong Lee
Ethan Che
Tianyi Peng
LRM
42
11
0
03 Mar 2025
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation
Yunhai Feng
Jiaming Han
Z. Yang
Xiangyu Yue
Sergey Levine
Jianlan Luo
LM&Ro
56
2
0
23 Feb 2025
Optimizing Model Selection for Compound AI Systems
Optimizing Model Selection for Compound AI Systems
Lingjiao Chen
Jared Quincy Davis
Boris Hanin
Peter Bailis
Matei A. Zaharia
James Y. Zou
Ion Stoica
54
1
0
20 Feb 2025
RTBAS: Defending LLM Agents Against Prompt Injection and Privacy Leakage
RTBAS: Defending LLM Agents Against Prompt Injection and Privacy Leakage
Peter Yong Zhong
Siyuan Chen
Ruiqi Wang
McKenna McCall
Ben L. Titzer
Heather Miller
Phillip B. Gibbons
LLMAG
90
3
0
17 Feb 2025
On Memory Construction and Retrieval for Personalized Conversational Agents
On Memory Construction and Retrieval for Personalized Conversational Agents
Zhuoshi Pan
Qianhui Wu
Huiqiang Jiang
Xufang Luo
Hao Cheng
...
Y. Yang
Chin-Yew Lin
H. V. Zhao
Lili Qiu
Jianfeng Gao
RALM
56
3
0
08 Feb 2025
Aligning Instruction Tuning with Pre-training
Aligning Instruction Tuning with Pre-training
Yiming Liang
Tianyu Zheng
Xinrun Du
Ge Zhang
J. Liu
...
Zhaoxiang Zhang
Wenhao Huang
Jiajun Zhang
Xiang Yue
Jiajun Zhang
86
1
0
16 Jan 2025
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Xinyu Guan
L. Zhang
Yifei Liu
Ning Shang
Youran Sun
Yi Zhu
Fan Yang
Mao Yang
LRM
SyDa
ReLM
62
78
0
08 Jan 2025
AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models
AnalogXpert: Automating Analog Topology Synthesis by Incorporating Circuit Design Expertise into Large Language Models
Haoyi Zhang
Shizhao Sun
Yibo Lin
Runsheng Wang
Jiang Bian
52
0
0
31 Dec 2024
Positive Experience Reflection for Agents in Interactive Text
  Environments
Positive Experience Reflection for Agents in Interactive Text Environments
Philip Lippmann
Matthijs Spaan
Jie-jin Yang
LLMAG
33
0
0
04 Nov 2024
MorphAgent: Empowering Agents through Self-Evolving Profiles and
  Decentralized Collaboration
MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration
Siyuan Lu
Jiaqi Shao
B. Luo
Tao Lin
LM&Ro
LLMAG
AI4CE
29
2
0
19 Oct 2024
MultiTalk: Introspective and Extrospective Dialogue for
  Human-Environment-LLM Alignment
MultiTalk: Introspective and Extrospective Dialogue for Human-Environment-LLM Alignment
Venkata Naren Devarakonda
Ali Umut Kaypak
Shuaihang Yuan
P. Krishnamurthy
Yi Fang
Farshad Khorrami
LLMAG
44
0
0
24 Sep 2024
HySem: A context length optimized LLM pipeline for unstructured tabular
  extraction
HySem: A context length optimized LLM pipeline for unstructured tabular extraction
Narayanan PP
A. P. N. Iyer
36
0
0
18 Aug 2024
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy
Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy
Zhenyu Guan
Xiangyu Kong
Fangwei Zhong
Yizhou Wang
36
6
0
09 Jul 2024
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
Aman Singh Thakur
Kartik Choudhary
Venkat Srinik Ramayapally
Sankaran Vaidyanathan
Dieuwke Hupkes
ELM
ALM
61
55
0
18 Jun 2024
A Theoretical Understanding of Self-Correction through In-context
  Alignment
A Theoretical Understanding of Self-Correction through In-context Alignment
Yifei Wang
Yuyang Wu
Zeming Wei
Stefanie Jegelka
Yisen Wang
LRM
41
13
0
28 May 2024
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in
  Long-Horizon Generation
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation
Zihao Wang
Anji Liu
Haowei Lin
Jiaqi Li
Xiaojian Ma
Yitao Liang
ReLM
RALM
LRM
91
47
0
08 Mar 2024
Large Language Models: A Survey
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
131
369
0
09 Feb 2024
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations
  from Large Language Models
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models
Chirag Agarwal
Sree Harsha Tanneru
Himabindu Lakkaraju
LRM
37
35
0
07 Feb 2024
The Benefits of a Concise Chain of Thought on Problem-Solving in Large
  Language Models
The Benefits of a Concise Chain of Thought on Problem-Solving in Large Language Models
Matthew Renze
Erhan Guven
LRM
41
36
0
11 Jan 2024
Metacognition is all you need? Using Introspection in Generative Agents
  to Improve Goal-directed Behavior
Metacognition is all you need? Using Introspection in Generative Agents to Improve Goal-directed Behavior
Jason Toy
Josh MacAdam
Phil Tabor
LLMAG
LRM
AI4CE
53
4
0
09 Jan 2024
Self-RAG: Learning to Retrieve, Generate, and Critique through
  Self-Reflection
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Akari Asai
Zeqiu Wu
Yizhong Wang
Avirup Sil
Hannaneh Hajishirzi
RALM
159
631
0
17 Oct 2023
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
240
2,494
0
06 Oct 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
322
4,077
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
361
8,495
0
28 Jan 2022
1