ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11171
  4. Cited By
Self-Consistency Improves Chain of Thought Reasoning in Language Models
v1v2v3v4 (latest)

Self-Consistency Improves Chain of Thought Reasoning in Language Models

21 March 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
    ReLMBDLLRMAI4CE
ArXiv (abs)PDFHTML

Papers citing "Self-Consistency Improves Chain of Thought Reasoning in Language Models"

50 / 908 papers shown
Title
Thought calibration: Efficient and confident test-time scaling
Thought calibration: Efficient and confident test-time scaling
Menghua Wu
Cai Zhou
Stephen Bates
Tommi Jaakkola
LRM
83
0
0
23 May 2025
SLearnLLM: A Self-Learning Framework for Efficient Domain-Specific Adaptation of Large Language Models
SLearnLLM: A Self-Learning Framework for Efficient Domain-Specific Adaptation of Large Language Models
Xiang Liu
Zhaoxiang Liu
Peng Wang
Kohou Wang
Huan Hu
Kai Wang
Shiguo Lian
201
0
0
23 May 2025
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
Yang Xiao
Jiashuo Wang
Qiancheng Xu
Changhe Song
Chunpu Xu
Yi Cheng
Wenjie Li
Pengfei Liu
200
0
0
23 May 2025
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Litao Guo
Xinli Xu
Luozhou Wang
Jiantao Lin
Jinsong Zhou
Zixin Zhang
Bolan Su
Ying-Cong Chen
LLMAGLRM
86
1
0
23 May 2025
Fast Quiet-STaR: Thinking Without Thought Tokens
Wei Huang
Yizhe Xiong
Xin Ye
Zhijie Deng
Hui Chen
Zijia Lin
Guiguang Ding
LLMAGLRMVLM
56
0
0
23 May 2025
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Kaiwen Wang
Jin Peng Zhou
Jonathan D. Chang
Zhaolin Gao
Nathan Kallus
Kianté Brantley
Wen Sun
LRM
90
1
0
23 May 2025
FlashForge: Ultra-Efficient Prefix-Aware Attention for LLM Decoding
FlashForge: Ultra-Efficient Prefix-Aware Attention for LLM Decoding
Zhibin Wang
Rui Ning
Chao Fang
Zhonghui Zhang
Xi Lin
...
Rong Gu
Kun Yang
Guihai Chen
Sheng Zhong
Chen Tian
58
0
0
23 May 2025
UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification
UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification
Poojah Ganesan
Rajat Aayush Jha
Dan Roth
Vivek Gupta
79
0
0
23 May 2025
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models
Razvan-Gabriel Dumitru
Darius Peteleaza
Vikas Yadav
Liangming Pan
ReLMLRM
115
1
0
22 May 2025
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning
Junhong Lin
Xinyue Zeng
Jie Zhu
Song Wang
Julian Shun
Jun Wu
Dawei Zhou
LRM
162
1
0
22 May 2025
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development
Yaxin Du
Yuzhu Cai
Yifan Zhou
Cheng-Yu Wang
Yu Qian
Xianghe Pang
Qian Liu
Yue Hu
Siheng Chen
61
0
0
22 May 2025
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Select2Reason: Efficient Instruction-Tuning Data Selection for Long-CoT Reasoning
Cehao Yang
Xueyuan Lin
Chengjin Xu
Xuhui Jiang
Xiaojun Wu
Honghao Liu
Hui Xiong
Jian Guo
LRM
104
0
0
22 May 2025
VLM-R$^3$: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought
VLM-R3^33: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought
Chaoya Jiang
Yongrui Heng
Wei Ye
Han Yang
Haiyang Xu
Ming Yan
Ji Zhang
Fei Huang
Shikun Zhang
LRM
75
0
0
22 May 2025
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Shivam Agarwal
Zimin Zhang
Lifan Yuan
Jiawei Han
Hao Peng
162
8
0
21 May 2025
Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision
Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision
Eric Hanchen Jiang
Haozheng Luo
Shengyuan Pang
Xiaomin Li
Zhenting Qi
...
Zongyu Lin
Xinfeng Li
Hao Xu
Kai-Wei Chang
Ying Nian Wu
LRM
120
0
0
21 May 2025
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
Zhen Zhang
Xuehai He
Weixiang Yan
Ao Shen
Chenyang Zhao
Shuaiqiang Wang
Yelong Shen
Xin Eric Wang
LRM
117
3
0
21 May 2025
Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities
Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities
Jinyang Wu
Chonghua Liao
Mingkuan Feng
Shuai Zhang
Zhengqi Wen
Pengpeng Shao
Huazhe Xu
Jianhua Tao
LRMOffRL
143
3
0
21 May 2025
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models
Yuchen Yan
Jin Jiang
Zhenbang Ren
Yijun Li
Xudong Cai
...
Mengdi Zhang
Jian Shao
Yongliang Shen
Jun Xiao
Yueting Zhuang
OffRLALMLRM
134
0
0
21 May 2025
Conformal Language Model Reasoning with Coherent Factuality
Conformal Language Model Reasoning with Coherent Factuality
Maxon Rubin-Toles
Maya Gambhir
Keshav Ramji
Aaron Roth
Surbhi Goel
HILMLRM
79
2
0
21 May 2025
KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance
KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance
Qihuang Zhong
Liang Ding
Xiantao Cai
Juhua Liu
Bo Du
Dacheng Tao
100
0
0
21 May 2025
Small Language Models in the Real World: Insights from Industrial Text Classification
Small Language Models in the Real World: Insights from Industrial Text Classification
Lujun Li
Lama Sleem
Niccolo Gentile
Geoffrey Nichil
Radu State
LLMAG
218
0
0
21 May 2025
ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Gengyang Li
Yifeng Gao
Yuming Li
Yunfang Wu
ReLMOffRLLRM
130
3
0
21 May 2025
Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning
Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning
Jiashu He
Jinxuan Fan
Bowen Jiang
Ignacio Houine
Dan Roth
Alejandro Ribeiro
ReLMRALMLRM
100
2
0
21 May 2025
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
Yuhang Zhou
Jing Zhu
Shengyi Qian
Zhuokai Zhao
Xiyao Wang
Xiaoyu Liu
Ming Li
Paiheng Xu
Wei Ai
Furong Huang
95
1
0
21 May 2025
Toward Reliable Scientific Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models
Toward Reliable Scientific Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models
Guangzhi Xiong
Eric Xie
Corey Williams
Myles Kim
Amir Hassan Shariatmadari
Sikun Guo
Stefan Bekiranov
Aidong Zhang
HILMLM&MA
123
0
0
20 May 2025
SHARP: Synthesizing High-quality Aligned Reasoning Problems for Large Reasoning Models Reinforcement Learning
SHARP: Synthesizing High-quality Aligned Reasoning Problems for Large Reasoning Models Reinforcement Learning
Xiong Jun Wu
Zhenduo Zhang
ZuJie Wen
Zhiqiang Zhang
Wang Ren
...
Xudong Han
Chengfu Tang
Dingnan Jin
Qing Cui
Jun Zhou
LRM
221
1
0
20 May 2025
CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning
CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning
Lei Sheng
Shuai-Shuai Xu
LRM
80
0
0
19 May 2025
Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems
Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems
Ke Chen
Yufei Zhou
Xitong Zhang
Haohan Wang
100
1
0
19 May 2025
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Yicheng Xiao
Lin Song
Yukang Chen
Yingmin Luo
Yuxin Chen
Yukang Gan
Wei Huang
Xiu Li
Xiaojuan Qi
Ying Shan
LRM
107
5
0
19 May 2025
J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization
J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization
Austin Xu
Yilun Zhou
Xuan-Phi Nguyen
Caiming Xiong
Shafiq Joty
ELMLRM
146
0
0
19 May 2025
On the Thinking-Language Modeling Gap in Large Language Models
On the Thinking-Language Modeling Gap in Large Language Models
Chenxi Liu
Yongqiang Chen
Tongliang Liu
James Cheng
Bo Han
Kun Zhang
LRMAI4CE
81
0
0
19 May 2025
Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment
Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment
Siyang Wu
Honglin Bao
Nadav Kunievsky
James A. Evans
132
0
0
18 May 2025
BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs
Junxiao Yang
Jinzhe Tu
Haoran Liu
Xiaoce Wang
Chujie Zheng
...
Caishun Chen
Tiantian He
Hongning Wang
Yew-Soon Ong
Minlie Huang
LRM
107
0
0
18 May 2025
LAMP: Extracting Locally Linear Decision Surfaces from LLM World Models
LAMP: Extracting Locally Linear Decision Surfaces from LLM World Models
Ryan Chen
Youngmin Ko
Zeyu Zhang
Catherine Cho
Sunny Chung
Mauro Giuffré
Dennis L. Shung
Bradly C. Stadie
181
0
0
17 May 2025
Retrospex: Language Agent Meets Offline Reinforcement Learning Critic
Retrospex: Language Agent Meets Offline Reinforcement Learning Critic
Yufei Xiang
Yiqun Shen
Yeqin Zhang
Cam-Tu Nguyen
OffRLLLMAGKELMLRM
232
3
0
17 May 2025
Rethinking Optimal Verification Granularity for Compute-Efficient Test-Time Scaling
Rethinking Optimal Verification Granularity for Compute-Efficient Test-Time Scaling
Hao Mark Chen
Guanxi Lu
Yasuyuki Okoshi
Zhiwen Mo
Masato Motomura
Hongxiang Fan
LRM
114
0
0
16 May 2025
Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory
Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory
Yexiang Liu
Zekun Li
Zhi Fang
Nan Xu
Ran He
Tieniu Tan
LRM
76
0
0
16 May 2025
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning
Yige Xu
Xu Guo
Zhiwei Zeng
Chunyan Miao
BDLLRM
146
1
0
16 May 2025
Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning
Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning
Yoichi Ishibashi
Taro Yano
Masafumi Oyamada
SyDaLRM
108
2
0
15 May 2025
Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents
Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents
Mrinal Rawat
Ambuje Gupta
Rushil Goomer
Alessandro Di Bari
Neha Gupta
Roberto Pieraccini
LLMAGLRM
102
0
0
15 May 2025
CEC-Zero: Chinese Error Correction Solution Based on LLM
CEC-Zero: Chinese Error Correction Solution Based on LLM
Sophie Zhang
Zhiming Lin
62
0
0
14 May 2025
NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context
NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context
Ben Yao
Qiuchi Li
Yazhou Zhang
Siyu Yang
Bohan Zhang
Prayag Tiwari
Jing Qin
113
0
0
13 May 2025
Applying Cognitive Design Patterns to General LLM Agents
Applying Cognitive Design Patterns to General LLM Agents
R. Wray
James R. Kirk
John E. Laird
LLMAGAI4TSAI4CE
120
0
0
11 May 2025
RedTeamLLM: an Agentic AI framework for offensive security
RedTeamLLM: an Agentic AI framework for offensive security
Brian Challita
Pierre Parrend
LLMAG
139
0
0
11 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
200
0
0
09 May 2025
VISTA: Generative Visual Imagination for Vision-and-Language Navigation
VISTA: Generative Visual Imagination for Vision-and-Language Navigation
Yanjia Huang
Mingyang Wu
Renjie Li
Zhengzhong Tu
LM&Ro
115
0
0
09 May 2025
Scalable Chain of Thoughts via Elastic Reasoning
Scalable Chain of Thoughts via Elastic Reasoning
Yuhui Xu
Hanze Dong
Lei Wang
Doyen Sahoo
Junnan Li
Caiming Xiong
OffRLLRM
125
8
0
08 May 2025
G-FOCUS: Towards a Robust Method for Assessing UI Design Persuasiveness
G-FOCUS: Towards a Robust Method for Assessing UI Design Persuasiveness
Jaehyun Jeon
Janghan Yoon
Minsoo Kim
Sumin Shim
Yejin Choi
Hanbin Kim
Youngjae Yu
AAML
161
0
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
131
0
0
08 May 2025
Reasoning Models Don't Always Say What They Think
Reasoning Models Don't Always Say What They Think
Yanda Chen
Joe Benton
Ansh Radhakrishnan
Jonathan Uesato
Carson E. Denison
...
Vlad Mikulik
Samuel R. Bowman
Jan Leike
Jared Kaplan
E. Perez
ReLMLRM
166
51
1
08 May 2025
Previous
12345...171819
Next