ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11171
  4. Cited By
Self-Consistency Improves Chain of Thought Reasoning in Language Models
v1v2v3v4 (latest)

Self-Consistency Improves Chain of Thought Reasoning in Language Models

21 March 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
    ReLMBDLLRMAI4CE
ArXiv (abs)PDFHTML

Papers citing "Self-Consistency Improves Chain of Thought Reasoning in Language Models"

50 / 908 papers shown
Title
Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics
Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics
Dongyoung Kim
S. Park
Huiwon Jang
Jinwoo Shin
Jaehyung Kim
Younggyo Seo
LRM
35
0
0
29 May 2025
Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Yongjin Yang
Euiin Yi
Jongwoo Ko
Kimin Lee
Zhijing Jin
Se-Young Yun
LLMAG
56
0
0
29 May 2025
Decoding Cortical Microcircuits: A Generative Model for Latent Space Exploration and Controlled Synthesis
Decoding Cortical Microcircuits: A Generative Model for Latent Space Exploration and Controlled Synthesis
Xingyu Liu
Yubin Li
Guozhang Chen
27
0
0
29 May 2025
Continuous Chain of Thought Enables Parallel Exploration and Reasoning
Continuous Chain of Thought Enables Parallel Exploration and Reasoning
Halil Alperen Gozeten
M. E. Ildiz
Xuechen Zhang
Hrayr Harutyunyan
A. S. Rawat
Samet Oymak
LRM
69
0
0
29 May 2025
Probability-Consistent Preference Optimization for Enhanced LLM Reasoning
Probability-Consistent Preference Optimization for Enhanced LLM Reasoning
Yunqiao Yang
Houxing Ren
Zimu Lu
Ke Wang
Weikang Shi
A-Long Zhou
Junting Pan
Mingjie Zhan
Hongsheng Li
LRM
52
0
0
29 May 2025
On Learning Verifiers for Chain-of-Thought Reasoning
On Learning Verifiers for Chain-of-Thought Reasoning
Maria-Florina Balcan
Avrim Blum
Zhiyuan Li
Dravyansh Sharma
LRM
53
0
0
28 May 2025
Self-Critique and Refinement for Faithful Natural Language Explanations
Self-Critique and Refinement for Faithful Natural Language Explanations
Yingming Wang
Pepa Atanasova
LRM
124
0
0
28 May 2025
Knowledge Base Construction for Knowledge-Augmented Text-to-SQL
Knowledge Base Construction for Knowledge-Augmented Text-to-SQL
Jinheon Baek
Horst Samulowitz
Oktie Hassanzadeh
D. Subramanian
Sola S. Shirai
A. Gliozzo
D. Bhattacharjya
44
0
0
28 May 2025
Visual Large Language Models Exhibit Human-Level Cognitive Flexibility in the Wisconsin Card Sorting Test
Visual Large Language Models Exhibit Human-Level Cognitive Flexibility in the Wisconsin Card Sorting Test
Guangfu Hao
Frederic Alexandre
S. Yu
LRM
33
0
0
28 May 2025
ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark
ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark
M. Shalyt
Rotem Elimelech
I. Kaminer
32
0
0
28 May 2025
Scaling Reasoning without Attention
Scaling Reasoning without Attention
Xueliang Zhao
Wei Wu
Lingpeng Kong
OffRLReLMLRMVLM
79
0
0
28 May 2025
Scaling Offline RL via Efficient and Expressive Shortcut Models
Scaling Offline RL via Efficient and Expressive Shortcut Models
Nicolas Espinosa-Dice
Yiyi Zhang
Yiding Chen
Bradley Guo
Owen Oertell
Gokul Swamy
Kianté Brantley
Wen Sun
OffRLLRM
71
0
0
28 May 2025
Decomposing Elements of Problem Solving: What "Math" Does RL Teach?
Decomposing Elements of Problem Solving: What "Math" Does RL Teach?
Tian Qin
Core Francisco Park
Mujin Kwun
Aaron Walsman
Eran Malach
Nikhil Anand
Hidenori Tanaka
David Alvarez-Melis
ReLMOffRLLRM
90
0
0
28 May 2025
ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM
ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM
Hoang Pham
Thanh-Do Nguyen
Khac-Hoai Nam Bui
51
0
0
28 May 2025
Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving
Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving
Kuo Zhou
Lu Zhang
LRM
49
0
0
27 May 2025
Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA
Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA
Xiangqing Shen
Fanfan Wang
Rui Xia
RALM
28
0
0
27 May 2025
Pretraining Language Models to Ponder in Continuous Space
Pretraining Language Models to Ponder in Continuous Space
Boyi Zeng
Shixiang Song
Siyuan Huang
Yixuan Wang
He Li
Ziwei He
Xinbing Wang
Zhiyu Li
Zhouhan Lin
LRM
95
0
0
27 May 2025
Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies
Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies
Terrance Liu
Shuyi Wang
Daniel Preotiuc-Pietro
Yash Chandarana
Chirag Gupta
29
1
0
27 May 2025
Can Large Reasoning Models Self-Train?
Can Large Reasoning Models Self-Train?
Sheikh Shafayat
Fahim Tajwar
Ruslan Salakhutdinov
J. Schneider
Andrea Zanette
ReLMOffRLLRM
81
2
0
27 May 2025
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
Zikang Guo
Benfeng Xu
Xiaorui Wang
Zhendong Mao
83
0
0
27 May 2025
VeriTrail: Closed-Domain Hallucination Detection with Traceability
VeriTrail: Closed-Domain Hallucination Detection with Traceability
Dasha Metropolitansky
Jonathan Larson
HILM
59
0
0
27 May 2025
Factual Self-Awareness in Language Models: Representation, Robustness, and Scaling
Factual Self-Awareness in Language Models: Representation, Robustness, and Scaling
Hovhannes Tamoyan
Subhabrata Dutta
Iryna Gurevych
HILMKELM
58
0
0
27 May 2025
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Sibo Xiao
Zixin Lin
Wenyang Gao
Yue Zhang
Yue Zhang
LLMAG
72
0
0
27 May 2025
Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing
Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing
Raoyuan Zhao
Abdullatif Köksal
Ali Modarressi
Michael A. Hedderich
Hinrich Schutze
49
0
0
27 May 2025
The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
Yiqun Zhang
Hao Li
Chenxu Wang
L. Chen
Qiaosheng Zhang
...
Xinrun Wang
Jia Xu
Lei Bai
Wanli Ouyang
Shuyue Hu
79
0
0
26 May 2025
Estimating LLM Consistency: A User Baseline vs Surrogate Metrics
Estimating LLM Consistency: A User Baseline vs Surrogate Metrics
Xiaoyuan Wu
Weiran Lin
Omer Akgul
Lujo Bauer
HILM
24
0
0
26 May 2025
Faster and Better LLMs via Latency-Aware Test-Time Scaling
Faster and Better LLMs via Latency-Aware Test-Time Scaling
Zili Wang
Tianyu Zhang
Haoli Bai
Lu Hou
Xianzhi Yu
Wulong Liu
Shiming Xiang
Lei Zhu
LRM
91
0
0
26 May 2025
Minimalist Softmax Attention Provably Learns Constrained Boolean Functions
Minimalist Softmax Attention Provably Learns Constrained Boolean Functions
Jerry Yao-Chieh Hu
Xiwen Zhang
Maojiang Su
Zhao Song
Han Liu
MLT
243
1
0
26 May 2025
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision
Tej Deep Pala
Panshul Sharma
Amir Zadeh
Chuan Li
Soujanya Poria
LRM
54
0
0
26 May 2025
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Peijie Dong
Zhenheng Tang
Xiang Liu
Lujun Li
Xiaowen Chu
Bo Li
106
0
0
26 May 2025
Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Zhihong Pan
Kai Zhang
Yuze Zhao
Yupeng Han
LRM
63
0
0
26 May 2025
An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation
An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation
Shubham Gandhi
Atharva Naik
Yiqing Xie
Carolyn Rose
61
0
0
26 May 2025
Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering
Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering
J. Zhu
Ye Liu
Meikai Bao
Kai Zhang
Yanghai Zhang
Qi Liu
LRM
49
0
0
26 May 2025
Training-Free Multi-Step Audio Source Separation
Training-Free Multi-Step Audio Source Separation
Yongyi Zang
Jingyi Li
Qiuqiang Kong
239
0
0
26 May 2025
DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning
DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning
Qi Cao
Ruiyi Wang
Ruiyi Zhang
Sai Ashish Somayajula
P. Xie
LRM
100
0
0
26 May 2025
MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search
MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search
Zonglin Yang
Wanhao Liu
Ben Gao
Y. Liu
Wei-Hong Li
Tong Xie
Lidong Bing
Wanli Ouyang
Erik Cambria
Dongzhan Zhou
69
0
0
25 May 2025
To CoT or To Loop? A Formal Comparison Between Chain-of-Thought and Looped Transformers
To CoT or To Loop? A Formal Comparison Between Chain-of-Thought and Looped Transformers
Kevin Xu
Issei Sato
LRM
68
0
0
25 May 2025
A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations
A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations
Lingjun Zhao
Hal Daumé III
168
0
0
25 May 2025
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering
Zheng Chu
H. Fan
Jingchang Chen
Qianyu Wang
M. Yang
...
Zhongjie Wang
Hao Li
Guo Tang
Ming Liu
Bing Qin
ReLMLRM
96
0
0
25 May 2025
Optimal Transport-Based Token Weighting scheme for Enhanced Preference Optimization
Optimal Transport-Based Token Weighting scheme for Enhanced Preference Optimization
Meng Li
Guangda Huzhang
Haibo Zhang
Xiting Wang
Anxiang Zeng
42
0
0
24 May 2025
Efficient Long CoT Reasoning in Small Language Models
Efficient Long CoT Reasoning in Small Language Models
Z. Wang
Jinqi Jiang
Tian Qiu
Hui Liu
Xianfeng Tang
Huaxiu Yao
OffRLReLMLRM
90
0
0
24 May 2025
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
Ruichen Zhang
Rana Muhammad Shahroz Khan
Zhen Tan
Dawei Li
Song Wang
Tianlong Chen
LRM
60
0
0
24 May 2025
Knowledge Retrieval in LLM Gaming: A Shift from Entity-Centric to Goal-Oriented Graphs
Knowledge Retrieval in LLM Gaming: A Shift from Entity-Centric to Goal-Oriented Graphs
Jonathan Leung
Yongjie Wang
Zhiqi Shen
LRM
27
0
0
24 May 2025
Flex-Judge: Think Once, Judge Anywhere
Flex-Judge: Think Once, Judge Anywhere
Jongwoo Ko
S. Kim
Sungwoo Cho
Se-Young Yun
ELMLRM
218
0
0
24 May 2025
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
Minglai Yang
Ethan Huang
Liang Zhang
Mihai Surdeanu
William Yang Wang
Liangming Pan
LRM
59
0
0
24 May 2025
FlashForge: Ultra-Efficient Prefix-Aware Attention for LLM Decoding
FlashForge: Ultra-Efficient Prefix-Aware Attention for LLM Decoding
Zhibin Wang
Rui Ning
Chao Fang
Zhonghui Zhang
Xi Lin
...
Rong Gu
Kun Yang
Guihai Chen
Sheng Zhong
Chen Tian
58
0
0
23 May 2025
Fast Quiet-STaR: Thinking Without Thought Tokens
Wei Huang
Yizhe Xiong
Xin Ye
Zhijie Deng
Hui Chen
Zijia Lin
Guiguang Ding
LLMAGLRMVLM
56
0
0
23 May 2025
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration
Jingtong Gao
Ling Pan
Yejing Wang
Rui Zhong
Chi Lu
Qingpeng Cai
Peng Jiang
Xiangyu Zhao
LRM
101
1
0
23 May 2025
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning
Michael Hassid
Gabriel Synnaeve
Yossi Adi
Roy Schwartz
ReLMLRM
113
1
0
23 May 2025
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Litao Guo
Xinli Xu
Luozhou Wang
Jiantao Lin
Jinsong Zhou
Zixin Zhang
Bolan Su
Ying-Cong Chen
LLMAGLRM
86
1
0
23 May 2025
Previous
123456...171819
Next