Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.11171
Cited By
v1
v2
v3
v4 (latest)
Self-Consistency Improves Chain of Thought Reasoning in Language Models
21 March 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Self-Consistency Improves Chain of Thought Reasoning in Language Models"
50 / 908 papers shown
Title
Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics
Dongyoung Kim
S. Park
Huiwon Jang
Jinwoo Shin
Jaehyung Kim
Younggyo Seo
LRM
35
0
0
29 May 2025
Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Yongjin Yang
Euiin Yi
Jongwoo Ko
Kimin Lee
Zhijing Jin
Se-Young Yun
LLMAG
56
0
0
29 May 2025
Decoding Cortical Microcircuits: A Generative Model for Latent Space Exploration and Controlled Synthesis
Xingyu Liu
Yubin Li
Guozhang Chen
27
0
0
29 May 2025
Continuous Chain of Thought Enables Parallel Exploration and Reasoning
Halil Alperen Gozeten
M. E. Ildiz
Xuechen Zhang
Hrayr Harutyunyan
A. S. Rawat
Samet Oymak
LRM
69
0
0
29 May 2025
Probability-Consistent Preference Optimization for Enhanced LLM Reasoning
Yunqiao Yang
Houxing Ren
Zimu Lu
Ke Wang
Weikang Shi
A-Long Zhou
Junting Pan
Mingjie Zhan
Hongsheng Li
LRM
52
0
0
29 May 2025
On Learning Verifiers for Chain-of-Thought Reasoning
Maria-Florina Balcan
Avrim Blum
Zhiyuan Li
Dravyansh Sharma
LRM
53
0
0
28 May 2025
Self-Critique and Refinement for Faithful Natural Language Explanations
Yingming Wang
Pepa Atanasova
LRM
126
0
0
28 May 2025
Knowledge Base Construction for Knowledge-Augmented Text-to-SQL
Jinheon Baek
Horst Samulowitz
Oktie Hassanzadeh
D. Subramanian
Sola S. Shirai
A. Gliozzo
D. Bhattacharjya
44
0
0
28 May 2025
Visual Large Language Models Exhibit Human-Level Cognitive Flexibility in the Wisconsin Card Sorting Test
Guangfu Hao
Frederic Alexandre
S. Yu
LRM
33
0
0
28 May 2025
ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark
M. Shalyt
Rotem Elimelech
I. Kaminer
35
0
0
28 May 2025
Scaling Reasoning without Attention
Xueliang Zhao
Wei Wu
Lingpeng Kong
OffRL
ReLM
LRM
VLM
79
0
0
28 May 2025
Scaling Offline RL via Efficient and Expressive Shortcut Models
Nicolas Espinosa-Dice
Yiyi Zhang
Yiding Chen
Bradley Guo
Owen Oertell
Gokul Swamy
Kianté Brantley
Wen Sun
OffRL
LRM
71
0
0
28 May 2025
Decomposing Elements of Problem Solving: What "Math" Does RL Teach?
Tian Qin
Core Francisco Park
Mujin Kwun
Aaron Walsman
Eran Malach
Nikhil Anand
Hidenori Tanaka
David Alvarez-Melis
ReLM
OffRL
LRM
90
0
0
28 May 2025
ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM
Hoang Pham
Thanh-Do Nguyen
Khac-Hoai Nam Bui
51
0
0
28 May 2025
Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving
Kuo Zhou
Lu Zhang
LRM
57
0
0
27 May 2025
Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA
Xiangqing Shen
Fanfan Wang
Rui Xia
RALM
28
0
0
27 May 2025
Pretraining Language Models to Ponder in Continuous Space
Boyi Zeng
Shixiang Song
Siyuan Huang
Yixuan Wang
He Li
Ziwei He
Xinbing Wang
Zhiyu Li
Zhouhan Lin
LRM
95
0
0
27 May 2025
Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies
Terrance Liu
Shuyi Wang
Daniel Preotiuc-Pietro
Yash Chandarana
Chirag Gupta
29
1
0
27 May 2025
Can Large Reasoning Models Self-Train?
Sheikh Shafayat
Fahim Tajwar
Ruslan Salakhutdinov
J. Schneider
Andrea Zanette
ReLM
OffRL
LRM
81
2
0
27 May 2025
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
Zikang Guo
Benfeng Xu
Xiaorui Wang
Zhendong Mao
83
0
0
27 May 2025
VeriTrail: Closed-Domain Hallucination Detection with Traceability
Dasha Metropolitansky
Jonathan Larson
HILM
61
0
0
27 May 2025
Factual Self-Awareness in Language Models: Representation, Robustness, and Scaling
Hovhannes Tamoyan
Subhabrata Dutta
Iryna Gurevych
HILM
KELM
58
0
0
27 May 2025
Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration
Sibo Xiao
Zixin Lin
Wenyang Gao
Hui Chen
Yue Zhang
LLMAG
72
0
0
27 May 2025
Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing
Raoyuan Zhao
Abdullatif Köksal
Ali Modarressi
Michael A. Hedderich
Hinrich Schutze
49
0
0
27 May 2025
The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
Yiqun Zhang
Hao Li
Chenxu Wang
L. Chen
Qiaosheng Zhang
...
Xinrun Wang
Jia Xu
Lei Bai
Wanli Ouyang
Shuyue Hu
79
0
0
26 May 2025
Estimating LLM Consistency: A User Baseline vs Surrogate Metrics
Xiaoyuan Wu
Weiran Lin
Omer Akgul
Lujo Bauer
HILM
24
0
0
26 May 2025
Faster and Better LLMs via Latency-Aware Test-Time Scaling
Zili Wang
Tianyu Zhang
Haoli Bai
Lu Hou
Xianzhi Yu
Wulong Liu
Shiming Xiang
Lei Zhu
LRM
91
0
0
26 May 2025
Minimalist Softmax Attention Provably Learns Constrained Boolean Functions
Jerry Yao-Chieh Hu
Xiwen Zhang
Maojiang Su
Zhao Song
Han Liu
MLT
243
1
0
26 May 2025
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision
Tej Deep Pala
Panshul Sharma
Amir Zadeh
Chuan Li
Soujanya Poria
LRM
54
0
0
26 May 2025
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression
Peijie Dong
Zhenheng Tang
Xiang Liu
Lujun Li
Xiaowen Chu
Bo Li
106
0
0
26 May 2025
Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Zhihong Pan
Kai Zhang
Yuze Zhao
Yupeng Han
LRM
63
0
0
26 May 2025
An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation
Shubham Gandhi
Atharva Naik
Yiqing Xie
Carolyn Rose
61
0
0
26 May 2025
Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering
J. Zhu
Ye Liu
Meikai Bao
Kai Zhang
Yanghai Zhang
Qi Liu
LRM
49
0
0
26 May 2025
Training-Free Multi-Step Audio Source Separation
Yongyi Zang
Jingyi Li
Qiuqiang Kong
239
0
0
26 May 2025
DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning
Qi Cao
Ruiyi Wang
Ruiyi Zhang
Sai Ashish Somayajula
P. Xie
LRM
100
0
0
26 May 2025
MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search
Zonglin Yang
Wanhao Liu
Ben Gao
Y. Liu
Wei-Hong Li
Tong Xie
Lidong Bing
Wanli Ouyang
Erik Cambria
Dongzhan Zhou
69
0
0
25 May 2025
To CoT or To Loop? A Formal Comparison Between Chain-of-Thought and Looped Transformers
Kevin Xu
Issei Sato
LRM
68
0
0
25 May 2025
A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations
Lingjun Zhao
Hal Daumé III
168
0
0
25 May 2025
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering
Zheng Chu
H. Fan
Jingchang Chen
Qianyu Wang
M. Yang
...
Zhongjie Wang
Hao Li
Guo Tang
Ming Liu
Bing Qin
ReLM
LRM
96
0
0
25 May 2025
Optimal Transport-Based Token Weighting scheme for Enhanced Preference Optimization
Meng Li
Guangda Huzhang
Haibo Zhang
Xiting Wang
Anxiang Zeng
42
0
0
24 May 2025
Efficient Long CoT Reasoning in Small Language Models
Z. Wang
Jinqi Jiang
Tian Qiu
Hui Liu
Xianfeng Tang
Huaxiu Yao
OffRL
ReLM
LRM
92
0
0
24 May 2025
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
Ruichen Zhang
Rana Muhammad Shahroz Khan
Zhen Tan
Dawei Li
Song Wang
Tianlong Chen
LRM
63
0
0
24 May 2025
Knowledge Retrieval in LLM Gaming: A Shift from Entity-Centric to Goal-Oriented Graphs
Jonathan Leung
Yongjie Wang
Zhiqi Shen
LRM
27
0
0
24 May 2025
Flex-Judge: Think Once, Judge Anywhere
Jongwoo Ko
S. Kim
Sungwoo Cho
Se-Young Yun
ELM
LRM
218
0
0
24 May 2025
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
Minglai Yang
Ethan Huang
Liang Zhang
Mihai Surdeanu
William Yang Wang
Liangming Pan
LRM
59
0
0
24 May 2025
FlashForge: Ultra-Efficient Prefix-Aware Attention for LLM Decoding
Zhibin Wang
Rui Ning
Chao Fang
Zhonghui Zhang
Xi Lin
...
Rong Gu
Kun Yang
Guihai Chen
Sheng Zhong
Chen Tian
58
0
0
23 May 2025
Fast Quiet-STaR: Thinking Without Thought Tokens
Wei Huang
Yizhe Xiong
Xin Ye
Zhijie Deng
Hui Chen
Zijia Lin
Guiguang Ding
LLMAG
LRM
VLM
56
0
0
23 May 2025
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration
Jingtong Gao
Ling Pan
Yejing Wang
Rui Zhong
Chi Lu
Qingpeng Cai
Peng Jiang
Xiangyu Zhao
LRM
101
1
0
23 May 2025
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning
Michael Hassid
Gabriel Synnaeve
Yossi Adi
Roy Schwartz
ReLM
LRM
113
1
0
23 May 2025
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback
Litao Guo
Xinli Xu
Luozhou Wang
Jiantao Lin
Jinsong Zhou
Zixin Zhang
Bolan Su
Ying-Cong Chen
LLMAG
LRM
86
1
0
23 May 2025
Previous
1
2
3
4
5
6
...
17
18
19
Next