Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.11171
Cited By
v1
v2
v3
v4 (latest)
Self-Consistency Improves Chain of Thought Reasoning in Language Models
21 March 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Self-Consistency Improves Chain of Thought Reasoning in Language Models"
50 / 920 papers shown
Title
FiDeLiS: Faithful Reasoning in Large Language Model for Knowledge Graph Question Answering
Yuan Sui
Yufei He
Nian Liu
Xiaoxin He
Kun Wang
Bryan Hooi
LRM
198
11
0
22 May 2024
Rethinking ChatGPT's Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs' Prompting
Xinzhe Li
Ming Liu
87
0
0
17 May 2024
Can formal argumentative reasoning enhance LLMs performances?
Federico Castagna
I. Sassoon
Simon Parsons
LRM
LLMAG
43
2
0
16 May 2024
Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers
Tuo Zhang
Jinyue Yuan
A. Avestimehr
LRM
48
5
0
16 May 2024
LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought
Zhuoxuan Jiang
Haoyuan Peng
Shanshan Feng
Fan Li
Dongsheng Li
KELM
LRM
95
16
0
09 May 2024
Preble: Efficient Distributed Prompt Scheduling for LLM Serving
Vikranth Srivatsa
Zijian He
Reyna Abhyankar
Dongming Li
Yiying Zhang
127
21
0
08 May 2024
Chain of Thoughtlessness? An Analysis of CoT in Planning
Kaya Stechly
Karthik Valmeekam
Subbarao Kambhampati
LRM
LM&Ro
182
52
0
08 May 2024
QuaLLM: An LLM-based Framework to Extract Quantitative Insights from Online Forums
Varun Nagaraj Rao
Eesha Agarwal
Samantha Dalal
D. Calacci
Andrés Monroy-Hernández
103
8
0
08 May 2024
Pose Priors from Language Models
Sanjay Subramanian
Evonne Ng
Lea Müller
Dan Klein
Shiry Ginosar
Trevor Darrell
116
4
0
06 May 2024
ModelShield: Adaptive and Robust Watermark against Model Extraction Attack
Kaiyi Pang
Tao Qi
Chuhan Wu
Minhao Bai
Minghu Jiang
Yongfeng Huang
AAML
WaLM
166
5
0
03 May 2024
Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning
Wen-Yu Chang
Yun-Nung Chen
92
7
0
29 Apr 2024
A Framework for Real-time Safeguarding the Text Generation of Large Language Model
Ximing Dong
Dayi Lin
Shaowei Wang
Ahmed E. Hassan
133
1
0
29 Apr 2024
Large Language Model Agent as a Mechanical Designer
Yayati Jadhav
A. Farimani
AI4CE
LLMAG
195
11
0
26 Apr 2024
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Timin Gao
Peixian Chen
Mengdan Zhang
Chaoyou Fu
Yunhang Shen
...
Shengchuan Zhang
Xiawu Zheng
Xing Sun
Liujuan Cao
Rongrong Ji
MLLM
LRM
121
22
0
24 Apr 2024
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia
Rui Wang
Xu Liu
Mingyan Li
Tong Yu
Xiang Chen
Julian McAuley
Shuai Li
LRM
128
22
0
24 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
212
61
0
23 Apr 2024
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Qihuang Zhong
Kang Wang
Ziyang Xu
Juhua Liu
Liang Ding
Bo Du
LRM
AIMat
162
4
0
23 Apr 2024
A Survey of Large Language Models on Generative Graph Analytics: Query, Learning, and Applications
Wenbo Shang
Xin Huang
126
9
0
23 Apr 2024
SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense
Yifan Jiang
Filip Ilievski
Kaixin Ma
LRM
112
30
0
22 Apr 2024
Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks
Avinash Anand
Mohit Gupta
Kritarth Prasad
Navya Singla
Sanjana Sanjeev
Jatin Kumar
A. Shivam
R. Shah
LRM
89
14
0
19 Apr 2024
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Chengwei Qin
Wenhan Xia
Tan Wang
Fangkai Jiao
Yuchen Hu
Bosheng Ding
Ruirui Chen
Shafiq Joty
LRM
129
5
0
19 Apr 2024
BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Yu Feng
Ben Zhou
Weidong Lin
Dan Roth
188
6
0
18 Apr 2024
Language Model Cascades: Token-level uncertainty and beyond
Neha Gupta
Harikrishna Narasimhan
Wittawat Jitkrittum
A. S. Rawat
A. Menon
Sanjiv Kumar
UQLM
142
56
0
15 Apr 2024
Distilling Reasoning Ability from Large Language Models with Adaptive Thinking
Xiao Chen
Sihang Zhou
K. Liang
Xinwang Liu
ReLM
LRM
113
6
0
14 Apr 2024
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models
Yanhong Li
Chenghao Yang
Allyson Ettinger
ReLM
LRM
LLMAG
84
11
0
14 Apr 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Dongyeop Kang
LLMAG
134
7
0
14 Apr 2024
JailbreakLens: Visual Analysis of Jailbreak Attacks Against Large Language Models
Yingchaojie Feng
Zhizhang Chen
Zhining Kang
Sijia Wang
Haoyu Tian
Wei Zhang
Minfeng Zhu
Wei Chen
116
4
0
12 Apr 2024
Best Practices and Lessons Learned on Synthetic Data for Language Models
Ruibo Liu
Jerry W. Wei
Fangyu Liu
Chenglei Si
Yanzhe Zhang
...
Steven Zheng
Daiyi Peng
Diyi Yang
Denny Zhou
Andrew M. Dai
SyDa
EgoV
126
96
0
11 Apr 2024
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
231
58
0
02 Apr 2024
DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference
Jinwei Yao
Kaiqi Chen
Kexun Zhang
Jiaxuan You
Binhang Yuan
Zeke Wang
Tao Lin
114
4
0
30 Mar 2024
Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning
Qinhao Zhou
Zihan Zhang
Xiang Xiang
Ke Wang
Yuchuan Wu
Yongbin Li
LLMAG
LRM
83
5
0
29 Mar 2024
BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation
Yuhong He
Yongqi Zhang
Shizhu He
Jun Wan
LRM
85
1
0
28 Mar 2024
PerOS: Personalized Self-Adapting Operating Systems in the Cloud
Hongyu He
51
1
0
26 Mar 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
97
3
0
22 Mar 2024
Improving the Robustness of Large Language Models via Consistency Alignment
Zhao Yukun
Lingyong Yan
Weiwei Sun
Guoliang Xing
Shuaiqiang Wang
Meng Chong
Zhicong Cheng
Zhaochun Ren
Yin Dawei
88
22
0
21 Mar 2024
Empowering Segmentation Ability to Multi-modal Large Language Models
Yuqi Yang
Peng-Tao Jiang
Jing Wang
Hao Zhang
Kai Zhao
Jinwei Chen
Yue Liu
LRM
VLM
86
4
0
21 Mar 2024
LaPuda: LLM-Enabled Policy-Based Query Optimizer for Multi-modal Data
Yifan Wang
Haodi Ma
Daisy Zhe Wang
55
1
0
20 Mar 2024
Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices
Sara Abdali
Richard Anarfi
C. Barberan
Jia He
Erfan Shayegani
PILM
137
31
0
19 Mar 2024
RouterBench: A Benchmark for Multi-LLM Routing System
Qitian Jason Hu
Jacob Bieker
Xiuyu Li
Nan Jiang
Benjamin Keigwin
Gaurav Ranganath
Kurt Keutzer
Shriyash Kaustubh Upadhyay
113
54
0
18 Mar 2024
Can a GPT4-Powered AI Agent Be a Good Enough Performance Attribution Analyst?
Bruno de Melo
Jamiel Sheikh
14
0
0
15 Mar 2024
Large Language Models are Contrastive Reasoners
Liang Yao
ReLM
ELM
LRM
110
3
0
13 Mar 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
141
28
0
12 Mar 2024
Alto: Orchestrating Distributed Compound AI Systems with Nested Ancestry
Keshav Santhanam
Deepti Raghavan
Muhammad Shahir Rahman
Thejas Venkatesh
Neha Kunjal
Maximilien Cura
Houjun Liu
Pratiksha Thaker
Philip Levis
Matei A. Zaharia
82
9
0
07 Mar 2024
ChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level Generation
Pittawat Taveekitworachai
Febri Abdullah
Mury F. Dewantoro
Yi Xia
Pratch Suntichaikul
R. Thawonmas
Julian Togelius
Jochen Renz
83
1
0
05 Mar 2024
SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models
Xiang Gao
Jiaxin Zhang
Lalla Mouatadid
Kamalika Das
83
14
0
04 Mar 2024
Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy
Jieyong Kim
Ryang Heo
Yongsik Seo
SeongKu Kang
Jinyoung Yeo
Dongha Lee
ReLM
LRM
59
8
0
01 Mar 2024
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Saurabh Srivastava
B. AnnaroseM
V. AntoP
Shashank Menon
Ajay Sukumar
T. AdwaithSamod
Alan Philipose
Stevin Prince
Sooraj Thomas
ELM
ReLM
LRM
79
56
0
29 Feb 2024
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Qintong Li
Leyang Cui
Xueliang Zhao
Lingpeng Kong
Wei Bi
LRM
122
62
0
29 Feb 2024
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Hanjie Chen
Zhouxiang Fang
Yash Singla
Mark Dredze
ELM
AI4MH
145
43
0
28 Feb 2024
A Neural Rewriting System to Solve Algorithmic Problems
Flavio Petruzzellis
Alberto Testolin
A. Sperduti
NAI
71
0
0
27 Feb 2024
Previous
1
2
3
...
12
13
14
...
17
18
19
Next