Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.02235
Cited By
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
6 January 2021
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies"
50 / 565 papers shown
Title
Quantifying Uncertainty in Natural Language Explanations of Large Language Models
Sree Harsha Tanneru
Chirag Agarwal
Himabindu Lakkaraju
LRM
68
15
0
06 Nov 2023
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis
Hongyi Zheng
Abulhair Saparov
AAML
LRM
77
7
0
01 Nov 2023
Learning From Mistakes Makes LLM Better Reasoner
Shengnan An
Zexiong Ma
Zeqi Lin
Nanning Zheng
Jian-Guang Lou
Weizhu Chen
LRM
120
82
0
31 Oct 2023
Defining a New NLP Playground
Sha Li
Chi Han
Pengfei Yu
Carl Edwards
Manling Li
...
Yi R. Fung
Charles Yu
Joel R. Tetreault
Eduard H. Hovy
Heng Ji
123
5
0
31 Oct 2023
Making Large Language Models Better Data Creators
Dong-Ho Lee
Jay Pujara
Mohit Sewak
Ryen W. White
S. Jauhar
ALM
SyDa
44
26
0
31 Oct 2023
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Nan He
Hanyu Lai
Chenyang Zhao
Zirui Cheng
Junting Pan
...
Zhaohui Hou
Zhiyuan Huang
Shaoqing Lu
Ding Liang
Mingjie Zhan
LRM
68
14
0
29 Oct 2023
Knowledge Corpus Error in Question Answering
Yejoon Lee
Philhoon Oh
James Thorne
41
2
0
27 Oct 2023
In-Context Ability Transfer for Question Decomposition in Complex QA
Venktesh V
Sourangshu Bhattacharya
Avishek Anand
LRM
ReLM
101
5
0
26 Oct 2023
Large Language Models are Visual Reasoning Coordinators
Liangyu Chen
Bo Li
Sheng Shen
Jingkang Yang
Chunyuan Li
Kurt Keutzer
Trevor Darrell
Ziwei Liu
VLM
LRM
130
58
0
23 Oct 2023
MCC-KD: Multi-CoT Consistent Knowledge Distillation
Hongzhan Chen
Siyue Wu
Xiaojun Quan
Rui Wang
Ming Yan
Ji Zhang
LRM
87
17
0
23 Oct 2023
AlpaCare:Instruction-tuned Large Language Models for Medical Application
Xinlu Zhang
Chenxin Tian
Xianjun Yang
Lichang Chen
Zekun Li
Linda R. Petzold
LM&MA
118
65
0
23 Oct 2023
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Gurusha Juneja
Subhabrata Dutta
Soumen Chakrabarti
Sunny Manchanda
Tanmoy Chakraborty
LRM
ReLM
115
18
0
21 Oct 2023
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Zhaoyang Wang
Shaohan Huang
Yuxuan Liu
Jiahai Wang
Minghui Song
...
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
LRM
122
12
0
20 Oct 2023
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
Xueliang Zhao
Xinting Huang
Wei Bi
Lingpeng Kong
LRM
91
1
0
19 Oct 2023
Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking
Yongqi Tong
Yifan Wang
Dawei Li
Sizhe Wang
Zi Lin
Simeng Han
Jingbo Shang
LRM
52
17
0
18 Oct 2023
A Comprehensive Evaluation of Tool-Assisted Generation Strategies
Alon Jacovi
Avi Caciularu
Jonathan Herzig
Roee Aharoni
Bernd Bohnet
Mor Geva
ELM
126
7
0
16 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Jindong Wang
Xing Xie
Zheng Zhang
Yue Zhang
HILM
KELM
172
202
0
11 Oct 2023
Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Anni Zou
Zhuosheng Zhang
Hai Zhao
Xiangru Tang
LRM
ReLM
77
3
0
10 Oct 2023
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
Huaixiu Steven Zheng
Swaroop Mishra
Xinyun Chen
Heng-Tze Cheng
Ed H. Chi
Quoc V. Le
Denny Zhou
RALM
LRM
96
127
0
09 Oct 2023
FireAct: Toward Language Agent Fine-tuning
Baian Chen
Chang Shu
Ehsan Shareghi
Nigel Collier
Karthik Narasimhan
Shunyu Yao
ALM
LLMAG
177
112
0
09 Oct 2023
Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena
Jiangjie Chen
Siyu Yuan
Rong Ye
Bodhisattwa Prasad Majumder
Kyle Richardson
LLMAG
ELM
120
60
0
09 Oct 2023
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Yile Wang
Peng Li
Maosong Sun
Yang Liu
RALM
KELM
88
50
0
08 Oct 2023
Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models
Song Jiang
Zahra Shakeri
Aaron Chan
Maziar Sanjabi
Hamed Firooz
...
Bugra Akyildiz
Yizhou Sun
Jinchao Li
Qifan Wang
Asli Celikyilmaz
LRM
ReLM
66
8
0
07 Oct 2023
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Murong Yue
Jie Zhao
Min Zhang
Liang Du
Ziyu Yao
LRM
132
71
0
04 Oct 2023
Ask Again, Then Fail: Large Language Models' Vacillations in Judgment
Qiming Xie
Zengzhi Wang
Yi Feng
Rui Xia
AAML
HILM
111
9
0
03 Oct 2023
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance
Saurabh Srivastava
Chengyue Huang
Weiguo Fan
Ziyu Yao
LLMAG
59
5
0
03 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALM
LRM
120
216
0
02 Oct 2023
RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Xi Lin
Xilun Chen
Mingda Chen
Weijia Shi
Maria Lomeli
...
Jacob Kahn
Gergely Szilvasy
Mike Lewis
Luke Zettlemoyer
Scott Yih
RALM
159
157
0
02 Oct 2023
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Chak Lam Shek
Xiyang Wu
Wesley A Suttle
Carl E. Busart
Erin Zaroukian
Dinesh Manocha
Pratap Tokekar
Amrit Singh Bedi
LLMAG
127
10
0
30 Sep 2023
UPAR: A Kantian-Inspired Prompting Framework for Enhancing Large Language Model Capabilities
Hejia Geng
Boxun Xu
Peng Li
ELM
LRM
ReLM
69
1
0
30 Sep 2023
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
Qiushi Sun
Zhangyue Yin
Xiang Li
Zhiyong Wu
Xipeng Qiu
Lingpeng Kong
LRM
LLMAG
91
50
0
30 Sep 2023
SocREval: Large Language Models with the Socratic Method for Reference-Free Reasoning Evaluation
Hangfeng He
Hongming Zhang
Dan Roth
LRM
ELM
ReLM
119
15
0
29 Sep 2023
Benchmarking Cognitive Biases in Large Language Models as Evaluators
Ryan Koo
Minhwa Lee
Vipul Raheja
Jong Inn Park
Zae Myung Kim
Dongyeop Kang
ALM
114
87
0
29 Sep 2023
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution
Chrisantha Fernando
Dylan Banarse
Henryk Michalewski
Simon Osindero
Tim Rocktaschel
LLMAG
ReLM
LRM
105
211
0
28 Sep 2023
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
Zheng Chu
Jingchang Chen
Qianglong Chen
Weijiang Yu
Tao He
Haotian Wang
Weihua Peng
Ming-Yuan Liu
Bing Qin
Ting Liu
LRM
AI4CE
131
175
0
27 Sep 2023
Physics of Language Models: Part 3.2, Knowledge Manipulation
Zeyuan Allen-Zhu
Yuanzhi Li
KELM
93
105
0
25 Sep 2023
Large Language Models Are Also Good Prototypical Commonsense Reasoners
Chenin Li
Qianglong Chen
Yin Zhang
Yifei Zhang
Hongxiang Yao
ReLM
LRM
ELM
66
0
0
22 Sep 2023
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs
Justin Chih-Yao Chen
Swarnadeep Saha
Joey Tianyi Zhou
LLMAG
LRM
103
143
0
22 Sep 2023
SCREWS: A Modular Framework for Reasoning with Revisions
K. Shridhar
Harsh Jhamtani
Hao Fang
Benjamin Van Durme
Jason Eisner
Patrick Xia
KELM
LRM
74
14
0
20 Sep 2023
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Tianhua Zhang
Jiaxin Ge
Hongyin Luo
Yung-Sung Chuang
Mingye Gao
Yuan Gong
Xixin Wu
Yoon Kim
Helen M. Meng
James R. Glass
LRM
ReLM
153
16
0
19 Sep 2023
Contrastive Decoding Improves Reasoning in Large Language Models
Sean O'Brien
Mike Lewis
SyDa
LRM
ReLM
102
39
0
17 Sep 2023
EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning
Rajasekhar Reddy Mekala
Yasaman Razeghi
Sameer Singh
LRM
91
11
0
16 Sep 2023
Re-Reading Improves Reasoning in Large Language Models
Xiaohan Xu
Chongyang Tao
Tao Shen
Can Xu
Hongbo Xu
Guodong Long
Jian-Guang Lou
ReLM
LRM
KELM
59
25
0
12 Sep 2023
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Yung-Sung Chuang
Yujia Xie
Hongyin Luo
Yoon Kim
James R. Glass
Pengcheng He
HILM
79
167
0
07 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context Learning?
Sheng Lu
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Iryna Gurevych
LRM
ELM
ReLM
155
100
0
04 Sep 2023
No Train Still Gain. Unleash Mathematical Reasoning of Large Language Models with Monte Carlo Tree Search Guided by Energy Function
Haotian Xu
LRM
92
14
0
01 Sep 2023
A Human-on-the-Loop Optimization Autoformalism Approach for Sustainability
Ming Jin
Bilgehan Sel
Fnu Hardeep
W. Yin
AI4CE
45
2
0
20 Aug 2023
GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach
Lang Cao
LRM
91
13
0
18 Aug 2023
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Haipeng Luo
Qingfeng Sun
Can Xu
Pu Zhao
Jian-Guang Lou
...
Xiubo Geng
Qingwei Lin
Shifeng Chen
Yansong Tang
Dongmei Zhang
LRM
OSLM
303
468
0
18 Aug 2023
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models
Ziyu Zhuang
Qiguang Chen
Longxuan Ma
Mingda Li
Yi Han
Yushan Qian
Haopeng Bai
Zixian Feng
Weinan Zhang
Ting Liu
ELM
80
13
0
15 Aug 2023
Previous
1
2
3
...
10
11
12
7
8
9
Next