Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.02235
Cited By
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
6 January 2021
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies"
50 / 565 papers shown
Title
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
Alon Jacovi
Yonatan Bitton
Bernd Bohnet
Jonathan Herzig
Or Honovich
Michael Tseng
Michael Collins
Roee Aharoni
Mor Geva
LRM
131
27
0
01 Feb 2024
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning
Tinghui Zhu
Kai Zhang
Jian Xie
Yu-Chuan Su
LRM
106
17
0
31 Jan 2024
When Large Language Models Meet Vector Databases: A Survey
Zhi Jing
Yongye Su
Yikun Han
Bo Yuan
Haiyun Xu
Chunjiang Liu
Kehai Chen
Min Zhang
142
38
0
30 Jan 2024
PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models
Haochen Tan
Zhijiang Guo
Zhan Shi
Lu Xu
Zhili Liu
...
Xiaoguang Li
Yasheng Wang
Lifeng Shang
Qun Liu
Linqi Song
101
16
0
26 Jan 2024
Towards Goal-oriented Prompt Engineering for Large Language Models: A Survey
Haochen Li
Jonathan Leung
Zhiqi Shen
LM&MA
LLMAG
LRM
69
1
0
25 Jan 2024
Towards Uncertainty-Aware Language Agent
Paul Burgess
Wray Buntine
Ehsan Shareghi
LLMAG
AI4CE
118
7
0
25 Jan 2024
Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning
Yanda Chen
Chandan Singh
Xiaodong Liu
Simiao Zuo
Bin Yu
He He
Jianfeng Gao
LRM
81
14
0
25 Jan 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
376
33
0
25 Jan 2024
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Taicheng Guo
Preslav Nakov
Yaqi Wang
Ruidi Chang
Shichao Pei
Nitesh Chawla
Olaf Wiest
Xiangliang Zhang
LLMAG
LM&Ro
AI4CE
LRM
166
334
0
21 Jan 2024
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
Zhen Xiang
Fengqing Jiang
Zidi Xiong
Bhaskar Ramasubramanian
Radha Poovendran
Bo Li
LRM
SILM
103
50
0
20 Jan 2024
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning
Yiwei Li
Peiwen Yuan
Shaoxiong Feng
Boyuan Pan
Xinglin Wang
Bin Sun
Heda Wang
Kan Li
LRM
73
38
0
19 Jan 2024
Large Language Models are Null-Shot Learners
Pittawat Taveekitworachai
Febri Abdullah
R. Thawonmas
LRM
46
2
0
16 Jan 2024
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities
S. Sravanthi
Meet Doshi
Tankala Pavan Kalyan
Rudra Murthy
Pushpak Bhattacharyya
Raj Dabre
75
29
0
13 Jan 2024
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks
Peter Hase
Mohit Bansal
Peter Clark
Sarah Wiegreffe
158
35
0
12 Jan 2024
SH2: Self-Highlighted Hesitation Helps You Decode More Truthfully
Jushi Kai
Hai Hu
Zhouhan Lin
HILM
63
11
0
11 Jan 2024
The Impact of Reasoning Step Length on Large Language Models
Mingyu Jin
Qinkai Yu
Dong Shu
Haiyan Zhao
Wenyue Hua
Yanda Meng
Yongfeng Zhang
Jundong Li
ReLM
LRM
182
113
0
10 Jan 2024
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Ke Yang
Jiateng Liu
John Wu
Chaoqi Yang
Yi R. Fung
...
Xu Cao
Xingyao Wang
Yiquan Wang
Chenhui Xu
Chengxiang Zhai
LLMAG
ELM
151
87
0
01 Jan 2024
ConfusionPrompt: Practical Private Inference for Online Large Language Models
Peihua Mai
Ran Yan
Rui Ye
Youjia Yang
Yinchuan Li
Yan Pang
71
2
0
30 Dec 2023
Task Contamination: Language Models May Not Be Few-Shot Anymore
Changmao Li
Jeffrey Flanigan
175
104
0
26 Dec 2023
Towards a Unified Multimodal Reasoning Framework
Abhinav Arun
Dipendra Singh Mal
Mehul Soni
Tomohiro Sawada
LRM
37
0
0
22 Dec 2023
Retrieval-Augmented Generation for Large Language Models: A Survey
Yunfan Gao
Yun Xiong
Xinyu Gao
Kangxiang Jia
Jinliu Pan
Yuxi Bi
Yi Dai
Jiawei Sun
Meng Wang
Haofen Wang
3DV
RALM
323
1,845
1
18 Dec 2023
Mixed Distillation Helps Smaller Language Model Better Reasoning
Chenglin Li
Qianglong Chen
Liangyue Li
Wang Caiyu
Yicheng Li
Zhang Yin
Yin Zhang
LRM
79
15
0
17 Dec 2023
PathFinder: Guided Search over Multi-Step Reasoning Paths
O. Yu. Golovneva
Sean O'Brien
Ramakanth Pasunuru
Tianlu Wang
Luke Zettlemoyer
Maryam Fazel-Zarandi
Asli Celikyilmaz
LRM
92
8
0
08 Dec 2023
A Study on the Calibration of In-context Learning
Hanlin Zhang
Yi-Fan Zhang
Yaodong Yu
Dhruv Madeka
Dean Phillips Foster
Eric Xing
Hima Lakkaraju
Sham Kakade
109
16
0
07 Dec 2023
Competition-Level Problems are Effective LLM Evaluators
Yiming Huang
Zheng-Wen Lin
Xiao Liu
Yeyun Gong
Shuai Lu
...
Yaobo Liang
Yelong Shen
Chen Lin
Nan Duan
Weizhu Chen
ELM
LRM
91
29
0
04 Dec 2023
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
Zhangyue Yin
Qiushi Sun
Cheng Chang
Qipeng Guo
Junqi Dai
Xuanjing Huang
Xipeng Qiu
LRM
87
59
0
04 Dec 2023
The Efficiency Spectrum of Large Language Models: An Algorithmic Survey
Tianyu Ding
Tianyi Chen
Haidong Zhu
Jiachen Jiang
Yiqi Zhong
Jinxin Zhou
Guangzhi Wang
Zhihui Zhu
Ilya Zharkov
Luming Liang
126
24
0
01 Dec 2023
IAG: Induction-Augmented Generation Framework for Answering Reasoning Questions
Zhebin Zhang
Xinyu Zhang
Yuanhang Ren
Saijiang Shi
Meng Han
Yongkang Wu
Ruofei Lai
Bo Zhao
RALM
LRM
48
16
0
30 Nov 2023
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension
Akira Kawabata
Saku Sugawara
ELM
62
7
0
30 Nov 2023
Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs
Yunxin Li
Baotian Hu
Wei Wang
Xiaochun Cao
Min Zhang
77
5
0
27 Nov 2023
Fully Authentic Visual Question Answering Dataset from Online Communities
Chongyan Chen
Mengchen Liu
Noel Codella
Yunsheng Li
Lu Yuan
Danna Gurari
116
5
0
27 Nov 2023
Physical Reasoning and Object Planning for Household Embodied Agents
Ayush Agrawal
Raghav Prabhakar
Anirudh Goyal
Dianbo Liu
LM&Ro
LRM
34
2
0
22 Nov 2023
AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng YANG
Yinya Huang
Jing Xiong
Liang Feng
Xiaodan Liang
Yiwei Wang
Jing Tang
LRM
87
2
0
22 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Zhuosheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
153
61
0
20 Nov 2023
Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?
Bangzheng Li
Ben Zhou
Fei Wang
Xingyu Fu
Dan Roth
Muhao Chen
HILM
LRM
104
22
0
16 Nov 2023
What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception
Chaitanya Malaviya
Subin Lee
Dan Roth
Mark Yatskar
80
2
0
16 Nov 2023
Effective Large Language Model Adaptation for Improved Grounding and Citation Generation
Xi Ye
Ruoxi Sun
Sercan O. Arik
Tomas Pfister
HILM
112
30
0
16 Nov 2023
Contrastive Chain-of-Thought Prompting
Yew Ken Chia
Guizhen Chen
Anh Tuan Luu
Soujanya Poria
Lidong Bing
LRM
AI4CE
122
34
0
15 Nov 2023
How Well Do Large Language Models Truly Ground?
Hyunji Lee
Se June Joo
Chaeeun Kim
Joel Jang
Doyoung Kim
Kyoung-Woon On
Minjoon Seo
HILM
97
8
0
15 Nov 2023
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers
Yuxia Wang
Revanth Gangi Reddy
Zain Muhammad Mujahid
Arnav Arora
Aleksandr Rubashevskii
...
Nadav Borenstein
Aditya Pillai
Isabelle Augenstein
Iryna Gurevych
Preslav Nakov
HILM
130
42
0
15 Nov 2023
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
Chang Gao
Haiyun Jiang
Deng Cai
Shuming Shi
Wai Lam
LRM
77
7
0
15 Nov 2023
Plum: Prompt Learning using Metaheuristic
Boyao Wang
Shuo Xing
Shizhe Diao
Wenhe Sun
Xiang Liu
Kashun Shum
Renjie Pi
Jipeng Zhang
Tong Zhang
VLM
OffRL
LRM
80
6
0
14 Nov 2023
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster
Hongxuan Zhang
Zhining Liu
Yao Zhao
Jiaqi Zheng
Chenyi Zhuang
Jinjie Gu
Guihai Chen
LRM
MLLM
50
1
0
14 Nov 2023
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration
Zhenran Xu
Senbao Shi
Baotian Hu
Jindi Yu
Dongfang Li
Min Zhang
Yuxiang Wu
LRM
LLMAG
ALM
118
28
0
14 Nov 2023
The ART of LLM Refinement: Ask, Refine, and Trust
Kumar Shridhar
Koustuv Sinha
Andrew Cohen
Tianlu Wang
Ping Yu
Ramakanth Pasunuru
Mrinmaya Sachan
Jason Weston
Asli Celikyilmaz
LLMAG
ReLM
LRM
76
27
0
14 Nov 2023
Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning
Yue Yu
Jiaming Shen
Tianqi Liu
Zhen Qin
Jing Nathan Yan
Jialu Liu
Chao Zhang
Michael Bendersky
113
7
0
13 Nov 2023
Large Language Models are In-context Teachers for Knowledge Reasoning
Jiachen Zhao
Zonghai Yao
Zhichao Yang
Hong-ye Yu
ReLM
LRM
59
2
0
12 Nov 2023
Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Zhangyin Feng
Weitao Ma
Weijiang Yu
Lei Huang
Haotian Wang
Qianglong Chen
Weihua Peng
Xiaocheng Feng
Bing Qin
Ting Liu
KELM
80
40
0
10 Nov 2023
Agent Lumos: Unified and Modular Training for Open-Source Language Agents
Da Yin
Faeze Brahman
Abhilasha Ravichander
Khyathi Chandu
Kai-Wei Chang
Yejin Choi
Bill Yuchen Lin
LLMAG
124
44
0
09 Nov 2023
Prompt Sketching for Large Language Models
Luca Beurer-Kellner
Mark Niklas Muller
Marc Fischer
Martin Vechev
KELM
LRM
78
5
0
08 Nov 2023
Previous
1
2
3
...
6
7
8
...
10
11
12
Next