ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.02235
  4. Cited By
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit
  Reasoning Strategies

Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies

6 January 2021
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
    RALM
ArXiv (abs)PDFHTML

Papers citing "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies"

50 / 565 papers shown
Title
PORT: Preference Optimization on Reasoning Traces
PORT: Preference Optimization on Reasoning Traces
Salem Lahlou
Abdalgader Abubaker
Hakim Hacid
LRM
120
5
0
23 Jun 2024
Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop
  Question Answering
Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering
Zhengliang Shi
Shuo Zhang
Weiwei Sun
Shen Gao
Fajie Yuan
Zhumin Chen
Zhaochun Ren
RALM
117
38
0
21 Jun 2024
Large Language Models are Skeptics: False Negative Problem of
  Input-conflicting Hallucination
Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination
Jongyoon Song
Sangwon Yu
Sungroh Yoon
HILM
65
4
0
20 Jun 2024
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Think-then-Act: A Dual-Angle Evaluated Retrieval-Augmented Generation
Yige Shen
Hao Jiang
Hua Qu
Jihong Zhao
RALMLRM
82
1
0
18 Jun 2024
DetectBench: Can Large Language Model Detect and Piece Together Implicit
  Evidence?
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Zhouhong Gu
Lin Zhang
Xiaoxuan Zhu
Jiangjie Chen
Wenhao Huang
...
Shusen Wang
Zheyu Ye
Yan Gao
Hongwei Feng
Yanghua Xiao
RALM
78
2
0
18 Jun 2024
Low-Redundant Optimization for Large Language Model Alignment
Low-Redundant Optimization for Large Language Model Alignment
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Jingyuan Wang
Ji-Rong Wen
83
3
0
18 Jun 2024
A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
Lijie Hu
Liang Liu
Shu Yang
Xin Chen
Hongru Xiao
Mengdi Li
Pan Zhou
Muhammad Asif Ali
Di Wang
LRM
148
7
0
18 Jun 2024
On the Role of Entity and Event Level Conceptualization in Generalizable
  Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Weiqi Wang
Tianqing Fang
Haochen Shi
Baixuan Xu
Wenxuan Ding
...
Wei Fan
Jiaxin Bai
Haoran Li
Xin Liu
Yangqiu Song
LRM
111
3
0
16 Jun 2024
Demonstration Notebook: Finding the Most Suited In-Context Learning
  Example from Interactions
Demonstration Notebook: Finding the Most Suited In-Context Learning Example from Interactions
Yiming Tang
Bin Dong
66
0
0
16 Jun 2024
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning
Joongwon Kim
Bhargavi Paranjape
Tushar Khot
Hannaneh Hajishirzi
LM&RoELMLLMAGLRM
85
9
0
10 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
82
1
0
05 Jun 2024
Break the Chain: Large Language Models Can be Shortcut Reasoners
Break the Chain: Large Language Models Can be Shortcut Reasoners
Mengru Ding
Hanmeng Liu
Zhizhang Fu
Jian Song
Wenbo Xie
Yue Zhang
KELMLRM
75
12
0
04 Jun 2024
ACCORD: Closing the Commonsense Measurability Gap
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
130
0
0
04 Jun 2024
A Survey of Useful LLM Evaluation
A Survey of Useful LLM Evaluation
Ji-Lun Peng
Sijia Cheng
Egil Diau
Yung-Yu Shih
Po-Heng Chen
Yen-Ting Lin
Yun-Nung Chen
LLMAGELM
88
16
0
03 Jun 2024
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective
  Rationales
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Tianyang Xu
Shujin Wu
Shizhe Diao
Xiaoze Liu
Xingyao Wang
Yangyi Chen
Jing Gao
LRM
102
43
0
31 May 2024
Preemptive Answer "Attacks" on Chain-of-Thought Reasoning
Preemptive Answer "Attacks" on Chain-of-Thought Reasoning
Rongwu Xu
Zehan Qi
Wei Xu
LRMSILM
92
9
0
31 May 2024
A Multi-Source Retrieval Question Answering Framework Based on RAG
A Multi-Source Retrieval Question Answering Framework Based on RAG
Ridong Wu
Shuhong Chen
Xiangbiao Su
Yuankai Zhu
Yifei Liao
Jianming Wu
72
4
0
29 May 2024
Evaluating the External and Parametric Knowledge Fusion of Large
  Language Models
Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Hao Zhang
Yuyang Zhang
Xiaoguang Li
Wenxuan Shi
Haonan Xu
...
Yasheng Wang
Lifeng Shang
Qun Liu
Yong Liu
Ruiming Tang
KELM
97
5
0
29 May 2024
PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework
PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework
Eshaan Agarwal
Vivek Dani
T. Ganu
A. Nambi
LLMAG
65
0
0
28 May 2024
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by
  Self-Contrast
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast
Chufan Shi
Cheng Yang
Xinyu Zhu
Jiahao Wang
Taiqiang Wu
Siheng Li
Deng Cai
Yujiu Yang
Yu Meng
MoE
83
14
0
23 May 2024
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Jiajie Jin
Yutao Zhu
Xinyu Yang
Chenghao Zhang
Zhicheng Dou
Chenghao Zhang
Tong Zhao
Zhao Yang
Zhicheng Dou
Ji-Rong Wen
VLM
167
72
0
22 May 2024
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer
  Selection in Large Language Models
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
Zhangyue Yin
Qiushi Sun
Qipeng Guo
Zhiyuan Zeng
Xiaonan Li
...
Qinyuan Cheng
Ding Wang
Xiaofeng Mou
Xipeng Qiu
XuanJing Huang
LRM
96
4
0
21 May 2024
Quantifying In-Context Reasoning Effects and Memorization Effects in
  LLMs
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs
Siyu Lou
Yuntian Chen
Xiaodan Liang
Liang Lin
Quanshi Zhang
147
2
0
20 May 2024
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
Yuxia Wang
Minghan Wang
Hasan Iqbal
Georgi Georgiev
Jiahui Geng
Preslav Nakov
HILM
107
18
0
09 May 2024
MIDGARD: Self-Consistency Using Minimum Description Length for
  Structured Commonsense Reasoning
MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning
Inderjeet Nair
Lu Wang
LRM
49
1
0
08 May 2024
Chain of Thoughtlessness? An Analysis of CoT in Planning
Chain of Thoughtlessness? An Analysis of CoT in Planning
Kaya Stechly
Karthik Valmeekam
Subbarao Kambhampati
LRMLM&Ro
182
52
0
08 May 2024
Optimizing Language Model's Reasoning Abilities with Weak Supervision
Optimizing Language Model's Reasoning Abilities with Weak Supervision
Yongqi Tong
Sizhe Wang
Dawei Li
Yifan Wang
Simeng Han
Zi Lin
Chengsong Huang
Jiaxin Huang
Jingbo Shang
LRMReLM
99
10
0
07 May 2024
Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought
  Prompt Optimization
Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought Prompt Optimization
Alvin Po-Chun Chen
Ray Groshan
Sean von Bayern
ReLMLRM
77
1
0
03 May 2024
Argumentative Large Language Models for Explainable and Contestable Claim Verification
Argumentative Large Language Models for Explainable and Contestable Claim Verification
Gabriel Freedman
Adam Dejl
Deniz Gorur
Xiang Yin
Antonio Rago
Francesca Toni
73
7
0
03 May 2024
General Purpose Verification for Chain of Thought Prompting
General Purpose Verification for Chain of Thought Prompting
Robert Vacareanu
Anurag Pratik
Evangelia Spiliopoulou
Zheng Qi
Giovanni Paolini
Neha Ann John
Jie Ma
Yassine Benajiba
Miguel Ballesteros
LRM
44
10
0
30 Apr 2024
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Olivia Wiles
Chuhan Zhang
Isabela Albuquerque
Ivana Kajić
Su Wang
...
Jordi Pont-Tuset
Aida Nematzadeh
Anant Nawalgaria
Jordi Pont-Tuset
Aida Nematzadeh
EGVM
261
22
0
25 Apr 2024
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems
Qihuang Zhong
Kang Wang
Ziyang Xu
Juhua Liu
Liang Ding
Bo Du
LRMAIMat
162
4
0
23 Apr 2024
LLMs Know What They Need: Leveraging a Missing Information Guided
  Framework to Empower Retrieval-Augmented Generation
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation
Keheng Wang
Feiyu Duan
Peiguang Li
Sirui Wang
Xunliang Cai
RALM
84
7
0
22 Apr 2024
Information Re-Organization Improves Reasoning in Large Language Models
Information Re-Organization Improves Reasoning in Large Language Models
Xiaoxia Cheng
Zeqi Tan
Wei Xue
Weiming Lu
LRM
64
2
0
22 Apr 2024
The Landscape of Emerging AI Agent Architectures for Reasoning,
  Planning, and Tool Calling: A Survey
The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
Tula Masterman
Sandi Besen
Mason Sawtell
Alex Chao
LM&RoLLMAG
114
58
0
17 Apr 2024
Uncertainty-Based Abstention in LLMs Improves Safety and Reduces
  Hallucinations
Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations
Christian Tomani
Kamalika Chaudhuri
Ivan Evtimov
Daniel Cremers
Mark Ibrahim
107
15
0
16 Apr 2024
Entropy Guided Extrapolative Decoding to Improve Factuality in Large
  Language Models
Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
Souvik Das
Lifeng Jin
Linfeng Song
Haitao Mi
Baolin Peng
Dong Yu
HILM
96
2
0
14 Apr 2024
Distilling Reasoning Ability from Large Language Models with Adaptive
  Thinking
Distilling Reasoning Ability from Large Language Models with Adaptive Thinking
Xiao Chen
Sihang Zhou
K. Liang
Xinwang Liu
ReLMLRM
115
6
0
14 Apr 2024
Exploring Concept Depth: How Large Language Models Acquire Knowledge at
  Different Layers?
Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
Mingyu Jin
Qinkai Yu
Jingyuan Huang
Qingcheng Zeng
Zhenting Wang
...
Yanda Meng
Kaize Ding
Fan Yang
Jundong Li
Yongfeng Zhang
100
21
0
10 Apr 2024
Groundedness in Retrieval-augmented Long-form Generation: An Empirical
  Study
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study
Alessandro Stolfo
RALMHILM
69
6
0
10 Apr 2024
Improving Language Model Reasoning with Self-motivated Learning
Improving Language Model Reasoning with Self-motivated Learning
Yunlong Feng
Yang Xu
Libo Qin
Yasheng Wang
Wanxiang Che
LRMReLM
73
7
0
10 Apr 2024
A Survey on the Integration of Generative AI for Critical Thinking in
  Mobile Networks
A Survey on the Integration of Generative AI for Critical Thinking in Mobile Networks
Athanasios Karapantelakis
Alexandros Nikou
Ajay Kattepur
Jean Martins
Leonid Mokrushin
S. Mohalik
Marin Orlic
Aneta Vulgarakis Feljan
89
2
0
10 Apr 2024
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step
  Reasoning with Large Language Models
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models
Shibo Hao
Yi Gu
Haotian Luo
Tianyang Liu
Xiyan Shao
...
Haodi Ma
Adithya Samavedhi
Qiyue Gao
Zhen Wang
Zhiting Hu
LRMELM
153
1
0
08 Apr 2024
Navigating the Landscape of Hint Generation Research: From the Past to
  the Future
Navigating the Landscape of Hint Generation Research: From the Past to the Future
Anubhav Jangra
Jamshid Mozafari
Adam Jatowt
Smaranda Muresan
74
2
0
06 Apr 2024
Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning
  Skills in Large Language Models
Untangle the KNOT: Interweaving Conflicting Knowledge and Reasoning Skills in Large Language Models
Yantao Liu
Zijun Yao
Xin Lv
Yuchen Fan
S. Cao
Jifan Yu
Lei Hou
Juanzi Li
105
3
0
04 Apr 2024
Advancing LLM Reasoning Generalists with Preference Trees
Advancing LLM Reasoning Generalists with Preference Trees
Lifan Yuan
Ganqu Cui
Hanbin Wang
Ning Ding
Xingyao Wang
...
Zhenghao Liu
Bowen Zhou
Hao Peng
Zhiyuan Liu
Maosong Sun
LRM
138
123
0
02 Apr 2024
VLM-Social-Nav: Socially Aware Robot Navigation through Scoring using
  Vision-Language Models
VLM-Social-Nav: Socially Aware Robot Navigation through Scoring using Vision-Language Models
Daeun Song
Jing Liang
Amirreza Payandeh
Xuesu Xiao
Dinesh Manocha
90
16
0
30 Mar 2024
Conceptual and Unbiased Reasoning in Language Models
Conceptual and Unbiased Reasoning in Language Models
Ben Zhou
Hongming Zhang
Sihao Chen
Dian Yu
Hongwei Wang
Baolin Peng
Dan Roth
Dong Yu
ReLMLRMELM
100
16
0
30 Mar 2024
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to
  Boost for Reasoning
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning
Yongqi Tong
Dawei Li
Sizhe Wang
Yujia Wang
Fei Teng
Jingbo Shang
LRM
118
59
0
29 Mar 2024
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner
Yuxuan Yao
Han Wu
Zhijiang Guo
Biyan Zhou
Jiahui Gao
Sichun Luo
Hanxu Hou
Xiaojin Fu
Linqi Song
LLMAGLRM
128
10
0
28 Mar 2024
Previous
123456...101112
Next