ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.02235
  4. Cited By
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit
  Reasoning Strategies

Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies

6 January 2021
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
    RALM
ArXiv (abs)PDFHTML

Papers citing "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies"

50 / 565 papers shown
Title
Policy Guided Tree Search for Enhanced LLM Reasoning
Policy Guided Tree Search for Enhanced LLM Reasoning
Yang Li
LRM
196
0
0
04 Feb 2025
SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains
Ran Xu
Hui Liu
Sreyashi Nag
Zhenwei Dai
Yaochen Xie
...
Chen Luo
Yang Li
Joyce C. Ho
Carl Yang
Qi He
RALM
179
11
0
28 Jan 2025
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
Yafu Li
Zhilin Wang
Tingchen Fu
Ganqu Cui
Sen Yang
Yu Cheng
101
4
0
21 Jan 2025
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting
Dong-Hai Zhu
Yu-Jie Xiong
Jia-Chen Zhang
Xi-Jiong Xie
Chun-Ming Xia
ReLMLRM
63
0
0
08 Jan 2025
Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks
Shengbin Yue
Siyuan Wang
Wei Chen
Xuanjing Huang
Zhongyu Wei
LLMAG
163
11
0
03 Jan 2025
Nash CoT: Multi-Path Inference with Preference Equilibrium
Nash CoT: Multi-Path Inference with Preference Equilibrium
Ziqi Zhang
Cunxiang Wang
Xiong Xiao
Yue Zhang
Donglin Wang
LRM
101
2
0
31 Dec 2024
Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria
Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria
Joonwon Jang
Jaehee Kim
Wonbin Kweon
Seonghyeon Lee
Hwanjo Yu
LRM
186
2
0
30 Dec 2024
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
Qing Zong
Zhaoxiang Wang
Tianshi Zheng
Xiyu Ren
Yangqiu Song
155
3
0
28 Dec 2024
The Power of Adaptation: Boosting In-Context Learning through Adaptive
  Prompting
The Power of Adaptation: Boosting In-Context Learning through Adaptive Prompting
Shuzhang Cai
Twumasi Mensah-Boateng
Xander Kuksov
Jing Yuan
Shaojie Tang
63
0
0
23 Dec 2024
GAMEBoT: Transparent Assessment of LLM Reasoning in Games
GAMEBoT: Transparent Assessment of LLM Reasoning in Games
Wenye Lin
Jonathan Roberts
Yunhan Yang
Samuel Albanie
Zongqing Lu
Kai Han
LRMELM
135
1
0
18 Dec 2024
A Survey of Calibration Process for Black-Box LLMs
A Survey of Calibration Process for Black-Box LLMs
Liangru Xie
Hui Liu
Jingying Zeng
Xianfeng Tang
Yan Han
Chen Luo
Jing Huang
Zhen Li
Suhang Wang
Qi He
142
4
0
17 Dec 2024
C3oT: Generating Shorter Chain-of-Thought without Compromising
  Effectiveness
C3oT: Generating Shorter Chain-of-Thought without Compromising Effectiveness
Yu Kang
Xianghui Sun
Liangyu Chen
Wei Zou
LRM
197
55
0
16 Dec 2024
Enhancing the Reasoning Capabilities of Small Language Models via
  Solution Guidance Fine-Tuning
Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning
Jing Bi
Yuting Wu
Weiwei Xing
Zhenjie Wei
ReLMOffRLLRM
138
4
0
13 Dec 2024
AutoReason: Automatic Few-Shot Reasoning Decomposition
AutoReason: Automatic Few-Shot Reasoning Decomposition
Arda Sevinc
A. Gumus
ReLMLRM
102
0
0
09 Dec 2024
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
Ola Shorinwa
Zhiting Mei
Justin Lidard
Allen Z. Ren
Anirudha Majumdar
HILMLRM
153
19
0
07 Dec 2024
Chain-of-Thought in Large Language Models: Decoding, Projection, and
  Activation
Chain-of-Thought in Large Language Models: Decoding, Projection, and Activation
H. Yang
Qianghua Zhao
Lei Li
AI4CELRM
106
3
0
05 Dec 2024
RARE: Retrieval-Augmented Reasoning Enhancement for Large Language Models
RARE: Retrieval-Augmented Reasoning Enhancement for Large Language Models
Hieu Tran
Zonghai Yao
Junda Wang
Yifan Zhang
Zhichao Yang
Hong-ye Yu
LRM
179
7
0
03 Dec 2024
Enhancing Zero-shot Chain of Thought Prompting via Uncertainty-Guided Strategy Selection
Shanu Kumar
Saish Mendke
Karody Lubna Abdul Rahman
Santosh Kurasa
Parag Agrawal
Sandipan Dandapat
LLMAGLRM
140
3
0
30 Nov 2024
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Feihu Che
Zengqi Wen
J. Tao
Jianhua Tao
LRMReLM
220
19
0
27 Nov 2024
Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Yutao Hou
Yajing Luo
Zhiwen Ruan
Hongru Wang
Weifeng Ge
Yuxiao Chen
Guanhua Chen
ELM
80
0
0
15 Nov 2024
MLAN: Language-Based Instruction Tuning Preserves and Transfers Knowledge in Multimodal Language Models
MLAN: Language-Based Instruction Tuning Preserves and Transfers Knowledge in Multimodal Language Models
Jianhong Tu
Zhuohao Ni
Nicholas Crispino
Zihao Yu
Michael Bendersky
...
Ruoxi Jia
Xin Liu
Lingjuan Lyu
Dawn Song
Chenguang Wang
VLMMLLM
94
0
0
15 Nov 2024
EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning
EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning
Kiran Purohit
Venktesh V
Raghuram Devalla
Krishna Mohan Yerragorla
Sourangshu Bhattacharya
Avishek Anand
LRM
89
3
0
06 Nov 2024
SLED: Self Logits Evolution Decoding for Improving Factuality in Large
  Language Models
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models
Jianyi Zhang
Da-Cheng Juan
Cyrus Rashtchian
Chun-Sung Ferng
Heinrich Jiang
Yiran Chen
86
4
0
01 Nov 2024
OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large
  Language Models
OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
Junda Wu
Xintong Li
Ruoyu Wang
Yu Xia
Yuxin Xiong
...
Xiang Chen
Branislav Kveton
Lina Yao
Jingbo Shang
Julian McAuley
OffRLLRM
80
1
0
31 Oct 2024
Reasoning or a Semblance of it? A Diagnostic Study of Transitive
  Reasoning in LLMs
Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs
Houman Mehrafarin
Arash Eshghi
Ioannis Konstas
LRM
36
0
0
26 Oct 2024
LanFL: Differentially Private Federated Learning with Large Language
  Models using Synthetic Samples
LanFL: Differentially Private Federated Learning with Large Language Models using Synthetic Samples
Huiyu Wu
Diego Klabjan
FedML
162
1
0
24 Oct 2024
Trustworthy Alignment of Retrieval-Augmented Large Language Models via
  Reinforcement Learning
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Zongmeng Zhang
Yufeng Shi
Jinhua Zhu
Wengang Zhou
Xiang Qi
Peng Zhang
Haoyang Li
RALMHILM
39
0
0
22 Oct 2024
ToW: Thoughts of Words Improve Reasoning in Large Language Models
ToW: Thoughts of Words Improve Reasoning in Large Language Models
Zhikun Xu
Ming shen
Jacob Dineen
Zhaonan Li
Xiao Ye
Shijie Lu
Aswin Rrv
Chitta Baral
Ben Zhou
LRM
454
1
0
21 Oct 2024
LocateBench: Evaluating the Locating Ability of Vision Language Models
LocateBench: Evaluating the Locating Ability of Vision Language Models
Ting-Rui Chiang
Joshua Robinson
Xinyan Velocity Yu
Dani Yogatama
VLMELM
70
0
0
17 Oct 2024
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Xinze Li
Sen Mei
Zhenghao Liu
Yukun Yan
Shuo Wang
...
Haotian Chen
Ge Yu
Zhiyuan Liu
Maosong Sun
Chenyan Xiong
110
12
0
17 Oct 2024
"Let's Argue Both Sides": Argument Generation Can Force Small Models to
  Utilize Previously Inaccessible Reasoning Capabilities
"Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities
Kaveh Eskandari Miandoab
Vasanth Sarathy
LRMReLM
33
1
0
16 Oct 2024
FLARE: Faithful Logic-Aided Reasoning and Exploration
FLARE: Faithful Logic-Aided Reasoning and Exploration
Erik Arakelyan
Pasquale Minervini
Pat Verga
Patrick Lewis
Isabelle Augenstein
ReLMLRM
214
2
0
14 Oct 2024
Effective Self-Mining of In-Context Examples for Unsupervised Machine
  Translation with LLMs
Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMs
Abdellah El Mekki
Muhammad Abdul-Mageed
LRM
79
1
0
14 Oct 2024
WILT: A Multi-Turn, Memorization-Robust Inductive Logic Benchmark for
  LLMs
WILT: A Multi-Turn, Memorization-Robust Inductive Logic Benchmark for LLMs
Eryk Banatt
Jonathan Cheng
Skanda Vaidyanath
Tiffany Hwu
LRM
39
3
0
14 Oct 2024
Beyond Graphs: Can Large Language Models Comprehend Hypergraphs?
Beyond Graphs: Can Large Language Models Comprehend Hypergraphs?
Yifan Feng
Chengwu Yang
Xingliang Hou
S. Du
Shihui Ying
Zongze Wu
Yue Gao
111
4
0
14 Oct 2024
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order
  Reasoning On Device
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device
Yicheng Fu
R. Anantha
Jianpeng Cheng
LRMLLMAG
90
4
0
12 Oct 2024
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Hojae Lee
Junho Kim
SangKeun Lee
LRM
65
3
0
11 Oct 2024
Understanding the Interplay between Parametric and Contextual Knowledge
  for Large Language Models
Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Sitao Cheng
Liangming Pan
Xunjian Yin
Xinyi Wang
William Yang Wang
KELM
83
4
0
10 Oct 2024
Dialectical Behavior Therapy Approach to LLM Prompting
Dialectical Behavior Therapy Approach to LLM Prompting
Oxana Vitman
Nika Amaglobeli
Paul Plachinda
LRM
19
0
0
10 Oct 2024
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+
  Interaction Trajectories
AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Yifan Song
Weimin Xiong
Xiutian Zhao
Dawei Zhu
Wenhao Wu
Ke Wang
Cheng Li
Wei Peng
Sujian Li
LLMAG
64
11
0
10 Oct 2024
Rationale-Aware Answer Verification by Pairwise Self-Evaluation
Rationale-Aware Answer Verification by Pairwise Self-Evaluation
Akira Kawabata
Saku Sugawara
LRM
119
5
0
07 Oct 2024
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
Siyuan Huang
Zhiyuan Ma
Jintao Du
Changhua Meng
Weiqiang Wang
Zhouhan Lin
LRM
70
4
0
07 Oct 2024
Accelerating Inference of Networks in the Frequency Domain
Accelerating Inference of Networks in the Frequency Domain
Chenqiu Zhao
Guanfang Dong
Anup Basu
122
20
0
06 Oct 2024
ECon: On the Detection and Resolution of Evidence Conflicts
ECon: On the Detection and Resolution of Evidence Conflicts
Cheng Jiayang
Chunkit Chan
Qianqian Zhuang
Lin Qiu
Tianhang Zhang
Tengxiao Liu
Yangqiu Song
Yue Zhang
Pengfei Liu
Zheng Zhang
106
5
0
05 Oct 2024
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Shashank Yadav
Rohan Tomar
Garvit Jain
Chirag Ahooja
Shubham Chaudhary
Charles Elkan
77
0
0
05 Oct 2024
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning
  Trajectories Search
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Murong Yue
Wenlin Yao
Haitao Mi
Dian Yu
Ziyu Yao
Dong Yu
LRM
72
7
0
04 Oct 2024
Understanding Reasoning in Chain-of-Thought from the Hopfieldian View
Understanding Reasoning in Chain-of-Thought from the Hopfieldian View
Lijie Hu
Liang Liu
Shu Yang
Xin Chen
Zhen Tan
Muhammad Asif Ali
Mengdi Li
Di Wang
LRM
140
5
0
04 Oct 2024
ALR$^2$: A Retrieve-then-Reason Framework for Long-context Question
  Answering
ALR2^22: A Retrieve-then-Reason Framework for Long-context Question Answering
Huayang Li
Pat Verga
Priyanka Sen
Bowen Yang
Vijay Viswanathan
Patrick Lewis
Taro Watanabe
Yixuan Su
RALMLRM
95
8
0
04 Oct 2024
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling
  for Retrieval-Augmented Generation
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation
Zixuan Li
Jing Xiong
Fanghua Ye
Chuanyang Zheng
Xun Wu
...
Xiaodan Liang
Chengming Li
Zhenan Sun
Lingpeng Kong
Ngai Wong
RALMUQLM
102
2
0
03 Oct 2024
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
Xiangyu Peng
Congying Xia
Xinyi Yang
Caiming Xiong
Chien-Sheng Wu
Chen Xing
LRM
142
8
0
03 Oct 2024
Previous
123456...101112
Next