Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.03350
Cited By
Measuring and Narrowing the Compositionality Gap in Language Models
7 October 2022
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLM
KELM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Measuring and Narrowing the Compositionality Gap in Language Models"
50 / 97 papers shown
Title
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering
Zheng Chu
H. Fan
Jingchang Chen
Qianyu Wang
M. Yang
...
Zhongjie Wang
Hao Li
Guo Tang
Ming Liu
Bing Qin
ReLM
LRM
66
0
0
25 May 2025
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Minki Kang
Jongwon Jeong
Seanie Lee
Jaewoong Cho
Sung Ju Hwang
LRM
148
1
0
23 May 2025
Curriculum Guided Reinforcement Learning for Efficient Multi Hop Retrieval Augmented Generation
Yuelyu Ji
Rui Meng
Zhuochun Li
Daqing He
118
0
0
23 May 2025
T
2
^2
2
: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
Zhengyi Zhao
Shubo Zhang
Zezhong Wang
Huimin Wang
Yutian Zhao
Bin Liang
Yefeng Zheng
Binyang Li
Kam-Fai Wong
X. Wu
LRM
58
0
0
23 May 2025
Two-way Evidence self-Alignment based Dual-Gated Reasoning Enhancement
Kexin Zhang
Junlan Chen
Daifeng Li
Yuxuan Zhang
Yangyang Feng
Bowen Deng
Weixu Chen
LRM
54
0
0
22 May 2025
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Shuang Sun
Huatong Song
Yuhao Wang
Ruiyang Ren
Jinhao Jiang
...
Wayne Xin Zhao
Zheng Liu
Lei Fang
Zhongyuan Wang
Ji-Rong Wen
LRM
46
4
0
22 May 2025
GAP: Graph-Assisted Prompts for Dialogue-based Medication Recommendation
Jialun Zhong
Yanzeng Li
Sen Hu
Yang Zhang
Teng Xu
Lei Zou
LM&MA
54
0
0
19 May 2025
Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs
Yaorui Shi
Shihan Li
Chang Wu
Zhiyuan Liu
Sihang Li
Hengxing Cai
An Zhang
Xiang Wang
ReLM
LRM
95
0
0
16 May 2025
SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning
Cheng Wen
Tingwei Guo
Shuaijiang Zhao
Wei Zou
Xiangang Li
OffRL
AuLLM
LRM
83
5
0
22 Apr 2025
CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models
Feiyang Li
Peng Fang
Zhan Shi
Arijit Khan
Fang Wang
Dan Feng
Weihao Wang
Xin Zhang
Yongjian Cui
ReLM
LRM
73
1
0
18 Apr 2025
LLM-as-a-Judge: Reassessing the Performance of LLMs in Extractive QA
Xanh Ho
Jiahao Huang
Florian Boudin
Akiko Aizawa
ELM
75
0
0
16 Apr 2025
ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search
Yize Zhang
Tianshu Wang
Sirui Chen
Kun Wang
Xingyu Zeng
Hongyu Lin
Jia Zheng
Le Sun
Chaochao Lu
LRM
53
0
0
15 Apr 2025
Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models
Yuxiang Lin
Jingdong Sun
Zhi-Qi Cheng
Jue Wang
Haomin Liang
Zebang Cheng
Yifei Dong
Jun-Yan He
Xiaojiang Peng
Xian-Sheng Hua
109
0
0
10 Apr 2025
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
M. Ben-Chen
Tianpeng Li
Haoze Sun
Yijie Zhou
Chenzheng Zhu
...
Xin Wu
Haofen Wang
Jeff Z. Pan
Wen Zhang
Ningyu Zhang
ReLM
OffRL
AI4TS
LRM
113
16
0
25 Mar 2025
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Bowen Jin
Hansi Zeng
Zhenrui Yue
Dong Wang
Sercan O. Arik
Dong Wang
Hamed Zamani
Jiawei Han
RALM
ReLM
KELM
OffRL
AI4TS
LRM
137
77
0
12 Mar 2025
ReAgent: Reversible Multi-Agent Reasoning for Knowledge-Enhanced Multi-Hop QA
Zhao Xinjie
Fan Gao
Rui Yang
Yingjian Chen
Yuyang Wang
Ying Zhu
Jiacheng Tang
Irene Li
Y. Matsuo
Irene Li
KELM
LRM
72
1
0
10 Mar 2025
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Huatong Song
Jinhao Jiang
Yingqian Min
Jie Chen
Zhongfu Chen
Wayne Xin Zhao
Lei Fang
Ji-Rong Wen
AI4TS
LRM
KELM
139
29
0
07 Mar 2025
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction
Lu Dai
Yijie Xu
Jinhui Ye
Hao Liu
Hui Xiong
3DV
RALM
139
2
0
03 Mar 2025
Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models
Zhenyu Pan
Haozheng Luo
Manling Li
Han Liu
LRM
84
16
0
24 Feb 2025
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
Abdelrahman Abdallah
Bhawna Piryani
Jamshid Mozafari
Mohammed Ali
Adam Jatowt
206
1
0
21 Feb 2025
RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation
Pengcheng Jiang
Lang Cao
Ruike Zhu
Minhao Jiang
Yunyi Zhang
Jimeng Sun
Jiawei Han
RALM
167
1
0
16 Feb 2025
Bag of Tricks for Inference-time Computation of LLM Reasoning
Fan Liu
Wenshuo Chao
Naiqiang Tan
Hao Liu
OffRL
LRM
104
3
0
11 Feb 2025
Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method
Peter Baile Chen
Yi Zhang
Michael Cafarella
Dan Roth
RALM
AIFin
223
3
0
30 Jan 2025
Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
Takyoung Kim
Kyungjae Lee
Y. Jang
Ji Yong Cho
Gangwoo Kim
Minseok Cho
Moontae Lee
198
1
0
28 Jan 2025
Chain-of-Retrieval Augmented Generation
Liang Wang
Haonan Chen
Nan Yang
Xiaolong Huang
Zhicheng Dou
Furu Wei
RALM
LRM
ReLM
3DV
114
7
0
24 Jan 2025
Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment
Shuoling Liu
Gaoguo Jia
Yuhang Jiang
Liyuan Chen
Qiang Yang
AIFin
LRM
126
0
0
17 Jan 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
109
34
0
10 Jan 2025
AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
Amy Xin
Jinxin Liu
Zijun Yao
Zhicheng Li
S. Cao
Lei Hou
Juanzi Li
LRM
123
2
0
25 Nov 2024
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh
Pradyot Prakash
Alexander Radovic
Akshay Shekher
Denis Savenkov
LRM
280
2
0
30 Oct 2024
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning
Mingyang Chen
Haoze Sun
Tianpeng Li
Fan Yang
Hao Liang
Keer Lu
Tengjiao Wang
Wentao Zhang
Guosheng Dong
Weipeng Chen
LRM
85
5
0
16 Oct 2024
RuleRAG: Rule-Guided Retrieval-Augmented Generation with Language Models for Question Answering
Zhongwu Chen
Chengjin Xu
Dingmin Wang
Zhen Huang
Yong Dou
Xuhui Jiang
Jian Guo
RALM
378
1
0
15 Oct 2024
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning
Hyun Ryu
Gyeongman Kim
Hyemin S. Lee
Eunho Yang
LRM
75
6
0
10 Oct 2024
Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context
Sangwon Yu
Ik-hwan Kim
Jongyoon Song
Saehyung Lee
Junsung Park
Sungroh Yoon
LRM
101
1
0
09 Oct 2024
Geometric Signatures of Compositionality Across a Language Model's Lifetime
Jin Hwa Lee
Thomas Jiralerspong
Lei Yu
Yoshua Bengio
Emily Cheng
CoGe
116
3
0
02 Oct 2024
Can Models Learn Skill Composition from Examples?
Haoyu Zhao
Simran Kaur
Dingli Yu
Anirudh Goyal
Sanjeev Arora
CoGe
MoE
71
5
0
29 Sep 2024
Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models
Jinyang Wu
Feihu Che
Chuyuan Zhang
Mingkuan Feng
Shuai Zhang
Pengpeng Shao
Jianhua Tao
109
5
0
24 Aug 2024
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
Bernal Jiménez Gutiérrez
Yiheng Shu
Yu Gu
Michihiro Yasunaga
Yu-Chuan Su
RALM
CLL
92
39
0
23 May 2024
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Jiajie Jin
Yutao Zhu
Xinyu Yang
Chenghao Zhang
Zhicheng Dou
Chenghao Zhang
Tong Zhao
Zhao Yang
Zhicheng Dou
Ji-Rong Wen
VLM
120
58
0
22 May 2024
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia
Rui Wang
Xu Liu
Mingyan Li
Tong Yu
Xiang Chen
Julian McAuley
Shuai Li
LRM
75
21
0
24 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
135
50
0
23 Apr 2024
Demystifying Chains, Trees, and Graphs of Thoughts
Maciej Besta
Florim Memedi
Zhenyu Zhang
Robert Gerstenberger
Guangyuan Piao
...
Aleš Kubíček
H. Niewiadomski
Aidan O'Mahony
Onur Mutlu
Torsten Hoefler
AI4CE
LRM
177
26
0
25 Jan 2024
AutoMix: Automatically Mixing Language Models
Pranjal Aggarwal
Aman Madaan
Ankit Anand
Srividya Pranavi Potharaju
Swaroop Mishra
...
Karthik Kappaganthu
Yiming Yang
Shyam Upadhyay
Manaal Faruqui
Mausam
83
22
0
19 Oct 2023
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
Haoxuan You
Rui Sun
Zhecan Wang
Long Chen
Gengyu Wang
Hammad A. Ayyubi
Kai-Wei Chang
Shih-Fu Chang
VLM
MLLM
LRM
91
43
0
24 May 2023
ReAct: Synergizing Reasoning and Acting in Language Models
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
362
2,709
0
06 Oct 2022
Decomposed Prompting: A Modular Approach for Solving Complex Tasks
Tushar Khot
H. Trivedi
Matthew Finlayson
Yao Fu
Kyle Richardson
Peter Clark
Ashish Sabharwal
ReLM
LRM
93
437
0
05 Oct 2022
RealTime QA: What's the Answer Right Now?
Jungo Kasai
Keisuke Sakaguchi
Yoichi Takahashi
Ronan Le Bras
Akari Asai
Xinyan Velocity Yu
Dragomir R. Radev
Noah A. Smith
Yejin Choi
Kentaro Inui
KELM
107
178
0
27 Jul 2022
Emergent Abilities of Large Language Models
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
...
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELM
ReLM
LRM
191
2,428
0
15 Jun 2022
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Aarohi Srivastava
Abhinav Rastogi
Abhishek Rao
Abu Awal Md Shoeb
Abubakar Abid
...
Zhuoye Zhao
Zijian Wang
Zijie J. Wang
Zirui Wang
Ziyi Wu
ELM
117
1,739
0
09 Jun 2022
Is a Question Decomposition Unit All We Need?
Pruthvi H. Patel
Swaroop Mishra
Mihir Parmar
Chitta Baral
ReLM
170
51
0
25 May 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
475
4,077
0
24 May 2022
1
2
Next