Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.09198
Cited By
Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering
15 November 2023
Junqing He
Kunhao Pan
Xiaoqun Dong
Zhuoyang Song
LiuYiBo LiuYiBo
Yuxin Liang
Hao Wang
Qianguosun Qianguosun
Enming Zhang
Zejian Xie
Jiaxing Zhang
KELM
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering"
9 / 9 papers shown
Title
SEGMENT+: Long Text Processing with Short-Context Language Models
Wei Shi
Shuang Li
Kerun Yu
Jinglei Chen
Zujie Liang
...
Feng Wei
Bo Zheng
Jiaqing Liang
Jiangjie Chen
Yanghua Xiao
RALM
VLM
57
2
0
09 Oct 2024
Eliminating Position Bias of Language Models: A Mechanistic Approach
Ziqi Wang
Hanlin Zhang
Xiner Li
Kuan-Hao Huang
Chi Han
Shuiwang Ji
Sham Kakade
Hao Peng
Heng Ji
57
12
0
01 Jul 2024
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
Yijiong Yu
Huiqiang Jiang
Xufang Luo
Qianhui Wu
Chin-Yew Lin
Dongsheng Li
Yuqing Yang
Yongfeng Huang
L. Qiu
50
9
0
04 Jun 2024
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models
Longze Chen
Ziqiang Liu
Wanwei He
Yunshui Li
Run Luo
Min Yang
42
9
0
28 May 2024
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Zhenyu (Allen) Zhang
Runjin Chen
Shiwei Liu
Zhewei Yao
Olatunji Ruwase
Beidi Chen
Xiaoxia Wu
Zhangyang Wang
34
26
0
05 Mar 2024
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
Yuxin Liang
Zhuoyang Song
Hao Wang
Jiaxing Zhang
HILM
43
30
0
27 Jan 2024
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Wenhai Wang
Jiangwei Xie
ChuanYang Hu
Haoming Zou
Jianan Fan
...
Lewei Lu
Xizhou Zhu
Xiaogang Wang
Yu Qiao
Jifeng Dai
36
125
0
14 Dec 2023
Your Transformer May Not be as Powerful as You Expect
Shengjie Luo
Shanda Li
Shuxin Zheng
Tie-Yan Liu
Liwei Wang
Di He
70
51
0
26 May 2022
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
253
698
0
27 Aug 2021
1