Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.11276
Cited By
The Rotary Position Embedding May Cause Dimension Inefficiency in Attention Heads for Long-Distance Retrieval
16 February 2025
Ting-Rui Chiang
Dani Yogatama
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Rotary Position Embedding May Cause Dimension Inefficiency in Attention Heads for Long-Distance Retrieval"
3 / 3 papers shown
Title
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
123
1,709
0
28 Sep 2023
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
86
2,307
0
20 Apr 2021
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
266
129,831
0
12 Jun 2017
1