Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.12522
Cited By
BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models
23 January 2024
Feng-Huei Lin
Hanling Yi
Hongbin Li
Yifan Yang
Xiaotian Yu
Guangming Lu
Rong Xiao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models"
10 / 10 papers shown
Title
Accelerating LLM Inference with Staged Speculative Decoding
Benjamin Spector
Christal Re
31
103
0
08 Aug 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
220
4,085
0
09 Jun 2023
Fast Inference from Transformers via Speculative Decoding
Yaniv Leviathan
Matan Kalman
Yossi Matias
LRM
69
663
0
30 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
266
2,348
0
09 Nov 2022
Directed Acyclic Transformer for Non-Autoregressive Machine Translation
Fei Huang
Hao Zhou
Yang Liu
Hanguang Li
Minlie Huang
AI4CE
54
61
0
16 May 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
Hao Fei
Tao Qin
Tie-Yan Liu
3DV
MedIm
AI4CE
47
85
0
20 Apr 2022
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
445
3,952
0
18 Apr 2021
GPT Understands, Too
Xiao Liu
Yanan Zheng
Zhengxiao Du
Ming Ding
Yujie Qian
Zhilin Yang
Jie Tang
VLM
130
1,161
0
18 Mar 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Xiang Lisa Li
Percy Liang
164
4,167
0
01 Jan 2021
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
97
1,652
0
27 Aug 2018
1