Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.16158
Cited By
The Feasibility of Implementing Large-Scale Transformers on Multi-FPGA Platforms
24 April 2024
Yu Gao
Juan Camilo Vega
Paul Chow
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Feasibility of Implementing Large-Scale Transformers on Multi-FPGA Platforms"
4 / 4 papers shown
Title
FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs
Shulin Zeng
Jun Liu
Guohao Dai
Xinhao Yang
Tianyu Fu
...
Zehao Wang
Ruoyu Zhang
Kairui Wen
Xuefei Ning
Yu Wang
62
55
0
08 Jan 2024
DFX: A Low-latency Multi-FPGA Appliance for Accelerating Transformer-based Text Generation
Seongmin Hong
Seungjae Moon
Junsoo Kim
Sungjae Lee
Minsub Kim
Dongsoo Lee
Joo-Young Kim
72
76
0
22 Sep 2022
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
105
341
0
05 Jan 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1