Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.10418
Cited By
DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
17 November 2023
Chenyu Jiang
Zhen Jia
Shuai Zheng
Yida Wang
Chuan Wu
MoE
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines"
4 / 4 papers shown
Title
PipeWeaver: Addressing Data Dynamicity in Large Multimodal Model Training with Dynamic Interleaved Pipeline
Zhenliang Xue
Hanpeng Hu
Xing Chen
Yimin Jiang
Yixin Song
Zeyu Mi
Yibo Zhu
Daxin Jiang
Yubin Xia
Haibo Chen
49
0
0
19 Apr 2025
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Yujia Zhai
Chengquan Jiang
Leyuan Wang
Xiaoying Jia
Shang Zhang
Zizhong Chen
Xin Liu
Yibo Zhu
62
48
0
06 Oct 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
215
1,661
0
15 Oct 2021
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
205
3,513
0
10 Jun 2015
1