Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.09869
Cited By
Transformer with Memory Replay
19 May 2022
R. Liu
Barzan Mozafari
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Transformer with Memory Replay"
4 / 4 papers shown
Title
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers
R. Liu
Young Jin Kim
Alexandre Muzio
Hany Awadalla
MoE
47
22
0
28 May 2022
Dynamic Experience Replay
Jieliang Luo
Hui Li
118
24
0
04 Mar 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
Efficient Per-Example Gradient Computations
Ian Goodfellow
186
74
0
07 Oct 2015
1