Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.02123
Cited By
Eva: A General Vectorized Approximation Framework for Second-order Optimization
4 August 2023
Lin Zhang
S. Shi
Bo-wen Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Eva: A General Vectorized Approximation Framework for Second-order Optimization"
4 / 4 papers shown
Title
Inverse-Free Fast Natural Gradient Descent Method for Deep Learning
Xinwei Ou
Ce Zhu
Xiaolin Huang
Yipeng Liu
ODL
40
0
0
06 Mar 2024
Gradient Descent on Neurons and its Link to Approximate Second-Order Optimization
Frederik Benzing
ODL
40
23
0
28 Jan 2022
Accelerating Distributed K-FAC with Smart Parallelism of Computing and Communication Tasks
S. Shi
Lin Zhang
Bo-wen Li
37
9
0
14 Jul 2021
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,821
0
17 Sep 2019
1