LR-CNN: Lightweight Row-centric Convolutional Neural Network Training for Memory Reduction

21 January 2024

Papers citing "LR-CNN: Lightweight Row-centric Convolutional Neural Network Training for Memory Reduction"

4 / 4 papers shown

Title
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training Guanhua Wang Heyang Qin S. A. Jacobs Connor Holmes Samyam Rajbhandari Olatunji Ruwase Feng Yan Lei Yang Yuxiong He VLM 59 57 0 16 Jun 2023
Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction Georgii Sergeevich Novikov Daniel Bershatsky Julia Gusak Alex Shonenkov Denis Dimitrov Ivan V. Oseledets MQ 26 17 0 01 Feb 2022
ZeRO-Offload: Democratizing Billion-Scale Model Training Jie Ren Samyam Rajbhandari Reza Yazdani Aminabadi Olatunji Ruwase Shuangyang Yang Minjia Zhang Dong Li Yuxiong He MoE 177 414 0 18 Jan 2021
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism M. Shoeybi M. Patwary Raul Puri P. LeGresley Jared Casper Bryan Catanzaro MoE 245 1,821 0 17 Sep 2019