
v1v2 (latest)
DropCompute: simple and more robust distributed synchronous training via compute variance reduction
Papers citing "DropCompute: simple and more robust distributed synchronous training via compute variance reduction"
2 / 2 papers shown
Title |
---|