Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design

Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design

Papers citing "Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design"

50 / 57 papers shown
Title
Learning Curve Theory
Learning Curve Theory
Marcus Hutter
202
63
0
08 Feb 2021