Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.04497
Cited By
The Fine-Grained Complexity of Gradient Computation for Training Large Language Models
7 February 2024
Josh Alman
Zhao-quan Song
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Fine-Grained Complexity of Gradient Computation for Training Large Language Models"
6 / 6 papers shown
Title
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
Yang Cao
Zhao-quan Song
Chiwun Yang
VGen
46
2
0
01 Feb 2025
HSR-Enhanced Sparse Attention Acceleration
Bo Chen
Yingyu Liang
Zhizhou Sha
Zhenmei Shi
Zhao-quan Song
93
18
0
14 Oct 2024
Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
Jerry Yao-Chieh Hu
Pei-Hsuan Chang
Haozheng Luo
Hong-Yu Chen
Weijian Li
Wei-Po Wang
Han Liu
39
25
0
04 Apr 2024
Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models
Dennis Wu
Jerry Yao-Chieh Hu
Teng-Yun Hsiao
Han Liu
40
28
0
04 Apr 2024
Dynamic Tensor Product Regression
Aravind Reddy
Zhao-quan Song
Licheng Zhang
42
20
0
08 Oct 2022
On The Computational Complexity of Self-Attention
Feyza Duman Keles
Pruthuvi Maheshakya Wijewardena
C. Hegde
68
108
0
11 Sep 2022
1