ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.01898
  4. Cited By
CoSA: Scheduling by Constrained Optimization for Spatial Accelerators

CoSA: Scheduling by Constrained Optimization for Spatial Accelerators

5 May 2021
Qijing Huang
Minwoo Kang
Grace Dinh
Thomas Norell
Aravind Kalaiah
J. Demmel
J. Wawrzynek
Y. Shao
ArXivPDFHTML

Papers citing "CoSA: Scheduling by Constrained Optimization for Spatial Accelerators"

9 / 9 papers shown
Title
Blockbuster, Part 1: Block-level AI Operator Fusion
Blockbuster, Part 1: Block-level AI Operator Fusion
Ofer Dekel
21
0
0
29 Apr 2025
SALSA: Simulated Annealing based Loop-Ordering Scheduler for DNN
  Accelerators
SALSA: Simulated Annealing based Loop-Ordering Scheduler for DNN Accelerators
Victor J. B. Jung
Arne Symons
L. Mei
Marian Verhelst
Luca Benini
18
3
0
20 Apr 2023
Full Stack Optimization of Transformer Inference: a Survey
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
36
101
0
27 Feb 2023
Demystifying Map Space Exploration for NPUs
Demystifying Map Space Exploration for NPUs
Sheng-Chun Kao
A. Parashar
Po-An Tsai
T. Krishna
38
11
0
07 Oct 2022
Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling
Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling
Yannan Nellie Wu
Po-An Tsai
A. Parashar
Vivienne Sze
J. Emer
25
57
0
12 May 2022
Communication Bounds for Convolutional Neural Networks
Communication Bounds for Convolutional Neural Networks
An Chen
J. Demmel
Grace Dinh
Mason Haberle
Olga Holtz
9
4
0
18 Apr 2022
DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for
  Layer Fusion in DNN Accelerators
DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for Layer Fusion in DNN Accelerators
Sheng-Chun Kao
Xiaoyu Huang
T. Krishna
AI4CE
35
9
0
26 Jan 2022
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
Sheng-Chun Kao
Suvinay Subramanian
Gaurav Agrawal
Amir Yazdanbakhsh
T. Krishna
38
57
0
13 Jul 2021
Aggregated Residual Transformations for Deep Neural Networks
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,220
0
16 Nov 2016
1