Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.02010
Cited By
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
4 September 2020
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning"
18 / 18 papers shown
Title
AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations
Jamin Seo
Akshat Ramachandran
Yu-Chuan Chuang
Anirudh Itagi
Tushar Krishna
AI4CE
40
0
0
20 Jan 2025
SCAR: Scheduling Multi-Model AI Workloads on Heterogeneous Multi-Chiplet Module Accelerators
Mohanad Odema
Luke Chen
Hyoukjun Kwon
Mohammad Abdullah Al Faruque
36
4
0
01 May 2024
Workload-Aware Hardware Accelerator Mining for Distributed Deep Learning Training
Muhammad Adnan
Amar Phanishayee
Janardhan Kulkarni
Prashant J. Nair
Divyat Mahajan
45
0
0
23 Apr 2024
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
36
102
0
27 Feb 2023
Expediting Distributed DNN Training with Device Topology-Aware Graph Deployment
Shiwei Zhang
Xiaodong Yi
Lansong Diao
Chuan Wu
Siyu Wang
W. Lin
GNN
22
5
0
13 Feb 2023
FedGPO: Heterogeneity-Aware Global Parameter Optimization for Efficient Federated Learning
Young Geun Kim
Carole-Jean Wu
FedML
24
5
0
30 Nov 2022
Multi-Agent Reinforcement Learning for Microprocessor Design Space Exploration
Srivatsan Krishnan
Natasha Jaques
Shayegan Omidshafiei
Dan Zhang
Izzeddin Gur
Vijay Janapa Reddi
Aleksandra Faust
34
2
0
29 Nov 2022
Demystifying Map Space Exploration for NPUs
Sheng-Chun Kao
A. Parashar
Po-An Tsai
T. Krishna
38
11
0
07 Oct 2022
Special Session: Towards an Agile Design Methodology for Efficient, Reliable, and Secure ML Systems
Shail Dave
Alberto Marchisio
Muhammad Abdullah Hanif
Amira Guesmi
Aviral Shrivastava
Ihsen Alouani
Muhammad Shafique
34
13
0
18 Apr 2022
EF-Train: Enable Efficient On-device CNN Training on FPGA Through Data Reshaping for Online Adaptation or Personalization
Yue Tang
Xinyi Zhang
Peipei Zhou
Jingtong Hu
21
17
0
18 Feb 2022
DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators
Sheng-Chun Kao
Michael Pellauer
A. Parashar
T. Krishna
27
29
0
26 Jan 2022
DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for Layer Fusion in DNN Accelerators
Sheng-Chun Kao
Xiaoyu Huang
T. Krishna
AI4CE
35
9
0
26 Jan 2022
Data-Driven Offline Optimization For Architecting Hardware Accelerators
Aviral Kumar
Amir Yazdanbakhsh
Milad Hashemi
Kevin Swersky
Sergey Levine
27
36
0
20 Oct 2021
Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning
Rahul Bera
Konstantinos Kanellopoulos
Anant V. Nori
Taha Shahroodi
S. Subramoney
O. Mutlu
32
80
0
24 Sep 2021
AutoFL: Enabling Heterogeneity-Aware Energy Efficient Federated Learning
Young Geun Kim
Carole-Jean Wu
26
85
0
16 Jul 2021
NAAS: Neural Accelerator Architecture Search
Yujun Lin
Mengtian Yang
Song Han
34
60
0
27 May 2021
A Survey of Machine Learning for Computer Architecture and Systems
Nan Wu
Yuan Xie
AI4TS
AI4CE
20
145
0
16 Feb 2021
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,748
0
26 Sep 2016
1