Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network
Compilation

Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation

23 January 2020

Prannoy Pilligundla

Amir Yazdanbakhsh

H. Esmaeilzadeh

Papers citing "Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation"

18 / 18 papers shown

Title
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving Yaoyao Ding Bohan Hou X. Zhang Allan Lin Tianqi Chen Cody Yu Hao Yida Wang Gennady Pekhimenko 50 0 0 17 Apr 2025
Data-efficient Performance Modeling via Pre-training Chunting Liu Riyadh Baghdadi 43 0 0 24 Jan 2025
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation Mufei Li Viraj Shitole Eli Chien Changhai Man Zhaodong Wang Srinivas Sridharan Ying Zhang Tushar Krishna P. Li 37 0 0 04 Nov 2024
Target-independent XLA optimization using Reinforcement Learning Milan Ganai Haichen Li Theodore Enns Yida Wang Randy Huang 34 0 0 28 Aug 2023
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks Zining Zhang Bingsheng He Zhenjie Zhang 14 5 0 21 Nov 2022
ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations Zhiying Xu Jiafan Xu H. Peng Wei Wang Xiaoliang Wang ... Haipeng Dai Yixu Xu Hao Cheng Kun Wang Guihai Chen 20 0 0 22 Oct 2022
HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time and Robustness D. Rieber Moritz Reiber Oliver Bringmann Holger Fröning 18 4 0 31 May 2022
Tensor Program Optimization with Probabilistic Programs Junru Shao Xiyou Zhou Siyuan Feng Bohan Hou Ruihang Lai Hongyi Jin Wuwei Lin Masahiro Masuda Cody Hao Yu Tianqi Chen 34 29 0 26 May 2022
A Semi-Decoupled Approach to Fast and Optimal Hardware-Software Co-Design of Neural Accelerators Bingqian Lu Zheyu Yan Yiyu Shi Shaolei Ren 23 2 0 25 Mar 2022
Shisha: Online scheduling of CNN pipelines on heterogeneous architectures Pirah Noor Soomro M. Abduljabbar J. Castrillón Miquel Pericàs 24 1 0 23 Feb 2022
Benchmarking of DL Libraries and Models on Mobile Devices Qiyang Zhang Xiang Li Xiangying Che Xiao Ma Ao Zhou Mengwei Xu Shangguang Wang Yun Ma Xuanzhe Liu 25 48 0 14 Feb 2022
Moses: Efficient Exploitation of Cross-device Transferable Features for Tensor Program Optimization Zhihe Zhao Xian Shuai Yang Bai Neiwen Ling Nan Guan Zhenyu Yan Guoliang Xing 20 6 0 15 Jan 2022
Transfer-Tuning: Reusing Auto-Schedules for Efficient Tensor Program Code Generation Perry Gibson José Cano 21 12 0 14 Jan 2022
Spatial Sharing of GPU for Autotuning DNN models Aditya Dhakal Junguk Cho Sameer G. Kulkarni K. Ramakrishnan P. Sharma 13 8 0 08 Aug 2020
A Learned Performance Model for Tensor Processing Units Samuel J. Kaufman P. Phothilimthana Yanqi Zhou Charith Mendis Sudip Roy Amit Sabne Mike Burrows 21 8 0 03 Aug 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices Byung Hoon Ahn Jinwon Lee J. Lin Hsin-Pai Cheng Jilei Hou H. Esmaeilzadeh 73 54 0 04 Mar 2020
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks Ahmed T. Elthakeb Prannoy Pilligundla Fatemehsadat Mireshghallah Amir Yazdanbakhsh H. Esmaeilzadeh MQ 55 68 0 05 Nov 2018
Neural Architecture Search with Reinforcement Learning Barret Zoph Quoc V. Le 271 5,329 0 05 Nov 2016