ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.08743
  4. Cited By
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network
  Compilation

Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation

23 January 2020
Byung Hoon Ahn
Prannoy Pilligundla
Amir Yazdanbakhsh
H. Esmaeilzadeh
    ODL
ArXivPDFHTML

Papers citing "Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation"

18 / 18 papers shown
Title
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving
Yaoyao Ding
Bohan Hou
X. Zhang
Allan Lin
Tianqi Chen
Cody Yu Hao
Yida Wang
Gennady Pekhimenko
50
0
0
17 Apr 2025
Data-efficient Performance Modeling via Pre-training
Data-efficient Performance Modeling via Pre-training
Chunting Liu
Riyadh Baghdadi
43
0
0
24 Jan 2025
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation
Mufei Li
Viraj Shitole
Eli Chien
Changhai Man
Zhaodong Wang
Srinivas Sridharan
Ying Zhang
Tushar Krishna
P. Li
37
0
0
04 Nov 2024
Target-independent XLA optimization using Reinforcement Learning
Target-independent XLA optimization using Reinforcement Learning
Milan Ganai
Haichen Li
Theodore Enns
Yida Wang
Randy Huang
34
0
0
28 Aug 2023
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler
  for Neural Networks
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks
Zining Zhang
Bingsheng He
Zhenjie Zhang
14
5
0
21 Nov 2022
ALT: Boosting Deep Learning Performance by Breaking the Wall between
  Graph and Operator Level Optimizations
ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations
Zhiying Xu
Jiafan Xu
H. Peng
Wei Wang
Xiaoliang Wang
...
Haipeng Dai
Yixu Xu
Hao Cheng
Kun Wang
Guihai Chen
20
0
0
22 Oct 2022
HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time
  and Robustness
HW-Aware Initialization of DNN Auto-Tuning to Improve Exploration Time and Robustness
D. Rieber
Moritz Reiber
Oliver Bringmann
Holger Fröning
18
4
0
31 May 2022
Tensor Program Optimization with Probabilistic Programs
Tensor Program Optimization with Probabilistic Programs
Junru Shao
Xiyou Zhou
Siyuan Feng
Bohan Hou
Ruihang Lai
Hongyi Jin
Wuwei Lin
Masahiro Masuda
Cody Hao Yu
Tianqi Chen
34
29
0
26 May 2022
A Semi-Decoupled Approach to Fast and Optimal Hardware-Software
  Co-Design of Neural Accelerators
A Semi-Decoupled Approach to Fast and Optimal Hardware-Software Co-Design of Neural Accelerators
Bingqian Lu
Zheyu Yan
Yiyu Shi
Shaolei Ren
23
2
0
25 Mar 2022
Shisha: Online scheduling of CNN pipelines on heterogeneous
  architectures
Shisha: Online scheduling of CNN pipelines on heterogeneous architectures
Pirah Noor Soomro
M. Abduljabbar
J. Castrillón
Miquel Pericàs
24
1
0
23 Feb 2022
Benchmarking of DL Libraries and Models on Mobile Devices
Benchmarking of DL Libraries and Models on Mobile Devices
Qiyang Zhang
Xiang Li
Xiangying Che
Xiao Ma
Ao Zhou
Mengwei Xu
Shangguang Wang
Yun Ma
Xuanzhe Liu
25
48
0
14 Feb 2022
Moses: Efficient Exploitation of Cross-device Transferable Features for
  Tensor Program Optimization
Moses: Efficient Exploitation of Cross-device Transferable Features for Tensor Program Optimization
Zhihe Zhao
Xian Shuai
Yang Bai
Neiwen Ling
Nan Guan
Zhenyu Yan
Guoliang Xing
20
6
0
15 Jan 2022
Transfer-Tuning: Reusing Auto-Schedules for Efficient Tensor Program
  Code Generation
Transfer-Tuning: Reusing Auto-Schedules for Efficient Tensor Program Code Generation
Perry Gibson
José Cano
21
12
0
14 Jan 2022
Spatial Sharing of GPU for Autotuning DNN models
Spatial Sharing of GPU for Autotuning DNN models
Aditya Dhakal
Junguk Cho
Sameer G. Kulkarni
K. Ramakrishnan
P. Sharma
13
8
0
08 Aug 2020
A Learned Performance Model for Tensor Processing Units
A Learned Performance Model for Tensor Processing Units
Samuel J. Kaufman
P. Phothilimthana
Yanqi Zhou
Charith Mendis
Sudip Roy
Amit Sabne
Mike Burrows
21
8
0
03 Aug 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural
  Networks for Edge Devices
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
73
54
0
04 Mar 2020
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural
  Networks
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
55
68
0
05 Nov 2018
Neural Architecture Search with Reinforcement Learning
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
271
5,329
0
05 Nov 2016
1