Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.13045
Cited By
QADAM: Quantization-Aware DNN Accelerator Modeling for Pareto-Optimality
20 May 2022
A. Inci
Siri Garudanagiri Virupaksha
Aman Jain
Venkata Vivek Thallam
Ruizhou Ding
Diana Marculescu
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"QADAM: Quantization-Aware DNN Accelerator Modeling for Pareto-Optimality"
20 / 20 papers shown
Title
QAPPA: Quantization-Aware Power, Performance, and Area Modeling of DNN Accelerators
A. Inci
Siri Garudanagiri Virupaksha
Aman Jain
Venkata Vivek Thallam
Ruizhou Ding
Diana Marculescu
MQ
30
5
0
17 May 2022
Rethinking Co-design of Neural Architectures and Hardware Accelerators
Yanqi Zhou
Xuanyi Dong
Berkin Akin
Mingxing Tan
Daiyi Peng
Tianjian Meng
Amir Yazdanbakhsh
Da Huang
Ravi Narayanaswami
James Laudon
117
26
0
17 Feb 2021
DeepNVM++: Cross-Layer Modeling and Optimization Framework of Non-Volatile Memories for Deep Learning
A. Inci
Mehmet Meric Isgenc
Diana Marculescu
50
20
0
08 Dec 2020
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems
A. Inci
Evgeny Bolotin
Yaosheng Fu
Gal Dalal
Shie Mannor
D. Nellans
Diana Marculescu
AI4CE
41
13
0
08 Dec 2020
Accelerator-aware Neural Network Design using AutoML
Suyog Gupta
Berkin Akin
56
66
0
05 Mar 2020
Co-Exploration of Neural Architectures and Heterogeneous ASIC Accelerator Designs Targeting Multiple Tasks
Lei Yang
Zheyu Yan
Meng Li
Hyoukjun Kwon
Liangzhen Lai
T. Krishna
Vikas Chandra
Weiwen Jiang
Yiyu Shi
69
115
0
10 Feb 2020
EfficientDet: Scalable and Efficient Object Detection
Mingxing Tan
Ruoming Pang
Quoc V. Le
92
5,024
0
20 Nov 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
131
18,058
0
28 May 2019
Towards Efficient Model Compression via Learned Global Ranking
Ting-Wu Chin
Ruizhou Ding
Cha Zhang
Diana Marculescu
48
171
0
28 Apr 2019
SCALE-Sim: Systolic CNN Accelerator Simulator
A. Samajdar
Yuhao Zhu
P. Whatmough
Matthew Mattina
Tushar Krishna
89
137
0
16 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.6K
94,511
0
11 Oct 2018
Hardware-Aware Machine Learning: Modeling and Optimization
Diana Marculescu
Dimitrios Stamoulis
E. Cai
49
45
0
14 Sep 2018
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Yang He
Guoliang Kang
Xuanyi Dong
Yanwei Fu
Yi Yang
AAML
VLM
60
963
0
21 Aug 2018
NeuralPower: Predict and Deploy Energy-Efficient Convolutional Neural Networks
E. Cai
Da-Cheng Juan
Dimitrios Stamoulis
Diana Marculescu
34
132
0
15 Oct 2017
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
70
1,125
0
23 May 2017
In-Datacenter Performance Analysis of a Tensor Processing Unit
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
...
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
223
4,626
0
16 Apr 2017
EIE: Efficient Inference Engine on Compressed Deep Neural Network
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
118
2,455
0
04 Feb 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.1K
193,426
0
10 Dec 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
243
8,821
0
01 Oct 2015
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.5K
100,213
0
04 Sep 2014
1