Title
Solving Fourier ptychographic imaging problems via neural network modeling and TensorFlow Shaowei Jiang K. Guo Jun Liao G. Zheng 57 96 0 09 Mar 2018
High-Accuracy Low-Precision Training Christopher De Sa Megan Leszczynski Jian Zhang Alana Marzoev Christopher R. Aberger K. Olukotun Christopher Ré 80 109 0 09 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine Renzo Andri Lukas Cavigelli D. Rossi Luca Benini MQ 85 19 0 05 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md. Zahangir Alom T. Taha C. Yakopcic Stefan Westberg P. Sidike Mst Shamima Nasrin B. Van Essen A. Awwal V. Asari VLM 133 882 0 03 Mar 2018
Trustless Machine Learning Contracts; Evaluating and Exchanging Machine Learning Models on the Ethereum Blockchain A. Krizhevsky Geoffrey E. Hinton SyDa 73 109 0 27 Feb 2018
A High GOPs/Slice Time Series Classifier for Portable and Embedded Biomedical Applications H. Soleimani Aliasghar Makhlooghpour Wilten Nicola Claudia Clopath E. Drakakis 20 2 0 27 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis Tal Ben-Nun Torsten Hoefler GNN 85 713 0 26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence Jinglan Liu Jiaxin Zhang Yukun Ding Xiaowei Xu Meng Jiang Yiyu Shi 76 4 0 26 Feb 2018
BigDataBench: A Scalable and Unified Big Data and AI Benchmark Suite Wanling Gao Jianfeng Zhan Lei Wang Chunjie Luo Daoyi Zheng ... Hainan Ye Haoning Tang Zheng Cao Shujie Zhang Jiahui Dai 77 35 0 23 Feb 2018
SparCML: High-Performance Sparse Communication for Machine Learning Cédric Renggli Saleh Ashkboos Mehdi Aghagolzadeh Dan Alistarh Torsten Hoefler 91 127 0 22 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning Hyeontaek Lim D. Andersen M. Kaminsky 134 70 0 21 Feb 2018
Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement Jason D. Lee Elman Mansimov Kyunghyun Cho DiffM BDL 97 456 0 19 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets Fabian Schuiki Michael Schaffner Frank K. Gürkaynak Luca Benini 66 70 0 19 Feb 2018
Deep neural decoders for near term fault-tolerant experiments C. Chamberland Pooya Ronagh 51 84 0 18 Feb 2018
Massivizing Computer Systems: a Vision to Understand, Design, and Engineer Computer Ecosystems through and beyond Modern Distributed Systems Alexandru Iosup Alexandru Uta L. Versluis George Andreadis Erwin Van Eyk T. Hegeman Sacheendra Talluri V. V. Beek L. Toader GNN 49 28 0 15 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks Qi Liu Tao Liu Zihao Liu Yanzhi Wang Yier Jin Wujie Wen AAML 68 48 0 14 Feb 2018
Field-Programmable Deep Neural Network (DNN) Learning and Inference accelerator: a concept L. Franca-Neto 27 1 0 14 Feb 2018
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions Nicolas Vasilache O. Zinenko Theodoros Theodoridis Priya Goyal Zach DeVito William S. Moses Sven Verdoolaege Andrew Adams Albert Cohen 126 438 0 13 Feb 2018
Training and Inference with Integers in Deep Neural Networks Shuang Wu Guoqi Li F. Chen Luping Shi MQ 78 391 0 13 Feb 2018
TVM: An Automated End-to-End Optimizing Compiler for Deep Learning Tianqi Chen T. Moreau Ziheng Jiang Lianmin Zheng Eddie Q. Yan ... Leyuan Wang Yuwei Hu Luis Ceze Carlos Guestrin Arvind Krishnamurthy 223 374 0 12 Feb 2018
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error Resilience for Energy Efficient Deep Neural Network Accelerators Jeff Zhang Kartheek Rangineni Zahra Ghodsi S. Garg 81 118 0 11 Feb 2018
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator Jeff Zhang Tianyu Gu K. Basu S. Garg 39 135 0 11 Feb 2018
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks Jian Cheng Peisong Wang Gang Li Qinghao Hu Hanqing Lu 49 3 0 03 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks R. Cai Ao Ren Ning Liu Caiwen Ding Luhao Wang Xuehai Qian Massoud Pedram Yanzhi Wang BDL 87 87 0 02 Feb 2018
On Scale-out Deep Learning Training for Cloud and HPC Srinivas Sridharan K. Vaidyanathan Dhiraj D. Kalamkar Dipankar Das Mikhail E. Smorkalov ... Dheevatsa Mudigere Naveen Mellempudi Sasikanth Avancha Bharat Kaul Pradeep Dubey BDL 70 30 0 24 Jan 2018
Flexible Deep Neural Network Processing Hokchhay Tann S. Hashemi Sherief Reda AI4CE 30 8 0 23 Jan 2018
In-RDBMS Hardware Acceleration of Advanced Analytics Divya Mahajan Joo-Young Kim Jacob Sacks A. Ardalan Arun Kumar H. Esmaeilzadeh 69 47 0 08 Jan 2018
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car Michael Bechtel Elise McEllhiney Minje Kim H. Yun 85 103 0 19 Dec 2017
TensorFlow-Serving: Flexible, High-Performance ML Serving Christopher Olston Noah Fiedel Kiril Gorovoy Jeremiah Harmsen Li Lao Fangwei Li Vinu Rajashekhar Sukriti Ramesh Jordan Soyke 55 312 0 17 Dec 2017
A Berkeley View of Systems Challenges for AI Ion Stoica Basel Alomair Raluca A. Popa D. Patterson Michael W. Mahoney ... Joseph E. Gonzalez Ken Goldberg A. Ghodsi David Culler Pieter Abbeel 87 201 0 15 Dec 2017
Deep Learning for IoT Big Data and Streaming Analytics: A Survey M. Mohammadi Ala I. Al-Fuqaha Sameh Sorour Mohsen Guizani 114 1,062 0 09 Dec 2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai ... D. Kumaran T. Graepel Timothy Lillicrap Karen Simonyan Demis Hassabis 210 1,784 0 05 Dec 2017
Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks Hardik Sharma Jongse Park Naveen Suda Liangzhen Lai Benson Chau Joo-Young Kim Vikas Chandra H. Esmaeilzadeh MQ 70 494 0 05 Dec 2017
NEURAghe: Exploiting CPU-FPGA Synergies for Efficient and Flexible CNN Inference Acceleration on Zynq SoCs Paolo Meloni Alessandro Capotondi Gianfranco Deriu Michele Brian Francesco Conti D. Rossi L. Raffo Luca Benini 62 51 0 04 Dec 2017
Structured Deep Neural Network Pruning via Matrix Pivoting Ranko Sredojevic Shaoyi Cheng Lazar Supic R. Naous Vladimir M. Stojanović 59 7 0 01 Dec 2017
Machine Learning and Manycore Systems Design: A Serendipitous Symbiosis R. Kim J. Doppa P. Pande Diana Marculescu R. Marculescu 45 27 0 30 Nov 2017
TensorFlow Distributions Joshua V. Dillon I. Langmore Dustin Tran E. Brevdo Srinivas Vasudevan David A. Moore Brian Patton Alexander A. Alemi Matt Hoffman Rif A. Saurous GP 120 352 0 28 Nov 2017
Recurrent Segmentation for Variable Computational Budgets Lane T. McIntosh Niru Maheswaranathan David Sussillo Jonathon Shlens SSeg VOS 89 20 0 28 Nov 2017
A Manifesto for Future Generation Cloud Computing: Research Directions for the Next Decade Rajkumar Buyya Satish Narayana G. Casale R. Calheiros Yogesh L. Simmhan ... Wanlei Zhou Hai Jin W. Gentzsch Albert Y. Zomaya Haiying Shen AI4TS AILaw 75 143 0 24 Nov 2017
Deep supervised learning using local errors Hesham Mostafa V. Ramesh Gert Cauwenberghs 68 115 0 17 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler Yu Ji Youhui Zhang Wenguang Chen Yuan Xie 102 56 0 15 Nov 2017
$Chipmunk: A Systolically Scalable 0.9 mm${}^2$, 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference$ Chipmunk: A Systolically Scalable 0.9 mm ${}^2$ , 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference Francesco Conti Lukas Cavigelli G. Paulin Igor Susmelj Luca Benini 46 42 0 15 Nov 2017
Deep Rewiring: Training very sparse deep networks G. Bellec David Kappel Wolfgang Maass Robert Legenstein BDL 204 279 0 14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation Moritz B. Milde Daniel Neil Alessandro Aimar T. Delbruck Giacomo Indiveri MQ 73 10 0 13 Nov 2017
DLVM: A modern compiler infrastructure for deep learning systems Richard Wei Lane Schwartz Vikram S. Adve 65 58 0 08 Nov 2017
Block-Sparse Recurrent Neural Networks Sharan Narang Eric Undersander G. Diamos 62 139 0 08 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks Sanchari Sen Shubham Jain Swagath Venkataramani A. Raghunathan 55 30 0 07 Nov 2017
Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks Urs Koster T. Webb Xin Eric Wang Marcel Nassar Arjun K. Bansal ... Luke Hornof A. Khosrowshahi Carey Kloss Ruby J. Pai N. Rao MQ 57 262 0 06 Nov 2017
Don't Decay the Learning Rate, Increase the Batch Size Samuel L. Smith Pieter-Jan Kindermans Chris Ying Quoc V. Le ODL 130 996 0 01 Nov 2017
HPC Cloud for Scientific and Business Applications: Taxonomy, Vision, and Research Challenges M. Netto R. Calheiros Eduardo Rodrigues R. L. F. Cunha Rajkumar Buyya 90 74 0 24 Oct 2017