Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.04760
Cited By
In-Datacenter Performance Analysis of a Tensor Processing Unit
16 April 2017
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
Raminder Bajwa
Sarah Bates
Suresh Bhatia
Nan Boden
Al Borchers
Rick Boyle
Pierre-luc Cantin
Clifford Chao
Chris Clark
Jeremy Coriell
Mike Daley
Matt Dau
Jeffrey Dean
Ben Gelb
Taraneh Ghaemmaghami
Rajendra Gottipati
William Gulland
Robert Hagmann
C. Richard Ho
Doug Hogberg
John Hu
R. Hundt
Dan Hurt
Julian Ibarz
A. Jaffey
Alek Jaworski
Alexander Kaplan
Harshit Khaitan
Andy Koch
Naveen Kumar
Steve Lacy
James Laudon
James Law
Diemthu Le
Chris Leary
Zhuyuan Liu
Kyle Lucke
Alan Lundin
Gordon MacKean
Adriana Maggiore
Maire Mahony
Kieran Miller
R. Nagarajan
Ravi Narayanaswami
Ray Ni
Kathy Nix
Thomas Norrie
Mark Omernick
Narayana Penukonda
Andy Phelps
Jonathan Ross
Matt Ross
Amir Salek
Emad Samadiani
Chris Severn
Gregory Sizikov
Matthew Snelham
Jed Souter
Dan Steinberg
Andy Swing
Mercedes Tan
Gregory Thorson
Bo Tian
Horia Toma
Erick Tuttle
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"In-Datacenter Performance Analysis of a Tensor Processing Unit"
50 / 1,165 papers shown
Title
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
29
874
0
03 Mar 2018
Trustless Machine Learning Contracts; Evaluating and Exchanging Machine Learning Models on the Ethereum Blockchain
A. Krizhevsky
Geoffrey E. Hinton
SyDa
19
109
0
27 Feb 2018
A High GOPs/Slice Time Series Classifier for Portable and Embedded Biomedical Applications
H. Soleimani
Aliasghar
Makhlooghpour
Wilten Nicola
Claudia Clopath
E. Drakakis
11
2
0
27 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
703
0
26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence
Jinglan Liu
Jiaxin Zhang
Yukun Ding
Xiaowei Xu
Meng Jiang
Yiyu Shi
36
4
0
26 Feb 2018
BigDataBench: A Scalable and Unified Big Data and AI Benchmark Suite
Wanling Gao
Jianfeng Zhan
Lei Wang
Chunjie Luo
Daoyi Zheng
...
Hainan Ye
Haoning Tang
Zheng Cao
Shujie Zhang
Jiahui Dai
11
34
0
23 Feb 2018
SparCML: High-Performance Sparse Communication for Machine Learning
Cédric Renggli
Saleh Ashkboos
Mehdi Aghagolzadeh
Dan Alistarh
Torsten Hoefler
29
126
0
22 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
Hyeontaek Lim
D. Andersen
M. Kaminsky
21
70
0
21 Feb 2018
Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement
Jason D. Lee
Elman Mansimov
Kyunghyun Cho
DiffM
BDL
42
455
0
19 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets
Fabian Schuiki
Michael Schaffner
Frank K. Gürkaynak
Luca Benini
31
70
0
19 Feb 2018
Deep neural decoders for near term fault-tolerant experiments
C. Chamberland
Pooya Ronagh
24
82
0
18 Feb 2018
Massivizing Computer Systems: a Vision to Understand, Design, and Engineer Computer Ecosystems through and beyond Modern Distributed Systems
Alexandru Iosup
Alexandru Uta
L. Versluis
George Andreadis
Erwin Van Eyk
T. Hegeman
Sacheendra Talluri
V. V. Beek
L. Toader
GNN
26
27
0
15 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks
Qi Liu
Tao Liu
Zihao Liu
Yanzhi Wang
Yier Jin
Wujie Wen
AAML
32
48
0
14 Feb 2018
Field-Programmable Deep Neural Network (DNN) Learning and Inference accelerator: a concept
L. Franca-Neto
15
1
0
14 Feb 2018
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions
Nicolas Vasilache
O. Zinenko
Theodoros Theodoridis
Priya Goyal
Zach DeVito
William S. Moses
Sven Verdoolaege
Andrew Adams
Albert Cohen
40
432
0
13 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
41
389
0
13 Feb 2018
TVM: An Automated End-to-End Optimizing Compiler for Deep Learning
Tianqi Chen
T. Moreau
Ziheng Jiang
Lianmin Zheng
Eddie Q. Yan
...
Leyuan Wang
Yuwei Hu
Luis Ceze
Carlos Guestrin
Arvind Krishnamurthy
55
374
0
12 Feb 2018
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error Resilience for Energy Efficient Deep Neural Network Accelerators
Jeff Zhang
Kartheek Rangineni
Zahra Ghodsi
S. Garg
36
118
0
11 Feb 2018
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator
Jeff Zhang
Tianyu Gu
K. Basu
S. Garg
14
134
0
11 Feb 2018
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks
Jian Cheng
Peisong Wang
Gang Li
Qinghao Hu
Hanqing Lu
32
3
0
03 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks
R. Cai
Ao Ren
Ning Liu
Caiwen Ding
Luhao Wang
Xuehai Qian
Massoud Pedram
Yanzhi Wang
BDL
46
87
0
02 Feb 2018
On Scale-out Deep Learning Training for Cloud and HPC
Srinivas Sridharan
K. Vaidyanathan
Dhiraj D. Kalamkar
Dipankar Das
Mikhail E. Smorkalov
...
Dheevatsa Mudigere
Naveen Mellempudi
Sasikanth Avancha
Bharat Kaul
Pradeep Dubey
BDL
26
30
0
24 Jan 2018
Flexible Deep Neural Network Processing
Hokchhay Tann
S. Hashemi
Sherief Reda
AI4CE
13
8
0
23 Jan 2018
In-RDBMS Hardware Acceleration of Advanced Analytics
Divya Mahajan
Joo-Young Kim
Jacob Sacks
A. Ardalan
Arun Kumar
H. Esmaeilzadeh
24
46
0
08 Jan 2018
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car
Michael Bechtel
Elise McEllhiney
Minje Kim
H. Yun
30
103
0
19 Dec 2017
TensorFlow-Serving: Flexible, High-Performance ML Serving
Christopher Olston
Noah Fiedel
Kiril Gorovoy
Jeremiah Harmsen
Li Lao
Fangwei Li
Vinu Rajashekhar
Sukriti Ramesh
Jordan Soyke
18
303
0
17 Dec 2017
A Berkeley View of Systems Challenges for AI
Ion Stoica
D. Song
Raluca A. Popa
D. Patterson
Michael W. Mahoney
...
Joseph E. Gonzalez
Ken Goldberg
A. Ghodsi
David Culler
Pieter Abbeel
24
199
0
15 Dec 2017
Deep Learning for IoT Big Data and Streaming Analytics: A Survey
M. Mohammadi
Ala I. Al-Fuqaha
Sameh Sorour
Mohsen Guizani
38
1,051
0
09 Dec 2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
42
1,741
0
05 Dec 2017
Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
Hardik Sharma
Jongse Park
Naveen Suda
Liangzhen Lai
Benson Chau
Joo-Young Kim
Vikas Chandra
H. Esmaeilzadeh
MQ
32
488
0
05 Dec 2017
NEURAghe: Exploiting CPU-FPGA Synergies for Efficient and Flexible CNN Inference Acceleration on Zynq SoCs
Paolo Meloni
Alessandro Capotondi
Gianfranco Deriu
Michele Brian
Francesco Conti
D. Rossi
L. Raffo
Luca Benini
19
51
0
04 Dec 2017
Structured Deep Neural Network Pruning via Matrix Pivoting
Ranko Sredojevic
Shaoyi Cheng
Lazar Supic
R. Naous
Vladimir M. Stojanović
13
7
0
01 Dec 2017
Machine Learning and Manycore Systems Design: A Serendipitous Symbiosis
R. Kim
J. Doppa
P. Pande
Diana Marculescu
R. Marculescu
26
26
0
30 Nov 2017
TensorFlow Distributions
Joshua V. Dillon
I. Langmore
Dustin Tran
E. Brevdo
Srinivas Vasudevan
David A. Moore
Brian Patton
Alexander A. Alemi
Matt Hoffman
Rif A. Saurous
GP
46
346
0
28 Nov 2017
Recurrent Segmentation for Variable Computational Budgets
Lane T. McIntosh
Niru Maheswaranathan
David Sussillo
Jonathon Shlens
SSeg
VOS
27
20
0
28 Nov 2017
A Manifesto for Future Generation Cloud Computing: Research Directions for the Next Decade
Rajkumar Buyya
Satish Narayana
G. Casale
R. Calheiros
Yogesh L. Simmhan
...
Wanlei Zhou
Hai Jin
W. Gentzsch
Albert Y. Zomaya
Haiying Shen
AI4TS
AILaw
28
137
0
24 Nov 2017
Deep supervised learning using local errors
Hesham Mostafa
V. Ramesh
Gert Cauwenberghs
41
113
0
17 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler
Yu Ji
Youhui Zhang
Wenguang Chen
Yuan Xie
27
56
0
15 Nov 2017
Chipmunk: A Systolically Scalable 0.9 mm
2
{}^2
2
, 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference
Francesco Conti
Lukas Cavigelli
G. Paulin
Igor Susmelj
Luca Benini
11
42
0
15 Nov 2017
Deep Rewiring: Training very sparse deep networks
G. Bellec
David Kappel
Wolfgang Maass
Robert Legenstein
BDL
29
275
0
14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
34
9
0
13 Nov 2017
DLVM: A modern compiler infrastructure for deep learning systems
Richard Wei
Lane Schwartz
Vikram S. Adve
11
58
0
08 Nov 2017
Block-Sparse Recurrent Neural Networks
Sharan Narang
Eric Undersander
G. Diamos
17
136
0
08 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks
Sanchari Sen
Shubham Jain
Swagath Venkataramani
A. Raghunathan
24
30
0
07 Nov 2017
Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
Urs Koster
T. Webb
Xin Eric Wang
Marcel Nassar
Arjun K. Bansal
...
Luke Hornof
A. Khosrowshahi
Carey Kloss
Ruby J. Pai
N. Rao
MQ
14
261
0
06 Nov 2017
Don't Decay the Learning Rate, Increase the Batch Size
Samuel L. Smith
Pieter-Jan Kindermans
Chris Ying
Quoc V. Le
ODL
36
981
0
01 Nov 2017
HPC Cloud for Scientific and Business Applications: Taxonomy, Vision, and Research Challenges
M. Netto
R. Calheiros
Eduardo Rodrigues
R. L. F. Cunha
Rajkumar Buyya
63
72
0
24 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
33
23
0
13 Oct 2017
A Comparative Taxonomy and Survey of Public Cloud Infrastructure Vendors
Dimitrios Sikeridis
I. Papapanagiotou
B. Rimal
M. Devetsikiotis
24
26
0
04 Oct 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design
Zhourui Song
Zhenyu Liu
Dongsheng Wang
31
41
0
22 Sep 2017
Previous
1
2
3
...
22
23
24
Next