Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.04760
Cited By
In-Datacenter Performance Analysis of a Tensor Processing Unit
16 April 2017
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
Raminder Bajwa
Sarah Bates
Suresh Bhatia
Nan Boden
Al Borchers
Rick Boyle
Pierre-luc Cantin
Clifford Chao
Chris Clark
Jeremy Coriell
Mike Daley
Matt Dau
Jeffrey Dean
Ben Gelb
Taraneh Ghaemmaghami
Rajendra Gottipati
William Gulland
Robert Hagmann
C. Richard Ho
Doug Hogberg
John Hu
R. Hundt
Dan Hurt
Julian Ibarz
A. Jaffey
Alek Jaworski
Alexander Kaplan
Harshit Khaitan
Andy Koch
Naveen Kumar
Steve Lacy
James Laudon
James Law
Diemthu Le
Chris Leary
Zhuyuan Liu
Kyle Lucke
Alan Lundin
Gordon MacKean
Adriana Maggiore
Maire Mahony
Kieran Miller
R. Nagarajan
Ravi Narayanaswami
Ray Ni
Kathy Nix
Thomas Norrie
Mark Omernick
Narayana Penukonda
Andy Phelps
Jonathan Ross
Matt Ross
Amir Salek
Emad Samadiani
Chris Severn
Gregory Sizikov
Matthew Snelham
Jed Souter
Dan Steinberg
Andy Swing
Mercedes Tan
Gregory Thorson
Bo Tian
Horia Toma
Erick Tuttle
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"In-Datacenter Performance Analysis of a Tensor Processing Unit"
50 / 1,165 papers shown
Title
Pushing the boundaries of parallel Deep Learning -- A practical approach
Paolo Viviani
M. Drocco
Marco Aldinucci
OOD
36
0
0
25 Jun 2018
The Temporal Singularity: time-accelerated simulated civilizations and their implications
G. Spigler
3DGS
AI4CE
9
1
0
22 Jun 2018
Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices
Thomas B. Preußer
Giulio Gambardella
Nicholas J. Fraser
Michaela Blott
MQ
32
41
0
21 Jun 2018
Rethinking Machine Learning Development and Deployment for Edge Devices
Liangzhen Lai
Naveen Suda
11
10
0
20 Jun 2018
Forest Packing: Fast, Parallel Decision Forests
J. Browne
Tyler M. Tomita
Disa Mhembere
Randal C. Burns
Joshua T. Vogelstein
17
16
0
19 Jun 2018
Continuous-variable quantum neural networks
N. Killoran
T. Bromley
J. M. Arrazola
Maria Schuld
N. Quesada
S. Lloyd
GNN
9
351
0
18 Jun 2018
Partitioning Compute Units in CNN Acceleration for Statistical Memory Traffic Shaping
Daejin Jung
Sunjung Lee
Wonjong Rhee
Jung Ho Ahn
22
8
0
18 Jun 2018
Resource-Efficient Neural Architect
Yanqi Zhou
S. Ebrahimi
Sercan Ö. Arik
Haonan Yu
Hairong Liu
G. Diamos
22
64
0
12 Jun 2018
Energy-Constrained Compression for Deep Neural Networks via Weighted Sparse Projection and Layer Input Masking
Haichuan Yang
Yuhao Zhu
Ji Liu
CVBM
19
36
0
12 Jun 2018
Smallify: Learning Network Size while Training
Guillaume Leclerc
Manasi Vartak
Raul Castro Fernandez
Tim Kraska
Samuel Madden
14
13
0
10 Jun 2018
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark
Cody Coleman
Daniel Kang
Deepak Narayanan
Luigi Nardi
Tian Zhao
Jian Zhang
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
13
117
0
04 Jun 2018
Between Progress and Potential Impact of AI: the Neglected Dimensions
Fernando Martínez-Plumed
S. Avin
Miles Brundage
Allan Dafoe
Seán Ó hÉigeartaigh
José Hernández-Orallo
33
3
0
02 Jun 2018
Training LSTM Networks with Resistive Cross-Point Devices
Tayfun Gokmen
Malte J. Rasch
W. Haensch
8
45
0
01 Jun 2018
Interpreting Deep Learning: The Machine Learning Rorschach Test?
Adam S. Charles
AAML
HAI
AI4CE
24
9
0
01 Jun 2018
Channel Gating Neural Networks
Weizhe Hua
Yuan Zhou
Christopher De Sa
Zhiru Zhang
G. E. Suh
15
180
0
29 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
27
44
0
22 May 2018
Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit
Seyed Hamed Fatemi Langroudi
Tej Pandit
Dhireesha Kudithipudi
MQ
19
41
0
22 May 2018
Scanner: Efficient Video Analysis at Scale
Alex Poms
Will Crichton
Pat Hanrahan
Kayvon Fatahalian
24
57
0
18 May 2018
Hu-Fu: Hardware and Software Collaborative Attack Framework against Neural Networks
Wenshuo Li
Jincheng Yu
Xuefei Ning
Pengjun Wang
Qi Wei
Yu Wang
Huazhong Yang
AAML
39
61
0
14 May 2018
An
O
(
N
)
O(N)
O
(
N
)
Sorting Algorithm: Machine Learning Sort
Hanqing Zhao
Yuehan Luo
13
2
0
11 May 2018
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks
Amir Yazdanbakhsh
Hajar Falahati
Philip J. Wolfe
K. Samadi
N. Kim
H. Esmaeilzadeh
30
71
0
10 May 2018
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
Charles Eckert
Xiaowei Wang
Jingcheng Wang
Arun K. Subramaniyan
R. Iyer
D. Sylvester
D. Blaauw
R. Das
MQ
13
334
0
09 May 2018
Understanding Reuse, Performance, and Hardware Cost of DNN Dataflows: A Data-Centric Approach Using MAESTRO
Hyoukjun Kwon
Prasanth Chatarasi
Michael Pellauer
A. Parashar
Vivek Sarkar
T. Krishna
19
10
0
04 May 2018
Dynamic Control Flow in Large-Scale Machine Learning
Yuan Yu
Martín Abadi
P. Barham
E. Brevdo
M. Burrows
...
Michael Isard
M. Kudlur
R. Monga
D. Murray
Xiaoqiang Zheng
AI4CE
30
106
0
04 May 2018
Ultra Power-Efficient CNN Domain Specific Accelerator with 9.3TOPS/Watt for Mobile and Embedded Applications
Baohua Sun
L. Yang
Patrick Dong
Wenhan Zhang
Jason Dong
Charles Young
39
34
0
30 Apr 2018
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation
Mengzhao Chen
Orhan Firat
Ankur Bapna
Melvin Johnson
Wolfgang Macherey
...
Niki Parmar
M. Schuster
Zhifeng Chen
Yonghui Wu
Macduff Hughes
AIMat
21
457
0
26 Apr 2018
Value-aware Quantization for Training and Inference of Neural Networks
Eunhyeok Park
S. Yoo
Peter Vajda
MQ
14
158
0
20 Apr 2018
Co-Design of Deep Neural Nets and Neural Net Accelerators for Embedded Vision Applications
K. Kwon
Alon Amid
A. Gholami
Bichen Wu
Krste Asanović
Kurt Keutzer
3DV
OOD
24
22
0
20 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition
Kartik Hegde
Jiyong Yu
R. Agrawal
Mengjia Yan
Michael Pellauer
Christopher W. Fletcher
22
165
0
18 Apr 2018
Mage: Online Interference-Aware Scheduling in Multi-Scale Heterogeneous Systems
Francisco Romero
Christina Delimitrou
29
2
0
17 Apr 2018
DPRed: Making Typical Activation and Weight Values Matter In Deep Learning Computing
A. Delmas
Sayeh Sharify
Patrick Judd
Kevin Siu
Milos Nikolic
Andreas Moshovos
MQ
24
3
0
17 Apr 2018
Training DNNs with Hybrid Block Floating Point
M. Drumond
Tao R. Lin
Martin Jaggi
Babak Falsafi
25
95
0
04 Apr 2018
Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision
Yuhao Zhu
A. Samajdar
Matthew Mattina
P. Whatmough
32
87
0
29 Mar 2018
Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs
Caiwen Ding
Ao Ren
Geng Yuan
Xiaolong Ma
Jiayu Li
Ning Liu
Bo Yuan
Yanzhi Wang
20
23
0
28 Mar 2018
Latency and Throughput Characterization of Convolutional Neural Networks for Mobile Computer Vision
Jussi Hanhirova
Teemu Kämäräinen
S. Seppälä
M. Siekkinen
V. Hirvisalo
Antti Ylä-Jääski
26
90
0
26 Mar 2018
SqueezeNext: Hardware-Aware Neural Network Design
A. Gholami
K. Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Kurt Keutzer
22
295
0
23 Mar 2018
A neural network memory prefetcher using semantic locality
L. Peled
U. Weiser
Yoav Etsion
6
41
0
19 Mar 2018
EVA
2
^2
2
: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
39
75
0
16 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
25
80
0
16 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
19
184
0
15 Mar 2018
Deep k-Nearest Neighbors: Towards Confident, Interpretable and Robust Deep Learning
Nicolas Papernot
Patrick McDaniel
OOD
AAML
13
503
0
13 Mar 2018
CuLDA_CGS: Solving Large-scale LDA Problems on GPUs
Xiaolong Xie
Yun Liang
Xiuhong Li
Wei Tan
16
8
0
13 Mar 2018
Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches
Yeming Wen
Paul Vicol
Jimmy Ba
Dustin Tran
Roger C. Grosse
BDL
22
307
0
12 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
45
1,306
0
12 Mar 2018
NVIDIA Tensor Core Programmability, Performance & Precision
Stefano Markidis
Steven W. D. Chien
Erwin Laure
Ivy Bo Peng
Jeffrey S. Vetter
6
366
0
11 Mar 2018
Efficient FPGA Implementation of Conjugate Gradient Methods for Laplacian System using HLS
Sahithi Rampalli
N. Sehgal
Ishita Bindlish
Tanya Tyagi
Pawan Kumar
27
4
0
10 Mar 2018
Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How
A. Delmas
Patrick Judd
Dylan Malone Stuart
Zissis Poulos
Mostafa Mahmoud
Sayeh Sharify
Milos Nikolic
Andreas Moshovos
24
24
0
09 Mar 2018
Solving Fourier ptychographic imaging problems via neural network modeling and TensorFlow
Shaowei Jiang
K. Guo
Jun Liao
G. Zheng
16
95
0
09 Mar 2018
High-Accuracy Low-Precision Training
Christopher De Sa
Megan Leszczynski
Jian Zhang
Alana Marzoev
Christopher R. Aberger
K. Olukotun
Christopher Ré
21
109
0
09 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
MQ
24
19
0
05 Mar 2018
Previous
1
2
3
...
21
22
23
24
Next