Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1704.04760
Cited By
In-Datacenter Performance Analysis of a Tensor Processing Unit
16 April 2017
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
Raminder Bajwa
Sarah Bates
Suresh Bhatia
Nan Boden
Al Borchers
Rick Boyle
Pierre-luc Cantin
Clifford Chao
Chris Clark
Jeremy Coriell
Mike Daley
Matt Dau
Jeffrey Dean
Ben Gelb
Taraneh Ghaemmaghami
Rajendra Gottipati
William Gulland
Robert Hagmann
C. Richard Ho
Doug Hogberg
John Hu
R. Hundt
Dan Hurt
Julian Ibarz
A. Jaffey
Alek Jaworski
Alexander Kaplan
Harshit Khaitan
Andy Koch
Naveen Kumar
Steve Lacy
James Laudon
James Law
Diemthu Le
Chris Leary
Zhuyuan Liu
Kyle Lucke
Alan Lundin
Gordon MacKean
Adriana Maggiore
Maire Mahony
Kieran Miller
R. Nagarajan
Ravi Narayanaswami
Ray Ni
Kathy Nix
Thomas Norrie
Mark Omernick
Narayana Penukonda
Andy Phelps
Jonathan Ross
Matt Ross
Amir Salek
Emad Samadiani
Chris Severn
Gregory Sizikov
Matthew Snelham
Jed Souter
Dan Steinberg
Andy Swing
Mercedes Tan
Gregory Thorson
Bo Tian
Horia Toma
Erick Tuttle
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"In-Datacenter Performance Analysis of a Tensor Processing Unit"
50 / 1,165 papers shown
Title
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks
Mahmood Azhar Qureshi
Arslan Munir
35
0
0
09 Nov 2021
Confidential Machine Learning Computation in Untrusted Environments: A Systems Security Perspective
Kha Dinh Duy
Taehyun Noh
Siwon Huh
Hojoon Lee
56
9
0
05 Nov 2021
Dynamic Data Augmentation with Gating Networks for Time Series Recognition
Daisuke Oba
Shinnosuke Matsuo
Brian Kenji Iwana
AI4TS
26
1
0
05 Nov 2021
Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples
Kanghyun Choi
Deokki Hong
Noseong Park
Youngsok Kim
Jinho Lee
MQ
32
65
0
04 Nov 2021
GNNear: Accelerating Full-Batch Training of Graph Neural Networks with Near-Memory Processing
Zhe Zhou
Cong Li
Xuechao Wei
Xiaoyang Wang
Guangyu Sun
GNN
22
24
0
01 Nov 2021
Collage: Seamless Integration of Deep Learning Backends with Automatic Placement
Byungsoo Jeon
Sunghyun Park
Peiyuan Liao
Sheng Xu
Tianqi Chen
Zhihao Jia
VLM
47
4
0
01 Nov 2021
Sustainable AI: Environmental Implications, Challenges and Opportunities
Carole-Jean Wu
Ramya Raghavendra
Udit Gupta
Bilge Acun
Newsha Ardalani
...
Maximilian Balandat
Joe Spisak
R. Jain
Michael G. Rabbat
K. Hazelwood
52
389
0
30 Oct 2021
Cross-attention conformer for context modeling in speech enhancement for ASR
A. Narayanan
Chung-Cheng Chiu
Tom O'Malley
Quan Wang
Yanzhang He
32
14
0
30 Oct 2021
NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM
Connor Holmes
Minjia Zhang
Yuxiong He
Bo Wu
37
18
0
28 Oct 2021
MERCURY: Accelerating DNN Training By Exploiting Input Similarity
Vahid Janfaza
Kevin Weston
Moein Razavi
Shantanu Mandal
Farabi Mahmud
Alex Hilty
A. Muzahid
41
5
0
28 Oct 2021
Applications and Techniques for Fast Machine Learning in Science
A. Deiana
Nhan Tran
Joshua C. Agar
Michaela Blott
G. D. Guglielmo
...
Ashish Sharma
S. Summers
Pietro Vischia
J. Vlimant
Olivia Weng
19
71
0
25 Oct 2021
Physical Side-Channel Attacks on Embedded Neural Networks: A Survey
M. M. Real
Ruben Salvador
AAML
25
32
0
21 Oct 2021
A Data-Centric Optimization Framework for Machine Learning
Oliver Rausch
Tal Ben-Nun
Nikoli Dryden
Andrei Ivanov
Shigang Li
Torsten Hoefler
AI4CE
22
16
0
20 Oct 2021
Data-Driven Offline Optimization For Architecting Hardware Accelerators
Aviral Kumar
Amir Yazdanbakhsh
Milad Hashemi
Kevin Swersky
Sergey Levine
38
36
0
20 Oct 2021
Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs
Kaichao You
Yong Liu
Ziyang Zhang
Jianmin Wang
Michael I. Jordan
Mingsheng Long
122
32
0
20 Oct 2021
When in Doubt, Summon the Titans: Efficient Inference with Large Models
A. S. Rawat
Manzil Zaheer
A. Menon
Amr Ahmed
Sanjiv Kumar
25
7
0
19 Oct 2021
Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention
Zhe Zhou
Junling Liu
Zhenyu Gu
Guangyu Sun
64
43
0
18 Oct 2021
Characterizing and Improving the Resilience of Accelerators in Autonomous Robots
Deval Shah
Zihui Xue
Karthik Pattabiraman
Tor M. Aamodt
26
1
0
17 Oct 2021
Exploring Deep Neural Networks on Edge TPU
Seyedehfaezeh Hosseininoorbin
S. Layeghy
Branislav Kusy
Raja Jurdak
Marius Portmann
29
9
0
17 Oct 2021
Bandwidth Utilization Side-Channel on ML Inference Accelerators
Sarbartha Banerjee
Shijia Wei
Prakash Ramrakhyani
Mohit Tiwari
31
3
0
14 Oct 2021
SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems
Harrison Lee
Raghav Gupta
Abhinav Rastogi
Yuan Cao
Bin Zhang
Yonghui Wu
74
33
0
13 Oct 2021
An In-depth Summary of Recent Artificial Intelligence Applications in Drug Design
Yi Zhang
AI4CE
50
5
0
10 Oct 2021
Pyxis: An Open-Source Performance Dataset of Sparse Accelerators
Linghao Song
Yuze Chi
Jason Cong
21
0
0
08 Oct 2021
Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators
Yangjie Zhou
Mengtian Yang
Cong Guo
Jingwen Leng
Yun Liang
Quan Chen
Minyi Guo
Yuhao Zhu
34
34
0
08 Oct 2021
Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition
Zhiyun Lu
Yanwei Pan
Thibault Doutre
Parisa Haghani
Liangliang Cao
Rohit Prabhavalkar
Chuxu Zhang
Trevor Strohman
AuLLM
83
14
0
08 Oct 2021
MAPA: Multi-Accelerator Pattern Allocation Policy for Multi-Tenant GPU Servers
K. Ranganath
Joshua D. Suetterlein
Joseph Manzano
Shuaiwen Leon Song
Daniel Wong
41
15
0
07 Oct 2021
RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
Geonhwa Jeong
Eric Qin
A. Samajdar
C. Hughes
S. Subramoney
Hyesoon Kim
T. Krishna
65
18
0
05 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
21
8
0
03 Oct 2021
SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN Accelerators for Edge Inference
Jude Haris
Perry Gibson
José Cano
Nicolas Bohm Agostini
David Kaeli
44
19
0
01 Oct 2021
Neural Network Verification in Control
M. Everett
AAML
37
16
0
30 Sep 2021
Accelerating Fully Connected Neural Network on Optical Network-on-Chip (ONoC)
Fei Dai
Yawen Chen
Haibo Zhang
Zhiyi Huang
16
5
0
30 Sep 2021
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
25
82
0
29 Sep 2021
LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models
William Won
Saeed Rashidi
Sudarshan Srinivasan
T. Krishna
AI4CE
26
8
0
24 Sep 2021
Towards Energy-Efficient and Secure Edge AI: A Cross-Layer Framework
Mohamed Bennai
Alberto Marchisio
Rachmad Vidya Wicaksana Putra
Muhammad Abdullah Hanif
49
34
0
20 Sep 2021
On the Noise Stability and Robustness of Adversarially Trained Networks on NVM Crossbars
Chun Tao
Deboleena Roy
I. Chakraborty
Kaushik Roy
AAML
43
2
0
19 Sep 2021
Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs
Anup Sarma
Sonali Singh
Huaipan Jiang
Ashutosh Pattnaik
Asit K. Mishra
N. Vijaykrishnan
M. Kandemir
Chita R. Das
16
5
0
16 Sep 2021
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators
Geonhwa Jeong
Gokcen Kestor
Prasanth Chatarasi
A. Parashar
Po-An Tsai
S. Rajamanickam
R. Gioiosa
T. Krishna
35
13
0
15 Sep 2021
2-in-1 Accelerator: Enabling Random Precision Switch for Winning Both Adversarial Robustness and Efficiency
Yonggan Fu
Yang Zhao
Qixuan Yu
Chaojian Li
Yingyan Lin
AAML
57
12
0
11 Sep 2021
Bootstrapped Meta-Learning
Sebastian Flennerhag
Yannick Schroecker
Tom Zahavy
Hado van Hasselt
David Silver
Satinder Singh
43
59
0
09 Sep 2021
Revisiting 3D ResNets for Video Recognition
Xianzhi Du
Yeqing Li
Huayu Chen
Rui Qian
Jing Li
Irwan Bello
56
17
0
03 Sep 2021
On the Accuracy of Analog Neural Network Inference Accelerators
T. Xiao
Ben Feinberg
C. Bennett
V. Prabhakar
Prashant Saxena
V. Agrawal
S. Agarwal
M. Marinella
30
34
0
03 Sep 2021
Evaluating the Single-Shot MultiBox Detector and YOLO Deep Learning Models for the Detection of Tomatoes in a Greenhouse
S. Magalhães
Luís Castro
Germano Moreira
F. Santos
Mário Cunha
Jorge Dias
A. Moreira
20
119
0
02 Sep 2021
Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning
S. Choi
Sunho Lee
Yeonjae Kim
Jongse Park
Youngjin Kwon
Jaehyuk Huh
30
21
0
01 Sep 2021
Effective Sequence-to-Sequence Dialogue State Tracking
Jeffrey Zhao
Mahdis Mahdieh
Ye Zhang
Yuan Cao
Yonghui Wu
28
42
0
31 Aug 2021
Edge-Cloud Collaborated Object Detection via Difficult-Case Discriminator
Zhiqiang Cao
Zhijun Li
Pan Heng
Yongrui Chen
Daqi Xie
Jie Liu
30
12
0
29 Aug 2021
Power-Based Attacks on Spatial DNN Accelerators
Ge Li
Mohit Tiwari
Michael Orshansky
38
8
0
28 Aug 2021
Design and Scaffolded Training of an Efficient DNN Operator for Computer Vision on the Edge
Vinod Ganesan
Pratyush Kumar
45
2
0
25 Aug 2021
Towards Memory-Efficient Neural Networks via Multi-Level in situ Generation
Jiaqi Gu
Hanqing Zhu
Chenghao Feng
Mingjie Liu
Zixuan Jiang
Ray T. Chen
David Z. Pan
22
4
0
25 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLM
MLLM
51
782
0
24 Aug 2021
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices
Stephan Patrick Baller
Anshul Jindal
Mohak Chadha
Michael Gerndt
22
71
0
21 Aug 2021
Previous
1
2
3
...
8
9
10
...
22
23
24
Next