Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.07181
Cited By
The Grand Illusion: The Myth of Software Portability and Implications for ML Progress
12 September 2023
Fraser Mince
Dzung Dinh
Jonas Kgomo
Neil Thompson
Sara Hooker
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Grand Illusion: The Myth of Software Portability and Implications for ML Progress"
15 / 15 papers shown
Title
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
292
3,634
0
02 May 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
426
6,202
0
05 Apr 2022
Pathways: Asynchronous Distributed Dataflow for ML
P. Barham
Aakanksha Chowdhery
J. Dean
Sanjay Ghemawat
Steven Hand
...
Parker Schuh
Ryan Sepassi
Laurent El Shafey
C. A. Thekkath
Yonghui Wu
GNN
MoE
98
129
0
23 Mar 2022
The De-democratization of AI: Deep Learning and the Compute Divide in Artificial Intelligence Research
N. Ahmed
Muntasir Wahed
49
110
0
22 Oct 2020
The Hardware Lottery
Sara Hooker
63
209
0
14 Sep 2020
Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training
Hongyu Zhu
Amar Phanishayee
Gennady Pekhimenko
98
50
0
05 Jun 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
534
4,773
0
23 Jan 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
384
42,299
0
03 Dec 2019
MLPerf Inference Benchmark
Vijayarāghava Reḍḍī
C. Cheng
David Kanter
Pete H Mattson
Guenther Schmuelling
...
Bing Yu
George Y. Yuan
Aaron Zhong
P. Zhang
Yuchen Zhou
86
497
0
06 Nov 2019
Chainer: A Deep Learning Framework for Accelerating the Research Cycle
Seiya Tokui
Ryosuke Okuta
Takuya Akiba
Yusuke Niitani
Toru Ogawa
Shunta Saito
Shuji Suzuki
Kota Uenishi
Brian K. Vogel
Hiroyuki Yamazaki Vincent
BDL
AI4CE
44
130
0
01 Aug 2019
Dynamic Routing Between Capsules
S. Sabour
Nicholas Frosst
Geoffrey E. Hinton
157
4,589
0
26 Oct 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
83
1,139
0
06 Sep 2017
In-Datacenter Performance Analysis of a Tensor Processing Unit
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
...
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
221
4,626
0
16 Apr 2017
Theano: A Python framework for fast computation of mathematical expressions
The Theano Development Team
Rami Al-Rfou
Guillaume Alain
Amjad Almahairi
Christof Angermüller
...
Kelvin Xu
Lijun Xue
Li Yao
Saizheng Zhang
Ying Zhang
180
2,339
0
09 May 2016
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
Tianqi Chen
Mu Li
Yutian Li
Min Lin
Naiyan Wang
Minjie Wang
Tianjun Xiao
Bing Xu
Chiyuan Zhang
Zheng Zhang
186
2,244
0
03 Dec 2015
1