Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.01005
Cited By
AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization
2 December 2022
Zhiying Xu
H. Peng
Wei Wang
GNN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization"
25 / 25 papers shown
Title
Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning
Chengfei Lv
Chaoyue Niu
Renjie Gu
Xiaotang Jiang
Zhaode Wang
...
Guohuan Xu
Leilei Gan
Shaojie Tang
Fan Wu
Guihai Chen
MoE
LRM
23
38
0
30 May 2022
Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance
Jiarong Xing
Leyuan Wang
Shang Zhang
Jack H Chen
Ang Chen
Yibo Zhu
46
43
0
25 Oct 2021
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
Sachin Mehta
Mohammad Rastegari
ViT
271
1,257
0
05 Oct 2021
Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics
Prajjwal Bhargava
Aleksandr Drozd
Anna Rogers
114
105
0
04 Oct 2021
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion
Wei Niu
Jiexiong Guan
Yanzhi Wang
G. Agrawal
Bin Ren
AI4CE
43
150
0
30 Aug 2021
Tuna: A Static Analysis Approach to Optimizing Deep Neural Networks
Yao Wang
Xingyu Zhou
Yanming Wang
Rui Li
Yong Wu
Vin Sharma
57
8
0
29 Apr 2021
A Deep Learning Based Cost Model for Automatic Code Optimization
Riyadh Baghdadi
Massinissa Merouani
Mohamed-Hicham Leghettas
K. Abdous
T. Arbaoui
K. Benatchba
Saman P. Amarasinghe
58
71
0
11 Apr 2021
Equality Saturation for Tensor Graph Superoptimization
Yichen Yang
Mangpo Phitchaya Phothilimtha
Y. Wang
Max Willsey
Sudip Roy
Jacques Pienaar
72
84
0
05 Jan 2021
IOS: Inter-Operator Scheduler for CNN Acceleration
Yaoyao Ding
Ligeng Zhu
Zhihao Jia
Gennady Pekhimenko
Song Han
44
73
0
02 Nov 2020
FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads
Zhen Zheng
Pengzhan Zhao
Guoping Long
Feiwen Zhu
Kai Zhu
Wenyi Zhao
Lansong Diao
Jun Yang
Wei Lin
56
31
0
23 Sep 2020
SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud
Stefanos Laskaridis
Stylianos I. Venieris
Mario Almeida
Ilias Leontiadis
Nicholas D. Lane
57
270
0
14 Aug 2020
Ansor: Generating High-Performance Tensor Programs for Deep Learning
Lianmin Zheng
Chengfan Jia
Minmin Sun
Zhao Wu
Cody Hao Yu
...
Jun Yang
Danyang Zhuo
Koushik Sen
Joseph E. Gonzalez
Ion Stoica
124
394
0
11 Jun 2020
Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices
Byung Hoon Ahn
Jinwon Lee
J. Lin
Hsin-Pai Cheng
Jilei Hou
H. Esmaeilzadeh
87
55
0
04 Mar 2020
MNN: A Universal and Efficient Inference Engine
Xiaotang Jiang
Huan Wang
Yiliu Chen
Ziqi Wu
Lichuan Wang
...
Zongyang Cui
Yuezhi Cai
Tianhang Yu
Chengfei Lv
Zhihua Wu
54
154
0
27 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
326
42,282
0
03 Dec 2019
Well-Read Students Learn Better: On the Importance of Pre-training Compact Models
Iulia Turc
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
57
224
0
23 Aug 2019
Relay: A New IR for Machine Learning Frameworks
Jared Roesch
Steven Lyubomirsky
Logan Weber
Josh Pollock
Marisa Kirisame
Tianqi Chen
Zachary Tatlock
51
105
0
26 Sep 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
112
3,004
0
31 Jul 2018
ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
Ningning Ma
Xiangyu Zhang
Haitao Zheng
Jian Sun
145
4,969
0
30 Jul 2018
Learning to Optimize Tensor Programs
Tianqi Chen
Lianmin Zheng
Eddie Q. Yan
Ziheng Jiang
T. Moreau
Luis Ceze
Carlos Guestrin
Arvind Krishnamurthy
63
401
0
21 May 2018
Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code
Riyadh Baghdadi
Jessica Ray
Malek Ben Romdhane
Emanuele Del Sozzo
Abdurrahman Akkas
Yunming Zhang
Patricia Suriana
Shoaib Kamil
Saman P. Amarasinghe
36
257
0
27 Apr 2018
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions
Nicolas Vasilache
O. Zinenko
Theodoros Theodoridis
Priya Goyal
Zach DeVito
William S. Moses
Sven Verdoolaege
Andrew Adams
Albert Cohen
69
434
0
13 Feb 2018
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
1.1K
20,781
0
17 Apr 2017
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhiwen Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
377
18,331
0
27 May 2016
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
F. Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
W. Dally
Kurt Keutzer
132
7,464
0
24 Feb 2016
1