ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.11054
  4. Cited By
MLIR: A Compiler Infrastructure for the End of Moore's Law

MLIR: A Compiler Infrastructure for the End of Moore's Law

25 February 2020
Chris Lattner
M. Amini
Uday Bondhugula
Albert Cohen
Andy Davis
J. Pienaar
River Riddle
T. Shpeisman
Nicolas Vasilache
O. Zinenko
    VLM
ArXivPDFHTML

Papers citing "MLIR: A Compiler Infrastructure for the End of Moore's Law"

27 / 27 papers shown
Title
OODTE: A Differential Testing Engine for the ONNX Optimizer
OODTE: A Differential Testing Engine for the ONNX Optimizer
Nikolaos Louloudakis
Ajitha Rajan
46
0
0
03 May 2025
Rulebook: bringing co-routines to reinforcement learning environments
Rulebook: bringing co-routines to reinforcement learning environments
Massimo Fioravanti
Samuele Pasini
Giovanni Agosta
33
0
0
28 Apr 2025
Neuromorphic Intermediate Representation: A Unified Instruction Set for
  Interoperable Brain-Inspired Computing
Neuromorphic Intermediate Representation: A Unified Instruction Set for Interoperable Brain-Inspired Computing
Jens Egholm Pedersen
Steven Abreu
Matthias Jobst
Gregor Lenz
Vittorio Fra
...
Gianvito Urgese
Sadasivan Shankar
Terrence C. Stewart
Jason K. Eshraghian
Sadique Sheik
34
30
0
24 Nov 2023
SimplePIM: A Software Framework for Productive and Efficient
  Processing-in-Memory
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory
Jinfan Chen
Juan Gómez Luna
I. E. Hajj
Yu-Yin Guo
Onur Mutlu
37
19
0
03 Oct 2023
On the Tool Manipulation Capability of Open-source Large Language Models
On the Tool Manipulation Capability of Open-source Large Language Models
Qiantong Xu
Fenglu Hong
Yangqiu Song
Changran Hu
Zheng Chen
Jian Zhang
LLMAG
35
69
0
25 May 2023
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on
  Production AI Platform
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform
Shiwei Zhang
Lansong Diao
Siyu Wang
Zongyan Cao
Yiliang Gu
Chang Si
Ziji Shi
Zhen Zheng
Chuan Wu
W. Lin
AI4CE
32
4
0
16 Feb 2023
Python FPGA Programming with Data-Centric Multi-Level Design
Python FPGA Programming with Data-Centric Multi-Level Design
Johannes de Fine Licht
T. De Matteis
Tal Ben-Nun
Andreas Kuster
Oliver Rausch
Manuel Burger
Carl-Johannes Johnsen
Torsten Hoefler
26
1
0
28 Dec 2022
On Physics-Informed Neural Networks for Quantum Computers
On Physics-Informed Neural Networks for Quantum Computers
Stefano Markidis
PINN
37
18
0
28 Sep 2022
Optimizing DNN Compilation for Distributed Training with Joint OP and
  Tensor Fusion
Optimizing DNN Compilation for Distributed Training with Joint OP and Tensor Fusion
Xiaodong Yi
Shiwei Zhang
Lansong Diao
Chuan Wu
Zhen Zheng
Shiqing Fan
Siyu Wang
Jun Yang
W. Lin
49
4
0
26 Sep 2022
Special Session: Towards an Agile Design Methodology for Efficient,
  Reliable, and Secure ML Systems
Special Session: Towards an Agile Design Methodology for Efficient, Reliable, and Secure ML Systems
Shail Dave
Alberto Marchisio
Muhammad Abdullah Hanif
Amira Guesmi
Aviral Shrivastava
Ihsen Alouani
Mohamed Bennai
39
13
0
18 Apr 2022
Query Processing on Tensor Computation Runtimes
Query Processing on Tensor Computation Runtimes
Dong He
Supun Nakandala
Dalitso Banda
Rathijit Sen
Karla Saur
Kwanghyun Park
Carlo Curino
Jesús Camacho-Rodríguez
Konstantinos Karanasos
Matteo Interlandi
32
36
0
03 Mar 2022
Memory Planning for Deep Neural Networks
Memory Planning for Deep Neural Networks
Maksim Levental
35
4
0
23 Feb 2022
Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation
Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation
Jiawei Liu
Yuxiang Wei
Sen Yang
Yinlin Deng
Lingming Zhang
41
41
0
21 Feb 2022
Compiler-Driven Simulation of Reconfigurable Hardware Accelerators
Compiler-Driven Simulation of Reconfigurable Hardware Accelerators
Zhijing Li
Yuwei Ye
S. Neuendorffer
Adrian Sampson
39
3
0
01 Feb 2022
Lifting C Semantics for Dataflow Optimization
Lifting C Semantics for Dataflow Optimization
A. Calotoiu
Tal Ben-Nun
Grzegorz Kwa'sniewski
Johannes de Fine Licht
Timo Schneider
Philipp Schaad
Torsten Hoefler
27
6
0
22 Dec 2021
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced
  Operator Fusion
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion
Wei Niu
Jiexiong Guan
Yanzhi Wang
G. Agrawal
Bin Ren
AI4CE
32
147
0
30 Aug 2021
High Performance GPU Code Generation for Matrix-Matrix Multiplication
  using MLIR: Some Early Results
High Performance GPU Code Generation for Matrix-Matrix Multiplication using MLIR: Some Early Results
Navdeep Katel
Vivek Khandelwal
Uday Bondhugula
8
7
0
23 Aug 2021
Automated Backend-Aware Post-Training Quantization
Automated Backend-Aware Post-Training Quantization
Ziheng Jiang
Animesh Jain
An Liu
Josh Fromm
Chengqian Ma
Tianqi Chen
Luis Ceze
MQ
37
2
0
27 Mar 2021
DISC: A Dynamic Shape Compiler for Machine Learning Workloads
DISC: A Dynamic Shape Compiler for Machine Learning Workloads
Kai Zhu
Wenyi Zhao
Zhen Zheng
Tianyou Guo
Pengzhan Zhao
...
Junjie Bai
Jun Yang
Xiaoyong Liu
Lansong Diao
Wei Lin
35
27
0
09 Mar 2021
tf.data: A Machine Learning Data Processing Framework
tf.data: A Machine Learning Data Processing Framework
D. Murray
Jiří Šimša
Ana Klimovic
Ihor Indyk
PINN
AI4CE
LMTD
47
87
0
28 Jan 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
678
0
24 Jan 2021
Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art
  Binarized Neural Networks
Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art Binarized Neural Networks
T. Bannink
Arash Bakhtiari
Adam Hillier
Lukas Geiger
T. D. Bruin
Leon Overweel
J. Neeven
K. Helwegen
3DV
MQ
13
36
0
18 Nov 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
64
82
0
02 Jul 2020
Data Movement Is All You Need: A Case Study on Optimizing Transformers
Data Movement Is All You Need: A Case Study on Optimizing Transformers
A. Ivanov
Nikoli Dryden
Tal Ben-Nun
Shigang Li
Torsten Hoefler
36
131
0
30 Jun 2020
Dynamic Tensor Rematerialization
Dynamic Tensor Rematerialization
Marisa Kirisame
Steven Lyubomirsky
Altan Haan
Jennifer Brennan
Mike He
Jared Roesch
Tianqi Chen
Zachary Tatlock
29
93
0
17 Jun 2020
ProTuner: Tuning Programs with Monte Carlo Tree Search
ProTuner: Tuning Programs with Monte Carlo Tree Search
Ameer Haj-Ali
Hasan Genç
Qijing Huang
William S. Moses
J. Wawrzynek
Krste Asanović
Ion Stoica
36
21
0
27 May 2020
Q-EEGNet: an Energy-Efficient 8-bit Quantized Parallel EEGNet
  Implementation for Edge Motor-Imagery Brain--Machine Interfaces
Q-EEGNet: an Energy-Efficient 8-bit Quantized Parallel EEGNet Implementation for Edge Motor-Imagery Brain--Machine Interfaces
Tibor Schneider
Xiaying Wang
Michael Hersche
Lukas Cavigelli
Luca Benini
19
24
0
24 Apr 2020
1