Title
CiMNet: Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware Souvik Kundu Anthony Sarah Vinay Joshi O. J. Omer S. Subramoney 66 0 0 19 Feb 2024
Accelerating Sparse DNNs Based on Tiled GEMM Cong Guo Fengchen Xue Jingwen Leng Yuxian Qiu Yue Guan Weihao Cui Quan Chen Minyi Guo 66 11 0 16 Feb 2024
A Precision-Optimized Fixed-Point Near-Memory Digital Processing Unit for Analog In-Memory Computing Elena Ferro A. Vasilopoulos Corey Lammie Manuel Le Gallo Luca Benini I. Boybat Abu Sebastian 27 3 0 12 Feb 2024
Let Your Graph Do the Talking: Encoding Structured Data for LLMs Bryan Perozzi Bahare Fatemi Dustin Zelle Anton Tsitsulin Mehran Kazemi Rami Al-Rfou Jonathan J. Halcrow GNN 82 69 0 08 Feb 2024
Training DNN Models over Heterogeneous Clusters with Optimal Performance Chengyi Nie Jessica Maghakian Zhenhua Liu 35 0 0 07 Feb 2024
Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression Xiaoxin Su Yipeng Zhou Laizhong Cui Song Guo FedML 53 3 0 06 Feb 2024
HEANA: A Hybrid Time-Amplitude Analog Optical Accelerator with Flexible Dataflows for Energy-Efficient CNN Inference Sairam Sri Vatsavai Venkata Sai Praneeth Karempudi Ishan G. Thakkar 60 0 0 05 Feb 2024
A Comparative Analysis of Microrings Based Incoherent Photonic GEMM Accelerators Sairam Sri Vatsavai Venkata Sai Praneeth Karempudi Oluwaseun Adewunmi Alo Ishan G. Thakkar 50 2 0 05 Feb 2024
ClipFormer: Key-Value Clipping of Transformers on Memristive Crossbars for Write Noise Mitigation Abhiroop Bhattacharjee Abhishek Moitra Priyadarshini Panda CLIP 69 6 0 04 Feb 2024
Data-Oblivious ML Accelerators using Hardware Security Extensions Hossam ElAtali John Z. Jekel Lachlan J. Gunn N. Asokan 54 0 0 29 Jan 2024
Digital-analog hybrid matrix multiplication processor for optical neural networks Xiansong Meng Deming Kong K. Kim Qiuchi Li Po Dong Ingemar J. Cox Christina Lioma Hao Hu 28 0 0 26 Jan 2024
PartIR: Composing SPMD Partitioning Strategies for Machine Learning Sami Alabed Daniel Belov Bart Chrzaszcz Juliana Franco Dominik Grewe ... Michael Schaarschmidt Timur Sitdikov Agnieszka Swietlik Dimitrios Vytiniotis Joel Wee 94 3 0 20 Jan 2024
Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization K. Balaskas Andreas Karatzas Christos Sad K. Siozios Iraklis Anagnostopoulos Georgios Zervakis Jörg Henkel MQ 72 11 0 23 Dec 2023
Attention, Distillation, and Tabularization: Towards Practical Neural Network-Based Prefetching Pengmiao Zhang Neelesh Gupta Rajgopal Kannan Viktor K. Prasanna 69 3 0 23 Dec 2023
Experimental demonstration of magnetic tunnel junction-based computational random-access memory Yang Lv Brandon R. Zink Robert P. Bloom Husrev Cilasun Pravin Khanal ... Ali T. Habiboglu Weigang Wang S. Sapatnekar Ulya R. Karpuzcu Jian-Ping Wang 23 9 0 21 Dec 2023
Muchisim: A Simulation Framework for Design Exploration of Multi-Chip Manycore Systems Marcelo Orenes-Vera Esin Tureci M. Martonosi D. Wentzlaff 54 9 0 15 Dec 2023
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding Talfan Evans Shreya Pathak Hamza Merzic Jonathan Schwarz Ryutaro Tanno Olivier J. Hénaff 80 17 0 08 Dec 2023
Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections Marcel Wagenlander Guo Li Bo Zhao Kai Zou Peter R. Pietzuch 96 7 0 08 Dec 2023
On The Fairness Impacts of Hardware Selection in Machine Learning Sree Harsha Nelaturu Nishaanth Kanna Ravichandran Cuong Tran Sara Hooker Ferdinando Fioretto 83 3 0 06 Dec 2023
The Landscape of Modern Machine Learning: A Review of Machine, Distributed and Federated Learning Omer Subasi Oceane Bel Joseph Manzano Kevin J. Barker FedML OOD PINN 91 2 0 05 Dec 2023
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models Yushi Hu Otilia Stretcu Chun-Ta Lu Krishnamurthy Viswanathan Kenji Hata Enming Luo Ranjay Krishna Ariel Fuxman VLM LRM MLLM 126 37 0 05 Dec 2023
Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments Shanqing Cai Subhashini Venugopalan Katie Seaver Xiang Xiao Katrin Tomanek ... Daniel E Vance Blair Casey Steve M. Gleason Philip Q. Nelson Michael P. Brenner 63 7 0 03 Dec 2023
Monitor Placement for Fault Localization in Deep Neural Network Accelerators Wei-Kai Liu 50 0 0 28 Nov 2023
Tascade: Hardware Support for Atomic-free, Asynchronous and Efficient Reduction Trees Marcelo Orenes-Vera Esin Tureci D. Wentzlaff M. Martonosi 37 2 0 27 Nov 2023
Learning to Skip for Language Modeling Dewen Zeng Nan Du Tao Wang Yuanzhong Xu Tao Lei Zhifeng Chen Claire Cui 68 12 0 26 Nov 2023
Large Language Models in Law: A Survey Jinqi Lai Wensheng Gan Jiayang Wu Zhenlian Qi Philip S. Yu ELM AILaw 115 91 0 26 Nov 2023
Locally Optimal Descent for Dynamic Stepsize Scheduling Gilad Yehudai Alon Cohen Amit Daniely Yoel Drori Tomer Koren Mariano Schain 91 0 0 23 Nov 2023
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints Francesco Corti Balz Maag Joachim Schauer U. Pferschy O. Saukh 102 2 0 22 Nov 2023
Fast Inner-Product Algorithms and Architectures for Deep Neural Network Accelerators Trevor E. Pogue N. Nicolici 53 3 0 20 Nov 2023
Tensor-Aware Energy Accounting Timur Babakol Yu David Liu 38 4 0 19 Nov 2023
DLAS: An Exploration and Assessment of the Deep Learning Acceleration Stack Perry Gibson José Cano Elliot J. Crowley Amos Storkey Michael F. P. O'Boyle 70 1 0 15 Nov 2023
Harnessing Manycore Processors with Distributed Memory for Accelerated Training of Sparse and Recurrent Models Jan Finkbeiner Thomas Gmeinder M. Pupilli A. Titterton Emre Neftci 83 3 0 07 Nov 2023
Practical Performance Guarantees for Pipelined DNN Inference Aaron Archer Matthew Fahrbach Kuikui Liu Prakash Prabhu 45 0 0 07 Nov 2023
Remaining useful life prediction of Lithium-ion batteries using spatio-temporal multimodal attention networks Sungho Suh D. Mittal Hymalai Bello Bo Zhou M. Jha P. Lukowicz 32 6 0 29 Oct 2023
Restoring the Broken Covenant Between Compilers and Deep Learning Accelerators Sean Kinzer Soroush Ghodrati R. Mahapatra Byung Hoon Ahn Edwin Mascarenhas Xiaolong Li J. Matai Liang Zhang H. Esmaeilzadeh 38 2 0 27 Oct 2023
GEVO-ML: Optimizing Machine Learning Code with Evolutionary Computation Jhe-Yu Liou Stephanie Forrest Carole-Jean Wu VLM 58 0 0 16 Oct 2023
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models Wenqi Jiang Marco Zeller R. Waleffe Torsten Hoefler Gustavo Alonso 128 19 0 15 Oct 2023
Ultima: Robust and Tail-Optimal AllReduce for Distributed Deep Learning in the Cloud Ertza Warraich Omer Shabtai Khalid Manaa S. Vargaftik Y. Piasetzky Matty Kadosh Lalith Suresh Muhammad Shahbaz 37 1 0 10 Oct 2023
Accelerating Machine Learning Primitives on Commodity Hardware R. Snytsar 38 0 0 08 Oct 2023
mlirSynth: Automatic, Retargetable Program Raising in Multi-Level IR using Program Synthesis Alexander Brauckmann Elizabeth Polgreen Tobias Grosser Michael F. P. O'Boyle 32 2 0 06 Oct 2023
MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems Samuel Hsia Alicia Golden Bilge Acun Newsha Ardalani Zach DeVito Gu-Yeon Wei David Brooks Carole-Jean Wu MoE 122 9 0 04 Oct 2023
Photonic Accelerators for Image Segmentation in Autonomous Driving and Defect Detection Lakshmi Nair David Widemann Brad Turcott Nick Moore Alexandra Wleklinski D. Bunandar Ioannis Papavasileiou Shihu Wang Eric Logan 78 0 0 28 Sep 2023
Transformer-VQ: Linear-Time Transformers via Vector Quantization Albert Mohwald 107 17 0 28 Sep 2023
Small-scale proxies for large-scale Transformer training instabilities Mitchell Wortsman Peter J. Liu Lechao Xiao Katie Everett A. Alemi ... Jascha Narain Sohl-Dickstein Kelvin Xu Jaehoon Lee Justin Gilmer Simon Kornblith 111 99 0 25 Sep 2023
Probabilistic Weight Fixing: Large-scale training of neural network weight uncertainties for quantization Christopher Subia-Waud S. Dasmahapatra UQCV MQ 56 1 0 24 Sep 2023
Efficient N:M Sparse DNN Training Using Algorithm, Architecture, and Dataflow Co-Design Chao Fang Wei Sun Aojun Zhou Zhongfeng Wang 69 14 0 22 Sep 2023
A Machine Learning-oriented Survey on Tiny Machine Learning Luigi Capogrosso Federico Cunico D. Cheng Franco Fummi Marco Cristani SyDa MU 106 44 0 21 Sep 2023
Logic Design of Neural Networks for High-Throughput and Low-Power Applications Kangwei Xu Grace Li Zhang Ulf Schlichtmann Bing Li 50 3 0 19 Sep 2023
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models Guanlong Zhao Yongqiang Wang Jason W. Pelecanos Yu Zhang Hank Liao Yiling Huang Han Lu Quan Wang 83 4 0 14 Sep 2023
Autotuning Apache TVM-based Scientific Applications Using Bayesian Optimization Xingfu Wu P. Paramasivam Valerie Taylor 60 4 0 13 Sep 2023