The Description Length of Deep Learning Models

20 February 2018

Papers citing "The Description Length of Deep Learning Models"

30 / 30 papers shown

Title
Identifying Causal Direction via Variational Bayesian Compression Quang-Duy Tran Bao Duong Phuoc Nguyen Thin Nguyen CML 34 0 0 12 May 2025
Minimum Description Length of a Spectrum Variational Autoencoder: A Theory Canlin Zhang Xiuwen Liu 45 0 0 01 Apr 2025
A Complexity-Based Theory of Compositionality Eric Elmoznino Thomas Jiralerspong Yoshua Bengio Guillaume Lajoie CoGe 64 4 0 18 Oct 2024
In-context learning and Occam's razor Eric Elmoznino Tom Marty Tejas Kasetty Léo Gagnon Sarthak Mittal Mahan Fathi Dhanya Sridhar Guillaume Lajoie 42 1 0 17 Oct 2024
Implications of Annotation Artifacts in Edge Probing Test Datasets Sagnik Ray Choudhury Jushaan Kalra 16 0 0 20 Oct 2023
Bridging Information-Theoretic and Geometric Compression in Language Models Emily Cheng Corentin Kervadec Marco Baroni 34 17 0 20 Oct 2023
Language Modeling Is Compression Grégoire Delétang Anian Ruoss Paul-Ambroise Duquenne Elliot Catt Tim Genewein ... Wenliang Kevin Li Matthew Aitchison Laurent Orseau Marcus Hutter J. Veness AI4CE 37 131 0 19 Sep 2023
What is the best recipe for character-level encoder-only modelling? Kris Cao 36 2 0 09 May 2023
Mathematical Challenges in Deep Learning V. Nia Guojun Zhang I. Kobyzev Michael R. Metel Xinlin Li ... S. Hemati M. Asgharian Linglong Kong Wulong Liu Boxing Chen AI4CE VLM 37 1 0 24 Mar 2023
Efficient and Effective Methods for Mixed Precision Neural Network Quantization for Faster, Energy-efficient Inference Deepika Bablani J. McKinstry S. K. Esser R. Appuswamy D. Modha MQ 23 4 0 30 Jan 2023
Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering Zhiyong Wu Yaoxiang Wang Jiacheng Ye Lingpeng Kong 41 119 0 20 Dec 2022
Weight Fixing Networks Christopher Subia-Waud S. Dasmahapatra MQ 19 2 0 24 Oct 2022
Sequential Learning Of Neural Networks for Prequential MDL J. Bornschein Yazhe Li Marcus Hutter AI4TS 27 6 0 14 Oct 2022
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data Ching-Yun Ko Pin-Yu Chen Jeet Mohapatra Payel Das Lucani E. Daniel 30 3 0 06 Oct 2022
Conformal Methods for Quantifying Uncertainty in Spatiotemporal Data: A Survey S. Sun AI4CE 38 10 0 08 Sep 2022
Minimum Description Length Control Theodore H. Moskovitz Ta-Chu Kao M. Sahani M. Botvinick 26 1 0 17 Jul 2022
Sparse Double Descent: Where Network Pruning Aggravates Overfitting Zhengqi He Zeke Xie Quanzhi Zhu Zengchang Qin 77 27 0 17 Jun 2022
On the Power-Law Hessian Spectrums in Deep Learning Zeke Xie Qian-Yuan Tang Yunfeng Cai Mingming Sun P. Li ODL 42 9 0 31 Jan 2022
Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks Tolga Birdal Aaron Lou Leonidas J. Guibas Umut cSimcsekli 30 61 0 25 Nov 2021
Representation Edit Distance as a Measure of Novelty J. Alspector 30 6 0 04 Nov 2021
Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP Zhijing Jin Julius von Kügelgen Jingwei Ni Tejas Vaidhya Ayush Kaushal Mrinmaya Sachan Bernhard Schoelkopf CML 38 30 0 07 Oct 2021
A Bayesian Framework for Information-Theoretic Probing Tiago Pimentel Ryan Cotterell 28 24 0 08 Sep 2021
What Is Considered Complete for Visual Recognition? Lingxi Xie Xiaopeng Zhang Longhui Wei Jianlong Chang Qi Tian VLM 23 4 0 28 May 2021
Revisiting Self-Supervised Monocular Depth Estimation Ue-Hwan Kim Jong-Hwan Kim SSL MDE 36 7 0 23 Mar 2021
Small Data, Big Decisions: Model Selection in the Small-Data Regime J. Bornschein Francesco Visin Simon Osindero 15 36 0 26 Sep 2020
Evaluating representations by the complexity of learning low-loss predictors William F. Whitney M. Song David Brandfonbrener Jaan Altosaar Kyunghyun Cho 25 23 0 15 Sep 2020
Information-Theoretic Probing with Minimum Description Length Elena Voita Ivan Titov 21 270 0 27 Mar 2020
oLMpics -- On what Language Model Pre-training Captures Alon Talmor Yanai Elazar Yoav Goldberg Jonathan Berant LRM 31 300 0 31 Dec 2019
Weight Agnostic Neural Networks Adam Gaier David R Ha OOD 35 239 0 11 Jun 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers Baihan Lin 16 2 0 27 Feb 2019