Visualizing the Loss Landscape of Neural Nets

28 December 2017

Hao Li

Papers citing "Visualizing the Loss Landscape of Neural Nets"

50 / 1,039 papers shown

Title
CRAFT: Contextual Re-Activation of Filters for face recognition Training Aman Bhatta Domingo Mery Haiyu Wu Kevin W. Bowyer CVBM 20 2 0 29 Nov 2023
Critical Influence of Overparameterization on Sharpness-aware Minimization Sungbin Shin Dongyeop Lee Maksym Andriushchenko Namhoon Lee AAML 44 1 0 29 Nov 2023
Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing Zhengming Zhang Yongming Huang Cheng Zhang Qingbi Zheng Luxi Yang Xiaohu You 24 12 0 28 Nov 2023
In Search of a Data Transformation That Accelerates Neural Field Training Junwon Seo Sangyoon Lee Kwang In Kim Jaeho Lee 44 3 0 28 Nov 2023
Should We Learn Most Likely Functions or Parameters? Shikai Qiu Tim G. J. Rudner Sanyam Kapoor Andrew Gordon Wilson 13 5 0 27 Nov 2023
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization Mingliang Xu Jiawei Hu Mingbao Lin Yonghong Tian Rongrong Ji MQ 30 10 0 16 Nov 2023
Using Stochastic Gradient Descent to Smooth Nonconvex Functions: Analysis of Implicit Graduated Optimization with Optimal Noise Scheduling Naoki Sato Hideaki Iiduka 22 3 0 15 Nov 2023
Estimating Post-Synaptic Effects for Online Training of Feed-Forward SNNs Thomas M. Summe Clemens J. S. Schaefer Siddharth Joshi 27 1 0 07 Nov 2023
Analysis of NaN Divergence in Training Monocular Depth Estimation Model Bum Jun Kim Hyeonah Jang Sang Woo Kim 29 0 0 07 Nov 2023
Signal Processing Meets SGD: From Momentum to Filter Zhipeng Yao Guisong Chang Jiaqi Zhang Qi Zhang Dazhou Li Yu Zhang ODL 29 0 0 06 Nov 2023
Optimal Budgeted Rejection Sampling for Generative Models Alexandre Verine Muni Sreenivas Pydi Benjamin Négrevergne Y. Chevaleyre 21 3 0 01 Nov 2023
Solutions to Elliptic and Parabolic Problems via Finite Difference Based Unsupervised Small Linear Convolutional Neural Networks A. Celaya Keegan L. A. Kirk David T. Fuentes Beatrice Riviere 11 1 0 01 Nov 2023
A Path to Simpler Models Starts With Noise Lesia Semenova Harry Chen Ronald E. Parr Cynthia Rudin 41 15 0 30 Oct 2023
Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game Perspective Yifei Wang Liangchen Li Jiansheng Yang Zhouchen Lin Yisen Wang 31 11 0 30 Oct 2023
Power-Enhanced Residual Network for Function Approximation and Physics-Informed Inverse Problems A. Noorizadegan D. Young Benny Y. C. Hon C. S. Chen PINN 17 7 0 24 Oct 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model Kaiyan Zhang Ning Ding Biqing Qi Xuekai Zhu Xinwei Long Bowen Zhou 46 4 0 24 Oct 2023
Revisiting Deep Ensemble for Out-of-Distribution Detection: A Loss Landscape Perspective Kun Fang Qinghua Tao Xiaolin Huang Jie-jin Yang OODD 48 2 0 22 Oct 2023
Training Dynamics of Deep Network Linear Regions Ahmed Imtiaz Humayun Randall Balestriero Richard Baraniuk 36 3 0 19 Oct 2023
Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression Adam Block Dylan J. Foster Akshay Krishnamurthy Max Simchowitz Cyril Zhang 30 4 0 17 Oct 2023
Domain Generalization Using Large Pretrained Models with Mixture-of-Adapters Gyuseong Lee Wooseok Jang Jin Hyeon Kim Jaewoo Jung Seungryong Kim MoE OOD 30 2 0 17 Oct 2023
RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets Zhicheng Cai Xiaohan Ding Qiu Shen Xun Cao 35 18 0 16 Oct 2023
"Reading Between the Heat": Co-Teaching Body Thermal Signatures for Non-intrusive Stress Detection Yi Xiao Harshit Sharma Zhongyang Zhang D. Bergen-Cico Tauhidur Rahman Asif Salekin 11 2 0 15 Oct 2023
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors Olivier Laurent Emanuel Aldea Gianni Franchi BDL UQCV 22 5 0 12 Oct 2023
Parameter Efficient Multi-task Model Fusion with Partial Linearization Anke Tang Li Shen Yong Luo Yibing Zhan Han Hu Bo Du Yixin Chen Dacheng Tao MoMe 26 30 0 07 Oct 2023
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models Yi-Lin Sung Jaehong Yoon Mohit Bansal VLM 23 14 0 04 Oct 2023
Spline-based neural network interatomic potentials: blending classical and machine learning models Joshua A Vita D. Trinkle 14 2 0 04 Oct 2023
Active Learning on Neural Networks through Interactive Generation of Digit Patterns and Visual Representation D. H. Jeong Jin-Hee Cho Feng Chen A. Jøsang Soo-Yeon Ji 11 0 0 02 Oct 2023
Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification M. Milling Andreas Triantafyllopoulos Iosif Tsangko Simon Rampp F. Schlüter 27 3 0 28 Sep 2023
Deep Model Fusion: A Survey Weishi Li Yong Peng Miao Zhang Liang Ding Han Hu Li Shen FedML MoMe 33 52 0 27 Sep 2023
Neuro-Visualizer: An Auto-encoder-based Loss Landscape Visualization Method Mohannad Elhamod Anuj Karpatne 24 1 0 26 Sep 2023
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control Nate Rahn P. DÓro Harley Wiltzer Pierre-Luc Bacon Marc G. Bellemare 27 3 0 26 Sep 2023
Are Large Language Models Really Robust to Word-Level Perturbations? Haoyu Wang Guozheng Ma Cong Yu Ning Gui Linrui Zhang ... Sen Zhang Li Shen Xueqian Wang Peilin Zhao Dacheng Tao KELM 26 22 0 20 Sep 2023
Gradient constrained sharpness-aware prompt learning for vision-language models Liangchen Liu Nannan Wang Dawei Zhou Xinbo Gao Decheng Liu Xi Yang Tongliang Liu VLM 33 2 0 14 Sep 2023
Investigating the Impact of Action Representations in Policy Gradient Algorithms Jan Schneider-Barnes Pierre Schumacher Daniel Haeufle Bernhard Scholkopf Le Chen OffRL 16 1 0 13 Sep 2023
Exploring Flat Minima for Domain Generalization with Large Learning Rates Jian Zhang Lei Qi Yinghuan Shi Yang Gao 41 2 0 12 Sep 2023
Active Neural Mapping Zike Yan Haoxiang Yang H. Zha 8 20 0 30 Aug 2023
Deep Video Codec Control for Vision Models Christoph Reich Biplob K. Debnath Deep Patel Tim Prangemeier Daniel Cremers S. Chakradhar 26 1 0 30 Aug 2023
On-the-Fly Guidance Training for Medical Image Registration Yuelin Xin Yicheng Chen Shengxiang Ji Kun Han Xiaohui Xie OOD 35 1 0 29 Aug 2023
Neural Network Training Strategy to Enhance Anomaly Detection Performance: A Perspective on Reconstruction Loss Amplification Yeonghyeon Park Sungho Kang Myung Jin Kim Hyeonho Jeong H. Park Hyeong Seok Kim Juneho Yi 18 3 0 28 Aug 2023
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors Chengkun Wei Wenlong Meng Zhikun Zhang M. Chen Ming-Hui Zhao Wenjing Fang Lei Wang Zihui Zhang Wenzhi Chen AAML 29 8 0 26 Aug 2023
Synergistic Fusion of Graph and Transformer Features for Enhanced Molecular Property Prediction M. V. Sai Prakash N. Siddartha Reddy Ganesh Parab V. Varun Vishal Vaddina Saisubramaniam Gopalakrishnan AI4CE 28 3 0 25 Aug 2023
FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in Federated Learning Gihun Lee Minchan Jeong Sangmook Kim Jaehoon Oh Se-Young Yun FedML 26 8 0 24 Aug 2023
Adversarial Collaborative Filtering for Free Huiyuan Chen Xiaoting Li Vivian Lai Chin-Chia Michael Yeh Yujie Fan Yan Zheng Mahashweta Das Hao Yang AAML 20 6 0 20 Aug 2023
Latent State Models of Training Dynamics Michael Y. Hu Angelica Chen Naomi Saphra Kyunghyun Cho 35 7 0 18 Aug 2023
Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent Xiaoge Deng Li Shen Shengwei Li Tao Sun Dongsheng Li Dacheng Tao 28 3 0 18 Aug 2023
Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation Shengcao Cao Mengtian Li James Hays Deva Ramanan Yu-xiong Wang Liangyan Gui VLM 26 11 0 17 Aug 2023
Membrane Potential Batch Normalization for Spiking Neural Networks Yu-Zhu Guo Yuhan Zhang Y. Chen Weihang Peng Xiaode Liu Liwen Zhang Xuhui Huang Zhe Ma AAML 32 37 0 16 Aug 2023
Unified Data-Free Compression: Pruning and Quantization without Fine-Tuning Shipeng Bai Jun Chen Xintian Shen Yixuan Qian Yong Liu MQ 24 12 0 14 Aug 2023
Spatially Varying Nanophotonic Neural Networks Kaixuan Wei Xiao Li Johannes E. Froech Praneeth Chakravarthula James E. M. Whitehead Ethan Tseng A. Majumdar Felix Heide 17 11 0 07 Aug 2023
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy Shibo Jie Haoqing Wang Zhiwei Deng 19 31 0 31 Jul 2023