ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.09913
  4. Cited By
Visualizing the Loss Landscape of Neural Nets

Visualizing the Loss Landscape of Neural Nets

28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
ArXivPDFHTML

Papers citing "Visualizing the Loss Landscape of Neural Nets"

50 / 1,039 papers shown
Title
CRAFT: Contextual Re-Activation of Filters for face recognition Training
CRAFT: Contextual Re-Activation of Filters for face recognition Training
Aman Bhatta
Domingo Mery
Haiyu Wu
Kevin W. Bowyer
CVBM
20
2
0
29 Nov 2023
Critical Influence of Overparameterization on Sharpness-aware Minimization
Critical Influence of Overparameterization on Sharpness-aware Minimization
Sungbin Shin
Dongyeop Lee
Maksym Andriushchenko
Namhoon Lee
AAML
44
1
0
29 Nov 2023
Digital Twin-Enhanced Deep Reinforcement Learning for Resource
  Management in Networks Slicing
Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing
Zhengming Zhang
Yongming Huang
Cheng Zhang
Qingbi Zheng
Luxi Yang
Xiaohu You
24
12
0
28 Nov 2023
In Search of a Data Transformation That Accelerates Neural Field
  Training
In Search of a Data Transformation That Accelerates Neural Field Training
Junwon Seo
Sangyoon Lee
Kwang In Kim
Jaeho Lee
44
3
0
28 Nov 2023
Should We Learn Most Likely Functions or Parameters?
Should We Learn Most Likely Functions or Parameters?
Shikai Qiu
Tim G. J. Rudner
Sanyam Kapoor
Andrew Gordon Wilson
13
5
0
27 Nov 2023
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of
  Post-Training ViTs Quantization
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
Mingliang Xu
Jiawei Hu
Mingbao Lin
Yonghong Tian
Rongrong Ji
MQ
30
10
0
16 Nov 2023
Using Stochastic Gradient Descent to Smooth Nonconvex Functions:
  Analysis of Implicit Graduated Optimization with Optimal Noise Scheduling
Using Stochastic Gradient Descent to Smooth Nonconvex Functions: Analysis of Implicit Graduated Optimization with Optimal Noise Scheduling
Naoki Sato
Hideaki Iiduka
22
3
0
15 Nov 2023
Estimating Post-Synaptic Effects for Online Training of Feed-Forward
  SNNs
Estimating Post-Synaptic Effects for Online Training of Feed-Forward SNNs
Thomas M. Summe
Clemens J. S. Schaefer
Siddharth Joshi
27
1
0
07 Nov 2023
Analysis of NaN Divergence in Training Monocular Depth Estimation Model
Analysis of NaN Divergence in Training Monocular Depth Estimation Model
Bum Jun Kim
Hyeonah Jang
Sang Woo Kim
29
0
0
07 Nov 2023
Signal Processing Meets SGD: From Momentum to Filter
Signal Processing Meets SGD: From Momentum to Filter
Zhipeng Yao
Guisong Chang
Jiaqi Zhang
Qi Zhang
Dazhou Li
Yu Zhang
ODL
29
0
0
06 Nov 2023
Optimal Budgeted Rejection Sampling for Generative Models
Optimal Budgeted Rejection Sampling for Generative Models
Alexandre Verine
Muni Sreenivas Pydi
Benjamin Négrevergne
Y. Chevaleyre
21
3
0
01 Nov 2023
Solutions to Elliptic and Parabolic Problems via Finite Difference Based
  Unsupervised Small Linear Convolutional Neural Networks
Solutions to Elliptic and Parabolic Problems via Finite Difference Based Unsupervised Small Linear Convolutional Neural Networks
A. Celaya
Keegan L. A. Kirk
David T. Fuentes
Beatrice Riviere
11
1
0
01 Nov 2023
A Path to Simpler Models Starts With Noise
A Path to Simpler Models Starts With Noise
Lesia Semenova
Harry Chen
Ronald E. Parr
Cynthia Rudin
41
15
0
30 Oct 2023
Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from
  a Minimax Game Perspective
Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game Perspective
Yifei Wang
Liangchen Li
Jiansheng Yang
Zhouchen Lin
Yisen Wang
31
11
0
30 Oct 2023
Power-Enhanced Residual Network for Function Approximation and
  Physics-Informed Inverse Problems
Power-Enhanced Residual Network for Function Approximation and Physics-Informed Inverse Problems
A. Noorizadegan
D. Young
Benny Y. C. Hon
C. S. Chen
PINN
17
7
0
24 Oct 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without
  Full Large Language Model
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
Kaiyan Zhang
Ning Ding
Biqing Qi
Xuekai Zhu
Xinwei Long
Bowen Zhou
46
4
0
24 Oct 2023
Revisiting Deep Ensemble for Out-of-Distribution Detection: A Loss
  Landscape Perspective
Revisiting Deep Ensemble for Out-of-Distribution Detection: A Loss Landscape Perspective
Kun Fang
Qinghua Tao
Xiaolin Huang
Jie-jin Yang
OODD
48
2
0
22 Oct 2023
Training Dynamics of Deep Network Linear Regions
Training Dynamics of Deep Network Linear Regions
Ahmed Imtiaz Humayun
Randall Balestriero
Richard Baraniuk
36
3
0
19 Oct 2023
Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning
  and Autoregression
Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression
Adam Block
Dylan J. Foster
Akshay Krishnamurthy
Max Simchowitz
Cyril Zhang
30
4
0
17 Oct 2023
Domain Generalization Using Large Pretrained Models with
  Mixture-of-Adapters
Domain Generalization Using Large Pretrained Models with Mixture-of-Adapters
Gyuseong Lee
Wooseok Jang
Jin Hyeon Kim
Jaewoo Jung
Seungryong Kim
MoE
OOD
30
2
0
17 Oct 2023
RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
Zhicheng Cai
Xiaohan Ding
Qiu Shen
Xun Cao
35
18
0
16 Oct 2023
"Reading Between the Heat": Co-Teaching Body Thermal Signatures for
  Non-intrusive Stress Detection
"Reading Between the Heat": Co-Teaching Body Thermal Signatures for Non-intrusive Stress Detection
Yi Xiao
Harshit Sharma
Zhongyang Zhang
D. Bergen-Cico
Tauhidur Rahman
Asif Salekin
11
2
0
15 Oct 2023
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors
A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors
Olivier Laurent
Emanuel Aldea
Gianni Franchi
BDL
UQCV
22
5
0
12 Oct 2023
Parameter Efficient Multi-task Model Fusion with Partial Linearization
Parameter Efficient Multi-task Model Fusion with Partial Linearization
Anke Tang
Li Shen
Yong Luo
Yibing Zhan
Han Hu
Bo Du
Yixin Chen
Dacheng Tao
MoMe
26
30
0
07 Oct 2023
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language
  Models
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
VLM
23
14
0
04 Oct 2023
Spline-based neural network interatomic potentials: blending classical
  and machine learning models
Spline-based neural network interatomic potentials: blending classical and machine learning models
Joshua A Vita
D. Trinkle
14
2
0
04 Oct 2023
Active Learning on Neural Networks through Interactive Generation of
  Digit Patterns and Visual Representation
Active Learning on Neural Networks through Interactive Generation of Digit Patterns and Visual Representation
D. H. Jeong
Jin-Hee Cho
Feng Chen
A. Jøsang
Soo-Yeon Ji
11
0
0
02 Oct 2023
Bringing the Discussion of Minima Sharpness to the Audio Domain: a
  Filter-Normalised Evaluation for Acoustic Scene Classification
Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification
M. Milling
Andreas Triantafyllopoulos
Iosif Tsangko
Simon Rampp
F. Schlüter
27
3
0
28 Sep 2023
Deep Model Fusion: A Survey
Deep Model Fusion: A Survey
Weishi Li
Yong Peng
Miao Zhang
Liang Ding
Han Hu
Li Shen
FedML
MoMe
33
52
0
27 Sep 2023
Neuro-Visualizer: An Auto-encoder-based Loss Landscape Visualization
  Method
Neuro-Visualizer: An Auto-encoder-based Loss Landscape Visualization Method
Mohannad Elhamod
Anuj Karpatne
24
1
0
26 Sep 2023
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in
  Continuous Control
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Nate Rahn
P. DÓro
Harley Wiltzer
Pierre-Luc Bacon
Marc G. Bellemare
27
3
0
26 Sep 2023
Are Large Language Models Really Robust to Word-Level Perturbations?
Are Large Language Models Really Robust to Word-Level Perturbations?
Haoyu Wang
Guozheng Ma
Cong Yu
Ning Gui
Linrui Zhang
...
Sen Zhang
Li Shen
Xueqian Wang
Peilin Zhao
Dacheng Tao
KELM
26
22
0
20 Sep 2023
Gradient constrained sharpness-aware prompt learning for vision-language
  models
Gradient constrained sharpness-aware prompt learning for vision-language models
Liangchen Liu
Nannan Wang
Dawei Zhou
Xinbo Gao
Decheng Liu
Xi Yang
Tongliang Liu
VLM
33
2
0
14 Sep 2023
Investigating the Impact of Action Representations in Policy Gradient
  Algorithms
Investigating the Impact of Action Representations in Policy Gradient Algorithms
Jan Schneider-Barnes
Pierre Schumacher
Daniel Haeufle
Bernhard Scholkopf
Le Chen
OffRL
16
1
0
13 Sep 2023
Exploring Flat Minima for Domain Generalization with Large Learning
  Rates
Exploring Flat Minima for Domain Generalization with Large Learning Rates
Jian Zhang
Lei Qi
Yinghuan Shi
Yang Gao
41
2
0
12 Sep 2023
Active Neural Mapping
Active Neural Mapping
Zike Yan
Haoxiang Yang
H. Zha
8
20
0
30 Aug 2023
Deep Video Codec Control for Vision Models
Deep Video Codec Control for Vision Models
Christoph Reich
Biplob K. Debnath
Deep Patel
Tim Prangemeier
Daniel Cremers
S. Chakradhar
26
1
0
30 Aug 2023
On-the-Fly Guidance Training for Medical Image Registration
On-the-Fly Guidance Training for Medical Image Registration
Yuelin Xin
Yicheng Chen
Shengxiang Ji
Kun Han
Xiaohui Xie
OOD
35
1
0
29 Aug 2023
Neural Network Training Strategy to Enhance Anomaly Detection
  Performance: A Perspective on Reconstruction Loss Amplification
Neural Network Training Strategy to Enhance Anomaly Detection Performance: A Perspective on Reconstruction Loss Amplification
Yeonghyeon Park
Sungho Kang
Myung Jin Kim
Hyeonho Jeong
H. Park
Hyeong Seok Kim
Juneho Yi
18
3
0
28 Aug 2023
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Chengkun Wei
Wenlong Meng
Zhikun Zhang
M. Chen
Ming-Hui Zhao
Wenjing Fang
Lei Wang
Zihui Zhang
Wenzhi Chen
AAML
29
8
0
26 Aug 2023
Synergistic Fusion of Graph and Transformer Features for Enhanced
  Molecular Property Prediction
Synergistic Fusion of Graph and Transformer Features for Enhanced Molecular Property Prediction
M. V. Sai Prakash
N. Siddartha Reddy
Ganesh Parab
V. Varun
Vishal Vaddina
Saisubramaniam Gopalakrishnan
AI4CE
28
3
0
25 Aug 2023
FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in
  Federated Learning
FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in Federated Learning
Gihun Lee
Minchan Jeong
Sangmook Kim
Jaehoon Oh
Se-Young Yun
FedML
26
8
0
24 Aug 2023
Adversarial Collaborative Filtering for Free
Adversarial Collaborative Filtering for Free
Huiyuan Chen
Xiaoting Li
Vivian Lai
Chin-Chia Michael Yeh
Yujie Fan
Yan Zheng
Mahashweta Das
Hao Yang
AAML
20
6
0
20 Aug 2023
Latent State Models of Training Dynamics
Latent State Models of Training Dynamics
Michael Y. Hu
Angelica Chen
Naomi Saphra
Kyunghyun Cho
35
7
0
18 Aug 2023
Towards Understanding the Generalizability of Delayed Stochastic
  Gradient Descent
Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent
Xiaoge Deng
Li Shen
Shengwei Li
Tao Sun
Dongsheng Li
Dacheng Tao
28
3
0
18 Aug 2023
Learning Lightweight Object Detectors via Multi-Teacher Progressive
  Distillation
Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation
Shengcao Cao
Mengtian Li
James Hays
Deva Ramanan
Yu-xiong Wang
Liangyan Gui
VLM
26
11
0
17 Aug 2023
Membrane Potential Batch Normalization for Spiking Neural Networks
Membrane Potential Batch Normalization for Spiking Neural Networks
Yu-Zhu Guo
Yuhan Zhang
Y. Chen
Weihang Peng
Xiaode Liu
Liwen Zhang
Xuhui Huang
Zhe Ma
AAML
32
37
0
16 Aug 2023
Unified Data-Free Compression: Pruning and Quantization without
  Fine-Tuning
Unified Data-Free Compression: Pruning and Quantization without Fine-Tuning
Shipeng Bai
Jun Chen
Xintian Shen
Yixuan Qian
Yong Liu
MQ
24
12
0
14 Aug 2023
Spatially Varying Nanophotonic Neural Networks
Spatially Varying Nanophotonic Neural Networks
Kaixuan Wei
Xiao Li
Johannes E. Froech
Praneeth Chakravarthula
James E. M. Whitehead
Ethan Tseng
A. Majumdar
Felix Heide
17
11
0
07 Aug 2023
Revisiting the Parameter Efficiency of Adapters from the Perspective of
  Precision Redundancy
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy
Shibo Jie
Haoqing Wang
Zhiwei Deng
19
31
0
31 Jul 2023
Previous
123...567...192021
Next