ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.09913
  4. Cited By
Visualizing the Loss Landscape of Neural Nets

Visualizing the Loss Landscape of Neural Nets

28 December 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
ArXivPDFHTML

Papers citing "Visualizing the Loss Landscape of Neural Nets"

50 / 1,039 papers shown
Title
Spherical and Hyperbolic Toric Topology-Based Codes On Graph Embedding
  for Ising MRF Models: Classical and Quantum Topology Machine Learning
Spherical and Hyperbolic Toric Topology-Based Codes On Graph Embedding for Ising MRF Models: Classical and Quantum Topology Machine Learning
V. Usatyuk
Sergey Egorov
Denis Sapozhnikov
29
3
0
28 Jul 2023
How to Scale Your EMA
How to Scale Your EMA
Dan Busbridge
Jason Ramapuram
Pierre Ablin
Tatiana Likhomanenko
Eeshan Gunesh Dhekane
Xavier Suau
Russ Webb
30
17
0
25 Jul 2023
The instabilities of large learning rate training: a loss landscape view
The instabilities of large learning rate training: a loss landscape view
Lawrence Wang
Stephen J. Roberts
8
2
0
22 Jul 2023
FedDefender: Client-Side Attack-Tolerant Federated Learning
FedDefender: Client-Side Attack-Tolerant Federated Learning
Sungwon Park
Sungwon Han
Fangzhao Wu
Sundong Kim
Bin Zhu
Xing Xie
Meeyoung Cha
FedML
AAML
25
20
0
18 Jul 2023
Sharpness-Aware Graph Collaborative Filtering
Sharpness-Aware Graph Collaborative Filtering
Huiyuan Chen
Chin-Chia Michael Yeh
Yujie Fan
Yan Zheng
Junpeng Wang
Vivian Lai
Mahashweta Das
Hao Yang
31
5
0
18 Jul 2023
DOT: A Distillation-Oriented Trainer
DOT: A Distillation-Oriented Trainer
Borui Zhao
Quan Cui
Renjie Song
Jiajun Liang
24
4
0
17 Jul 2023
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
Jun Chen
Shipeng Bai
Tianxin Huang
Mengmeng Wang
Guanzhong Tian
Y. Liu
MQ
38
18
0
02 Jul 2023
Navigating Noise: A Study of How Noise Influences Generalisation and
  Calibration of Neural Networks
Navigating Noise: A Study of How Noise Influences Generalisation and Calibration of Neural Networks
Martin Ferianc
Ondrej Bohdal
Timothy M. Hospedales
Miguel R. D. Rodrigues
15
4
0
30 Jun 2023
Systematic Investigation of Sparse Perturbed Sharpness-Aware
  Minimization Optimizer
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer
Peng Mi
Li Shen
Tianhe Ren
Yiyi Zhou
Tianshuo Xu
Xiaoshuai Sun
Tongliang Liu
Rongrong Ji
Dacheng Tao
AAML
37
2
0
30 Jun 2023
Weight Compander: A Simple Weight Reparameterization for Regularization
Weight Compander: A Simple Weight Reparameterization for Regularization
Rinor Cakaj
Jens Mehnert
B. Yang
21
1
0
29 Jun 2023
Predicting Grokking Long Before it Happens: A look into the loss
  landscape of models which grok
Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Pascal Junior Tikeng Notsawo
Hattie Zhou
Mohammad Pezeshki
Irina Rish
G. Dumas
25
23
0
23 Jun 2023
No Wrong Turns: The Simple Geometry Of Neural Networks Optimization
  Paths
No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths
Charles Guille-Escuret
Hiroki Naganuma
Kilian Fatras
Ioannis Mitliagkas
16
3
0
20 Jun 2023
FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology
  Structure and Knowledge Distillation
FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation
Jingwen Guo
Hong Liu
Shitong Sun
Tianyu Guo
Hao Fei
Chenyang Si
41
3
0
19 Jun 2023
Learnable Weight Initialization for Volumetric Medical Image
  Segmentation
Learnable Weight Initialization for Volumetric Medical Image Segmentation
Shahina Kunhimon
Abdelrahman M. Shaker
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
31
1
0
15 Jun 2023
The Split Matters: Flat Minima Methods for Improving the Performance of
  GNNs
The Split Matters: Flat Minima Methods for Improving the Performance of GNNs
N. Lell
A. Scherp
43
1
0
15 Jun 2023
Lookaround Optimizer: $k$ steps around, 1 step average
Lookaround Optimizer: kkk steps around, 1 step average
Jiangtao Zhang
Shunyu Liu
Mingli Song
Tongtian Zhu
Zhenxing Xu
Mingli Song
MoMe
34
6
0
13 Jun 2023
Riemannian Laplace approximations for Bayesian neural networks
Riemannian Laplace approximations for Bayesian neural networks
Federico Bergamin
Pablo Moreno-Muñoz
Søren Hauberg
Georgios Arvanitidis
BDL
38
6
0
12 Jun 2023
FalconNet: Factorization for the Light-weight ConvNets
FalconNet: Factorization for the Light-weight ConvNets
Zhicheng Cai
Qiu Shen
26
11
0
10 Jun 2023
Differentially Private Sharpness-Aware Training
Differentially Private Sharpness-Aware Training
Jinseong Park
Hoki Kim
Yujin Choi
Jaewook Lee
27
8
0
09 Jun 2023
Connectional-Style-Guided Contextual Representation Learning for Brain
  Disease Diagnosis
Connectional-Style-Guided Contextual Representation Learning for Brain Disease Diagnosis
Gongshu Wang
Ning Jiang
Yunxiao Ma
Tiantian Liu
Duanduan Chen
Jinglong Wu
Guoqi Li
Dong Liang
Tianyi Yan
MedIm
28
2
0
08 Jun 2023
Boosting Adversarial Transferability by Achieving Flat Local Maxima
Boosting Adversarial Transferability by Achieving Flat Local Maxima
Zhijin Ge
Hongying Liu
Xiaosen Wang
Fanhua Shang
Yuanyuan Liu
AAML
11
40
0
08 Jun 2023
Machine learning in and out of equilibrium
Machine learning in and out of equilibrium
Shishir Adhikari
Alkan Kabakcciouglu
A. Strang
Deniz Yuret
M. Hinczewski
22
4
0
06 Jun 2023
Discovering Novel Biological Traits From Images Using Phylogeny-Guided
  Neural Networks
Discovering Novel Biological Traits From Images Using Phylogeny-Guided Neural Networks
Mohannad Elhamod
Mridul Khurana
Harish Babu Manogaran
Josef C. Uyeda
M. Balk
...
Wei-Lun Chao
Chuck Stewart
Daniel Rubenstein
T. Berger-Wolf
Anuj Karpatne
14
7
0
05 Jun 2023
Decentralized SGD and Average-direction SAM are Asymptotically
  Equivalent
Decentralized SGD and Average-direction SAM are Asymptotically Equivalent
Tongtian Zhu
Fengxiang He
Kaixuan Chen
Mingli Song
Dacheng Tao
34
15
0
05 Jun 2023
Enhance Diffusion to Improve Robust Generalization
Enhance Diffusion to Improve Robust Generalization
Jianhui Sun
Sanchit Sinha
Aidong Zhang
32
4
0
05 Jun 2023
ReContrast: Domain-Specific Anomaly Detection via Contrastive
  Reconstruction
ReContrast: Domain-Specific Anomaly Detection via Contrastive Reconstruction
Jia Guo
Shuai Lu
Lize Jia
Weihang Zhang
Huiqi Li
24
23
0
05 Jun 2023
Lottery Tickets in Evolutionary Optimization: On Sparse
  Backpropagation-Free Trainability
Lottery Tickets in Evolutionary Optimization: On Sparse Backpropagation-Free Trainability
R. T. Lange
Henning Sprekeler
19
2
0
31 May 2023
Generalization Ability of Wide Residual Networks
Generalization Ability of Wide Residual Networks
Jianfa Lai
Zixiong Yu
Songtao Tian
Qian Lin
31
4
0
29 May 2023
SANE: The phases of gradient descent through Sharpness Adjusted Number
  of Effective parameters
SANE: The phases of gradient descent through Sharpness Adjusted Number of Effective parameters
Lawrence Wang
Stephen J. Roberts
24
0
0
29 May 2023
Exploring Weight Balancing on Long-Tailed Recognition Problem
Exploring Weight Balancing on Long-Tailed Recognition Problem
Naoya Hasegawa
Issei Sato
27
6
0
26 May 2023
Stochastic Modified Equations and Dynamics of Dropout Algorithm
Stochastic Modified Equations and Dynamics of Dropout Algorithm
Zhongwang Zhang
Yuqing Li
Tao Luo
Z. Xu
19
6
0
25 May 2023
Sharpness-Aware Minimization Revisited: Weighted Sharpness as a
  Regularization Term
Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term
Yun Yue
Jiadi Jiang
Zhiling Ye
Ni Gao
Yongchao Liu
Kecheng Zhang
MLAU
ODL
30
11
0
25 May 2023
RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
David Qiu
David Rim
Shaojin Ding
Oleg Rybakov
Yanzhang He
MQ
35
4
0
24 May 2023
Self-Evolution Learning for Discriminative Language Model Pretraining
Self-Evolution Learning for Discriminative Language Model Pretraining
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
37
12
0
24 May 2023
Transferring Learning Trajectories of Neural Networks
Transferring Learning Trajectories of Neural Networks
Daiki Chijiwa
31
2
0
23 May 2023
Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot
  Text Classification Tasks
Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks
Haoqi Zheng
Qihuang Zhong
Liang Ding
Zhiliang Tian
Xin-Yi Niu
Dongsheng Li
Dacheng Tao
VLM
40
6
0
22 May 2023
Subspace-Configurable Networks
Subspace-Configurable Networks
Dong Wang
O. Saukh
Xiaoxi He
Lothar Thiele
OOD
35
0
0
22 May 2023
Evolutionary Algorithms in the Light of SGD: Limit Equivalence, Minima
  Flatness, and Transfer Learning
Evolutionary Algorithms in the Light of SGD: Limit Equivalence, Minima Flatness, and Transfer Learning
Andrei Kucharavy
R. Guerraoui
Ljiljana Dolamic
32
1
0
20 May 2023
Loss Spike in Training Neural Networks
Loss Spike in Training Neural Networks
Zhongwang Zhang
Z. Xu
36
4
0
20 May 2023
Annealing Self-Distillation Rectification Improves Adversarial Training
Annealing Self-Distillation Rectification Improves Adversarial Training
Yuehua Wu
Hung-Jui Wang
Shang-Tse Chen
AAML
24
3
0
20 May 2023
DisCo: Distilled Student Models Co-training for Semi-supervised Text
  Mining
DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining
Weifeng Jiang
Qianren Mao
Chenghua Lin
Jianxin Li
Ting Deng
Weiyi Yang
Zihan Wang
18
2
0
20 May 2023
Machine learning for phase-resolved reconstruction of nonlinear ocean
  wave surface elevations from sparse remote sensing data
Machine learning for phase-resolved reconstruction of nonlinear ocean wave surface elevations from sparse remote sensing data
Svenja Ehlers
Marco Klein
Alexander Heinlein
Mathies Wedler
Nicolas Desmars
Norbert Hoffmann
M. Stender
AI4Cl
40
2
0
18 May 2023
Physics Inspired Approaches To Understanding Gaussian Processes
Physics Inspired Approaches To Understanding Gaussian Processes
Maximilian P. Niroomand
L. Dicks
Edward O. Pyzer-Knapp
D. Wales
36
1
0
18 May 2023
Understanding the Initial Condensation of Convolutional Neural Networks
Understanding the Initial Condensation of Convolutional Neural Networks
Zhangchen Zhou
Hanxu Zhou
Yuqing Li
Zhi-Qin John Xu
MLT
AI4CE
26
5
0
17 May 2023
Privacy-Preserving Ensemble Infused Enhanced Deep Neural Network
  Framework for Edge Cloud Convergence
Privacy-Preserving Ensemble Infused Enhanced Deep Neural Network Framework for Edge Cloud Convergence
Veronika Stephanie
I. Khalil
Mohammad Saidur Rahman
Mohammed Atiquzzaman
FedML
13
10
0
16 May 2023
Understanding and Improving Model Averaging in Federated Learning on
  Heterogeneous Data
Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data
Tailin Zhou
Zehong Lin
Jinchao Zhang
Danny H. K. Tsang
MoMe
FedML
38
12
0
13 May 2023
Explainable Parallel RCNN with Novel Feature Representation for Time
  Series Forecasting
Explainable Parallel RCNN with Novel Feature Representation for Time Series Forecasting
Jimeng Shi
Rukmangadh Myana
Vitalii Stebliankin
Azam Shirali
Giri Narasimhan
AI4TS
30
6
0
08 May 2023
HiFi: High-Information Attention Heads Hold for Parameter-Efficient
  Model Adaptation
HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation
Anchun Gui
Han Xiao
19
4
0
08 May 2023
Interpreting Training Aspects of Deep-Learned Error-Correcting Codes
Interpreting Training Aspects of Deep-Learned Error-Correcting Codes
Natasha Devroye
A. Mulgund
R. Shekhar
Gyorgy Turán
M. vZefran
Y. Zhou
26
3
0
07 May 2023
Simulation and Prediction of Countercurrent Spontaneous Imbibition at
  Early and Late Times Using Physics-Informed Neural Networks
Simulation and Prediction of Countercurrent Spontaneous Imbibition at Early and Late Times Using Physics-Informed Neural Networks
J. Abbasi
P. Andersen
PINN
27
4
0
06 May 2023
Previous
123...678...192021
Next