Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.01412
Cited By
Sharpness-Aware Minimization for Efficiently Improving Generalization
3 October 2020
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sharpness-Aware Minimization for Efficiently Improving Generalization"
50 / 867 papers shown
Title
REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation
Skyler Seto
B. Theobald
Federico Danieli
Navdeep Jaitly
Dan Busbridge
TTA
OOD
45
6
0
07 Sep 2023
Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN
Kin Wai Lau
L. Po
Yasar Abbas Ur Rehman
VLM
32
200
0
04 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
Fengxiang Bie
Yibo Yang
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
33
19
0
02 Sep 2023
On the Implicit Bias of Adam
M. D. Cattaneo
Jason M. Klusowski
Boris Shigida
36
18
0
31 Aug 2023
Incorporating Neuro-Inspired Adaptability for Continual Learning in Artificial Intelligence
Liyuan Wang
Xingxing Zhang
Qian Li
Mingtian Zhang
Hang Su
Jun Zhu
Yi Zhong
34
50
0
29 Aug 2023
FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in Federated Learning
Gihun Lee
Minchan Jeong
Sangmook Kim
Jaehoon Oh
Se-Young Yun
FedML
26
8
0
24 Aug 2023
Bias-Aware Minimisation: Understanding and Mitigating Estimator Bias in Private SGD
Moritz Knolle
R. Dorfman
Alexander Ziller
Daniel Rueckert
Georgios Kaissis
20
2
0
23 Aug 2023
Adversarial Collaborative Filtering for Free
Huiyuan Chen
Xiaoting Li
Vivian Lai
Chin-Chia Michael Yeh
Yujie Fan
Yan Zheng
Mahashweta Das
Hao Yang
AAML
23
6
0
20 Aug 2023
GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Xiaohan Zhang
Xingyu Li
Waqas Sultani
Chen Chen
S. Wshah
32
12
0
18 Aug 2023
DFedADMM: Dual Constraints Controlled Model Inconsistency for Decentralized Federated Learning
Qinglun Li
Li Shen
Guang-Ming Li
Quanjun Yin
Dacheng Tao
FedML
31
7
0
16 Aug 2023
ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
Yixuan Zhou
Yi Qu
Xing Xu
Hengtao Shen
33
16
0
15 Aug 2023
EFaR 2023: Efficient Face Recognition Competition
J. Kolf
Fadi Boutros
Jurek Elliesen
Markus Theuerkauf
Naser Damer
...
D. Nunes
Ahmad Hassanpour
Pankaj Khatiwada
A. Toor
Bian Yang
CVBM
MQ
35
13
0
08 Aug 2023
G-Mix: A Generalized Mixup Learning Framework Towards Flat Minima
Xingyu Li
Bo Tang
AAML
17
0
0
07 Aug 2023
Meta-Tsallis-Entropy Minimization: A New Self-Training Approach for Domain Adaptation on Text Classification
Menglong Lu
Zhen Huang
Zhiliang Tian
Yunxiang Zhao
Xuanyu Fei
Dongsheng Li
OOD
34
5
0
04 Aug 2023
Frustratingly Easy Model Generalization by Dummy Risk Minimization
Juncheng Wang
Jindong Wang
Xixu Hu
Shujun Wang
Xingxu Xie
16
1
0
04 Aug 2023
Baby Llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty
I. Timiryasov
J. Tastet
21
47
0
03 Aug 2023
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy
Shibo Jie
Haoqing Wang
Zhiwei Deng
21
31
0
31 Jul 2023
Lookbehind-SAM: k steps back, 1 step forward
Gonçalo Mordido
Pranshu Malviya
A. Baratin
Sarath Chandar
AAML
45
1
0
31 Jul 2023
Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges
Debesh Jha
Vanshali Sharma
Debapriya Banik
Debayan Bhattacharya
K. Roy
...
Sharib Ali
Michael A. Riegler
P. Halvorsen
Thomas de Lange
Ulas Bagci
30
1
0
30 Jul 2023
The instabilities of large learning rate training: a loss landscape view
Lawrence Wang
Stephen J. Roberts
8
2
0
22 Jul 2023
Improving Transferability of Adversarial Examples via Bayesian Attacks
Qizhang Li
Yiwen Guo
Xiaochen Yang
W. Zuo
Hao Chen
AAML
BDL
36
2
0
21 Jul 2023
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization
Kaiyue Wen
Zhiyuan Li
Tengyu Ma
FAtt
38
26
0
20 Jul 2023
Flatness-Aware Minimization for Domain Generalization
Xingxuan Zhang
Renzhe Xu
Han Yu
Yancheng Dong
Pengfei Tian
Peng Cu
32
20
0
20 Jul 2023
A Holistic Assessment of the Reliability of Machine Learning Systems
Anthony Corso
David Karamadian
Romeo Valentin
Mary Cooper
Mykel J. Kochenderfer
30
6
0
20 Jul 2023
FedSoup: Improving Generalization and Personalization in Federated Learning via Selective Model Interpolation
Minghui Chen
Meirui Jiang
Qianming Dou
Zehua Wang
Xiaoxiao Li
FedML
35
16
0
20 Jul 2023
IncDSI: Incrementally Updatable Document Retrieval
Varsha Kishore
Chao-gang Wan
Justin Lovelace
Yoav Artzi
Kilian Q. Weinberger
30
9
0
19 Jul 2023
Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Pranshu Malviya
Gonçalo Mordido
A. Baratin
Reza Babanezhad Harikandeh
Jerry Huang
Simon Lacoste-Julien
Razvan Pascanu
Sarath Chandar
ODL
33
1
0
18 Jul 2023
Sharpness-Aware Graph Collaborative Filtering
Huiyuan Chen
Chin-Chia Michael Yeh
Yujie Fan
Yan Zheng
Junpeng Wang
Vivian Lai
Mahashweta Das
Hao Yang
31
5
0
18 Jul 2023
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration
Hiroki Naganuma
Ryuichiro Hataya
Kotaro Yoshida
Ioannis Mitliagkas
OODD
95
1
0
17 Jul 2023
Why Does Little Robustness Help? Understanding and Improving Adversarial Transferability from Surrogate Training
Yechao Zhang
Shengshan Hu
Leo Yu Zhang
Junyu Shi
Minghui Li
Xiaogeng Liu
Wei Wan
Hai Jin
AAML
22
21
0
15 Jul 2023
Variance-reduced accelerated methods for decentralized stochastic double-regularized nonconvex strongly-concave minimax problems
Gabriel Mancino-Ball
Yangyang Xu
20
8
0
14 Jul 2023
Layer-wise Linear Mode Connectivity
Linara Adilova
Maksym Andriushchenko
Michael Kamp
Asja Fischer
Martin Jaggi
FedML
FAtt
MoMe
33
15
0
13 Jul 2023
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models
Jean Kaddour
Oscar Key
Piotr Nawrot
Pasquale Minervini
Matt J. Kusner
22
41
0
12 Jul 2023
GNP Attack: Transferable Adversarial Examples via Gradient Norm Penalty
Tao Wu
Tie-Mei Luo
D. Wunsch
AAML
30
9
0
09 Jul 2023
Benchmarking Test-Time Adaptation against Distribution Shifts in Image Classification
Yongcan Yu
Lijun Sheng
Ran He
Jian Liang
OOD
VLM
TTA
35
15
0
06 Jul 2023
FAM: Relative Flatness Aware Minimization
Linara Adilova
Amr Abourayya
Jianning Li
Amin Dada
Henning Petzka
Jan Egger
Jens Kleesiek
Michael Kamp
ODL
29
1
0
05 Jul 2023
Sparsity-aware generalization theory for deep neural networks
Ramchandran Muthukumar
Jeremias Sulam
MLT
24
4
0
01 Jul 2023
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer
Peng Mi
Li Shen
Tianhe Ren
Yiyi Zhou
Tianshuo Xu
Xiaoshuai Sun
Tongliang Liu
Rongrong Ji
Dacheng Tao
AAML
37
2
0
30 Jun 2023
Efficient Backdoor Removal Through Natural Gradient Fine-tuning
Nazmul Karim
Abdullah Al Arafat
Umar Khalid
Zhishan Guo
Naznin Rahnavard
AAML
28
1
0
30 Jun 2023
Adaptive Sharpness-Aware Pruning for Robust Sparse Networks
Anna Bair
Hongxu Yin
Maying Shen
Pavlo Molchanov
J. Álvarez
40
10
0
25 Jun 2023
Enhancing Adversarial Training via Reweighting Optimization Trajectory
Tianjin Huang
Shiwei Liu
Tianlong Chen
Meng Fang
Lijuan Shen
Vlaod Menkovski
Lu Yin
Yulong Pei
Mykola Pechenizkiy
AAML
30
4
0
25 Jun 2023
G-TRACER: Expected Sharpness Optimization
John R. Williams
Stephen J. Roberts
35
0
0
24 Jun 2023
The Inductive Bias of Flatness Regularization for Deep Matrix Factorization
Khashayar Gatmiry
Zhiyuan Li
Ching-Yao Chuang
Sashank J. Reddi
Tengyu Ma
Stefanie Jegelka
ODL
25
11
0
22 Jun 2023
FFCV: Accelerating Training by Removing Data Bottlenecks
Guillaume Leclerc
Andrew Ilyas
Logan Engstrom
Sung Min Park
Hadi Salman
A. Madry
29
67
0
21 Jun 2023
Training Transformers with 4-bit Integers
Haocheng Xi
Changhao Li
Jianfei Chen
Jun Zhu
MQ
25
47
0
21 Jun 2023
Adversarial Training Should Be Cast as a Non-Zero-Sum Game
Alexander Robey
Fabian Latorre
George J. Pappas
Hamed Hassani
V. Cevher
AAML
66
12
0
19 Jun 2023
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning
Hojoon Lee
Hanseul Cho
Hyunseung Kim
Daehoon Gwak
Joonkee Kim
Jaegul Choo
Se-Young Yun
Chulhee Yun
OffRL
82
26
0
19 Jun 2023
Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima
Dongkuk Si
Chulhee Yun
28
15
0
16 Jun 2023
Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Ramnath Kumar
Kushal Majmundar
Dheeraj M. Nagaraj
A. Suggala
ODL
29
6
0
15 Jun 2023
The Split Matters: Flat Minima Methods for Improving the Performance of GNNs
N. Lell
A. Scherp
43
1
0
15 Jun 2023
Previous
1
2
3
...
8
9
10
...
16
17
18
Next