Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.03996
Cited By
v1
v2
v3
v4 (latest)
Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning
8 June 2022
Momin Abbas
Quan-Wu Xiao
Lisha Chen
Pin-Yu Chen
Tianyi Chen
Re-assign community
ArXiv (abs)
PDF
HTML
Github (31★)
Papers citing
"Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning"
50 / 61 papers shown
Title
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Wenjie Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
161
0
0
27 Apr 2025
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Chen Wang
Kaiyi Ji
Junyi Geng
Zhongqiang Ren
Taimeng Fu
...
Yi Du
Qihang Li
Yue Yang
Xiao Lin
Zhipeng Zhao
SSL
156
10
0
28 Jan 2025
Rethinking Meta-Learning from a Learning Lens
Wenwen Qiang
Jingyao Wang
Chuxiong Sun
Hui Xiong
Jiangmeng Li
157
3
0
13 Sep 2024
Learning to Learn from APIs: Black-Box Data-Free Meta-Learning
Zixuan Hu
Li Shen
Zhenyi Wang
Baoyuan Wu
Chun Yuan
Dacheng Tao
124
8
0
28 May 2023
Improving Multi-task Learning via Seeking Task-based Flat Regions
Hoang Phan
Lam C. Tran
Ngoc N. Tran
Nhat Ho
Tuan Truong
Qi Lei
Nhat Ho
Dinh Q. Phung
Trung Le
207
11
0
24 Nov 2022
Is Bayesian Model-Agnostic Meta Learning Better than Model-Agnostic Meta Learning, Provably?
Lisha Chen
Tianyi
BDL
70
16
0
06 Mar 2022
Sharpness-Aware Minimization Improves Language Model Generalization
Dara Bahri
H. Mobahi
Yi Tay
162
103
0
16 Oct 2021
Efficient Sharpness-aware Minimization for Improved Training of Neural Networks
Jiawei Du
Hanshu Yan
Jiashi Feng
Qiufeng Wang
Liangli Zhen
Rick Siow Mong Goh
Vincent Y. F. Tan
AAML
163
135
0
07 Oct 2021
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations
Xiangning Chen
Cho-Jui Hsieh
Boqing Gong
ViT
97
329
0
03 Jun 2021
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
444
2,694
0
04 May 2021
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
Ren Wang
Kaidi Xu
Sijia Liu
Pin-Yu Chen
Tsui-Wei Weng
Chuang Gan
Meng Wang
AAML
90
47
0
20 Feb 2021
Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform Stability
Alec Farid
Anirudha Majumdar
72
36
0
12 Feb 2021
A Single-Timescale Method for Stochastic Bilevel Optimization
Tianyi Chen
Yuejiao Sun
Quan-Wu Xiao
W. Yin
71
79
0
09 Feb 2021
mT5: A massively multilingual pre-trained text-to-text transformer
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
148
2,561
0
22 Oct 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
684
41,563
0
22 Oct 2020
Sharpness-Aware Minimization for Efficiently Improving Generalization
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
199
1,359
0
03 Oct 2020
Yet Meta Learning Can Adapt Fast, It Can Also Break Easily
Han Xu
Yaxin Li
Xiaorui Liu
Hui Liu
Jiliang Tang
AAML
119
10
0
02 Sep 2020
Tracking by Instance Detection: A Meta-Learning Approach
Guangting Wang
Chong Luo
Xiaoyan Sun
Zhiwei Xiong
Wenjun Zeng
77
148
0
02 Apr 2020
PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees
Jonas Rothfuss
Vincent Fortuin
Martin Josifoski
Andreas Krause
UQCV
86
127
0
13 Feb 2020
Meta-Learning without Memorization
Mingzhang Yin
George Tucker
Mingyuan Zhou
Sergey Levine
Chelsea Finn
VLM
50
188
0
09 Dec 2019
Fantastic Generalization Measures and Where to Find Them
Yiding Jiang
Behnam Neyshabur
H. Mobahi
Dilip Krishnan
Samy Bengio
AI4CE
145
611
0
04 Dec 2019
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
93
218
0
30 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
506
20,376
0
23 Oct 2019
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML
Aniruddh Raghu
M. Raghu
Samy Bengio
Oriol Vinyals
315
647
0
19 Sep 2019
Torchmeta: A Meta-Learning library for PyTorch
T. Deleu
Tobias Würfl
Mandana Samiei
Joseph Paul Cohen
Yoshua Bengio
OffRL
74
85
0
14 Sep 2019
Meta-Learning with Implicit Gradients
Aravind Rajeswaran
Chelsea Finn
Sham Kakade
Sergey Levine
119
858
0
10 Sep 2019
On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms
Alireza Fallah
Aryan Mokhtari
Asuman Ozdaglar
93
225
0
27 Aug 2019
Boosting Few-Shot Visual Learning with Self-Supervision
Spyros Gidaris
Andrei Bursuc
N. Komodakis
P. Pérez
Matthieu Cord
SSL
92
405
0
12 Jun 2019
Improved Training Speed, Accuracy, and Data Utilization Through Loss Function Optimization
Santiago Gonzalez
Risto Miikkulainen
82
76
0
27 May 2019
Task2Vec: Task Embedding for Meta-Learning
Alessandro Achille
Michael Lam
Rahul Tewari
Avinash Ravichandran
Subhransu Maji
Charless C. Fowlkes
Stefano Soatto
Pietro Perona
SSL
80
316
0
10 Feb 2019
Meta-Curvature
Eunbyung Park
Junier B. Oliva
BDL
75
124
0
09 Feb 2019
How to train your MAML
Antreas Antoniou
Harrison Edwards
Amos Storkey
74
778
0
22 Oct 2018
Meta-Learning: A Survey
Joaquin Vanschoren
FedML
OOD
76
761
0
08 Oct 2018
Unsupervised Learning via Meta-Learning
Kyle Hsu
Sergey Levine
Chelsea Finn
SSL
OffRL
85
230
0
04 Oct 2018
Bayesian Model-Agnostic Meta-Learning
Taesup Kim
Jaesik Yoon
Ousmane Amadou Dia
Sungwoong Kim
Yoshua Bengio
Sungjin Ahn
UQCV
BDL
301
503
0
11 Jun 2018
Training Medical Image Analysis Systems like Radiologists
Gabriel Maicas
A. Bradley
Jacinto C. Nascimento
Ian Reid
G. Carneiro
65
55
0
28 May 2018
Meta-learning with differentiable closed-form solvers
Luca Bertinetto
João F. Henriques
Philip Torr
Andrea Vedaldi
ODL
100
931
0
21 May 2018
Averaging Weights Leads to Wider Optima and Better Generalization
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
FedML
MoMe
143
1,673
0
14 Mar 2018
On First-Order Meta-Learning Algorithms
Alex Nichol
Joshua Achiam
John Schulman
248
2,239
0
08 Mar 2018
Natural Language to Structured Query Generation via Meta-Learning
Po-Sen Huang
Chenglong Wang
Rishabh Singh
Wen-tau Yih
Xiaodong He
73
123
0
02 Mar 2018
Recasting Gradient-Based Meta-Learning as Hierarchical Bayes
Erin Grant
Chelsea Finn
Sergey Levine
Trevor Darrell
Thomas Griffiths
BDL
98
510
0
26 Jan 2018
Visualizing the Loss Landscape of Neural Nets
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
266
1,901
0
28 Dec 2017
Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung
Yongxin Yang
Li Zhang
Tao Xiang
Philip Torr
Timothy M. Hospedales
314
4,054
0
16 Nov 2017
Few-Shot Learning with Graph Neural Networks
Victor Garcia Satorras
Joan Bruna
GNN
178
1,240
0
10 Nov 2017
Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm
Chelsea Finn
Sergey Levine
SSL
117
223
0
31 Oct 2017
Learning to Generalize: Meta-Learning for Domain Generalization
Da Li
Yongxin Yang
Yi-Zhe Song
Timothy M. Hospedales
OOD
102
1,430
0
10 Oct 2017
SMASH: One-Shot Model Architecture Search through HyperNetworks
Andrew Brock
Theodore Lim
J. Ritchie
Nick Weston
167
765
0
17 Aug 2017
Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
Zhenguo Li
Fengwei Zhou
Fei Chen
Hang Li
101
1,121
0
31 Jul 2017
Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data
Gintare Karolina Dziugaite
Daniel M. Roy
121
820
0
31 Mar 2017
Prototypical Networks for Few-shot Learning
Jake C. Snell
Kevin Swersky
R. Zemel
305
8,164
0
15 Mar 2017
1
2
Next