ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.03996
  4. Cited By
Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning
v1v2v3v4 (latest)

Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning

8 June 2022
Momin Abbas
Quan-Wu Xiao
Lisha Chen
Pin-Yu Chen
Tianyi Chen
ArXiv (abs)PDFHTMLGithub (31★)

Papers citing "Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning"

50 / 61 papers shown
Title
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Wenjie Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
161
0
0
27 Apr 2025
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy
Chen Wang
Kaiyi Ji
Junyi Geng
Zhongqiang Ren
Taimeng Fu
...
Yi Du
Qihang Li
Yue Yang
Xiao Lin
Zhipeng Zhao
SSL
156
10
0
28 Jan 2025
Rethinking Meta-Learning from a Learning Lens
Rethinking Meta-Learning from a Learning Lens
Wenwen Qiang
Jingyao Wang
Chuxiong Sun
Hui Xiong
Jiangmeng Li
157
3
0
13 Sep 2024
Learning to Learn from APIs: Black-Box Data-Free Meta-Learning
Learning to Learn from APIs: Black-Box Data-Free Meta-Learning
Zixuan Hu
Li Shen
Zhenyi Wang
Baoyuan Wu
Chun Yuan
Dacheng Tao
120
8
0
28 May 2023
Improving Multi-task Learning via Seeking Task-based Flat Regions
Improving Multi-task Learning via Seeking Task-based Flat Regions
Hoang Phan
Lam C. Tran
Ngoc N. Tran
Nhat Ho
Tuan Truong
Qi Lei
Nhat Ho
Dinh Q. Phung
Trung Le
207
11
0
24 Nov 2022
Is Bayesian Model-Agnostic Meta Learning Better than Model-Agnostic Meta
  Learning, Provably?
Is Bayesian Model-Agnostic Meta Learning Better than Model-Agnostic Meta Learning, Provably?
Lisha Chen
Tianyi
BDL
70
16
0
06 Mar 2022
Sharpness-Aware Minimization Improves Language Model Generalization
Sharpness-Aware Minimization Improves Language Model Generalization
Dara Bahri
H. Mobahi
Yi Tay
162
103
0
16 Oct 2021
Efficient Sharpness-aware Minimization for Improved Training of Neural
  Networks
Efficient Sharpness-aware Minimization for Improved Training of Neural Networks
Jiawei Du
Hanshu Yan
Jiashi Feng
Qiufeng Wang
Liangli Zhen
Rick Siow Mong Goh
Vincent Y. F. Tan
AAML
163
135
0
07 Oct 2021
When Vision Transformers Outperform ResNets without Pre-training or
  Strong Data Augmentations
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations
Xiangning Chen
Cho-Jui Hsieh
Boqing Gong
ViT
97
329
0
03 Jun 2021
MLP-Mixer: An all-MLP Architecture for Vision
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
444
2,694
0
04 May 2021
On Fast Adversarial Robustness Adaptation in Model-Agnostic
  Meta-Learning
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
Ren Wang
Kaidi Xu
Sijia Liu
Pin-Yu Chen
Tsui-Wei Weng
Chuang Gan
Meng Wang
AAML
90
47
0
20 Feb 2021
Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform
  Stability
Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform Stability
Alec Farid
Anirudha Majumdar
72
36
0
12 Feb 2021
A Single-Timescale Method for Stochastic Bilevel Optimization
A Single-Timescale Method for Stochastic Bilevel Optimization
Tianyi Chen
Yuejiao Sun
Quan-Wu Xiao
W. Yin
71
79
0
09 Feb 2021
mT5: A massively multilingual pre-trained text-to-text transformer
mT5: A massively multilingual pre-trained text-to-text transformer
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
148
2,561
0
22 Oct 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
684
41,563
0
22 Oct 2020
Sharpness-Aware Minimization for Efficiently Improving Generalization
Sharpness-Aware Minimization for Efficiently Improving Generalization
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
199
1,359
0
03 Oct 2020
Yet Meta Learning Can Adapt Fast, It Can Also Break Easily
Yet Meta Learning Can Adapt Fast, It Can Also Break Easily
Han Xu
Yaxin Li
Xiaorui Liu
Hui Liu
Jiliang Tang
AAML
119
10
0
02 Sep 2020
Tracking by Instance Detection: A Meta-Learning Approach
Tracking by Instance Detection: A Meta-Learning Approach
Guangting Wang
Chong Luo
Xiaoyan Sun
Zhiwei Xiong
Wenjun Zeng
77
148
0
02 Apr 2020
PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees
PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees
Jonas Rothfuss
Vincent Fortuin
Martin Josifoski
Andreas Krause
UQCV
86
127
0
13 Feb 2020
Meta-Learning without Memorization
Meta-Learning without Memorization
Mingzhang Yin
George Tucker
Mingyuan Zhou
Sergey Levine
Chelsea Finn
VLM
50
188
0
09 Dec 2019
Fantastic Generalization Measures and Where to Find Them
Fantastic Generalization Measures and Where to Find Them
Yiding Jiang
Behnam Neyshabur
H. Mobahi
Dilip Krishnan
Samy Bengio
AI4CE
145
611
0
04 Dec 2019
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Risto Vuorio
Shao-Hua Sun
Hexiang Hu
Joseph J. Lim
93
218
0
30 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
506
20,376
0
23 Oct 2019
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness
  of MAML
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML
Aniruddh Raghu
M. Raghu
Samy Bengio
Oriol Vinyals
315
647
0
19 Sep 2019
Torchmeta: A Meta-Learning library for PyTorch
Torchmeta: A Meta-Learning library for PyTorch
T. Deleu
Tobias Würfl
Mandana Samiei
Joseph Paul Cohen
Yoshua Bengio
OffRL
74
85
0
14 Sep 2019
Meta-Learning with Implicit Gradients
Meta-Learning with Implicit Gradients
Aravind Rajeswaran
Chelsea Finn
Sham Kakade
Sergey Levine
119
858
0
10 Sep 2019
On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning
  Algorithms
On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms
Alireza Fallah
Aryan Mokhtari
Asuman Ozdaglar
93
225
0
27 Aug 2019
Boosting Few-Shot Visual Learning with Self-Supervision
Boosting Few-Shot Visual Learning with Self-Supervision
Spyros Gidaris
Andrei Bursuc
N. Komodakis
P. Pérez
Matthieu Cord
SSL
92
405
0
12 Jun 2019
Improved Training Speed, Accuracy, and Data Utilization Through Loss
  Function Optimization
Improved Training Speed, Accuracy, and Data Utilization Through Loss Function Optimization
Santiago Gonzalez
Risto Miikkulainen
82
76
0
27 May 2019
Task2Vec: Task Embedding for Meta-Learning
Task2Vec: Task Embedding for Meta-Learning
Alessandro Achille
Michael Lam
Rahul Tewari
Avinash Ravichandran
Subhransu Maji
Charless C. Fowlkes
Stefano Soatto
Pietro Perona
SSL
80
316
0
10 Feb 2019
Meta-Curvature
Meta-Curvature
Eunbyung Park
Junier B. Oliva
BDL
75
124
0
09 Feb 2019
How to train your MAML
How to train your MAML
Antreas Antoniou
Harrison Edwards
Amos Storkey
74
778
0
22 Oct 2018
Meta-Learning: A Survey
Meta-Learning: A Survey
Joaquin Vanschoren
FedMLOOD
76
761
0
08 Oct 2018
Unsupervised Learning via Meta-Learning
Unsupervised Learning via Meta-Learning
Kyle Hsu
Sergey Levine
Chelsea Finn
SSLOffRL
85
230
0
04 Oct 2018
Bayesian Model-Agnostic Meta-Learning
Bayesian Model-Agnostic Meta-Learning
Taesup Kim
Jaesik Yoon
Ousmane Amadou Dia
Sungwoong Kim
Yoshua Bengio
Sungjin Ahn
UQCVBDL
301
503
0
11 Jun 2018
Training Medical Image Analysis Systems like Radiologists
Training Medical Image Analysis Systems like Radiologists
Gabriel Maicas
A. Bradley
Jacinto C. Nascimento
Ian Reid
G. Carneiro
65
55
0
28 May 2018
Meta-learning with differentiable closed-form solvers
Meta-learning with differentiable closed-form solvers
Luca Bertinetto
João F. Henriques
Philip Torr
Andrea Vedaldi
ODL
100
931
0
21 May 2018
Averaging Weights Leads to Wider Optima and Better Generalization
Averaging Weights Leads to Wider Optima and Better Generalization
Pavel Izmailov
Dmitrii Podoprikhin
T. Garipov
Dmitry Vetrov
A. Wilson
FedMLMoMe
143
1,673
0
14 Mar 2018
On First-Order Meta-Learning Algorithms
On First-Order Meta-Learning Algorithms
Alex Nichol
Joshua Achiam
John Schulman
246
2,239
0
08 Mar 2018
Natural Language to Structured Query Generation via Meta-Learning
Natural Language to Structured Query Generation via Meta-Learning
Po-Sen Huang
Chenglong Wang
Rishabh Singh
Wen-tau Yih
Xiaodong He
73
123
0
02 Mar 2018
Recasting Gradient-Based Meta-Learning as Hierarchical Bayes
Recasting Gradient-Based Meta-Learning as Hierarchical Bayes
Erin Grant
Chelsea Finn
Sergey Levine
Trevor Darrell
Thomas Griffiths
BDL
98
510
0
26 Jan 2018
Visualizing the Loss Landscape of Neural Nets
Visualizing the Loss Landscape of Neural Nets
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
266
1,901
0
28 Dec 2017
Learning to Compare: Relation Network for Few-Shot Learning
Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung
Yongxin Yang
Li Zhang
Tao Xiang
Philip Torr
Timothy M. Hospedales
314
4,054
0
16 Nov 2017
Few-Shot Learning with Graph Neural Networks
Few-Shot Learning with Graph Neural Networks
Victor Garcia Satorras
Joan Bruna
GNN
176
1,240
0
10 Nov 2017
Meta-Learning and Universality: Deep Representations and Gradient
  Descent can Approximate any Learning Algorithm
Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm
Chelsea Finn
Sergey Levine
SSL
117
223
0
31 Oct 2017
Learning to Generalize: Meta-Learning for Domain Generalization
Learning to Generalize: Meta-Learning for Domain Generalization
Da Li
Yongxin Yang
Yi-Zhe Song
Timothy M. Hospedales
OOD
102
1,430
0
10 Oct 2017
SMASH: One-Shot Model Architecture Search through HyperNetworks
SMASH: One-Shot Model Architecture Search through HyperNetworks
Andrew Brock
Theodore Lim
J. Ritchie
Nick Weston
167
765
0
17 Aug 2017
Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
Zhenguo Li
Fengwei Zhou
Fei Chen
Hang Li
101
1,121
0
31 Jul 2017
Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural
  Networks with Many More Parameters than Training Data
Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data
Gintare Karolina Dziugaite
Daniel M. Roy
121
820
0
31 Mar 2017
Prototypical Networks for Few-shot Learning
Prototypical Networks for Few-shot Learning
Jake C. Snell
Kevin Swersky
R. Zemel
305
8,164
0
15 Mar 2017
12
Next