ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.01122
  4. Cited By
Fast Adaptive Federated Bilevel Optimization
v1v2v3 (latest)

Fast Adaptive Federated Bilevel Optimization

2 November 2022
Feihu Huang
    FedML
ArXiv (abs)PDFHTML

Papers citing "Fast Adaptive Federated Bilevel Optimization"

32 / 32 papers shown
Title
Stochastic Controlled Averaging for Federated Learning with
  Communication Compression
Stochastic Controlled Averaging for Federated Learning with Communication Compression
Xinmeng Huang
Ping Li
Xiaoyun Li
93
207
0
16 Aug 2023
FedNest: Federated Bilevel, Minimax, and Compositional Optimization
FedNest: Federated Bilevel, Minimax, and Compositional Optimization
Davoud Ataee Tarzanagh
Mingchen Li
Christos Thrampoulidis
Samet Oymak
FedML
84
74
0
04 May 2022
Local Stochastic Bilevel Optimization with Momentum-Based Variance
  Reduction
Local Stochastic Bilevel Optimization with Momentum-Based Variance Reduction
Junyi Li
Feihu Huang
Heng-Chiao Huang
FedML
76
27
0
03 May 2022
Maximizing Communication Efficiency for Large-scale Training via 0/1
  Adam
Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam
Yucheng Lu
Conglong Li
Minjia Zhang
Christopher De Sa
Yuxiong He
OffRLAI4CE
56
21
0
12 Feb 2022
A Fully Single Loop Algorithm for Bilevel Optimization without Hessian
  Inverse
A Fully Single Loop Algorithm for Bilevel Optimization without Hessian Inverse
Junyi Li
Bin Gu
Heng-Chiao Huang
72
74
0
09 Dec 2021
A Novel Convergence Analysis for Algorithms of the Adam Family
A Novel Convergence Analysis for Algorithms of the Adam Family
Zhishuai Guo
Yi Tian Xu
W. Yin
Rong Jin
Tianbao Yang
64
49
0
07 Dec 2021
Sharp Bounds for Federated Averaging (Local SGD) and Continuous
  Perspective
Sharp Bounds for Federated Averaging (Local SGD) and Continuous Perspective
Margalit Glasgow
Honglin Yuan
Tengyu Ma
FedML
55
44
0
05 Nov 2021
Toward Communication Efficient Adaptive Gradient Method
Toward Communication Efficient Adaptive Gradient Method
Xiangyi Chen
Xiaoyun Li
P. Li
FedML
61
42
0
10 Sep 2021
Enhanced Bilevel Optimization via Bregman Distance
Enhanced Bilevel Optimization via Bregman Distance
Feihu Huang
Junyi Li
Shangqian Gao
Heng-Chiao Huang
61
33
0
26 Jul 2021
STEM: A Stochastic Two-Sided Momentum Algorithm Achieving Near-Optimal
  Sample and Communication Complexities for Federated Learning
STEM: A Stochastic Two-Sided Momentum Algorithm Achieving Near-Optimal Sample and Communication Complexities for Federated Learning
Prashant Khanduri
Pranay Sharma
Haibo Yang
Min-Fong Hong
Jia Liu
K. Rajawat
P. Varshney
FedML
50
63
0
19 Jun 2021
SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients
SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients
Feihu Huang
Junyi Li
Heng-Chiao Huang
ODL
55
42
0
15 Jun 2021
Provably Faster Algorithms for Bilevel Optimization
Provably Faster Algorithms for Bilevel Optimization
Junjie Yang
Kaiyi Ji
Yingbin Liang
90
135
0
08 Jun 2021
Local Stochastic Gradient Descent Ascent: Convergence Analysis and
  Communication Efficiency
Local Stochastic Gradient Descent Ascent: Convergence Analysis and Communication Efficiency
Yuyang Deng
M. Mahdavi
88
61
0
25 Feb 2021
A Near-Optimal Algorithm for Stochastic Bilevel Optimization via
  Double-Momentum
A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum
Prashant Khanduri
Siliang Zeng
Mingyi Hong
Hoi-To Wai
Zhaoran Wang
Zhuoran Yang
65
131
0
15 Feb 2021
AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed
  Gradients
AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients
Juntang Zhuang
Tommy M. Tang
Yifan Ding
S. Tatikonda
Nicha Dvornek
X. Papademetris
James S. Duncan
ODL
162
517
0
15 Oct 2020
FedCluster: Boosting the Convergence of Federated Learning via
  Cluster-Cycling
FedCluster: Boosting the Convergence of Federated Learning via Cluster-Cycling
Cheng Chen
Ziyi Chen
Yi Zhou
B. Kailkhura
FedML
61
61
0
22 Sep 2020
APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD
  Algorithm
APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Hanlin Tang
Shaoduo Gan
Samyam Rajbhandari
Xiangru Lian
Ji Liu
Yuxiong He
Ce Zhang
52
8
0
26 Aug 2020
Mime: Mimicking Centralized Stochastic Algorithms in Federated Learning
Mime: Mimicking Centralized Stochastic Algorithms in Federated Learning
Sai Praneeth Karimireddy
Martin Jaggi
Satyen Kale
M. Mohri
Sashank J. Reddi
Sebastian U. Stich
A. Suresh
FedML
133
217
0
08 Aug 2020
Federated Accelerated Stochastic Gradient Descent
Federated Accelerated Stochastic Gradient Descent
Honglin Yuan
Tengyu Ma
FedML
64
180
0
16 Jun 2020
Adaptive Federated Optimization
Adaptive Federated Optimization
Sashank J. Reddi
Zachary B. Charles
Manzil Zaheer
Zachary Garrett
Keith Rush
Jakub Konecný
Sanjiv Kumar
H. B. McMahan
FedML
177
1,437
0
29 Feb 2020
On the Convergence of FedAvg on Non-IID Data
On the Convergence of FedAvg on Non-IID Data
Xiang Li
Kaixuan Huang
Wenhao Yang
Shusen Wang
Zhihua Zhang
FedML
142
2,334
0
04 Jul 2019
Momentum-Based Variance Reduction in Non-Convex SGD
Momentum-Based Variance Reduction in Non-Convex SGD
Ashok Cutkosky
Francesco Orabona
ODL
86
407
0
24 May 2019
On the Convergence of Adam and Beyond
On the Convergence of Adam and Beyond
Sashank J. Reddi
Satyen Kale
Surinder Kumar
96
2,499
0
19 Apr 2019
Error Feedback Fixes SignSGD and other Gradient Compression Schemes
Error Feedback Fixes SignSGD and other Gradient Compression Schemes
Sai Praneeth Karimireddy
Quentin Rebjock
Sebastian U. Stich
Martin Jaggi
66
502
0
28 Jan 2019
Truncated Back-propagation for Bilevel Optimization
Truncated Back-propagation for Bilevel Optimization
Amirreza Shaban
Ching-An Cheng
Nathan Hatch
Byron Boots
101
266
0
25 Oct 2018
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex
  Optimization
On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization
Xiangyi Chen
Sijia Liu
Ruoyu Sun
Mingyi Hong
58
323
0
08 Aug 2018
DARTS: Differentiable Architecture Search
DARTS: Differentiable Architecture Search
Hanxiao Liu
Karen Simonyan
Yiming Yang
199
4,355
0
24 Jun 2018
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Luca Franceschi
P. Frasconi
Saverio Salzo
Riccardo Grazzi
Massimiliano Pontil
176
728
0
13 Jun 2018
Local SGD Converges Fast and Communicates Little
Local SGD Converges Fast and Communicates Little
Sebastian U. Stich
FedML
176
1,063
0
24 May 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
823
11,909
0
09 Mar 2017
Communication-Efficient Learning of Deep Networks from Decentralized
  Data
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. B. McMahan
Eider Moore
Daniel Ramage
S. Hampson
Blaise Agüera y Arcas
FedML
406
17,486
0
17 Feb 2016
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.9K
150,115
0
22 Dec 2014
1