ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.07427
26
29

Lifted Primal-Dual Method for Bilinearly Coupled Smooth Minimax Optimization

19 January 2022
K. K. Thekumparampil
Niao He
Sewoong Oh
ArXivPDFHTML
Abstract

We study the bilinearly coupled minimax problem: min⁡xmax⁡yf(x)+y⊤Ax−h(y)\min_{x} \max_{y} f(x) + y^\top A x - h(y)minx​maxy​f(x)+y⊤Ax−h(y), where fff and hhh are both strongly convex smooth functions and admit first-order gradient oracles. Surprisingly, no known first-order algorithms have hitherto achieved the lower complexity bound of Ω((Lxμx+∥A∥μxμy+Lyμy)log⁡(1ε))\Omega((\sqrt{\frac{L_x}{\mu_x}} + \frac{\|A\|}{\sqrt{\mu_x \mu_y}} + \sqrt{\frac{L_y}{\mu_y}}) \log(\frac1{\varepsilon}))Ω((μx​Lx​​​+μx​μy​​∥A∥​+μy​Ly​​​)log(ε1​)) for solving this problem up to an ε\varepsilonε primal-dual gap in the general parameter regime, where Lx,Ly,μx,μyL_x, L_y,\mu_x,\mu_yLx​,Ly​,μx​,μy​ are the corresponding smoothness and strongly convexity constants. We close this gap by devising the first optimal algorithm, the Lifted Primal-Dual (LPD) method. Our method lifts the objective into an extended form that allows both the smooth terms and the bilinear term to be handled optimally and seamlessly with the same primal-dual framework. Besides optimality, our method yields a desirably simple single-loop algorithm that uses only one gradient oracle call per iteration. Moreover, when fff is just convex, the same algorithm applied to a smoothed objective achieves the nearly optimal iteration complexity. We also provide a direct single-loop algorithm, using the LPD method, that achieves the iteration complexity of O(Lxε+∥A∥μyε+Lyε)O(\sqrt{\frac{L_x}{\varepsilon}} + \frac{\|A\|}{\sqrt{\mu_y \varepsilon}} + \sqrt{\frac{L_y}{\varepsilon}})O(εLx​​​+μy​ε​∥A∥​+εLy​​​). Numerical experiments on quadratic minimax problems and policy evaluation problems further demonstrate the fast convergence of our algorithm in practice.

View on arXiv
Comments on this paper