Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.10414
Cited By
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences
21 June 2022
Han Shen
Tianyi Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences"
17 / 17 papers shown
Title
On Penalty-based Bilevel Gradient Descent Method
Han Shen
Quan-Wu Xiao
Tianyi Chen
60
51
0
08 Jan 2025
Single-Timescale Multi-Sequence Stochastic Approximation Without Fixed Point Smoothness: Theories and Applications
Yue Huang
Zhaoxian Wu
Shiqian Ma
Qing Ling
31
1
0
17 Oct 2024
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
Han Shen
Pin-Yu Chen
Payel Das
Tianyi Chen
ALM
26
11
0
09 Oct 2024
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
43
1
0
13 Aug 2024
A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled Constraints
Liuyuan Jiang
Quan-Wu Xiao
Victor M. Tenorio
Fernando Real-Rojas
Antonio G. Marques
Tianyi Chen
48
1
0
14 Jun 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Sihan Zeng
Thinh T. Doan
54
5
0
15 May 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
40
14
0
10 Feb 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
30
1
0
02 Feb 2024
Federated Multi-Sequence Stochastic Approximation with Local Hypergradient Estimation
Davoud Ataee Tarzanagh
Mingchen Li
Pranay Sharma
Samet Oymak
26
0
0
02 Jun 2023
Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization
Quan-Wu Xiao
Han Shen
W. Yin
Tianyi Chen
16
8
0
14 Nov 2022
A framework for bilevel optimization that enables stochastic and global variance reduction algorithms
Mathieu Dagréou
Pierre Ablin
Samuel Vaiter
Thomas Moreau
139
96
0
31 Jan 2022
Solving Stochastic Compositional Optimization is Nearly as Easy as Solving Stochastic Optimization
Tianyi Chen
Yuejiao Sun
W. Yin
48
81
0
25 Aug 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
90
146
0
04 May 2020
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
102
79
0
18 Oct 2019
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation
Gang Wang
Bingcong Li
G. Giannakis
21
28
0
10 Sep 2019
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Luca Franceschi
P. Frasconi
Saverio Salzo
Riccardo Grazzi
Massimiliano Pontil
110
716
0
13 Jun 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
338
11,684
0
09 Mar 2017
1