ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.10414
  4. Cited By
A Single-Timescale Analysis For Stochastic Approximation With Multiple
  Coupled Sequences

A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences

21 June 2022
Han Shen
Tianyi Chen
ArXivPDFHTML

Papers citing "A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences"

17 / 17 papers shown
Title
On Penalty-based Bilevel Gradient Descent Method
On Penalty-based Bilevel Gradient Descent Method
Han Shen
Quan-Wu Xiao
Tianyi Chen
60
51
0
08 Jan 2025
Single-Timescale Multi-Sequence Stochastic Approximation Without Fixed
  Point Smoothness: Theories and Applications
Single-Timescale Multi-Sequence Stochastic Approximation Without Fixed Point Smoothness: Theories and Applications
Yue Huang
Zhaoxian Wu
Shiqian Ma
Qing Ling
31
1
0
17 Oct 2024
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
Han Shen
Pin-Yu Chen
Payel Das
Tianyi Chen
ALM
26
11
0
09 Oct 2024
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
45
1
0
13 Aug 2024
A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with
  Coupled Constraints
A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled Constraints
Liuyuan Jiang
Quan-Wu Xiao
Victor M. Tenorio
Fernando Real-Rojas
Antonio G. Marques
Tianyi Chen
48
1
0
14 Jun 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Sihan Zeng
Thinh T. Doan
54
5
0
15 May 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and
  RLHF
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
40
14
0
10 Feb 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function
  Approximation
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
32
1
0
02 Feb 2024
Federated Multi-Sequence Stochastic Approximation with Local
  Hypergradient Estimation
Federated Multi-Sequence Stochastic Approximation with Local Hypergradient Estimation
Davoud Ataee Tarzanagh
Mingchen Li
Pranay Sharma
Samet Oymak
26
0
0
02 Jun 2023
Alternating Implicit Projected SGD and Its Efficient Variants for
  Equality-constrained Bilevel Optimization
Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization
Quan-Wu Xiao
Han Shen
W. Yin
Tianyi Chen
18
8
0
14 Nov 2022
A framework for bilevel optimization that enables stochastic and global
  variance reduction algorithms
A framework for bilevel optimization that enables stochastic and global variance reduction algorithms
Mathieu Dagréou
Pierre Ablin
Samuel Vaiter
Thomas Moreau
139
96
0
31 Jan 2022
Solving Stochastic Compositional Optimization is Nearly as Easy as
  Solving Stochastic Optimization
Solving Stochastic Compositional Optimization is Nearly as Easy as Solving Stochastic Optimization
Tianyi Chen
Yuejiao Sun
W. Yin
48
81
0
25 Aug 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
90
146
0
04 May 2020
On the Sample Complexity of Actor-Critic Method for Reinforcement
  Learning with Function Approximation
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
102
79
0
18 Oct 2019
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased
  Stochastic Approximation
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation
Gang Wang
Bingcong Li
G. Giannakis
23
28
0
10 Sep 2019
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Bilevel Programming for Hyperparameter Optimization and Meta-Learning
Luca Franceschi
P. Frasconi
Saverio Salzo
Riccardo Grazzi
Massimiliano Pontil
110
716
0
13 Jun 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
347
11,684
0
09 Mar 2017
1