ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.10328
  4. Cited By
Scalable Bilinear $π$ Learning Using State and Action Features

Scalable Bilinear πππ Learning Using State and Action Features

27 April 2018
Yichen Chen
Lihong Li
Mengdi Wang
ArXivPDFHTML

Papers citing "Scalable Bilinear $π$ Learning Using State and Action Features"

13 / 13 papers shown
Title
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
Axel Friedrich Wolter
Tobias Sutter
OffRL
37
0
0
07 May 2025
Offline Primal-Dual Reinforcement Learning for Linear MDPs
Offline Primal-Dual Reinforcement Learning for Linear MDPs
Germano Gabbianelli
Gergely Neu
Nneka Okolo
Matteo Papini
OffRL
38
7
0
22 May 2023
Efficient Global Planning in Large MDPs via Stochastic Primal-Dual
  Optimization
Efficient Global Planning in Large MDPs via Stochastic Primal-Dual Optimization
Gergely Neu
Nneka Okolo
45
6
0
21 Oct 2022
Efficient Performance Bounds for Primal-Dual Reinforcement Learning from
  Demonstrations
Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations
Angeliki Kamoutsi
G. Banjac
John Lygeros
OffRL
31
7
0
28 Dec 2021
Near Optimal Policy Optimization via REPS
Near Optimal Policy Optimization via REPS
Aldo Pacchiano
Jonathan Lee
Peter L. Bartlett
Ofir Nachum
23
3
0
17 Mar 2021
A Provably Efficient Sample Collection Strategy for Reinforcement
  Learning
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
27
16
0
13 Jul 2020
Efficient Planning in Large MDPs with Weak Linear Function Approximation
Efficient Planning in Large MDPs with Weak Linear Function Approximation
R. Shariff
Csaba Szepesvári
39
22
0
13 Jul 2020
The Landscape of the Proximal Point Method for Nonconvex-Nonconcave
  Minimax Optimization
The Landscape of the Proximal Point Method for Nonconvex-Nonconcave Minimax Optimization
Benjamin Grimmer
Haihao Lu
Pratik Worah
Vahab Mirrokni
42
9
0
15 Jun 2020
Reinforcement Learning via Fenchel-Rockafellar Duality
Reinforcement Learning via Fenchel-Rockafellar Duality
Ofir Nachum
Bo Dai
OffRL
16
118
0
07 Jan 2020
Feature-Based Q-Learning for Two-Player Stochastic Games
Feature-Based Q-Learning for Two-Player Stochastic Games
Zeyu Jia
Lin F. Yang
Mengdi Wang
27
45
0
02 Jun 2019
A Kernel Loss for Solving the Bellman Equation
A Kernel Loss for Solving the Bellman Equation
Yihao Feng
Lihong Li
Qiang Liu
30
70
0
25 May 2019
Unknown mixing times in apprenticeship and reinforcement learning
Unknown mixing times in apprenticeship and reinforcement learning
Tom Zahavy
Alon Cohen
Haim Kaplan
Yishay Mansour
OffRL
17
11
0
23 May 2019
State Aggregation Learning from Markov Transition Data
State Aggregation Learning from Markov Transition Data
Shiqi Wang
Yizheng Chen
Ahmed Abdou
18
54
0
06 Nov 2018
1