Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11151
Cited By
Logistic Q-Learning
21 October 2020
Joan Bas-Serrano
Sebastian Curi
Andreas Krause
Gergely Neu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Logistic Q-Learning"
16 / 16 papers shown
Title
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees
Yongtao Wu
Luca Viano
Yihang Chen
Zhenyu Zhu
Kimon Antonakopoulos
Quanquan Gu
V. Cevher
54
0
0
18 Feb 2025
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
59
7
0
19 Sep 2024
Offline RL via Feature-Occupancy Gradient Ascent
Gergely Neu
Nneka Okolo
OffRL
34
0
0
22 May 2024
Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm
Titouan Renard
Andreas Schlaginhaufen
Tingting Ni
Maryam Kamgarpour
51
1
0
25 Mar 2024
Stability of Q-Learning Through Design and Optimism
Sean P. Meyn
23
10
0
05 Jul 2023
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning
Outongyi Lv
Bingxin Zhou
OffRL
44
0
0
05 Jul 2023
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
32
11
0
25 May 2023
Offline Primal-Dual Reinforcement Learning for Linear MDPs
Germano Gabbianelli
Gergely Neu
Nneka Okolo
Matteo Papini
OffRL
29
7
0
22 May 2023
Efficient Global Planning in Large MDPs via Stochastic Primal-Dual Optimization
Gergely Neu
Nneka Okolo
32
6
0
21 Oct 2022
Sufficient Exploration for Convex Q-learning
F. Lu
P. Mehta
Sean P. Meyn
Gergely Neu
OffRL
16
1
0
17 Oct 2022
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
D. Tiapkin
Denis Belomestny
Eric Moulines
A. Naumov
S. Samsonov
Yunhao Tang
Michal Valko
Pierre Menard
24
16
0
16 May 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations
Angeliki Kamoutsi
G. Banjac
John Lygeros
OffRL
26
7
0
28 Dec 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
21
80
0
01 Sep 2021
Near Optimal Policy Optimization via REPS
Aldo Pacchiano
Jonathan Lee
Peter L. Bartlett
Ofir Nachum
18
3
0
17 Mar 2021
1