Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.07006
Cited By
Sustainable Online Reinforcement Learning for Auto-bidding
13 October 2022
Zhiyu Mou
Yusen Huo
Rongquan Bai
Mingzhou Xie
Chuan Yu
Jian Xu
Bo Zheng
OffRL
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sustainable Online Reinforcement Learning for Auto-bidding"
31 / 31 papers shown
Title
Generative Auto-Bidding with Value-Guided Explorations
Jingtong Gao
Yewen Li
Shuai Mao
Peng Jiang
Nan Jiang
...
Fei Pan
Peng Jiang
Kun Gai
Bo An
Xiangyu Zhao
OffRL
110
0
0
20 Apr 2025
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
181
86
0
22 Sep 2021
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble
Seunghyun Lee
Younggyo Seo
Kimin Lee
Pieter Abbeel
Jinwoo Shin
OffRL
OnRL
45
187
0
01 Jul 2021
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
65
167
0
16 Jun 2021
Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search
Ziyu Guan
Hongchang Wu
Qingyu Cao
Hao Liu
Wei Zhao
Sheng Li
Cai Xu
Guang Qiu
Jian Xu
Bo Zheng
32
16
0
08 Jun 2021
Neural Auction: End-to-End Learning of Auction Mechanisms for E-Commerce Advertising
Xiangyu Liu
Chuan Yu
Zhilin Zhang
Zhenzhe Zheng
Yu Rong
...
Dagui Chen
Jian Xu
Fan Wu
Guihai Chen
Xiaoqiang Zhu
32
64
0
07 Jun 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRL
AI4CE
80
68
0
23 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
177
81
0
01 Feb 2021
Inverse Constrained Reinforcement Learning
Usman Anwar
Shehryar Malik
Alireza Aghasi
Ali Ahmed
51
59
0
19 Nov 2020
Conservative Safety Critics for Exploration
Homanga Bharadhwaj
Aviral Kumar
Nicholas Rhinehart
Sergey Levine
Florian Shkurti
Animesh Garg
OffRL
74
139
0
27 Oct 2020
COLD: Towards the Next Generation of Pre-Ranking System
Zhe Wang
Liqin Zhao
Biye Jiang
Guorui Zhou
Xiaoqiang Zhu
Kun Gai
24
55
0
31 Jul 2020
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Xiaotian Hao
Zhaoqing Peng
Yi-An Ma
Guanlong Wang
Junqi Jin
...
Zhenzhe Zheng
Chuan Yu
Han Li
Jian Xu
Kun Gai
27
26
0
29 Jun 2020
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
128
320
0
26 Jun 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
77
601
0
16 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
104
1,780
0
08 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
477
1,994
0
04 May 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
54
283
0
19 Feb 2020
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
43
154
0
15 Nov 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
113
548
0
01 Oct 2019
Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning
D. Scobee
S. Shankar Sastry
18
64
0
12 Sep 2019
Sim2real transfer learning for 3D human pose estimation: motion to the rescue
Carl Doersch
Andrew Zisserman
3DH
31
155
0
04 Jul 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
79
1,044
0
03 Jun 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
157
1,586
0
07 Dec 2018
Learning-based Model Predictive Control for Safe Exploration
Torsten Koller
Felix Berkenkamp
M. Turchetta
Andreas Krause
45
376
0
22 Mar 2018
Budget Constrained Bidding by Model-free Reinforcement Learning in Display Advertising
Di Wu
Xiujun Chen
Xun Yang
Hao Wang
Qing Tan
Xiaoxun Zhang
Jian Xu
Kun Gai
27
127
0
23 Feb 2018
Safe Exploration in Continuous Action Spaces
Gal Dalal
Krishnamurthy Dvijotham
Matej Vecerík
Todd Hester
Cosmin Paduraru
Yuval Tassa
44
438
0
26 Jan 2018
Safe Policy Improvement with Baseline Bootstrapping
Romain Laroche
P. Trichelair
Rémi Tachet des Combes
OffRL
48
198
0
19 Dec 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
Felix Berkenkamp
M. Turchetta
Angela P. Schoellig
Andreas Krause
129
845
0
23 May 2017
Real-Time Bidding by Reinforcement Learning in Display Advertising
Han Cai
Kan Ren
Weinan Zhang
Kleanthis Malialis
Jun Wang
Yong Yu
Defeng Guo
33
246
0
10 Jan 2017
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes
M. Turchetta
Felix Berkenkamp
Andreas Krause
61
186
0
15 Jun 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
210
13,174
0
09 Sep 2015
1