Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.05240
Cited By
ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning
10 June 2022
Haozhe Jasper Wang
Chao Du
Panyan Fang
Shuo Yuan
Xu-Jiang He
Liang Wang
Bo Zheng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning"
13 / 13 papers shown
Title
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
Haozhe Wang
Chao Qu
Zuming Huang
Wei Chu
Fangzhen Lin
Wenhu Chen
OffRL
ReLM
SyDa
LRM
VLM
127
26
0
10 Apr 2025
Auto-bidding in real-time auctions via Oracle Imitation Learning (OIL)
Alberto Silvio Chiappa
Briti Gangopadhyay
Zhao Wang
Shingo Takamatsu
125
1
0
16 Dec 2024
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
L. Zintgraf
K. Shiarlis
Maximilian Igl
Sebastian Schulze
Y. Gal
Katja Hofmann
Shimon Whiteson
OffRL
55
277
0
18 Oct 2019
Lyapunov-based Safe Policy Optimization for Continuous Control
Yinlam Chow
Ofir Nachum
Aleksandra Faust
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
70
246
0
28 Jan 2019
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDL
OffRL
58
261
0
06 Jun 2018
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
83
540
0
28 May 2018
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
110
1,322
0
30 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
106
2,437
0
15 May 2017
Real-Time Bidding by Reinforcement Learning in Display Advertising
Han Cai
Kan Ren
Weinan Zhang
Kleanthis Malialis
Jun Wang
Yong Yu
Defeng Guo
61
246
0
10 Jan 2017
Variational Inference: A Review for Statisticians
David M. Blei
A. Kucukelbir
Jon D. McAuliffe
BDL
269
4,787
0
04 Jan 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
318
13,237
0
09 Sep 2015
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
452
16,933
0
20 Dec 2013
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
125
12,227
0
19 Dec 2013
1