ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.05240
  4. Cited By
ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement
  Learning

ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning

10 June 2022
Haozhe Jasper Wang
Chao Du
Panyan Fang
Shuo Yuan
Xu-Jiang He
Liang Wang
Bo Zheng
ArXivPDFHTML

Papers citing "ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning"

13 / 13 papers shown
Title
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning
Haozhe Wang
Chao Qu
Zuming Huang
Wei Chu
Fangzhen Lin
Wenhu Chen
OffRL
ReLM
SyDa
LRM
VLM
127
26
0
10 Apr 2025
Auto-bidding in real-time auctions via Oracle Imitation Learning (OIL)
Auto-bidding in real-time auctions via Oracle Imitation Learning (OIL)
Alberto Silvio Chiappa
Briti Gangopadhyay
Zhao Wang
Shingo Takamatsu
125
1
0
16 Dec 2024
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
L. Zintgraf
K. Shiarlis
Maximilian Igl
Sebastian Schulze
Y. Gal
Katja Hofmann
Shimon Whiteson
OffRL
55
277
0
18 Oct 2019
Lyapunov-based Safe Policy Optimization for Continuous Control
Lyapunov-based Safe Policy Optimization for Continuous Control
Yinlam Chow
Ofir Nachum
Aleksandra Faust
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
70
246
0
28 Jan 2019
Deep Variational Reinforcement Learning for POMDPs
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDL
OffRL
58
261
0
06 Jun 2018
Reward Constrained Policy Optimization
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
83
540
0
28 May 2018
Constrained Policy Optimization
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
110
1,322
0
30 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
106
2,437
0
15 May 2017
Real-Time Bidding by Reinforcement Learning in Display Advertising
Real-Time Bidding by Reinforcement Learning in Display Advertising
Han Cai
Kan Ren
Weinan Zhang
Kleanthis Malialis
Jun Wang
Yong Yu
Defeng Guo
61
246
0
10 Jan 2017
Variational Inference: A Review for Statisticians
Variational Inference: A Review for Statisticians
David M. Blei
A. Kucukelbir
Jon D. McAuliffe
BDL
269
4,787
0
04 Jan 2016
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
318
13,237
0
09 Sep 2015
Auto-Encoding Variational Bayes
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
452
16,933
0
20 Dec 2013
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
125
12,227
0
19 Dec 2013
1