ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.13578
  4. Cited By
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for
  Counselor Reflection Generation

Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation

20 March 2024
Do June Min
Verónica Pérez-Rosas
Kenneth Resnicow
Rada Mihalcea
    OffRL
ArXivPDFHTML

Papers citing "Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation"

13 / 13 papers shown
Title
EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning
EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning
Lingxiao Kong
Cong Yang
Susanne Neufang
Oya Beyan
Zeyd Boukhers
OffRL
57
0
0
05 May 2025
Is Reinforcement Learning (Not) for Natural Language Processing:
  Benchmarks, Baselines, and Building Blocks for Natural Language Policy
  Optimization
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
82
246
0
03 Oct 2022
Why is constrained neural language generation particularly challenging?
Why is constrained neural language generation particularly challenging?
Cristina Garbacea
Qiaozhu Mei
96
15
0
11 Jun 2022
Quark: Controllable Text Generation with Reinforced Unlearning
Quark: Controllable Text Generation with Reinforced Unlearning
Ximing Lu
Sean Welleck
Jack Hessel
Liwei Jiang
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
MU
99
216
0
26 May 2022
Automated Quality Assessment of Cognitive Behavioral Therapy Sessions
  Through Highly Contextualized Language Representations
Automated Quality Assessment of Cognitive Behavioral Therapy Sessions Through Highly Contextualized Language Representations
Nikolaos Flemotomos
Víctor R. Martínez
Zhuohao Chen
Torrey A. Creed
David C. Atkins
Shrikanth Narayanan
41
31
0
23 Feb 2021
DORB: Dynamically Optimizing Multiple Rewards with Bandits
DORB: Dynamically Optimizing Multiple Rewards with Bandits
Ramakanth Pasunuru
Han Guo
Joey Tianyi Zhou
OffRL
44
7
0
15 Nov 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
385
20,114
0
23 Oct 2019
Multi-Reward Reinforced Summarization with Saliency and Entailment
Multi-Reward Reinforced Summarization with Saliency and Entailment
Ramakanth Pasunuru
Joey Tianyi Zhou
52
201
0
17 Apr 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
446
18,931
0
20 Jul 2017
Automated Curriculum Learning for Neural Networks
Automated Curriculum Learning for Neural Networks
Alex Graves
Marc G. Bellemare
Jacob Menick
Rémi Munos
Koray Kavukcuoglu
72
523
0
10 Apr 2017
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
270
1,331
0
05 Jun 2016
A Survey of Online Experiment Design with the Stochastic Multi-Armed
  Bandit
A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit
Giuseppe Burtini
Jason L. Loeppky
Ramon Lawrence
58
119
0
02 Oct 2015
Optimizing Dialogue Management with Reinforcement Learning: Experiments
  with the NJFun System
Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System
Michael Kearns
Diane Litman
Satinder Singh
M. Walker
OffRL
78
426
0
03 Jun 2011
1