ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.00059
  4. Cited By
A Mixture-of-Expert Approach to RL-based Dialogue Management

A Mixture-of-Expert Approach to RL-based Dialogue Management

31 May 2022
Yinlam Chow
Azamat Tulepbergenov
Ofir Nachum
Moonkyung Ryu
Mohammad Ghavamzadeh
Craig Boutilier
    MoE
ArXivPDFHTML

Papers citing "A Mixture-of-Expert Approach to RL-based Dialogue Management"

17 / 17 papers shown
Title
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts
Huy Le Nguyen
Pedram Akbarian
Trang Pham
Trang Nguyen
Shujian Zhang
Nhat Ho
MoE
51
2
0
23 May 2024
Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture
  of Experts
Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts
Huy Nguyen
Nhat Ho
Alessandro Rinaldo
55
3
0
22 May 2024
On Least Square Estimation in Softmax Gating Mixture of Experts
On Least Square Estimation in Softmax Gating Mixture of Experts
Huy Nguyen
Nhat Ho
Alessandro Rinaldo
51
13
0
05 Feb 2024
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via
  Competition
CompeteSMoE -- Effective Training of Sparse Mixture of Experts via Competition
Quang-Cuong Pham
Giang Do
Huy Nguyen
TrungTin Nguyen
Chenghao Liu
...
Binh T. Nguyen
Savitha Ramasamy
Xiaoli Li
Steven C. H. Hoi
Nhat Ho
25
17
0
04 Feb 2024
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the
  Generative Artificial Intelligence (AI) Research Landscape
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape
Timothy R. McIntosh
Teo Susnjak
Tong Liu
Paul Watters
Malka N. Halgamuge
94
46
0
18 Dec 2023
A General Theory for Softmax Gating Multinomial Logistic Mixture of
  Experts
A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts
Huy Nguyen
Pedram Akbarian
TrungTin Nguyen
Nhat Ho
32
10
0
22 Oct 2023
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of
  Experts
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts
Huy Nguyen
Pedram Akbarian
Fanqi Yan
Nhat Ho
MoE
41
16
0
25 Sep 2023
AI Text-to-Behavior: A Study In Steerability
AI Text-to-Behavior: A Study In Steerability
David A. Noever
Samuel Hyams
LLMSV
60
8
0
07 Aug 2023
Leveraging Large Language Models in Conversational Recommender Systems
Leveraging Large Language Models in Conversational Recommender Systems
Luke Friedman
Sameer Ahuja
David Allen
Zhenning Tan
Hakim Sidahmed
...
Ajay Patel
Harsh Lara
Brian Chu
Zexiang Chen
Manoj Kumar Tiwari
32
102
0
13 May 2023
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
Dhawal Gupta
Yinlam Chow
Aza Tulepbergenov
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
19
3
0
21 Feb 2023
Trust in Language Grounding: a new AI challenge for human-robot teams
Trust in Language Grounding: a new AI challenge for human-robot teams
David M. Bossens
C. Evers
36
1
0
05 Sep 2022
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy
  Optimization
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Wenze Chen
Shiyu Huang
Yuan Chiang
Tim Pearce
Wei-Wei Tu
Tingling Chen
Jun Zhu
23
5
0
12 Jul 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
843
0
12 Oct 2021
How BPE Affects Memorization in Transformers
How BPE Affects Memorization in Transformers
Eugene Kharitonov
Marco Baroni
Dieuwke Hupkes
163
32
0
06 Oct 2021
Predictive Coding for Locally-Linear Control
Predictive Coding for Locally-Linear Control
Rui Shu
Tung D. Nguyen
Yinlam Chow
Tu Pham
Khoat Than
Mohammad Ghavamzadeh
Stefano Ermon
Hung Bui
OffRL
BDL
34
24
0
02 Mar 2020
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
292
1,610
0
18 Sep 2019
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
214
1,327
0
05 Jun 2016
1