ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.10658
  4. Cited By
Learning to Coordinate Multiple Reinforcement Learning Agents for
  Diverse Query Reformulation

Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation

27 September 2018
Rodrigo Nogueira
Jannis Bulian
Massimiliano Ciaramita
ArXivPDFHTML

Papers citing "Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation"

31 / 31 papers shown
Title
Large scale distributed neural network training through online
  distillation
Large scale distributed neural network training through online distillation
Rohan Anil
Gabriel Pereyra
Alexandre Passos
Róbert Ormándi
George E. Dahl
Geoffrey E. Hinton
FedML
317
407
0
09 Apr 2018
Analyzing Language Learned by an Active Question Answering Agent
Analyzing Language Learned by an Active Question Answering Agent
Christian Buck
Jannis Bulian
Massimiliano Ciaramita
Wojciech Gajewski
Andrea Gesmundo
N. Houlsby
Wei Wang
LLMAG
28
5
0
23 Jan 2018
Improving Exploration in Evolution Strategies for Deep Reinforcement
  Learning via a Population of Novelty-Seeking Agents
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
61
347
0
18 Dec 2017
Evidence Aggregation for Answer Re-Ranking in Open-Domain Question
  Answering
Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering
Shuohang Wang
Mo Yu
Jing Jiang
Wei Zhang
Xiaoxiao Guo
Shiyu Chang
Zhiguo Wang
Tim Klinger
Gerald Tesauro
Murray Campbell
RALM
69
161
0
14 Nov 2017
A Deep Reinforcement Learning Chatbot
A Deep Reinforcement Learning Chatbot
Iulian Serban
Chinnadhurai Sankar
M. Germain
Saizheng Zhang
Zhouhan Lin
...
Dendi Suhubdy
Vincent Michalski
A. Nguyen
Joelle Pineau
Yoshua Bengio
75
235
0
07 Sep 2017
R$^3$: Reinforced Reader-Ranker for Open-Domain Question Answering
R3^33: Reinforced Reader-Ranker for Open-Domain Question Answering
Shuohang Wang
Mo Yu
Xiaoxiao Guo
Zhiguo Wang
Tim Klinger
Wei Zhang
Shiyu Chang
Gerald Tesauro
Bowen Zhou
Jing Jiang
RALM
45
64
0
31 Aug 2017
Ask the Right Questions: Active Question Reformulation with
  Reinforcement Learning
Ask the Right Questions: Active Question Reformulation with Reinforcement Learning
Christian Buck
Jannis Bulian
Massimiliano Ciaramita
Wojciech Gajewski
Andrea Gesmundo
N. Houlsby
Wei Wang
59
167
0
22 May 2017
A Deep Reinforced Model for Abstractive Summarization
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
189
1,556
0
11 May 2017
Reinforced Mnemonic Reader for Machine Reading Comprehension
Reinforced Mnemonic Reader for Machine Reading Comprehension
Minghao Hu
Yuxing Peng
Zhen Huang
Xipeng Qiu
Furu Wei
Ming Zhou
RALM
AIMat
49
69
0
08 May 2017
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
Matthew Dunn
Levent Sagun
Mike Higgins
V. U. Güney
Volkan Cirik
Kyunghyun Cho
RALM
84
455
0
18 Apr 2017
PACRR: A Position-Aware Neural IR Model for Relevance Matching
PACRR: A Position-Aware Neural IR Model for Relevance Matching
Kai Hui
Andrew Yates
K. Berberich
Gerard de Melo
43
155
0
12 Apr 2017
Outrageously Large Neural Networks: The Sparsely-Gated
  Mixture-of-Experts Layer
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
235
2,635
0
23 Jan 2017
A Simple, Fast Diverse Decoding Algorithm for Neural Generation
A Simple, Fast Diverse Decoding Algorithm for Neural Generation
Jiwei Li
Will Monroe
Dan Jurafsky
67
240
0
25 Nov 2016
Bidirectional Attention Flow for Machine Comprehension
Bidirectional Attention Flow for Machine Comprehension
Minjoon Seo
Aniruddha Kembhavi
Ali Farhadi
Hannaneh Hajishirzi
131
2,089
0
05 Nov 2016
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence
  Models
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models
Ashwin K. Vijayakumar
Michael Cogswell
Ramprasaath R. Selvaraju
Q. Sun
Stefan Lee
David J. Crandall
Dhruv Batra
89
554
0
07 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
880
6,781
0
26 Sep 2016
An Actor-Critic Algorithm for Sequence Prediction
An Actor-Critic Algorithm for Sequence Prediction
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
126
639
0
24 Jul 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
204
5,073
0
05 Jun 2016
Deep Exploration via Bootstrapped DQN
Deep Exploration via Bootstrapped DQN
Ian Osband
Charles Blundell
Alexander Pritzel
Benjamin Van Roy
110
1,305
0
15 Feb 2016
End-to-End Goal-Driven Web Navigation
End-to-End Goal-Driven Web Navigation
Rodrigo Nogueira
Kyunghyun Cho
LLMAG
59
35
0
06 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
189
8,833
0
04 Feb 2016
Sequence Level Training with Recurrent Neural Networks
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
98
1,614
0
20 Nov 2015
Policy Distillation
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
79
690
0
19 Nov 2015
Distilling the Knowledge in a Neural Network
Distilling the Knowledge in a Neural Network
Geoffrey E. Hinton
Oriol Vinyals
J. Dean
FedML
320
19,609
0
09 Mar 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.6K
149,842
0
22 Dec 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
387
20,528
0
10 Sep 2014
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
607
13,416
0
25 Aug 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
934
23,310
0
03 Jun 2014
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
635
31,469
0
16 Jan 2013
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
109
3,002
0
19 Jul 2012
The Divergence of Reinforcement Learning Algorithms with Value-Iteration
  and Function Approximation
The Divergence of Reinforcement Learning Algorithms with Value-Iteration and Function Approximation
Michael Fairbank
Eduardo Alonso
56
33
0
22 Jul 2011
1