Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.05958
Cited By
Can Neural Machine Translation be Improved with User Feedback?
16 April 2018
Julia Kreutzer
Shahram Khadivi
E. Matusov
Stefan Riezler
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Can Neural Machine Translation be Improved with User Feedback?"
28 / 28 papers shown
Title
2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization
Mengyang Li
Zhong Zhang
64
1
0
10 Apr 2025
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Miguel Moura Ramos
Tomás Almeida
Daniel Vareta
Filipe Azevedo
Sweta Agrawal
Patrick Fernandes
André F. T. Martins
97
4
0
08 Nov 2024
Learning from Chunk-based Feedback in Neural Machine Translation
Pavel Petrushkov
Shahram Khadivi
E. Matusov
42
19
0
19 Jun 2018
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
Julia Kreutzer
Joshua Uyheng
Stefan Riezler
62
86
0
27 May 2018
A Reinforcement Learning Approach to Interactive-Predictive Neural Machine Translation
Tsz Kin Lam
Julia Kreutzer
Stefan Riezler
70
32
0
03 May 2018
Improving a Neural Semantic Parser by Counterfactual Learning from Human Bandit Feedback
Carolin (Haas) Lawrence
Stefan Riezler
OffRL
218
57
0
03 May 2018
Counterfactual Learning for Machine Translation: Degeneracies and Solutions
Carolin (Haas) Lawrence
Pratik Gajane
Stefan Riezler
OffRL
CML
53
8
0
23 Nov 2017
Counterfactual Learning from Bandit Feedback under Deterministic Logging: A Case Study in Statistical Machine Translation
Carolin (Haas) Lawrence
Artem Sokolov
Stefan Riezler
OffRL
65
34
0
28 Jul 2017
A Shared Task on Bandit Learning for Machine Translation
Artem Sokolov
Julia Kreutzer
Kellen Sunderland
Pavel Danchenko
Witold Szymaniak
Hagen Fürstenau
Stefan Riezler
64
16
0
27 Jul 2017
Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback
Khanh Nguyen
Hal Daumé
Jordan L. Boyd-Graber
65
138
0
24 Jul 2017
Do Convolutional Networks need to be Deep for Text Classification ?
Hoa T. Le
Christophe Cerisara
Alexandre Denis
VLM
54
105
0
13 Jul 2017
Bandit Structured Prediction for Neural Sequence-to-Sequence Learning
Julia Kreutzer
Artem Sokolov
Stefan Riezler
64
49
0
21 Apr 2017
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
897
6,790
0
26 Sep 2016
An Actor-Critic Algorithm for Sequence Prediction
Dzmitry Bahdanau
Philemon Brakel
Kelvin Xu
Anirudh Goyal
Ryan J. Lowe
Joelle Pineau
Aaron Courville
Yoshua Bengio
130
639
0
24 Jul 2016
Edinburgh Neural Machine Translation Systems for WMT 16
Rico Sennrich
Barry Haddow
Alexandra Birch
67
524
0
09 Jun 2016
Stochastic Structured Prediction under Bandit Feedback
Artem Sokolov
Julia Kreutzer
Christopher Lo
Stefan Riezler
56
31
0
02 Jun 2016
Minimum Risk Training for Neural Machine Translation
Shiqi Shen
Yong Cheng
Zhongjun He
W. He
Hua Wu
Maosong Sun
Yang Liu
114
469
0
08 Dec 2015
Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich
Barry Haddow
Alexandra Birch
248
2,722
0
20 Nov 2015
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
202
623
0
11 Nov 2015
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
221
7,745
0
31 Aug 2015
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
821
9,318
0
06 Jun 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,115
0
22 Dec 2014
On Using Very Large Target Vocabulary for Neural Machine Translation
Sébastien Jean
Kyunghyun Cho
Roland Memisevic
Yoshua Bengio
155
1,011
0
05 Dec 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
558
27,311
0
01 Sep 2014
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
626
13,427
0
25 Aug 2014
Counterfactual Reasoning and Learning Systems
Léon Bottou
J. Peters
J. Q. Candela
Denis Xavier Charles
D. M. Chickering
Elon Portugaly
Dipankar Ray
Patrice Y. Simard
Edward Snelson
CML
OffRL
385
783
0
11 Sep 2012
Doubly Robust Policy Evaluation and Learning
Miroslav Dudík
John Langford
Lihong Li
OffRL
339
697
0
23 Mar 2011
Learning from Logged Implicit Exploration Data
Alexander L. Strehl
John Langford
Sham Kakade
Lihong Li
OffRL
181
255
0
27 Feb 2010
1