ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.04956
  4. Cited By
Classical Structured Prediction Losses for Sequence to Sequence Learning

Classical Structured Prediction Losses for Sequence to Sequence Learning

14 November 2017
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
    AIMat
ArXivPDFHTML

Papers citing "Classical Structured Prediction Losses for Sequence to Sequence Learning"

41 / 41 papers shown
Title
Context Consistency between Training and Testing in Simultaneous Machine
  Translation
Context Consistency between Training and Testing in Simultaneous Machine Translation
M. Zhong
Lemao Liu
Kehai Chen
Mingming Yang
Min Zhang
LRM
27
0
0
13 Nov 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li
Yiran Liu
Xingxing Zhang
Wei Lu
Furu Wei
ALM
30
3
0
20 Oct 2023
Large-Scale and Multi-Perspective Opinion Summarization with Diverse
  Review Subsets
Large-Scale and Multi-Perspective Opinion Summarization with Diverse Review Subsets
Han Jiang
Rui Wang
Zhihua Wei
Yu Li
Xinpeng Wang
37
4
0
20 Oct 2023
Improving Large Language Model Fine-tuning for Solving Math Problems
Improving Large Language Model Fine-tuning for Solving Math Problems
Yixin Liu
Avi Singh
C. D. Freeman
John D. Co-Reyes
Peter J. Liu
LRM
ReLM
35
45
0
16 Oct 2023
Multi-Granularity Optimization for Non-Autoregressive Translation
Multi-Granularity Optimization for Non-Autoregressive Translation
Yafu Li
Leyang Cui
Yongjing Yin
Yue Zhang
27
7
0
20 Oct 2022
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
Kazuma Hashimoto
K. Raman
VLM
11
1
0
29 Sep 2022
Are Neighbors Enough? Multi-Head Neural n-gram can be Alternative to
  Self-attention
Are Neighbors Enough? Multi-Head Neural n-gram can be Alternative to Self-attention
Mengsay Loem
Sho Takase
Masahiro Kaneko
Naoaki Okazaki
11
1
0
27 Jul 2022
Quark: Controllable Text Generation with Reinforced Unlearning
Quark: Controllable Text Generation with Reinforced Unlearning
Ximing Lu
Sean Welleck
Jack Hessel
Liwei Jiang
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
MU
63
206
0
26 May 2022
RankGen: Improving Text Generation with Large Ranking Models
RankGen: Improving Text Generation with Large Ranking Models
Kalpesh Krishna
Yapei Chang
John Wieting
Mohit Iyyer
AIMat
24
68
0
19 May 2022
Quality-Aware Decoding for Neural Machine Translation
Quality-Aware Decoding for Neural Machine Translation
Patrick Fernandes
António Farinhas
Ricardo Rei
José G. C. de Souza
Perez Ogayo
Graham Neubig
André F. T. Martins
33
57
0
02 May 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
26
6
0
11 Apr 2022
High Quality Rather than High Model Probability: Minimum Bayes Risk
  Decoding with Neural Metrics
High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics
Markus Freitag
David Grangier
Qijun Tan
Bowen Liang
22
92
0
17 Nov 2021
What's Hidden in a One-layer Randomly Weighted Transformer?
What's Hidden in a One-layer Randomly Weighted Transformer?
Sheng Shen
Z. Yao
Douwe Kiela
Kurt Keutzer
Michael W. Mahoney
24
4
0
08 Sep 2021
Survey of Low-Resource Machine Translation
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
31
147
0
01 Sep 2021
Controlled Text Generation as Continuous Optimization with Multiple
  Constraints
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDL
AI4CE
37
76
0
04 Aug 2021
Modelling Latent Translations for Cross-Lingual Transfer
Modelling Latent Translations for Cross-Lingual Transfer
E. Ponti
Julia Kreutzer
Ivan Vulić
Siva Reddy
26
18
0
23 Jul 2021
Revisiting the Weaknesses of Reinforcement Learning for Neural Machine
  Translation
Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation
Samuel Kiegeland
Julia Kreutzer
AAML
31
46
0
16 Jun 2021
Reward Optimization for Neural Machine Translation with Learned Metrics
Reward Optimization for Neural Machine Translation with Learned Metrics
Raphael Shu
Kang Min Yoo
Jung-Woo Ha
29
12
0
15 Apr 2021
Domain Adaptation and Multi-Domain Adaptation for Neural Machine
  Translation: A Survey
Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
Danielle Saunders
AI4CE
17
85
0
14 Apr 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools
Neural Machine Translation: A Review of Methods, Resources, and Tools
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DV
AI4TS
15
105
0
31 Dec 2020
Reservoir Transformers
Reservoir Transformers
Sheng Shen
Alexei Baevski
Ari S. Morcos
Kurt Keutzer
Michael Auli
Douwe Kiela
32
17
0
30 Dec 2020
Efficient Wait-k Models for Simultaneous Machine Translation
Efficient Wait-k Models for Simultaneous Machine Translation
Maha Elbayad
Laurent Besacier
Jakob Verbeek
VLM
24
77
0
18 May 2020
On Exposure Bias, Hallucination and Domain Shift in Neural Machine
  Translation
On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation
Chaojun Wang
Rico Sennrich
6
155
0
07 May 2020
Residual Energy-Based Models for Text Generation
Residual Energy-Based Models for Text Generation
Yuntian Deng
A. Bakhtin
Myle Ott
Arthur Szlam
MarcÁurelio Ranzato
20
125
0
22 Apr 2020
On Feature Normalization and Data Augmentation
On Feature Normalization and Data Augmentation
Boyi Li
Felix Wu
Ser-Nam Lim
Serge J. Belongie
Kilian Q. Weinberger
21
134
0
25 Feb 2020
Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue
Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue
Byeongchang Kim
Jaewoo Ahn
Gunhee Kim
BDL
33
167
0
18 Feb 2020
Estimating Gradients for Discrete Random Variables by Sampling without
  Replacement
Estimating Gradients for Discrete Random Variables by Sampling without Replacement
W. Kool
H. V. Hoof
Max Welling
BDL
23
49
0
14 Feb 2020
LAVA NAT: A Non-Autoregressive Translation Model with Look-Around
  Decoding and Vocabulary Attention
LAVA NAT: A Non-Autoregressive Translation Model with Look-Around Decoding and Vocabulary Attention
Xiaoya Li
Yuxian Meng
Arianna Yuan
Fei Wu
Jiwei Li
32
12
0
08 Feb 2020
Explicit Sparse Transformer: Concentrated Attention Through Explicit
  Selection
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
Guangxiang Zhao
Junyang Lin
Zhiyuan Zhang
Xuancheng Ren
Qi Su
Xu Sun
14
108
0
25 Dec 2019
Depth-Adaptive Transformer
Depth-Adaptive Transformer
Maha Elbayad
Jiatao Gu
Edouard Grave
Michael Auli
19
188
0
22 Oct 2019
Hint-Based Training for Non-Autoregressive Machine Translation
Hint-Based Training for Non-Autoregressive Machine Translation
Zhuohan Li
Zi Lin
Di He
Fei Tian
Tao Qin
Liwei Wang
Tie-Yan Liu
23
72
0
15 Sep 2019
Sparse Sequence-to-Sequence Models
Sparse Sequence-to-Sequence Models
Ben Peters
Vlad Niculae
André F. T. Martins
TPM
19
209
0
14 May 2019
Benchmarking Approximate Inference Methods for Neural Structured
  Prediction
Benchmarking Approximate Inference Methods for Neural Structured Prediction
Lifu Tu
Kevin Gimpel
BDL
33
17
0
01 Apr 2019
Abstractive Summarization of Reddit Posts with Multi-level Memory
  Networks
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks
Byeongchang Kim
Hyunwoo J. Kim
Gunhee Kim
12
181
0
02 Nov 2018
Sequence to Sequence Mixture Model for Diverse Machine Translation
Sequence to Sequence Mixture Model for Diverse Machine Translation
Xuanli He
Gholamreza Haffari
Mohammad Norouzi
12
57
0
17 Oct 2018
A Study of Reinforcement Learning for Neural Machine Translation
A Study of Reinforcement Learning for Neural Machine Translation
Lijun Wu
Fei Tian
Tao Qin
Jianhuang Lai
Tie-Yan Liu
OffRL
27
182
0
27 Aug 2018
Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing
Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing
Daniel Fried
Dan Klein
26
27
0
08 Jun 2018
RETURNN as a Generic Flexible Neural Toolkit with Application to
  Translation and Speech Recognition
RETURNN as a Generic Flexible Neural Toolkit with Application to Translation and Speech Recognition
Albert Zeyer
Tamer Alkhouli
Hermann Ney
29
90
0
14 May 2018
Analyzing Uncertainty in Neural Machine Translation
Analyzing Uncertainty in Neural Machine Translation
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
UQLM
29
270
0
28 Feb 2018
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1