ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.09943
  4. Cited By
Revisiting Character-Based Neural Machine Translation with Capacity and
  Compression

Revisiting Character-Based Neural Machine Translation with Capacity and Compression

29 August 2018
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
ArXivPDFHTML

Papers citing "Revisiting Character-Based Neural Machine Translation with Capacity and Compression"

25 / 25 papers shown
Title
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
71
3
0
28 Oct 2024
Smirk: An Atomically Complete Tokenizer for Molecular Foundation Models
Smirk: An Atomically Complete Tokenizer for Molecular Foundation Models
Alexius Wadell
Anoushka Bhutani
Venkatasubramanian Viswanathan
361
0
0
19 Sep 2024
Controlling the Output Length of Neural Machine Translation
Controlling the Output Length of Neural Machine Translation
Surafel Melaku Lakew
Mattia Antonino Di Gangi
Marcello Federico
80
67
0
23 Oct 2019
Revisiting the Hierarchical Multiscale LSTM
Revisiting the Hierarchical Multiscale LSTM
Ákos Kádár
Marc-Alexandre Côté
Grzegorz Chrupała
Afra Alishahi
37
13
0
10 Jul 2018
Focused Hierarchical RNNs for Conditional Sequence Processing
Focused Hierarchical RNNs for Conditional Sequence Processing
Nan Rosemary Ke
Konrad Zolna
Alessandro Sordoni
Zhouhan Lin
Adam Trischler
Yoshua Bengio
Joelle Pineau
Laurent Charlin
C. Pal
AIMat
38
25
0
12 Jun 2018
Subword Regularization: Improving Neural Network Translation Models with
  Multiple Subword Candidates
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
165
1,153
0
29 Apr 2018
The Best of Both Worlds: Combining Recent Advances in Neural Machine
  Translation
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation
Mengzhao Chen
Orhan Firat
Ankur Bapna
Melvin Johnson
Wolfgang Macherey
...
Niki Parmar
M. Schuster
Zhifeng Chen
Yonghui Wu
Macduff Hughes
AIMat
52
457
0
26 Apr 2018
A Call for Clarity in Reporting BLEU Scores
A Call for Clarity in Reporting BLEU Scores
Matt Post
116
2,941
0
23 Apr 2018
Fast Decoding in Sequence Models using Discrete Latent Variables
Fast Decoding in Sequence Models using Discrete Latent Variables
Łukasz Kaiser
Aurko Roy
Ashish Vaswani
Niki Parmar
Samy Bengio
Jakob Uszkoreit
Noam M. Shazeer
44
231
0
09 Mar 2018
Plan, Attend, Generate: Character-level Neural Machine Translation with
  Planning in the Decoder
Plan, Attend, Generate: Character-level Neural Machine Translation with Planning in the Decoder
Çağlar Gülçehre
Francis Dutil
Adam Trischler
Yoshua Bengio
41
7
0
13 Jun 2017
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Online and Linear-Time Attention by Enforcing Monotonic Alignments
Colin Raffel
Minh-Thang Luong
Peter J. Liu
Ron J. Weiss
Douglas Eck
56
258
0
03 Apr 2017
How Grammatical is Character-level Neural Machine Translation? Assessing
  MT Quality with Contrastive Translation Pairs
How Grammatical is Character-level Neural Machine Translation? Assessing MT Quality with Contrastive Translation Pairs
Rico Sennrich
73
165
0
14 Dec 2016
Google's Multilingual Neural Machine Translation System: Enabling
  Zero-Shot Translation
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
Melvin Johnson
M. Schuster
Quoc V. Le
M. Krikun
Yonghui Wu
...
F. Viégas
Martin Wattenberg
Gregory S. Corrado
Macduff Hughes
Jeffrey Dean
102
2,087
0
14 Nov 2016
Quasi-Recurrent Neural Networks
Quasi-Recurrent Neural Networks
James Bradbury
Stephen Merity
Caiming Xiong
R. Socher
105
441
0
05 Nov 2016
Fully Character-Level Neural Machine Translation without Explicit
  Segmentation
Fully Character-Level Neural Machine Translation without Explicit Segmentation
Jason D. Lee
Kyunghyun Cho
Thomas Hofmann
VLM
112
457
0
10 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
832
6,768
0
26 Sep 2016
Hierarchical Multiscale Recurrent Neural Networks
Hierarchical Multiscale Recurrent Neural Networks
Junyoung Chung
Sungjin Ahn
Yoshua Bengio
BDL
76
534
0
06 Sep 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
307
10,412
0
21 Jul 2016
Achieving Open Vocabulary Neural Machine Translation with Hybrid
  Word-Character Models
Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models
Minh-Thang Luong
Christopher D. Manning
VLM
78
373
0
04 Apr 2016
Character-based Neural Machine Translation
Character-based Neural Machine Translation
Marta R. Costa-jussá
José A. R. Fonollosa
AIMat
64
338
0
02 Mar 2016
Modeling Coverage for Neural Machine Translation
Modeling Coverage for Neural Machine Translation
Zhaopeng Tu
Zhengdong Lu
Yang Liu
Xiaohua Liu
Hang Li
76
746
0
19 Jan 2016
Neural Machine Translation of Rare Words with Subword Units
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
174
7,683
0
31 Aug 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
147
2,261
0
05 Aug 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.1K
149,474
0
22 Dec 2014
Estimating or Propagating Gradients Through Stochastic Neurons for
  Conditional Computation
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation
Yoshua Bengio
Nicholas Léonard
Aaron Courville
334
3,099
0
15 Aug 2013
1