ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.07909
  4. Cited By
Neural Machine Translation of Rare Words with Subword Units
v1v2v3v4v5 (latest)

Neural Machine Translation of Rare Words with Subword Units

31 August 2015
Rico Sennrich
Barry Haddow
Alexandra Birch
ArXiv (abs)PDFHTML

Papers citing "Neural Machine Translation of Rare Words with Subword Units"

50 / 101 papers shown
Title
Autoregressive Speech Synthesis without Vector Quantization
Autoregressive Speech Synthesis without Vector Quantization
Lingwei Meng
Long Zhou
Shujie Liu
Sanyuan Chen
Bing Han
...
Jinyu Li
Sheng Zhao
Xixin Wu
Helen M. Meng
Furu Wei
150
43
0
11 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
148
56
0
09 Jul 2024
A Principled Framework for Evaluating on Typologically Diverse Languages
A Principled Framework for Evaluating on Typologically Diverse Languages
Esther Ploeger
Wessel Poelman
Andreas Holck Høeg-Petersen
Anders Schlichtkrull
Miryam de Lhoneux
Johannes Bjerva
117
1
0
06 Jul 2024
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
Jinghui Lu
Haiyang Yu
Yanjie Wang
Yongjie Ye
Jingqun Tang
...
Qi Liu
Hao Feng
Han Wang
Hao Liu
Can Huang
159
23
0
02 Jul 2024
Large Vocabulary Size Improves Large Language Models
Large Vocabulary Size Improves Large Language Models
Sho Takase
Ryokan Ri
Shun Kiyono
Takuya Kato
129
4
0
24 Jun 2024
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
Yifan Yang
Zheshu Song
Jianheng Zhuo
Mingyu Cui
Jinpeng Li
...
Shuai Fan
Kai Yu
Wei Zhang
Guoguo Chen
Xie Chen
124
12
0
17 Jun 2024
Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation
Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation
Boxuan Lyu
Hidetaka Kamigaito
Kotaro Funakoshi
Manabu Okumura
159
0
0
17 Jun 2024
Language Models are Crossword Solvers
Language Models are Crossword Solvers
Soumadeep Saha
Sutanoya Chakraborty
Saptarshi Saha
Utpal Garain
LRMReLM
111
3
0
13 Jun 2024
MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension
MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension
Khiem Le
Zhichun Guo
Kaiwen Dong
Xiaobao Huang
B. Nan
Roshni G. Iyer
Xiangliang Zhang
Olaf Wiest
Wei Wang
Nitesh Chawla
105
0
0
10 Jun 2024
Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Saierdaer Yusuyin
Te Ma
Hao Huang
Wenbo Zhao
Zhijian Ou
116
4
0
04 Jun 2024
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Dixuan Wang
Yanda Li
Junyuan Jiang
Zepeng Ding
Ziqin Luo
Guochao Jiang
Jiaqing Liang
Deqing Yang
103
15
0
27 May 2024
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
Ziqiao Ma
Zekun Wang
Joyce Chai
127
4
0
22 May 2024
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
203
338
0
16 May 2024
Full Line Code Completion: Bringing AI to Desktop
Full Line Code Completion: Bringing AI to Desktop
Anton Semenkin
Vitaliy Bibaev
Yaroslav Sokolov
Kirill Krylov
Alexey Kalina
...
Mikhail Podvitskii
Petr Surkov
Yaroslav Golubev
Nikita Povarov
T. Bryksin
78
2
0
14 May 2024
Constructing a BPE Tokenization DFA
Constructing a BPE Tokenization DFA
Martin Berglund
Willeke Martens
Brink van der Merwe
67
2
0
13 May 2024
Understanding Emergent Abilities of Language Models from the Loss Perspective
Understanding Emergent Abilities of Language Models from the Loss Perspective
Zhengxiao Du
Aohan Zeng
Yuxiao Dong
Jie Tang
UQCVLRM
135
55
0
23 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
247
96
0
05 Mar 2024
Non-autoregressive Sequence-to-Sequence Vision-Language Models
Non-autoregressive Sequence-to-Sequence Vision-Language Models
Kunyu Shi
Qi Dong
Luis Goncalves
Zhuowen Tu
Stefano Soatto
VLM
133
3
0
04 Mar 2024
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
Yuqiao Wen
Behzad Shayegh
Chenyang Huang
Yanshuai Cao
Lili Mou
132
5
0
29 Feb 2024
Subobject-level Image Tokenization
Subobject-level Image Tokenization
Delong Chen
Samuel Cahyawijaya
Jianfeng Liu
Baoyuan Wang
Pascale Fung
VLMOCL
261
9
0
22 Feb 2024
Streaming Sequence Transduction through Dynamic Compression
Streaming Sequence Transduction through Dynamic Compression
Weiting Tan
Yunmo Chen
Tongfei Chen
Guanghui Qin
Haoran Xu
Heidi C. Zhang
Benjamin Van Durme
Philipp Koehn
148
2
0
02 Feb 2024
CroissantLLM: A Truly Bilingual French-English Language Model
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
145
37
0
01 Feb 2024
Local Grammar-Based Coding Revisited
Local Grammar-Based Coding Revisited
L. Debowski
68
0
0
27 Sep 2022
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds
Yu Zhang
Yu Meng
Xuan Wang
Sheng Wang
Jiawei Han
148
13
0
04 May 2022
AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
AtteSTNet -- An attention and subword tokenization based approach for code-switched text hate speech detection
Geet Shingi
Vedangi Wagh
134
0
0
10 Dec 2021
FFR v1.1: Fon-French Neural Machine Translation
FFR v1.1: Fon-French Neural Machine Translation
Bonaventure F. P. Dossou
Chris C. Emezue
69
26
0
14 Jun 2020
Simplify-then-Translate: Automatic Preprocessing for Black-Box Machine
  Translation
Simplify-then-Translate: Automatic Preprocessing for Black-Box Machine Translation
Sneha Mehta
Bahareh Azarnoush
Boris Chen
Avneesh Singh Saluja
Vinith Misra
Ballav Bihani
Ritwik K. Kumar
94
17
0
22 May 2020
PowerNorm: Rethinking Batch Normalization in Transformers
PowerNorm: Rethinking Batch Normalization in Transformers
Sheng Shen
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
BDL
91
16
0
17 Mar 2020
PhoBERT: Pre-trained language models for Vietnamese
PhoBERT: Pre-trained language models for Vietnamese
Dat Quoc Nguyen
A. Nguyen
248
356
0
02 Mar 2020
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Zewen Chi
Li Dong
Furu Wei
Xian-Ling Mao
Heyan Huang
LRMVLM
101
13
0
10 Nov 2019
Lipschitz Constrained Parameter Initialization for Deep Transformers
Lipschitz Constrained Parameter Initialization for Deep Transformers
Hongfei Xu
Qiuhui Liu
Josef van Genabith
Deyi Xiong
Jingyi Zhang
ODL
87
26
0
08 Nov 2019
Controlling the Output Length of Neural Machine Translation
Controlling the Output Length of Neural Machine Translation
Surafel Melaku Lakew
Mattia Antonino Di Gangi
Marcello Federico
120
69
0
23 Oct 2019
Fine-Grained Attention Mechanism for Neural Machine Translation
Fine-Grained Attention Mechanism for Neural Machine Translation
Heeyoul Choi
Kyunghyun Cho
Yoshua Bengio
74
175
0
30 Mar 2018
Data Augmentation for Low-Resource Neural Machine Translation
Data Augmentation for Low-Resource Neural Machine Translation
Marzieh Fadaee
Arianna Bisazza
Christof Monz
112
471
0
01 May 2017
Trainable Greedy Decoding for Neural Machine Translation
Trainable Greedy Decoding for Neural Machine Translation
Jiatao Gu
Kyunghyun Cho
Victor O.K. Li
162
74
0
08 Feb 2017
Beam Search Strategies for Neural Machine Translation
Beam Search Strategies for Neural Machine Translation
Markus Freitag
Yaser Al-Onaizan
121
396
0
06 Feb 2017
SYSTRAN's Pure Neural Machine Translation Systems
SYSTRAN's Pure Neural Machine Translation Systems
Josep Crego
Jungi Kim
Guillaume Klein
Anabel Rebollo
Kathy Yang
...
Bo Wang
Jin Yang
Dakun Zhang
Jing Zhou
Peter Zoldan
103
125
0
18 Oct 2016
Pre-Translation for Neural Machine Translation
Pre-Translation for Neural Machine Translation
Jan Niehues
Eunah Cho
Thanh-Le Ha
A. Waibel
AIMat
75
91
0
17 Oct 2016
Fully Character-Level Neural Machine Translation without Explicit
  Segmentation
Fully Character-Level Neural Machine Translation without Explicit Segmentation
Jason D. Lee
Kyunghyun Cho
Thomas Hofmann
VLM
155
457
0
10 Oct 2016
Character-based Neural Machine Translation
Character-based Neural Machine Translation
Wang Ling
Isabel Trancoso
Chris Dyer
A. Black
60
72
0
14 Nov 2015
Character-Aware Neural Language Models
Character-Aware Neural Language Models
Yoon Kim
Yacine Jernite
David Sontag
Alexander M. Rush
113
1,671
0
26 Aug 2015
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
430
7,971
0
17 Aug 2015
Finding Function in Form: Compositional Character Models for Open
  Vocabulary Word Representation
Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation
Wang Ling
Tiago Luís
Luís Marujo
Ramón Fernández Astudillo
Silvio Amir
Chris Dyer
A. Black
Isabel Trancoso
CoGe
73
643
0
09 Aug 2015
On Using Very Large Target Vocabulary for Neural Machine Translation
On Using Very Large Target Vocabulary for Neural Machine Translation
Sébastien Jean
Kyunghyun Cho
Roland Memisevic
Yoshua Bengio
163
1,011
0
05 Dec 2014
Addressing the Rare Word Problem in Neural Machine Translation
Addressing the Rare Word Problem in Neural Machine Translation
Thang Luong
Ilya Sutskever
Quoc V. Le
Oriol Vinyals
Wojciech Zaremba
AIMatAAML
131
788
0
30 Oct 2014
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
452
20,611
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
589
27,345
0
01 Sep 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
1.1K
23,414
0
03 Jun 2014
Compositional Morphology for Word Representations and Language Modelling
Compositional Morphology for Word Representations and Language Modelling
Jan A. Botha
Phil Blunsom
96
253
0
16 May 2014
ADADELTA: An Adaptive Learning Rate Method
ADADELTA: An Adaptive Learning Rate Method
Matthew D. Zeiler
ODL
165
6,635
0
22 Dec 2012
Previous
123
Next