ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.06823
  4. Cited By
Incorporating BERT into Neural Machine Translation

Incorporating BERT into Neural Machine Translation

17 February 2020
Jinhua Zhu
Yingce Xia
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
    FedML
    AIMat
ArXivPDFHTML

Papers citing "Incorporating BERT into Neural Machine Translation"

50 / 74 papers shown
Title
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model
Phan Tran Minh Dat
Vo Hoang Nhat Khang
Quan Thanh Tho
17
0
0
16 May 2025
A Combination of BERT and Transformer for Vietnamese Spelling Correction
A Combination of BERT and Transformer for Vietnamese Spelling Correction
Trung Hieu Ngo
Ham Duong Tran
Tin Huynh
Kiem Hoang
45
5
0
04 May 2024
Enhancing Context Through Contrast
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
26
0
0
06 Jan 2024
Conditional Prompt Tuning for Multimodal Fusion
Conditional Prompt Tuning for Multimodal Fusion
Ruixia Jiang
Lingbo Liu
Changwen Chen
41
0
0
28 Nov 2023
Interpreting Pretrained Language Models via Concept Bottlenecks
Interpreting Pretrained Language Models via Concept Bottlenecks
Zhen Tan
Lu Cheng
Song Wang
Yuan Bo
Wenlin Yao
Huan Liu
LRM
40
20
0
08 Nov 2023
Diversifying Question Generation over Knowledge Base via External Natural Questions
Diversifying Question Generation over Knowledge Base via External Natural Questions
Shasha Guo
Jing Zhang
Xirui Ke
Cuiping Li
Hong Chen
45
3
0
23 Sep 2023
Improving Language Model Integration for Neural Machine Translation
Improving Language Model Integration for Neural Machine Translation
Christian Herold
Yingbo Gao
Mohammad Zeineldeen
Hermann Ney
29
2
0
08 Jun 2023
Document-Level Machine Translation with Large Language Models
Document-Level Machine Translation with Large Language Models
Longyue Wang
Chenyang Lyu
Tianbo Ji
Zhirui Zhang
Dian Yu
Shuming Shi
Zhaopeng Tu
ELM
28
116
0
05 Apr 2023
AccelTran: A Sparsity-Aware Accelerator for Dynamic Inference with
  Transformers
AccelTran: A Sparsity-Aware Accelerator for Dynamic Inference with Transformers
Shikhar Tuli
N. Jha
36
32
0
28 Feb 2023
How to prepare your task head for finetuning
How to prepare your task head for finetuning
Yi Ren
Shangmin Guo
Wonho Bae
Danica J. Sutherland
24
14
0
11 Feb 2023
Plan-then-Seam: Towards Efficient Table-to-Text Generation
Plan-then-Seam: Towards Efficient Table-to-Text Generation
Liang Li
Ruiying Geng
Chengyang Fang
Bing Li
Can Ma
Binhua Li
Yongbin Li
LMTD
33
2
0
10 Feb 2023
Findings of the Covid-19 MLIA Machine Translation Task
Findings of the Covid-19 MLIA Machine Translation Task
F. Casacuberta
Alexandru Ceausu
K. Choukri
Miltos Deligiannis
Miguel Domingo
...
V. Papavassiliou
Stelios Piperidis
Prokopis Prokopidis
Dimitris Roussis
M. Salah
16
0
0
14 Nov 2022
Mask More and Mask Later: Efficient Pre-training of Masked Language
  Models by Disentangling the [MASK] Token
Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token
Baohao Liao
David Thulke
Sanjika Hewavitharana
Hermann Ney
Christof Monz
36
9
0
09 Nov 2022
RoChBert: Towards Robust BERT Fine-tuning for Chinese
RoChBert: Towards Robust BERT Fine-tuning for Chinese
Zihan Zhang
Jinfeng Li
Ning Shi
Bo Yuan
Xiangyu Liu
Rong Zhang
Hui Xue
Donghong Sun
Chao Zhang
AAML
34
4
0
28 Oct 2022
Active Countermeasures for Email Fraud
Active Countermeasures for Email Fraud
Wentao Chen
Fuzhou Wang
Matthew Edwards
43
5
0
26 Oct 2022
The Shared Task on Gender Rewriting
The Shared Task on Gender Rewriting
Bashar Alhafni
Nizar Habash
Houda Bouamor
Ossama Obeid
Sultan Alrowili
...
Mohamed Gabr
Abderrahmane Issam
Abdelrahim Qaddoumi
K. Vijay-Shanker
Mahmoud Zyate
34
1
0
22 Oct 2022
A baseline revisited: Pushing the limits of multi-segment models for
  context-aware translation
A baseline revisited: Pushing the limits of multi-segment models for context-aware translation
Suvodeep Majumde
Stanislas Lauly
Maria Nadejde
Marcello Federico
Georgiana Dinu
43
13
0
19 Oct 2022
On the Complementarity between Pre-Training and Random-Initialization
  for Resource-Rich Machine Translation
On the Complementarity between Pre-Training and Random-Initialization for Resource-Rich Machine Translation
Changtong Zan
Liang Ding
Li Shen
Yu Cao
Weifeng Liu
Dacheng Tao
37
21
0
07 Sep 2022
Discourse Cohesion Evaluation for Document-Level Neural Machine
  Translation
Discourse Cohesion Evaluation for Document-Level Neural Machine Translation
Xin Tan
Longyin Zhang
Guodong Zhou
18
1
0
19 Aug 2022
Learning to Generalize to More: Continuous Semantic Augmentation for
  Neural Machine Translation
Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Xiangpeng Wei
Heng Yu
Yue Hu
Rongxiang Weng
Weihua Luo
Jun Xie
Rong Jin
CLL
17
24
0
14 Apr 2022
CipherDAug: Ciphertext based Data Augmentation for Neural Machine
  Translation
CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation
Nishant Kambhatla
Logan Born
Anoop Sarkar
21
16
0
01 Apr 2022
elBERto: Self-supervised Commonsense Learning for Question Answering
elBERto: Self-supervised Commonsense Learning for Question Answering
Xunlin Zhan
Yuan Li
Xiao Dong
Xiaodan Liang
Zhiting Hu
Lawrence Carin
SSL
RALM
LRM
24
7
0
17 Mar 2022
Understanding and Improving Sequence-to-Sequence Pretraining for Neural
  Machine Translation
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Yongchang Hao
Xing Wang
Shuming Shi
Zhaopeng Tu
Michael Lyu
AIMat
39
26
0
16 Mar 2022
Prompt-Learning for Short Text Classification
Prompt-Learning for Short Text Classification
Yi Zhu
Xinke Zhou
Jipeng Qiang
Yun Li
Yunhao Yuan
Xindong Wu
VLM
18
34
0
23 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
50
249
0
03 Feb 2022
Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models
Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models
Lu Dong
Zhiyuan Guo
Chao-Hong Tan
Ya-Jun Hu
Yuan Jiang
Zhenhua Ling
38
11
0
26 Jan 2022
Semi-supervised Domain Adaptive Structure Learning
Semi-supervised Domain Adaptive Structure Learning
Can Qin
Lichen Wang
Qianqian Ma
Yu Yin
Huan Wang
Y. Fu
TTA
42
20
0
12 Dec 2021
Say What? Collaborative Pop Lyric Generation Using Multitask Transfer
  Learning
Say What? Collaborative Pop Lyric Generation Using Multitask Transfer Learning
Naveen Ram
Tanay Gummadi
Rahul Bhethanabotla
Richard J. Savery
Gil Weinberg
20
9
0
15 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained
  Language Models: A Survey
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
83
1,038
0
01 Nov 2021
Interpreting Deep Learning Models in Natural Language Processing: A
  Review
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
19
45
0
20 Oct 2021
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain
  Language Model Compression
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression
Chenhe Dong
Yaliang Li
Ying Shen
Minghui Qiu
VLM
43
7
0
16 Oct 2021
MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better
  Translators
MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators
Zhixing Tan
Xiangwen Zhang
Shuo Wang
Yang Liu
VLM
LRM
213
52
0
13 Oct 2021
Discovering Drug-Target Interaction Knowledge from Biomedical Literature
Discovering Drug-Target Interaction Knowledge from Biomedical Literature
Yutai Hou
Yingce Xia
Lijun Wu
Shufang Xie
Yang Fan
Jinhua Zhu
Wanxiang Che
Tao Qin
Tie-Yan Liu
25
14
0
27 Sep 2021
Multilingual Translation via Grafting Pre-trained Language Models
Multilingual Translation via Grafting Pre-trained Language Models
Zewei Sun
Mingxuan Wang
Lei Li
AI4CE
191
22
0
11 Sep 2021
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural
  Machine Translation
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation
Haoran Xu
Benjamin Van Durme
Kenton W. Murray
50
57
0
09 Sep 2021
Paraphrase Generation as Unsupervised Machine Translation
Paraphrase Generation as Unsupervised Machine Translation
Xiaofei Sun
Yufei Tian
Yuxian Meng
Nanyun Peng
Fei Wu
Jiwei Li
Chun Fan
LRM
22
5
0
07 Sep 2021
Self Training with Ensemble of Teacher Models
Self Training with Ensemble of Teacher Models
Soumyadeep Ghosh
Sanjay Kumar
Janu Verma
Awanish Kumar
25
3
0
17 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
19
43
0
10 Jul 2021
A Primer on Pretrained Multilingual Language Models
A Primer on Pretrained Multilingual Language Models
Sumanth Doddapaneni
Gowtham Ramesh
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
LRM
43
74
0
01 Jul 2021
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin
  Information
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information
Zijun Sun
Xiaoya Li
Xiaofei Sun
Yuxian Meng
Xiang Ao
Qing He
Fei Wu
Jiwei Li
SSeg
57
184
0
30 Jun 2021
Neural Machine Translation for Low-Resource Languages: A Survey
Neural Machine Translation for Low-Resource Languages: A Survey
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
40
236
0
29 Jun 2021
R-Drop: Regularized Dropout for Neural Networks
R-Drop: Regularized Dropout for Neural Networks
Xiaobo Liang
Lijun Wu
Juntao Li
Yue Wang
Qi Meng
Tao Qin
Wei Chen
Mengdi Zhang
Tie-Yan Liu
47
424
0
28 Jun 2021
Dual-view Molecule Pre-training
Dual-view Molecule Pre-training
Jinhua Zhu
Yingce Xia
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
AI4CE
27
51
0
17 Jun 2021
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
Inigo Jauregi Unanue
Jacob Parnell
Massimo Piccardi
26
32
0
04 Jun 2021
Transfer Learning for Sequence Generation: from Single-source to
  Multi-source
Transfer Learning for Sequence Generation: from Single-source to Multi-source
Xuancheng Huang
Jingfang Xu
Maosong Sun
Yang Liu
33
5
0
31 May 2021
On Compositional Generalization of Neural Machine Translation
On Compositional Generalization of Neural Machine Translation
Yafu Li
Yongjing Yin
Yulong Chen
Yue Zhang
156
45
0
31 May 2021
Good for Misconceived Reasons: An Empirical Revisiting on the Need for
  Visual Context in Multimodal Machine Translation
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
Zhiyong Wu
Lingpeng Kong
W. Bi
Xiang Li
B. Kao
LRM
23
77
0
30 May 2021
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
Shuai Peng
Ke Yuan
Liangcai Gao
Zhi Tang
AIMat
49
105
0
02 May 2021
MOROCCO: Model Resource Comparison Framework
MOROCCO: Model Resource Comparison Framework
Valentin Malykh
Alexander Kukushkin
Ekaterina Artemova
Vladislav Mikhailov
Maria Tikhonova
Tatiana Shavrina
24
0
0
29 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
28
330
0
17 Apr 2021
12
Next