Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.09685
Cited By
Noisy Parallel Data Alignment
23 January 2023
Ruoyu Xie
Antonios Anastasopoulos
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Noisy Parallel Data Alignment"
17 / 17 papers shown
Title
OCR Improves Machine Translation for Low-Resource Languages
Oana Ignat
Jean Maillard
Vishrav Chaudhary
Francisco Guzmán
61
10
0
27 Feb 2022
Lexically Aware Semi-Supervised Learning for OCR Post-Correction
Shruti Rijhwani
Daisy Rosenblum
Antonios Anastasopoulos
Graham Neubig
42
16
0
04 Nov 2021
MirrorAlign: A Super Lightweight Unsupervised Word Alignment Model via Cross-Lingual Contrastive Learning
Di Wu
Liang Ding
Shuo Yang
Mingyang Li
60
7
0
08 Feb 2021
Word Alignment by Fine-tuning Embeddings on Parallel Corpora
Zi-Yi Dou
Graham Neubig
118
260
0
20 Jan 2021
Mask-Align: Self-Supervised Neural Word Alignment
Chi Chen
Maosong Sun
Yang Liu
36
34
0
13 Dec 2020
OCR Post Correction for Endangered Language Texts
Shruti Rijhwani
Antonios Anastasopoulos
Graham Neubig
40
45
0
10 Nov 2020
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
Benjamin Muller
Antonis Anastasopoulos
Benoît Sagot
Djamé Seddah
LRM
156
167
0
24 Oct 2020
Accurate Word Alignment Induction from Neural Machine Translation
Yun-Nung Chen
Yang Liu
Guanhua Chen
Xin Jiang
Qun Liu
58
61
0
30 Apr 2020
End-to-End Neural Word Alignment Outperforms GIZA++
Thomas Zenkel
Joern Wuebker
John DeNero
26
55
0
30 Apr 2020
A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT
Masaaki Nagata
Katsuki Chousa
Masaaki Nishino
31
47
0
29 Apr 2020
Jointly Learning to Align and Translate with Transformer Models
Sarthak Garg
Stephan Peitz
Udhyakumar Nallasamy
Matthias Paulik
32
172
0
04 Sep 2019
Low-Resource Corpus Filtering using Multilingual Sentence Embeddings
Vishrav Chaudhary
Y. Tang
Francisco Guzmán
Holger Schwenk
Philipp Koehn
59
79
0
20 Jun 2019
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
Mikel Artetxe
Holger Schwenk
3DV
119
1,008
0
26 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.2K
93,936
0
11 Oct 2018
Part-of-Speech Tagging on an Endangered Language: a Parallel Griko-Italian Resource
Antonios Anastasopoulos
M. Lekakou
J. Quer
Eleni Zimianiti
Justin DeBenedetto
David Chiang
80
28
0
11 Jun 2018
Leveraging translations for speech transcription in low-resource settings
Antonios Anastasopoulos
David Chiang
39
26
0
23 Mar 2018
Incorporating Structural Alignment Biases into an Attentional Neural Translation Model
Trevor Cohn
Cong Duy Vu Hoang
Ekaterina Vymolova
Kaisheng Yao
Chris Dyer
Gholamreza Haffari
59
174
0
06 Jan 2016
1