Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.07291
Cited By
Cross-lingual Language Model Pretraining
22 January 2019
Guillaume Lample
Alexis Conneau
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cross-lingual Language Model Pretraining"
50 / 53 papers shown
Title
Unveil Multi-Picture Descriptions for Multilingual Mild Cognitive Impairment Detection via Contrastive Learning
Kristin Qi
Jiali Cheng
Youxiang Zhu
Hadi Amiri
Xiaohui Liang
103
0
0
19 May 2025
Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations
Mahjabin Nahar
Eun-Ju Lee
Jin Won Park
Dongwon Lee
HILM
130
0
0
01 Apr 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
486
3
0
07 Mar 2025
A kinetic-based regularization method for data science applications
Abhisek Ganguly
Alessandro Gabbana
Vybhav Rao
Sauro Succi
Santosh Ansumali
118
1
0
06 Mar 2025
Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting
Gyanendra Shrestha
Chutain Jiang
Sai Akula
Vivek Yannam
Anna Pyayt
Michael Gubanov
LMTD
143
0
0
20 Feb 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
193
11
0
23 Jan 2025
MAS-Attention: Memory-Aware Stream Processing for Attention Acceleration on Resource-Constrained Edge Devices
Mohammadali Shakerdargah
Shan Lu
Chao Gao
Di Niu
130
0
0
20 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
163
1
0
20 Nov 2024
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Genta Indra Winata
Frederikus Hudi
Patrick Amadeus Irawan
David Anugraha
Rifki Afina Putri
...
Alham Fikri Aji
Taro Watanabe
Derry Wijaya
Alice Oh
Chong-Wah Ngo
CoGe
153
14
0
16 Oct 2024
Language Imbalance Driven Rewarding for Multilingual Self-improving
Wen Yang
Junhong Wu
Chen Wang
Chengqing Zong
J.N. Zhang
ALM
LRM
155
7
0
11 Oct 2024
DEPT: Decoupled Embeddings for Pre-training Language Models
Alex Iacob
Lorenzo Sani
Meghdad Kurmanji
William F. Shen
Xinchi Qiu
Dongqi Cai
Yan Gao
Nicholas D. Lane
VLM
560
1
0
07 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
83
2
0
06 Oct 2024
LangSAMP: Language-Script Aware Multilingual Pretraining
Yihong Liu
Haotian Ye
Chunlan Ma
Mingyang Wang
Hinrich Schütze
VLM
195
0
0
26 Sep 2024
Towards Zero-Shot Multimodal Machine Translation
Matthieu Futeral
Cordelia Schmid
Benoît Sagot
Rachel Bawden
79
4
0
18 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
93
3
0
12 Jul 2024
A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models
Peiqin Lin
André F. T. Martins
Hinrich Schütze
110
3
0
29 Jun 2024
Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models
Lynn Chua
Badih Ghazi
Yangsibo Huang
Pritish Kamath
Ravi Kumar
Pasin Manurangsi
Amer Sinha
Chulin Xie
Chiyuan Zhang
134
2
0
23 Jun 2024
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
111
7
0
22 Jun 2024
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Jiexin Wang
Adam Jatowt
Yi Cai
AI4CE
65
1
0
04 Jun 2024
Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Saierdaer Yusuyin
Te Ma
Hao Huang
Wenbo Zhao
Zhijian Ou
96
4
0
04 Jun 2024
Large Language Models: A Survey
Shervin Minaee
Tomas Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
195
408
0
09 Feb 2024
Measuring Catastrophic Forgetting in Cross-Lingual Transfer Paradigms: Exploring Tuning Strategies
Boshko Koloski
Blaž Škrlj
Marko Robnik-Šikonja
Senja Pollak
CLL
86
2
0
12 Sep 2023
Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability
Yoshinari Fujinuma
Jordan L. Boyd-Graber
Katharina Kann
AAML
116
24
0
21 Mar 2022
PhoBERT: Pre-trained language models for Vietnamese
Dat Quoc Nguyen
A. Nguyen
217
355
0
02 Mar 2020
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Zewen Chi
Li Dong
Furu Wei
Xian-Ling Mao
Heyan Huang
LRM
VLM
89
13
0
10 Nov 2019
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
Mikel Artetxe
Holger Schwenk
3DV
154
1,016
0
26 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
94,891
0
11 Oct 2018
XNLI: Evaluating Cross-lingual Sentence Representations
Alexis Conneau
Guillaume Lample
Ruty Rinott
Adina Williams
Samuel R. Bowman
Holger Schwenk
Veselin Stoyanov
ELM
73
1,386
0
13 Sep 2018
Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation
Akiko Eriguchi
Melvin Johnson
Orhan Firat
Hideto Kazawa
Wolfgang Macherey
39
62
0
12 Sep 2018
Unsupervised Cross-lingual Word Embedding by Multilingual Neural Language Models
Takashi Wada
Tomoharu Iwata
50
27
0
07 Sep 2018
Character-Level Language Modeling with Deeper Self-Attention
Rami Al-Rfou
Dokook Choe
Noah Constant
Mandy Guo
Llion Jones
134
391
0
09 Aug 2018
Phrase-Based & Neural Unsupervised Machine Translation
Guillaume Lample
Myle Ott
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
86
682
0
20 Apr 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
1.1K
7,182
0
20 Apr 2018
SentEval: An Evaluation Toolkit for Universal Sentence Representations
Alexis Conneau
Douwe Kiela
100
641
0
14 Mar 2018
Unsupervised Machine Translation Using Monolingual Corpora Only
Guillaume Lample
Alexis Conneau
Ludovic Denoyer
MarcÁurelio Ranzato
SSL
114
1,097
0
31 Oct 2017
Unsupervised Neural Machine Translation
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Kyunghyun Cho
89
772
0
30 Oct 2017
Word Translation Without Parallel Data
Alexis Conneau
Guillaume Lample
MarcÁurelio Ranzato
Ludovic Denoyer
Hervé Jégou
291
1,660
0
11 Oct 2017
The IIT Bombay English-Hindi Parallel Corpus
Anoop Kunchukuttan
Pratik Mehta
P. Bhattacharyya
AIMat
82
251
0
08 Oct 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
701
131,652
0
12 Jun 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
524
4,479
0
18 Apr 2017
Offline bilingual word vectors, orthogonal transformations and the inverted softmax
Samuel L. Smith
David H. P. Turban
Steven Hamblin
Nils Y. Hammerla
OffRL
66
536
0
13 Feb 2017
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
Melvin Johnson
M. Schuster
Quoc V. Le
M. Krikun
Yonghui Wu
...
F. Viégas
Martin Wattenberg
Gregory S. Corrado
Macduff Hughes
Jeffrey Dean
122
2,092
0
14 Nov 2016
Unsupervised Pretraining for Sequence to Sequence Learning
Prajit Ramachandran
Peter J. Liu
Quoc V. Le
SSL
AIMat
84
282
0
08 Nov 2016
Enriching Word Vectors with Subword Information
Piotr Bojanowski
Edouard Grave
Armand Joulin
Tomas Mikolov
NAI
SSL
VLM
229
9,972
0
15 Jul 2016
Edinburgh Neural Machine Translation Systems for WMT 16
Rico Sennrich
Barry Haddow
Alexandra Birch
70
524
0
09 Jun 2016
Exploring the Limits of Language Modeling
Rafal Jozefowicz
Oriol Vinyals
M. Schuster
Noam M. Shazeer
Yonghui Wu
191
1,145
0
07 Feb 2016
Massively Multilingual Word Embeddings
Bridger Waleed Ammar
George Mulcaire
Yulia Tsvetkov
Guillaume Lample
Chris Dyer
Noah A. Smith
101
271
0
05 Feb 2016
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
224
7,745
0
31 Aug 2015
A large annotated corpus for learning natural language inference
Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
315
4,287
0
21 Aug 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.9K
150,115
0
22 Dec 2014
1
2
Next