Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.02116
Cited By
Unsupervised Cross-lingual Representation Learning at Scale
5 November 2019
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Cross-lingual Representation Learning at Scale"
50 / 1,236 papers shown
Title
CL-XABSA: Contrastive Learning for Cross-lingual Aspect-based Sentiment Analysis
Nankai Lin
Yingwen Fu
Xiaotian Lin
Aimin Yang
Shengyi Jiang
45
16
0
02 Apr 2022
indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages
Anirudh Gupta
Neeraj Chhimwal
Ankur Dhuriya
Rishabh Gaur
Priyanshi Shah
Harveen Singh Chadha
Vivek Raghavan
22
2
0
31 Mar 2022
Auto-MLM: Improved Contrastive Learning for Self-supervised Multi-lingual Knowledge Retrieval
Wenshen Xu
M. Maimaiti
Yuanhang Zheng
Xin Tang
Ji Zhang
RALM
SSL
16
2
0
30 Mar 2022
TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models
Ziqing Yang
Yiming Cui
Zhigang Chen
SyDa
VLM
31
12
0
30 Mar 2022
bitsa_nlp@LT-EDI-ACL2022: Leveraging Pretrained Language Models for Detecting Homophobia and Transphobia in Social Media Comments
Vitthal Bhandari
Poonam Goyal
36
16
0
27 Mar 2022
Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages
Ehsan Aghazadeh
Mohsen Fayyaz
Yadollah Yaghoobzadeh
36
51
0
26 Mar 2022
L3Cube-MahaHate: A Tweet-based Marathi Hate Speech Detection Dataset and BERT models
Abhishek Velankar
H. Patil
Amol Gore
Shubham Salunke
Raviraj Joshi
35
39
0
25 Mar 2022
Probing Pre-Trained Language Models for Cross-Cultural Differences in Values
Arnav Arora
Lucie-Aimée Kaffee
Isabelle Augenstein
VLM
48
124
0
25 Mar 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
45
100
0
24 Mar 2022
Probing for Labeled Dependency Trees
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
19
7
0
24 Mar 2022
Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments
Antonis Maronikolakis
Axel Wisiorek
Leah Nann
Haris Jabbar
Sahana Udupa
Hinrich Schütze
30
24
0
22 Mar 2022
Factual Consistency of Multilingual Pretrained Language Models
Constanza Fierro
Anders Søgaard
HILM
27
15
0
22 Mar 2022
VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension
Kiet Van Nguyen
Son Quoc Tran
Luan Thanh Nguyen
Tin Van Huynh
Son T. Luu
Ngan Luu-Thuy Nguyen
35
12
0
22 Mar 2022
Towards Explainable Evaluation Metrics for Natural Language Generation
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei Zhao
Yang Gao
Steffen Eger
AAML
ELM
40
20
0
21 Mar 2022
Document-Level Relation Extraction with Adaptive Focal Loss and Knowledge Distillation
Qingyu Tan
Ruidan He
Lidong Bing
Hwee Tou Ng
26
97
0
21 Mar 2022
Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability
Yoshinari Fujinuma
Jordan L. Boyd-Graber
Katharina Kann
AAML
64
23
0
21 Mar 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models
Ryokan Ri
Yoshimasa Tsuruoka
37
26
0
19 Mar 2022
Challenges and Strategies in Cross-Cultural NLP
Daniel Hershcovich
Stella Frank
Heather Lent
Miryam de Lhoneux
Mostafa Abdou
...
Ruixiang Cui
Constanza Fierro
Katerina Margatina
Phillip Rust
Anders Søgaard
48
163
0
18 Mar 2022
Do Multilingual Language Models Capture Differing Moral Norms?
Katharina Hämmerl
Bjorn Deiseroth
P. Schramowski
Jindrich Libovický
Alexander Fraser
Kristian Kersting
21
15
0
18 Mar 2022
Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation
Xinyi Wang
Sebastian Ruder
Graham Neubig
47
61
0
17 Mar 2022
Combining Static and Contextualised Multilingual Embeddings
Katharina Hämmerl
Jindrich Libovický
Alexander Fraser
32
10
0
17 Mar 2022
Geographic Adaptation of Pretrained Language Models
Valentin Hofmann
Goran Glavaš
Nikola Ljubevsić
J. Pierrehumbert
Hinrich Schütze
VLM
26
16
0
16 Mar 2022
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Yongchang Hao
Xing Wang
Shuming Shi
Zhaopeng Tu
Michael Lyu
AIMat
39
26
0
16 Mar 2022
Cross-Lingual Ability of Multilingual Masked Language Models: A Study of Language Structure
Yuan Chai
Yaobo Liang
Nan Duan
LRM
29
21
0
16 Mar 2022
Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction
Kuan-Hao Huang
I-Hung Hsu
Premkumar Natarajan
Kai-Wei Chang
Nanyun Peng
41
65
0
15 Mar 2022
Does Corpus Quality Really Matter for Low-Resource Languages?
Mikel Artetxe
Itziar Aldabe
Rodrigo Agerri
Olatz Perez-de-Viñaspre
Aitor Soroa Etxabe
49
19
0
15 Mar 2022
ViWOZ: A Multi-Domain Task-Oriented Dialogue Systems Dataset For Low-resource Language
Phi Nguyen Van
Tung Cao Hoang
Dũng Nguyễn Mạnh
Q. Minh
Long Tran Quoc
37
2
0
15 Mar 2022
FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing
Ilias Chalkidis
Tommaso Pasini
Shenmin Zhang
Letizia Tomada
Sebastian Felix Schwemer
Anders Søgaard
AILaw
42
54
0
14 Mar 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
Changhua Meng
Ke Wang
Jun Lan
Weiqiang Wang
Ming Gu
Liqing Zhang
39
78
0
14 Mar 2022
Efficient Language Modeling with Sparse all-MLP
Ping Yu
Mikel Artetxe
Myle Ott
Sam Shleifer
Hongyu Gong
Ves Stoyanov
Xian Li
MoE
23
11
0
14 Mar 2022
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai
Heng-Jui Chang
Wen-Chin Huang
Zili Huang
Kushal Lakhotia
...
Hsuan-Jui Chen
Shang-Wen Li
Shinji Watanabe
Abdel-rahman Mohamed
Hung-yi Lee
31
109
0
14 Mar 2022
Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice
Andreas Grivas
Nikolay Bogoychev
Adam Lopez
17
9
0
12 Mar 2022
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation
Yutong Chen
Fangyun Wei
Xiao Sun
Zhirong Wu
Stephen Lin
SLR
35
98
0
08 Mar 2022
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
Gabriele Sarti
Malvina Nissim
AILaw
23
42
0
07 Mar 2022
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition
Beiduo Chen
Jun-Yu Ma
Jiajun Qi
Wu Guo
Zhen-Hua Ling
Quan Liu
30
16
0
07 Mar 2022
Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages
Vaidehi Patil
Partha P. Talukdar
Sunita Sarawagi
26
21
0
03 Mar 2022
As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning
Jannis Vamvas
Rico Sennrich
30
19
0
03 Mar 2022
Large-Scale Hate Speech Detection with Cross-Domain Transfer
Cagri Toraman
Furkan Şahinuç
E. Yilmaz
37
60
0
02 Mar 2022
A Survey on Aspect-Based Sentiment Analysis: Tasks, Methods, and Challenges
Wenxuan Zhang
Xin Li
Yang Deng
Lidong Bing
W. Lam
48
242
0
02 Mar 2022
DeepNet: Scaling Transformers to 1,000 Layers
Hongyu Wang
Shuming Ma
Li Dong
Shaohan Huang
Dongdong Zhang
Furu Wei
MoE
AI4CE
42
157
0
01 Mar 2022
DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition
Xinyu Wang
Yongliang Shen
Jiong Cai
Tao Wang
Xiaobin Wang
...
Weiming Lu
Yueting Zhuang
Kewei Tu
Wei Lu
Yong-jia Jiang
76
43
0
01 Mar 2022
Multilingual Abusiveness Identification on Code-Mixed Social Media Text
Ekagra Ranjan
Naman Poddar
29
0
0
01 Mar 2022
Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
Mingyang Zhou
Licheng Yu
Amanpreet Singh
Mengjiao MJ Wang
Zhou Yu
Ning Zhang
VLM
33
31
0
01 Mar 2022
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding
Jiapeng Wang
Lianwen Jin
Kai Ding
VLM
35
140
0
28 Feb 2022
CINO: A Chinese Minority Pre-trained Language Model
Ziqing Yang
Zihang Xu
Yiming Cui
Baoxin Wang
Min Lin
Dayong Wu
Zhigang Chen
28
25
0
28 Feb 2022
OCR Improves Machine Translation for Low-Resource Languages
Oana Ignat
Jean Maillard
Vishrav Chaudhary
Francisco Guzmán
45
10
0
27 Feb 2022
Multi-Level Contrastive Learning for Cross-Lingual Alignment
Beiduo Chen
Wu Guo
Bin Gu
Quan Liu
Yongchao Wang
33
5
0
26 Feb 2022
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language Models Better
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
27
58
0
24 Feb 2022
Using natural language prompts for machine translation
Xavier Garcia
Orhan Firat
AI4CE
35
30
0
23 Feb 2022
MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset
Dan Saattrup Nielsen
Ryan McConville
22
72
0
23 Feb 2022
Previous
1
2
3
...
16
17
18
...
23
24
25
Next