ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.02116
  4. Cited By
Unsupervised Cross-lingual Representation Learning at Scale

Unsupervised Cross-lingual Representation Learning at Scale

5 November 2019
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
ArXivPDFHTML

Papers citing "Unsupervised Cross-lingual Representation Learning at Scale"

50 / 1,223 papers shown
Title
LENS: A Learnable Evaluation Metric for Text Simplification
LENS: A Learnable Evaluation Metric for Text Simplification
Mounica Maddela
Yao Dou
David Heineman
Wei Xu
31
63
0
19 Dec 2022
MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource
  Languages
MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource Languages
Shashank Sonkar
Zichao Wang
Richard G. Baraniuk
42
1
0
19 Dec 2022
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code
  Completion
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion
Zi Gong
Yinpeng Guo
Pingyi Zhou
Cuiyun Gao
Yasheng Wang
Zenglin Xu
17
8
0
19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Samuel Cahyawijaya
Holy Lovenia
Alham Fikri Aji
Genta Indra Winata
Bryan Wilie
...
Timothy Baldwin
Sebastian Ruder
Herry Sujaini
S. Sakti
Ayu Purwarianti
44
48
0
19 Dec 2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Zheng-Xin Yong
Hailey Schoelkopf
Niklas Muennighoff
Alham Fikri Aji
David Ifeoluwa Adelani
...
Genta Indra Winata
Stella Biderman
Edward Raff
Dragomir R. Radev
Vassilina Nikoulina
CLL
VLM
AI4CE
LRM
40
81
0
19 Dec 2022
Controlling Styles in Neural Machine Translation with Activation Prompt
Controlling Styles in Neural Machine Translation with Activation Prompt
Yifan Wang
Zewei Sun
Shanbo Cheng
Weiguo Zheng
Mingxuan Wang
35
10
0
17 Dec 2022
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
William B. Held
Christopher Hidey
Fei Liu
Eric Zhu
Rahul Goel
Diyi Yang
Rushin Shah
44
0
0
15 Dec 2022
Multi-VALUE: A Framework for Cross-Dialectal English NLP
Multi-VALUE: A Framework for Cross-Dialectal English NLP
Caleb Ziems
William B. Held
Jingfeng Yang
Jwala Dhamala
Rahul Gupta
Diyi Yang
51
41
0
15 Dec 2022
Advancing Multilingual Pre-training: TRIP Triangular Document-level
  Pre-training for Multilingual Language Models
Advancing Multilingual Pre-training: TRIP Triangular Document-level Pre-training for Multilingual Language Models
Hongyuan Lu
Haoyang Huang
Shuming Ma
Dongdong Zhang
W. Lam
Furu Wei
39
4
0
15 Dec 2022
Causes and Cures for Interference in Multilingual Translation
Causes and Cures for Interference in Multilingual Translation
Uri Shaham
Maha Elbayad
Vedanuj Goswami
Omer Levy
Shruti Bhosale
23
26
0
14 Dec 2022
VTCC-NLP at NL4Opt competition subtask 1: An Ensemble Pre-trained
  language models for Named Entity Recognition
VTCC-NLP at NL4Opt competition subtask 1: An Ensemble Pre-trained language models for Named Entity Recognition
Xuan-Dung Doan
35
6
0
14 Dec 2022
Towards Linguistically Informed Multi-Objective Pre-Training for Natural
  Language Inference
Towards Linguistically Informed Multi-Objective Pre-Training for Natural Language Inference
Maren Pielka
Svetlana Schmidt
Lisa Pucknat
R. Sifa
CLIP
AI4CE
19
2
0
14 Dec 2022
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for
  Programming Languages
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
Yekun Chai
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua Wu
38
35
0
13 Dec 2022
A Survey on Natural Language Processing for Programming
A Survey on Natural Language Processing for Programming
Qingfu Zhu
Xianzhen Luo
Fang Liu
Cuiyun Gao
Wanxiang Che
25
2
0
12 Dec 2022
Ensembling Transformers for Cross-domain Automatic Term Extraction
Ensembling Transformers for Cross-domain Automatic Term Extraction
T. Hanh
Matej Martinc
Andraz Pelicon
Antoine Doucet
Senja Pollak
27
5
0
12 Dec 2022
Punctuation Restoration for Singaporean Spoken Languages: English,
  Malay, and Mandarin
Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin
Abhinav Rao
Ho Thi-Nga
Chng Eng Siong
26
3
0
10 Dec 2022
Multimodal Vision Transformers with Forced Attention for Behavior
  Analysis
Multimodal Vision Transformers with Forced Attention for Behavior Analysis
Tanay Agrawal
Michal Balazia
Philippe Muller
Franccois Brémond
ViT
33
9
0
07 Dec 2022
Video Games as a Corpus: Sentiment Analysis using Fallout New Vegas
  Dialog
Video Games as a Corpus: Sentiment Analysis using Fallout New Vegas Dialog
Mika Hämäläinen
Khalid Alnajjar
Thierry Poibeau
16
5
0
05 Dec 2022
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Ana Kotarcic
Dominik Hangartner
Fabrizio Gilardi
Selina Kurer
K. Donnay
29
2
0
05 Dec 2022
MiLMo:Minority Multilingual Pre-trained Language Model
MiLMo:Minority Multilingual Pre-trained Language Model
Sisi Liu
Hanru Shi
Xinhe Yu
Wugedele Bao
Yuan Sun
Xiaobing Zhao
31
0
0
04 Dec 2022
Languages You Know Influence Those You Learn: Impact of Language
  Characteristics on Multi-Lingual Text-to-Text Transfer
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Benjamin Muller
Deepanshu Gupta
Siddharth Patwardhan
J. Fauconnier
David Vandyke
Sachin Agarwal
43
5
0
04 Dec 2022
Nonparametric Masked Language Modeling
Nonparametric Masked Language Modeling
Sewon Min
Weijia Shi
M. Lewis
Xilun Chen
Wen-tau Yih
Hannaneh Hajishirzi
Luke Zettlemoyer
RALM
50
48
0
02 Dec 2022
SOLD: Sinhala Offensive Language Dataset
SOLD: Sinhala Offensive Language Dataset
Tharindu Ranasinghe
Isuri Anuradha
Damith Premasiri
Kanishka Silva
Hansi Hettiarachchi
Lasitha Uyangodage
Marcos Zampieri
46
8
0
01 Dec 2022
Embedding generation for text classification of Brazilian Portuguese
  user reviews: from bag-of-words to transformers
Embedding generation for text classification of Brazilian Portuguese user reviews: from bag-of-words to transformers
F. Souza
J. B. O. S. Filho
21
6
0
01 Dec 2022
A Commonsense-Infused Language-Agnostic Learning Framework for Enhancing
  Prediction of Political Polarity in Multilingual News Headlines
A Commonsense-Infused Language-Agnostic Learning Framework for Enhancing Prediction of Political Polarity in Multilingual News Headlines
Swati Swati
Adrian Mladenic Grobelnik
Dunja Mladenić
M. Grobelnik
37
3
0
01 Dec 2022
Word Alignment in the Era of Deep Learning: A Tutorial
Word Alignment in the Era of Deep Learning: A Tutorial
Bryan Li
37
5
0
30 Nov 2022
Automatic Identification of Motivation for Code-Switching in Speech
  Transcripts
Automatic Identification of Motivation for Code-Switching in Speech Transcripts
Ritu Belani
Jeffrey Flanigan
35
0
0
30 Nov 2022
Extending the Subwording Model of Multilingual Pretrained Models for New
  Languages
Extending the Subwording Model of Multilingual Pretrained Models for New Languages
K. Imamura
Eiichiro Sumita
VLM
29
3
0
29 Nov 2022
Compressing Cross-Lingual Multi-Task Models at Qualtrics
Compressing Cross-Lingual Multi-Task Models at Qualtrics
Daniel Fernando Campos
Daniel J. Perry
S. Joshi
Yashmeet Gambhir
Wei Du
Zhengzheng Xing
Aaron Colak
29
1
0
29 Nov 2022
Frustratingly Easy Label Projection for Cross-lingual Transfer
Frustratingly Easy Label Projection for Cross-lingual Transfer
Yang Chen
Chao Jiang
Alan Ritter
Wei Xu
32
31
0
28 Nov 2022
Understanding BLOOM: An empirical study on diverse NLP tasks
Understanding BLOOM: An empirical study on diverse NLP tasks
Parag Dakle
Sai Krishna Rallabandi
Preethi Raghavan
AI4CE
39
3
0
27 Nov 2022
Transformer-based Model for Word Level Language Identification in
  Code-mixed Kannada-English Texts
Transformer-based Model for Word Level Language Identification in Code-mixed Kannada-English Texts
A. Tonja
M. Yigezu
Olga Kolesnikova
Moein Shahiki Tash
Grigori Sidorov
Alexander Gelbukh
28
23
0
26 Nov 2022
Breaking the Representation Bottleneck of Chinese Characters: Neural
  Machine Translation with Stroke Sequence Modeling
Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence Modeling
Zhijun Wang
Xuebo Liu
Min Zhang
35
11
0
23 Nov 2022
Word-Level Representation From Bytes For Language Modeling
Word-Level Representation From Bytes For Language Modeling
Chul Lee
Qipeng Guo
Xipeng Qiu
23
1
0
23 Nov 2022
Predicting the Type and Target of Offensive Social Media Posts in
  Marathi
Predicting the Type and Target of Offensive Social Media Posts in Marathi
Marcos Zampieri
Tharindu Ranasinghe
Mrinal Chaudhari
Saurabh Gaikwad
P. Krishna
Mayuresh Nene
Shrunali Paygude
34
24
0
22 Nov 2022
L3Cube-HindBERT and DevBERT: Pre-Trained BERT Transformer models for
  Devanagari based Hindi and Marathi Languages
L3Cube-HindBERT and DevBERT: Pre-Trained BERT Transformer models for Devanagari based Hindi and Marathi Languages
Raviraj Joshi
52
56
0
21 Nov 2022
ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on
  Diversity over Language and Culture
ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture
Youssef Mohamed
Mohamed AbdelFattah
Shyma Alhuwaider
Feifan Li
Xiangliang Zhang
Kenneth Church
Mohamed Elhoseiny
VLM
22
14
0
19 Nov 2022
Efficient Transformers with Dynamic Token Pooling
Efficient Transformers with Dynamic Token Pooling
Piotr Nawrot
J. Chorowski
Adrian Lañcucki
Edoardo Ponti
22
42
0
17 Nov 2022
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
Vaclav Kosar
A. Hoskovec
Milan Šulc
Radek Bartyzal
VLM
32
3
0
17 Nov 2022
ConNER: Consistency Training for Cross-lingual Named Entity Recognition
ConNER: Consistency Training for Cross-lingual Named Entity Recognition
Ran Zhou
Xin Li
Lidong Bing
Min Zhang
Luo Si
Steven C. H. Hoi
47
18
0
17 Nov 2022
TSMind: Alibaba and Soochow University's Submission to the WMT22
  Translation Suggestion Task
TSMind: Alibaba and Soochow University's Submission to the WMT22 Translation Suggestion Task
Xin Ge
Ke Min Wang
Jiayi Wang
Nini Xiao
Xiangyu Duan
Yu Zhao
Yuqi Zhang
35
2
0
16 Nov 2022
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed
  Representations
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Linlin Liu
Xingxuan Li
Megh Thakkar
Xin Li
Shafiq Joty
Luo Si
Lidong Bing
32
2
0
16 Nov 2022
QueryForm: A Simple Zero-shot Form Entity Query Framework
QueryForm: A Simple Zero-shot Form Entity Query Framework
Zifeng Wang
Zizhao Zhang
Jacob Devlin
Chen-Yu Lee
Guolong Su
Hao Zhang
Jennifer Dy
Vincent Perot
Tomas Pfister
27
8
0
14 Nov 2022
mOKB6: A Multilingual Open Knowledge Base Completion Benchmark
mOKB6: A Multilingual Open Knowledge Base Completion Benchmark
Shubham Mittal
Keshav Kolluru
Soumen Chakrabarti
Mausam
38
4
0
13 Nov 2022
AltCLIP: Altering the Language Encoder in CLIP for Extended Language
  Capabilities
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities
Zhongzhi Chen
Guangyi Liu
Bo Zhang
Fulong Ye
Qinghong Yang
Ledell Yu Wu
VLM
42
81
0
12 Nov 2022
ConceptX: A Framework for Latent Concept Analysis
ConceptX: A Framework for Latent Concept Analysis
Firoj Alam
Fahim Dalvi
Nadir Durrani
Hassan Sajjad
A. Khan
Jia Xu
33
5
0
12 Nov 2022
English Contrastive Learning Can Learn Universal Cross-lingual Sentence
  Embeddings
English Contrastive Learning Can Learn Universal Cross-lingual Sentence Embeddings
Yau-Shian Wang
Ashley Wu
Graham Neubig
SSL
43
31
0
11 Nov 2022
CoRAL: a Context-aware Croatian Abusive Language Dataset
CoRAL: a Context-aware Croatian Abusive Language Dataset
Ravi Shekhar
Mladen Karan
Matthew Purver
46
5
0
11 Nov 2022
MINION: a Large-Scale and Diverse Dataset for Multilingual Event
  Detection
MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection
Amir Pouran Ben Veyseh
Minh Le Nguyen
Franck Dernoncourt
Thien Huu Nguyen
34
15
0
11 Nov 2022
Collateral facilitation in humans and language models
Collateral facilitation in humans and language models
J. Michaelov
Benjamin Bergen
25
11
0
09 Nov 2022
Previous
123...101112...232425
Next