ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.02116
  4. Cited By
Unsupervised Cross-lingual Representation Learning at Scale

Unsupervised Cross-lingual Representation Learning at Scale

5 November 2019
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
ArXivPDFHTML

Papers citing "Unsupervised Cross-lingual Representation Learning at Scale"

50 / 1,223 papers shown
Title
HierCat: Hierarchical Query Categorization from Weakly Supervised Data
  at Facebook Marketplace
HierCat: Hierarchical Query Categorization from Weakly Supervised Data at Facebook Marketplace
Yunzhong He
Congle Zhang
Ruoyan Kong
Chaitanya Kulkarni
Qing Liu
A. Gandhe
Amit Nithianandan
Arul T. Prakash
11
12
0
21 Feb 2023
Exploring the Potential of Machine Translation for Generating Named
  Entity Datasets: A Case Study between Persian and English
Exploring the Potential of Machine Translation for Generating Named Entity Datasets: A Case Study between Persian and English
A. Sartipi
A. Fatemi
31
3
0
19 Feb 2023
Towards Fine-Grained Information: Identifying the Type and Location of
  Translation Errors
Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors
Keqin Bao
Boyi Deng
Dayiheng Liu
Baosong Yang
Wenqiang Lei
Xiangnan He
Derek F.Wong
Jun Xie
42
4
0
17 Feb 2023
LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with
  Knowledge Distillation
LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings with Knowledge Distillation
Zhuoyuan Mao
Tetsuji Nakagawa
FedML
19
19
0
16 Feb 2023
Why Can't Discourse Parsing Generalize? A Thorough Investigation of the
  Impact of Data Diversity
Why Can't Discourse Parsing Generalize? A Thorough Investigation of the Impact of Data Diversity
Yang Liu
Amir Zeldes
8
18
0
13 Feb 2023
Unified Vision-Language Representation Modeling for E-Commerce
  Same-Style Products Retrieval
Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval
Ben Chen
Linbo Jin
Xinxin Wang
D. Gao
Wen Jiang
Wei Ning
22
3
0
10 Feb 2023
A Novel Approach for Auto-Formulation of Optimization Problems
A Novel Approach for Auto-Formulation of Optimization Problems
Yuting Ning
Jia-Yin Liu
Longhu Qin
Tong Xiao
Shan Xue
Zhenya Huang
Qi Liu
Enhong Chen
Jinze Wu
19
9
0
09 Feb 2023
Robust Question Answering against Distribution Shifts with Test-Time
  Adaptation: An Empirical Study
Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study
Hai Ye
Yuyang Ding
Juntao Li
Hwee Tou Ng
OOD
TTA
31
9
0
09 Feb 2023
Measuring The Impact Of Programming Language Distribution
Measuring The Impact Of Programming Language Distribution
Gabriel Orlanski
Kefan Xiao
Xavier Garcia
Jeffrey Hui
Joshua Howland
J. Malmaud
Jacob Austin
Rishah Singh
Michele Catasta
32
28
0
03 Feb 2023
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense
  Retrieval
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval
Shunyu Zhang
Yaobo Liang
Ming Gong
Daxin Jiang
Nan Duan
30
4
0
03 Feb 2023
The unreasonable effectiveness of few-shot learning for machine
  translation
The unreasonable effectiveness of few-shot learning for machine translation
Xavier Garcia
Yamini Bansal
Colin Cherry
George F. Foster
M. Krikun
Fan Feng
Melvin Johnson
Orhan Firat
40
103
0
02 Feb 2023
idT5: Indonesian Version of Multilingual T5 Transformer
idT5: Indonesian Version of Multilingual T5 Transformer
Mukhlish Fuadi
A. Wibawa
S. Sumpeno
19
6
0
02 Feb 2023
Zero-shot cross-lingual transfer language selection using linguistic
  similarity
Zero-shot cross-lingual transfer language selection using linguistic similarity
J. Eronen
M. Ptaszynski
Fumito Masui
24
33
0
31 Jan 2023
ZhichunRoad at Amazon KDD Cup 2022: MultiTask Pre-Training for
  E-Commerce Product Search
ZhichunRoad at Amazon KDD Cup 2022: MultiTask Pre-Training for E-Commerce Product Search
Xuange Cui
Wei Xiong
Songlin Wang
35
1
0
31 Jan 2023
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural
  Networks
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks
Antoine Louis
Gijs van Dijck
Gerasimos Spanakis
AILaw
23
9
0
30 Jan 2023
KG-BERTScore: Incorporating Knowledge Graph into BERTScore for
  Reference-Free Machine Translation Evaluation
KG-BERTScore: Incorporating Knowledge Graph into BERTScore for Reference-Free Machine Translation Evaluation
Zhanglin Wu
Min Zhang
Mingsheng Zhu
Yinglu Li
Tingting Zhu
Hao Yang
Song Peng
Ying Qin
30
5
0
30 Jan 2023
Improving Cross-lingual Information Retrieval on Low-Resource Languages
  via Optimal Transport Distillation
Improving Cross-lingual Information Retrieval on Low-Resource Languages via Optimal Transport Distillation
Zhiqi Huang
Puxuan Yu
James Allan
VLM
43
26
0
29 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
55
2
0
26 Jan 2023
A benchmark for toxic comment classification on Civil Comments dataset
A benchmark for toxic comment classification on Civil Comments dataset
Corentin Duchene
Henri Jamet
Pierre Guillaume
Reda Dehak
41
8
0
26 Jan 2023
Cross-lingual Argument Mining in the Medical Domain
Cross-lingual Argument Mining in the Medical Domain
Anar Yeginbergenova
Rodrigo Agerri
50
7
0
25 Jan 2023
FewShotTextGCN: K-hop neighborhood regularization for few-shot learning
  on graphs
FewShotTextGCN: K-hop neighborhood regularization for few-shot learning on graphs
Niels van der Heijden
Ekaterina Shutova
H. Yannakoudakis
23
0
0
25 Jan 2023
An Experimental Study on Pretraining Transformers from Scratch for IR
An Experimental Study on Pretraining Transformers from Scratch for IR
Carlos Lassance
Hervé Déjean
S. Clinchant
28
11
0
25 Jan 2023
ViHOS: Hate Speech Spans Detection for Vietnamese
ViHOS: Hate Speech Spans Detection for Vietnamese
Phu Gia Hoang
Canh Duc Luu
K. Tran
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
31
20
0
24 Jan 2023
PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question
  Answering Research and Development
PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development
Avirup Sil
Jaydeep Sen
Bhavani Iyer
M. Franz
Kshitij P. Fadnis
...
Yulong Li
Md Arafat Sultan
Riyaz Ahmad Bhat
Radu Florian
Salim Roukos
37
4
0
23 Jan 2023
Efficient Language Model Training through Cross-Lingual and Progressive
  Transfer Learning
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Malte Ostendorff
Georg Rehm
CLIP
VLM
CLL
43
24
0
23 Jan 2023
Ensemble Transfer Learning for Multilingual Coreference Resolution
Ensemble Transfer Learning for Multilingual Coreference Resolution
T. Lai
Heng Ji
18
1
0
22 Jan 2023
Language Embeddings Sometimes Contain Typological Generalizations
Language Embeddings Sometimes Contain Typological Generalizations
Robert Östling
Murathan Kurfali
NAI
38
9
0
19 Jan 2023
Understanding and Detecting Hallucinations in Neural Machine Translation
  via Model Introspection
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection
Weijia Xu
Sweta Agrawal
Eleftheria Briakou
Marianna J. Martindale
Marine Carpuat
HILM
27
47
0
18 Jan 2023
Adapting Multilingual Speech Representation Model for a New,
  Underresourced Language through Multilingual Fine-tuning and Continued
  Pretraining
Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Karol Nowakowski
M. Ptaszynski
Kyoko Murasaki
Jagna Nieuwazny
23
24
0
18 Jan 2023
Curriculum Script Distillation for Multilingual Visual Question
  Answering
Curriculum Script Distillation for Multilingual Visual Question Answering
Khyathi Raghavi Chandu
A. Geramifard
30
0
0
17 Jan 2023
ClassBases at CASE-2022 Multilingual Protest Event Detection Tasks:
  Multilingual Protest News Detection and Automatically Replicating Manually
  Created Event Datasets
ClassBases at CASE-2022 Multilingual Protest Event Detection Tasks: Multilingual Protest News Detection and Automatically Replicating Manually Created Event Datasets
Peratham Wiriyathammabhum
24
3
0
16 Jan 2023
FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual
  Transfer with Scheduled Unfreezing
FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual Transfer with Scheduled Unfreezing
Chen Cecilia Liu
Jonas Pfeiffer
Ivan Vulić
Iryna Gurevych
CLL
32
9
0
13 Jan 2023
See, Think, Confirm: Interactive Prompting Between Vision and Language
  Models for Knowledge-based Visual Reasoning
See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning
Zhenfang Chen
Qinhong Zhou
Songlin Yang
Yining Hong
Hao Zhang
Chuang Gan
LRM
VLM
42
36
0
12 Jan 2023
Few-shot Learning for Cross-Target Stance Detection by Aggregating
  Multimodal Embeddings
Few-shot Learning for Cross-Target Stance Detection by Aggregating Multimodal Embeddings
Parisa Jamadi Khiabani
A. Zubiaga
34
10
0
11 Jan 2023
Multilingual Entity and Relation Extraction from Unified to
  Language-specific Training
Multilingual Entity and Relation Extraction from Unified to Language-specific Training
Zixiang Wang
Jian Yang
Tongliang Li
Jiaheng Liu
Ying Mo
Jiaqi Bai
Longtao He
Zhoujun Li
29
2
0
11 Jan 2023
FullStop:Punctuation and Segmentation Prediction for Dutch with
  Transformers
FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Vincent Vandeghinste
Oliver Guhr
10
6
0
09 Jan 2023
A Survey of Code-switching: Linguistic and Social Perspectives for
  Language Technologies
A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies
A. Seza Doğruöz
Sunayana Sitaram
Barbara E. Bullock
Almeida Jacqueline Toribio
81
74
0
05 Jan 2023
Linear programming word problems formulation using EnsembleCRF NER
  labeler and T5 text generator with data augmentations
Linear programming word problems formulation using EnsembleCRF NER labeler and T5 text generator with data augmentations
Jianglong He
N. Mamatha
S. Vignesh
Deepak Kumar
Akshay Uppal
AIMat
26
7
0
30 Dec 2022
GAE-ISumm: Unsupervised Graph-Based Summarization of Indian Languages
GAE-ISumm: Unsupervised Graph-Based Summarization of Indian Languages
Lakshmi Sireesha Vakada
Anudeep Ch
Mounika Marreddy
S. Oota
R. Mamidi
27
1
0
25 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
17
40
0
21 Dec 2022
Mini-Model Adaptation: Efficiently Extending Pretrained Models to New
  Languages via Aligned Shallow Training
Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training
Kelly Marchisio
Patrick Lewis
Yihong Chen
Mikel Artetxe
35
16
0
20 Dec 2022
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free
  Language Models
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models
Jonas Belouadi
Steffen Eger
57
24
0
20 Dec 2022
BMX: Boosting Natural Language Generation Metrics with Explainability
BMX: Boosting Natural Language Generation Metrics with Explainability
Christoph Leiter
Hoang-Quan Nguyen
Steffen Eger
ELM
24
0
0
20 Dec 2022
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for
  Natural Language Understanding in Task-Oriented Dialogue
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue
Nikita Moghe
E. Razumovskaia
Liane Guillou
Ivan Vulić
Anna Korhonen
Alexandra Birch
45
13
0
20 Dec 2022
Extrinsic Evaluation of Machine Translation Metrics
Extrinsic Evaluation of Machine Translation Metrics
Nikita Moghe
Tom Sherborne
Mark Steedman
Alexandra Birch
ELM
36
18
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDa
AI4CE
37
25
0
20 Dec 2022
On the Role of Parallel Data in Cross-lingual Transfer Learning
On the Role of Parallel Data in Cross-lingual Transfer Learning
Machel Reid
Mikel Artetxe
21
10
0
20 Dec 2022
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic
  Languages
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
A. Mhaske
Harsh Kedia
Sumanth Doddapaneni
Mitesh M. Khapra
Pratyush Kumar
V. Rudramurthy
Anoop Kunchukuttan
59
26
0
20 Dec 2022
Synthetic Pre-Training Tasks for Neural Machine Translation
Synthetic Pre-Training Tasks for Neural Machine Translation
Zexue He
Graeme W. Blackwood
Yikang Shen
Julian McAuley
Rogerio Feris
29
3
0
19 Dec 2022
Memory-efficient NLLB-200: Language-specific Expert Pruning of a
  Massively Multilingual Machine Translation Model
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model
Yeskendir Koishekenov
Alexandre Berard
Vassilina Nikoulina
MoE
40
29
0
19 Dec 2022
Previous
123...91011...232425
Next