ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.02116
  4. Cited By
Unsupervised Cross-lingual Representation Learning at Scale

Unsupervised Cross-lingual Representation Learning at Scale

5 November 2019
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
ArXivPDFHTML

Papers citing "Unsupervised Cross-lingual Representation Learning at Scale"

50 / 1,190 papers shown
Title
Medical Spoken Named Entity Recognition
Medical Spoken Named Entity Recognition
Khai Le-Duc
David Thulke
Hung-Phong Tran
Long Vo-Dang
Khai-Nguyen Nguyen
Truong-Son Hy
Ralf Schluter
49
0
0
19 Jun 2024
Multilingual Large Language Models and Curse of Multilinguality
Multilingual Large Language Models and Curse of Multilinguality
Daniil Gurgurov
Tanja Bäumel
Tatiana Anikina
86
4
0
15 Jun 2024
Datasets for Multilingual Answer Sentence Selection
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
46
0
0
14 Jun 2024
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for
  Low-Resource Languages
UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages
Trinh Pham
Khoi M. Le
Luu Anh Tuan
42
1
0
14 Jun 2024
Decoding the Diversity: A Review of the Indic AI Research Landscape
Decoding the Diversity: A Review of the Indic AI Research Landscape
Sankalp KJ
Vinija Jain
S. Bhaduri
Tamoghna Roy
Aman Chadha
55
5
0
13 Jun 2024
Bilingual Sexism Classification: Fine-Tuned XLM-RoBERTa and GPT-3.5 Few-Shot Learning
Bilingual Sexism Classification: Fine-Tuned XLM-RoBERTa and GPT-3.5 Few-Shot Learning
AmirMohammad Azadi
Baktash Ansari
Sina Zamani
Sauleh Eetemadi
21
1
0
11 Jun 2024
Decipherment-Aware Multilingual Learning in Jointly Trained Language
  Models
Decipherment-Aware Multilingual Learning in Jointly Trained Language Models
Grandee Lee
34
0
0
11 Jun 2024
AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German
  Consumer Contracts
AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts
Daniel Braun
Florian Matthes
AILaw
ELM
42
3
0
10 Jun 2024
ThaiCoref: Thai Coreference Resolution Dataset
ThaiCoref: Thai Coreference Resolution Dataset
Pontakorn Trakuekul
Wei Qi Leong
Charin Polpanumas
Jitkapat Sawatphol
William-Chandra Tjhi
Attapol T. Rutherford
23
0
0
10 Jun 2024
Innovations in Cover Song Detection: A Lyrics-Based Approach
Innovations in Cover Song Detection: A Lyrics-Based Approach
Maximilian Balluff
Peter Mandl
Christian Wolff
21
1
0
06 Jun 2024
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
David Ifeoluwa Adelani
Jessica Ojo
Israel Abebe Azime
Jian Yun Zhuang
Jesujoba Oluwadara Alabi
...
Salomey Osei
Sokhar Samb
Tadesse Kebede Guge
Pontus Stenetorp
Pontus Stenetorp
ELM
67
7
0
05 Jun 2024
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of
  Multilingual and Monolingual Text Embedding
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
Kenneth Enevoldsen
Márton Kardos
Niklas Muennighoff
Kristoffer Nielbo
42
9
0
04 Jun 2024
Multimodal Metadata Assignment for Cultural Heritage Artifacts
Multimodal Metadata Assignment for Cultural Heritage Artifacts
Luis Rei
Dunja Mladenić
M. Dorozynski
Franz Rottensteiner
Thomas Schleider
Raphael Troncy
J. Lozano
Mar Gaitán Salvatella
31
6
0
01 Jun 2024
Critical Learning Periods: Leveraging Early Training Dynamics for
  Efficient Data Pruning
Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
E. Chimoto
Jay Gala
Orevaoghene Ahia
Julia Kreutzer
Bruce A. Bassett
Sara Hooker
VLM
44
4
0
29 May 2024
XFormParser: A Simple and Effective Multimodal Multilingual
  Semi-structured Form Parser
XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser
Xianfu Cheng
Hang Zhang
Jian Yang
Xiang Li
Weixiao Zhou
...
Fei Liu
Wei Zhang
Tao Sun
Tongliang Li
Zhoujun Li
52
2
0
27 May 2024
Chasing COMET: Leveraging Minimum Bayes Risk Decoding for Self-Improving
  Machine Translation
Chasing COMET: Leveraging Minimum Bayes Risk Decoding for Self-Improving Machine Translation
Kamil Guttmann
Miko Pokrywka
Adrian Charkiewicz
Artur Nowakowski
58
3
0
20 May 2024
Cyber Risks of Machine Translation Critical Errors : Arabic Mental
  Health Tweets as a Case Study
Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study
Hadeel Saadany
Ashraf Tantawy
Constantin Orasan
45
1
0
19 May 2024
Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in
  Fine-tuning LLMs for Simultaneous Translation
Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation
Matthew Raffel
Victor Agostinelli
Lizhong Chen
41
5
0
16 May 2024
LyS at SemEval-2024 Task 3: An Early Prototype for End-to-End Multimodal
  Emotion Linking as Graph-Based Parsing
LyS at SemEval-2024 Task 3: An Early Prototype for End-to-End Multimodal Emotion Linking as Graph-Based Parsing
Ana Ezquerro
David Vilares
38
1
0
10 May 2024
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences
Prashant Kodali
Anmol Goel
Likhith Asapu
Vamshi Krishna Bonagiri
Anirudh Govil
Monojit Choudhury
Manish Shrivastava
Ponnurangam Kumaraguru
55
0
0
09 May 2024
SUTRA: Scalable Multilingual Language Model Architecture
SUTRA: Scalable Multilingual Language Model Architecture
Abhijit Bendale
Michael Sapienza
Steven Ripplinger
Simon Gibbs
Jaewon Lee
Pranav Mistry
LRM
ELM
36
4
0
07 May 2024
Enhancing Language Models for Financial Relation Extraction with Named
  Entities and Part-of-Speech
Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech
Menglin Li
Kwan Hui Lim
46
0
0
02 May 2024
ViTHSD: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts
ViTHSD: Exploiting Hatred by Targets for Hate Speech Detection on Vietnamese Social Media Texts
Cuong Nhat Vo
Khanh Bao Huynh
Son T. Luu
Trong-Hop Do
47
1
0
30 Apr 2024
What Drives Performance in Multilingual Language Models?
What Drives Performance in Multilingual Language Models?
Sina Bagheri Nezhad
Ameeta Agrawal
LRM
42
9
0
29 Apr 2024
Unknown Script: Impact of Script on Cross-Lingual Transfer
Unknown Script: Impact of Script on Cross-Lingual Transfer
Wondimagegnhue Tufa
Ilia Markov
Piek Vossen
45
0
0
29 Apr 2024
Explainability of machine learning approaches in forensic linguistics: a
  case study in geolinguistic authorship profiling
Explainability of machine learning approaches in forensic linguistics: a case study in geolinguistic authorship profiling
Dana Roemling
Yves Scherrer
Aleksandra Miletic
61
0
0
29 Apr 2024
Comparing LLM prompting with Cross-lingual transfer performance on
  Indigenous and Low-resource Brazilian Languages
Comparing LLM prompting with Cross-lingual transfer performance on Indigenous and Low-resource Brazilian Languages
David Ifeoluwa Adelani
A. S. Dougruoz
André Coneglian
Atul Kr. Ojha
36
2
0
28 Apr 2024
Can Perplexity Predict Fine-Tuning Performance? An Investigation of
  Tokenization Effects on Sequential Language Models for Nepali
Can Perplexity Predict Fine-Tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali
Nishant Luitel
Nirajan Bekoju
Anand Kumar Sah
Subarna Shakya
52
1
0
28 Apr 2024
Building a Large Japanese Web Corpus for Large Language Models
Building a Large Japanese Web Corpus for Large Language Models
Naoaki Okazaki
Kakeru Hattori
Hirai Shota
Hiroki Iida
Masanari Ohi
Kazuki Fujii
Taishi Nakamura
Mengsay Loem
Rio Yokota
Sakae Mizuki
55
7
0
27 Apr 2024
TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya
TIGQA:An Expert Annotated Question Answering Dataset in Tigrinya
Hailay Teklehaymanot
Dren Fazlija
Niloy Ganguly
Gourab K. Patro
Wolfgang Nejdl
36
0
0
26 Apr 2024
Automatic Speech Recognition System-Independent Word Error Rate
  Estimation
Automatic Speech Recognition System-Independent Word Error Rate Estimation
Chanho Park
Mingjie Chen
Thomas Hain
26
0
0
25 Apr 2024
No Train but Gain: Language Arithmetic for training-free Language
  Adapters enhancement
No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement
Mateusz Klimaszewski
Piotr Andruszkiewicz
Alexandra Birch
MoMe
47
4
0
24 Apr 2024
Software Mention Recognition with a Three-Stage Framework Based on
  BERTology Models at SOMD 2024
Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024
Thuy Nguyen Thi
Anh Nguyen Viet
D. Thin
Ngan Luu-Thuy Nguyen
34
0
0
23 Apr 2024
Multi-Head Mixture-of-Experts
Multi-Head Mixture-of-Experts
Xun Wu
Shaohan Huang
Wenhui Wang
Furu Wei
MoE
39
12
0
23 Apr 2024
Do "English" Named Entity Recognizers Work Well on Global Englishes?
Do "English" Named Entity Recognizers Work Well on Global Englishes?
Alexander Shan
John Bauer
Riley Carlson
Christopher D. Manning
36
2
0
20 Apr 2024
Latent Concept-based Explanation of NLP Models
Latent Concept-based Explanation of NLP Models
Xuemin Yu
Fahim Dalvi
Nadir Durrani
Marzia Nouri
Hassan Sajjad
LRM
FAtt
29
2
0
18 Apr 2024
Grammatical Error Correction for Code-Switched Sentences by Learners of
  English
Grammatical Error Correction for Code-Switched Sentences by Learners of English
Kelvin Wey Han Chan
Christopher Bryant
Li Nguyen
Andrew Caines
Zheng Yuan
54
2
0
18 Apr 2024
GeMQuAD : Generating Multilingual Question Answering Datasets from Large
  Language Models using Few Shot Learning
GeMQuAD : Generating Multilingual Question Answering Datasets from Large Language Models using Few Shot Learning
Amani Namboori
Shivam Mangale
Andrew Rosenbaum
Saleh Soltan
45
0
0
14 Apr 2024
Data-Augmentation-Based Dialectal Adaptation for LLMs
Data-Augmentation-Based Dialectal Adaptation for LLMs
Fahim Faisal
Antonios Anastasopoulos
39
2
0
11 Apr 2024
Guiding Large Language Models to Post-Edit Machine Translation with
  Error Annotations
Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations
Dayeon Ki
Marine Carpuat
38
17
0
11 Apr 2024
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and
  Techniques in Cyber Threat Reports
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports
Lukas Lange
Marc Müller
G. H. Torbati
Dragan Milchevski
Patrick Grau
S. Pujari
Annemarie Friedrich
33
0
0
11 Apr 2024
High-Dimension Human Value Representation in Large Language Models
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
71
5
0
11 Apr 2024
Event Extraction in Basque: Typologically motivated Cross-Lingual
  Transfer-Learning Analysis
Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis
Mikel Zubillaga
Oscar Sainz
A. Estarrona
Oier López de Lacalle
Eneko Agirre
51
2
0
09 Apr 2024
Comprehensive Study on German Language Models for Clinical and
  Biomedical Text Understanding
Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding
Ahmad Idrissi-Yaghir
Amin Dada
Henning Schafer
Kamyar Arzideh
Giulia Baldini
...
Peter A. Horn
Christin Seifert
F. Nensa
Jens Kleesiek
Christoph M. Friedrich
AI4MH
39
2
0
08 Apr 2024
OPSD: an Offensive Persian Social media Dataset and its baseline
  evaluations
OPSD: an Offensive Persian Social media Dataset and its baseline evaluations
M. Safayani
Amir Sartipi
Amir Hossein Ahmadi
Parniyan Jalali
Amir Hossein Mansouri
Mohammad Bisheh-Niasar
Zahra Pourbahman
21
0
0
08 Apr 2024
Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Hongchuan Zeng
Hongshen Xu
Lu Chen
Kai Yu
59
5
0
06 Apr 2024
ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for
  Angolan Language Model
ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model
Osvaldo Luamba Quinjica
David Ifeoluwa Adelani
46
0
0
03 Apr 2024
Optical Text Recognition in Nepali and Bengali: A Transformer-based
  Approach
Optical Text Recognition in Nepali and Bengali: A Transformer-based Approach
Rakib Hasan
Aakar Dhakal
Kabir Mehedi
Annajiat Alim Rasel
27
1
0
03 Apr 2024
Can Humans Identify Domains?
Can Humans Identify Domains?
Maria Barrett
Max Müller-Eberstein
Elisa Bassignana
Amalie Brogaard Pauli
Mike Zhang
Rob van der Goot
47
1
0
02 Apr 2024
A Study on Scaling Up Multilingual News Framing Analysis
A Study on Scaling Up Multilingual News Framing Analysis
Syeda Sabrina Akter
Antonios Anastasopoulos
34
0
0
01 Apr 2024
Previous
12345...222324
Next