ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.11080
  4. Cited By
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating
  Cross-lingual Generalization
v1v2v3v4v5 (latest)

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization

24 March 2020
Junjie Hu
Sebastian Ruder
Aditya Siddhant
Graham Neubig
Orhan Firat
Melvin Johnson
    ELM
ArXiv (abs)PDFHTML

Papers citing "XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization"

50 / 666 papers shown
Title
Domain Mastery Benchmark: An Ever-Updating Benchmark for Evaluating Holistic Domain Knowledge of Large Language Model--A Preliminary Release
Zhouhong Gu
Xiaoxuan Zhu
Haoning Ye
Lin Zhang
Zhuozhi Xiong
Zihan Li
Qi He
Sihang Jiang
Hongwei Feng
Yanghua Xiao
ELMALM
74
2
0
23 Apr 2023
UniMax: Fairer and more Effective Language Sampling for Large-Scale
  Multilingual Pretraining
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Hyung Won Chung
Noah Constant
Xavier Garcia
Adam Roberts
Yi Tay
Sharan Narang
Orhan Firat
116
57
0
18 Apr 2023
Romanization-based Large-scale Adaptation of Multilingual Language
  Models
Romanization-based Large-scale Adaptation of Multilingual Language Models
Sukannya Purkayastha
Sebastian Ruder
Jonas Pfeiffer
Iryna Gurevych
Ivan Vulić
93
13
0
18 Apr 2023
Transfer to a Low-Resource Language via Close Relatives: The Case Study
  on Faroese
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese
Vésteinn Snaebjarnarson
A. Simonsen
Goran Glavaš
Ivan Vulić
84
23
0
18 Apr 2023
VECO 2.0: Cross-lingual Language Model Pre-training with
  Multi-granularity Contrastive Learning
VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning
Zhen-Ru Zhang
Chuanqi Tan
Songfang Huang
Fei Huang
VLM
64
5
0
17 Apr 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
  Language Models in Multilingual Learning
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELMLM&MA
69
291
0
12 Apr 2023
Efficiently Aligned Cross-Lingual Transfer Learning for Conversational
  Tasks using Prompt-Tuning
Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning
Lifu Tu
Jin Qu
Semih Yavuz
Shafiq Joty
Wenhao Liu
Caiming Xiong
Yingbo Zhou
82
9
0
03 Apr 2023
MEGA: Multilingual Evaluation of Generative AI
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja
Harshita Diddee
Rishav Hada
Millicent Ochieng
Krithika Ramesh
...
T. Ganu
Sameer Segal
Maxamed Axmed
Kalika Bali
Sunayana Sitaram
LM&MALRMELM
122
292
0
22 Mar 2023
Character, Word, or Both? Revisiting the Segmentation Granularity for
  Chinese Pre-trained Language Models
Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models
Xinnian Liang
Zefan Zhou
Hui Huang
Shuangzhi Wu
Tong Xiao
Muyun Yang
Zhoujun Li
Chao Bian
VLM
65
2
0
20 Mar 2023
DiTTO: A Feature Representation Imitation Approach for Improving
  Cross-Lingual Transfer
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer
Shanu Kumar
Abbaraju Soujanya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
VLM
78
1
0
04 Mar 2023
CLICKER: Attention-Based Cross-Lingual Commonsense Knowledge Transfer
CLICKER: Attention-Based Cross-Lingual Commonsense Knowledge Transfer
Ruolin Su
Zhongkai Sun
Sixing Lu
Chengyuan Ma
Chenlei Guo
LRM
53
0
0
26 Feb 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMeOOD
159
80
0
22 Feb 2023
Designing and Evaluating Interfaces that Highlight News Coverage
  Diversity Using Discord Questions
Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Philippe Laban
Chien-Sheng Wu
Lidiya Murakhovs'ka
Xiang Ánthony' Chen
Caiming Xiong
42
8
0
17 Feb 2023
Robust Question Answering against Distribution Shifts with Test-Time
  Adaptation: An Empirical Study
Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study
Hai Ye
Yuyang Ding
Juntao Li
Hwee Tou Ng
OODTTA
116
10
0
09 Feb 2023
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense
  Retrieval
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval
Shunyu Zhang
Yaobo Liang
Ming Gong
Daxin Jiang
Nan Duan
83
4
0
03 Feb 2023
Zero-shot cross-lingual transfer language selection using linguistic
  similarity
Zero-shot cross-lingual transfer language selection using linguistic similarity
J. Eronen
M. Ptaszynski
Fumito Masui
95
38
0
31 Jan 2023
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
Joel Niklaus
Veton Matoshi
Pooja Rani
Andrea Galassi
Matthias Sturmer
Ilias Chalkidis
ELMAILaw
100
60
0
30 Jan 2023
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked
  Language Models
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models
Davis Liang
Hila Gonen
Yuning Mao
Rui Hou
Naman Goyal
Marjan Ghazvininejad
Luke Zettlemoyer
Madian Khabsa
93
80
0
25 Jan 2023
Cross-lingual German Biomedical Information Extraction: from Zero-shot
  to Human-in-the-Loop
Cross-lingual German Biomedical Information Extraction: from Zero-shot to Human-in-the-Loop
Siting Liang
Mareike Hartmann
Daniel Sonntag
59
3
0
24 Jan 2023
XNLI 2.0: Improving XNLI dataset and performance on Cross Lingual
  Understanding (XLU)
XNLI 2.0: Improving XNLI dataset and performance on Cross Lingual Understanding (XLU)
A. Upadhyay
Harsit Kumar Upadhya
36
1
0
16 Jan 2023
FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual
  Transfer with Scheduled Unfreezing
FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual Transfer with Scheduled Unfreezing
Chen Cecilia Liu
Jonas Pfeiffer
Ivan Vulić
Iryna Gurevych
CLL
74
9
0
13 Jan 2023
Boosting Neural Networks to Decompile Optimized Binaries
Boosting Neural Networks to Decompile Optimized Binaries
Ying Cao
Ruigang Liang
Kai Chen
Peiwei Hu
75
20
0
03 Jan 2023
CAMeMBERT: Cascading Assistant-Mediated Multilingual BERT
CAMeMBERT: Cascading Assistant-Mediated Multilingual BERT
Dan DeGenaro
Jugal Kalita
46
0
0
22 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
109
46
0
21 Dec 2022
T-Projection: High Quality Annotation Projection for Sequence Labeling
  Tasks
T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks
Iker García-Ferrero
Rodrigo Agerri
German Rigau
102
16
0
20 Dec 2022
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for
  Natural Language Understanding in Task-Oriented Dialogue
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue
Nikita Moghe
E. Razumovskaia
Liane Guillou
Ivan Vulić
Anna Korhonen
Alexandra Birch
88
13
0
20 Dec 2022
Extrinsic Evaluation of Machine Translation Metrics
Extrinsic Evaluation of Machine Translation Metrics
Nikita Moghe
Tom Sherborne
Mark Steedman
Alexandra Birch
ELM
103
20
0
20 Dec 2022
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic
  Languages
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
A. Mhaske
Harsh Kedia
Sumanth Doddapaneni
Mitesh M. Khapra
Pratyush Kumar
V. Rudramurthy
Anoop Kunchukuttan
104
31
0
20 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLMRALMLRM
114
22
0
19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Samuel Cahyawijaya
Holy Lovenia
Alham Fikri Aji
Genta Indra Winata
Bryan Wilie
...
Timothy Baldwin
Sebastian Ruder
Herry Sujaini
S. Sakti
Ayu Purwarianti
127
50
0
19 Dec 2022
Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Mu2^{2}2SLAM: Multitask, Multilingual Speech and Language Models
Yong Cheng
Yu Zhang
Melvin Johnson
Wolfgang Macherey
Ankur Bapna
66
8
0
19 Dec 2022
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
William B. Held
Christopher Hidey
Fei Liu
Eric Zhu
Rahul Goel
Diyi Yang
Rushin Shah
102
0
0
15 Dec 2022
Towards Leaving No Indic Language Behind: Building Monolingual Corpora,
  Benchmark and Models for Indic Languages
Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages
Sumanth Doddapaneni
Rahul Aralikatte
Gowtham Ramesh
Shreyansh Goyal
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
ELM
107
86
0
11 Dec 2022
Languages You Know Influence Those You Learn: Impact of Language
  Characteristics on Multi-Lingual Text-to-Text Transfer
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Benjamin Muller
Deepanshu Gupta
Siddharth Patwardhan
J. Fauconnier
David Vandyke
Sachin Agarwal
94
5
0
04 Dec 2022
TyDiP: A Dataset for Politeness Classification in Nine Typologically
  Diverse Languages
TyDiP: A Dataset for Politeness Classification in Nine Typologically Diverse Languages
A. Srinivasan
Eunsol Choi
89
15
0
29 Nov 2022
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction
  and Necessary Resources
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction and Necessary Resources
Xinyan Velocity Yu
Akari Asai
Trina Chatterjee
Junjie Hu
Eunsol Choi
102
23
0
28 Nov 2022
Frustratingly Easy Label Projection for Cross-lingual Transfer
Frustratingly Easy Label Projection for Cross-lingual Transfer
Yang Chen
Chao Jiang
Alan Ritter
Wei Xu
97
32
0
28 Nov 2022
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Zhuoyuan Yao
Shuo Ren
Sanyuan Chen
Ziyang Ma
Pengcheng Guo
Linfu Xie
90
5
0
24 Nov 2022
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP
  benchmark for Polish
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Lukasz Augustyniak
Kamil Tagowski
Albert Sawczyn
Denis Janiak
Roman Bartusiak
...
Arkadiusz Janz
Piotr Szymañski
M. Morzy
Tomasz Kajdanowicz
Maciej Piasecki
62
12
0
23 Nov 2022
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed
  Representations
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Linlin Liu
Xingxuan Li
Megh Thakkar
Xin Li
Shafiq Joty
Luo Si
Lidong Bing
86
2
0
16 Nov 2022
Disentangling Task Relations for Few-shot Text Classification via
  Self-Supervised Hierarchical Task Clustering
Disentangling Task Relations for Few-shot Text Classification via Self-Supervised Hierarchical Task Clustering
Juan Zha
Zheng Li
Ying Wei
Yu Zhang
90
5
0
16 Nov 2022
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual
  Pre-training
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training
Henry Tang
Ameet Deshpande
Karthik Narasimhan
106
5
0
15 Nov 2022
DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named
  Entity Recognition
DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition
Jiali Zeng
Yu Jiang
Yongjing Yin
Xu Wang
Binghuai Lin
Yunbo Cao
80
3
0
15 Nov 2022
English Contrastive Learning Can Learn Universal Cross-lingual Sentence
  Embeddings
English Contrastive Learning Can Learn Universal Cross-lingual Sentence Embeddings
Yau-Shian Wang
Ashley Wu
Graham Neubig
SSL
93
33
0
11 Nov 2022
Casual Conversations v2: Designing a large consent-driven dataset to
  measure algorithmic bias and robustness
Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness
C. Hazirbas
Yejin Bang
Tiezheng Yu
Parisa Assar
Bilal Porgali
...
Jacqueline Pan
Emily McReynolds
Miranda Bogen
Pascale Fung
Cristian Canton Ferrer
81
8
0
10 Nov 2022
Assistive Completion of Agrammatic Aphasic Sentences: A Transfer
  Learning Approach using Neurolinguistics-based Synthetic Dataset
Assistive Completion of Agrammatic Aphasic Sentences: A Transfer Learning Approach using Neurolinguistics-based Synthetic Dataset
Rohit Misra
S. Mishra
Tapan K. Gandhi
93
2
0
10 Nov 2022
Local Structure Matters Most in Most Languages
Local Structure Matters Most in Most Languages
Louis Clouâtre
Prasanna Parthasarathi
Payel Das
Sarath Chandar
74
1
0
09 Nov 2022
Detecting Languages Unintelligible to Multilingual Models through Local
  Structure Probes
Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes
Louis Clouâtre
Prasanna Parthasarathi
Payel Das
Sarath Chandar
82
3
0
09 Nov 2022
Discord Questions: A Computational Approach To Diversity Analysis in
  News Coverage
Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
Philippe Laban
Chien-Sheng Wu
Lidiya Murakhovs'ka
Xiang Ánthony' Chen
Caiming Xiong
84
14
0
09 Nov 2022
Third-Party Aligner for Neural Word Alignments
Third-Party Aligner for Neural Word Alignments
Jinpeng Zhang
C. Dong
Xiangyu Duan
Yuqi Zhang
Hao Fei
67
0
0
08 Nov 2022
Previous
123...567...121314
Next