ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.11080
  4. Cited By
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating
  Cross-lingual Generalization

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization

24 March 2020
Junjie Hu
Sebastian Ruder
Aditya Siddhant
Graham Neubig
Orhan Firat
Melvin Johnson
    ELM
ArXivPDFHTML

Papers citing "XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization"

50 / 661 papers shown
Title
MEGA: Multilingual Evaluation of Generative AI
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja
Harshita Diddee
Rishav Hada
Millicent Ochieng
Krithika Ramesh
...
T. Ganu
Sameer Segal
Maxamed Axmed
Kalika Bali
Sunayana Sitaram
LM&MA
LRM
ELM
27
269
0
22 Mar 2023
Character, Word, or Both? Revisiting the Segmentation Granularity for
  Chinese Pre-trained Language Models
Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models
Xinnian Liang
Zefan Zhou
Hui Huang
Shuangzhi Wu
Tong Xiao
Muyun Yang
Zhoujun Li
Chao Bian
VLM
38
2
0
20 Mar 2023
DiTTO: A Feature Representation Imitation Approach for Improving
  Cross-Lingual Transfer
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer
Shanu Kumar
Abbaraju Soujanya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
VLM
33
1
0
04 Mar 2023
CLICKER: Attention-Based Cross-Lingual Commonsense Knowledge Transfer
CLICKER: Attention-Based Cross-Lingual Commonsense Knowledge Transfer
Ruolin Su
Zhongkai Sun
Sixing Lu
Chengyuan Ma
Chenlei Guo
LRM
26
0
0
26 Feb 2023
Modular Deep Learning
Modular Deep Learning
Jonas Pfeiffer
Sebastian Ruder
Ivan Vulić
Edoardo Ponti
MoMe
OOD
32
73
0
22 Feb 2023
Designing and Evaluating Interfaces that Highlight News Coverage
  Diversity Using Discord Questions
Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions
Philippe Laban
Chien-Sheng Wu
Lidiya Murakhovs'ka
Xiang Ánthony' Chen
Caiming Xiong
24
8
0
17 Feb 2023
Robust Question Answering against Distribution Shifts with Test-Time
  Adaptation: An Empirical Study
Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study
Hai Ye
Yuyang Ding
Juntao Li
Hwee Tou Ng
OOD
TTA
29
9
0
09 Feb 2023
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense
  Retrieval
Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval
Shunyu Zhang
Yaobo Liang
Ming Gong
Daxin Jiang
Nan Duan
25
4
0
03 Feb 2023
Zero-shot cross-lingual transfer language selection using linguistic
  similarity
Zero-shot cross-lingual transfer language selection using linguistic similarity
J. Eronen
M. Ptaszynski
Fumito Masui
24
33
0
31 Jan 2023
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
Joel Niklaus
Veton Matoshi
Pooja Rani
Andrea Galassi
Matthias Sturmer
Ilias Chalkidis
ELM
AILaw
19
55
0
30 Jan 2023
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked
  Language Models
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models
Davis Liang
Hila Gonen
Yuning Mao
Rui Hou
Naman Goyal
Marjan Ghazvininejad
Luke Zettlemoyer
Madian Khabsa
20
72
0
25 Jan 2023
Cross-lingual German Biomedical Information Extraction: from Zero-shot
  to Human-in-the-Loop
Cross-lingual German Biomedical Information Extraction: from Zero-shot to Human-in-the-Loop
Siting Liang
Mareike Hartmann
Daniel Sonntag
23
3
0
24 Jan 2023
XNLI 2.0: Improving XNLI dataset and performance on Cross Lingual
  Understanding (XLU)
XNLI 2.0: Improving XNLI dataset and performance on Cross Lingual Understanding (XLU)
A. Upadhyay
Harsit Kumar Upadhya
21
1
0
16 Jan 2023
FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual
  Transfer with Scheduled Unfreezing
FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual Transfer with Scheduled Unfreezing
Chen Cecilia Liu
Jonas Pfeiffer
Ivan Vulić
Iryna Gurevych
CLL
32
9
0
13 Jan 2023
Boosting Neural Networks to Decompile Optimized Binaries
Boosting Neural Networks to Decompile Optimized Binaries
Ying Cao
Ruigang Liang
Kai Chen
Peiwei Hu
31
17
0
03 Jan 2023
CAMeMBERT: Cascading Assistant-Mediated Multilingual BERT
CAMeMBERT: Cascading Assistant-Mediated Multilingual BERT
Dan DeGenaro
Jugal Kalita
35
0
0
22 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
17
40
0
21 Dec 2022
T-Projection: High Quality Annotation Projection for Sequence Labeling
  Tasks
T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks
Iker García-Ferrero
Rodrigo Agerri
German Rigau
41
13
0
20 Dec 2022
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for
  Natural Language Understanding in Task-Oriented Dialogue
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue
Nikita Moghe
E. Razumovskaia
Liane Guillou
Ivan Vulić
Anna Korhonen
Alexandra Birch
40
13
0
20 Dec 2022
Extrinsic Evaluation of Machine Translation Metrics
Extrinsic Evaluation of Machine Translation Metrics
Nikita Moghe
Tom Sherborne
Mark Steedman
Alexandra Birch
ELM
26
18
0
20 Dec 2022
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic
  Languages
Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
A. Mhaske
Harsh Kedia
Sumanth Doddapaneni
Mitesh M. Khapra
Pratyush Kumar
V. Rudramurthy
Anoop Kunchukuttan
54
26
0
20 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLM
RALM
LRM
30
22
0
19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Samuel Cahyawijaya
Holy Lovenia
Alham Fikri Aji
Genta Indra Winata
Bryan Wilie
...
Timothy Baldwin
Sebastian Ruder
Herry Sujaini
S. Sakti
Ayu Purwarianti
39
48
0
19 Dec 2022
Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Mu2^{2}2SLAM: Multitask, Multilingual Speech and Language Models
Yong Cheng
Yu Zhang
Melvin Johnson
Wolfgang Macherey
Ankur Bapna
33
8
0
19 Dec 2022
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue
William B. Held
Christopher Hidey
Fei Liu
Eric Zhu
Rahul Goel
Diyi Yang
Rushin Shah
34
0
0
15 Dec 2022
Towards Leaving No Indic Language Behind: Building Monolingual Corpora,
  Benchmark and Models for Indic Languages
Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages
Sumanth Doddapaneni
Rahul Aralikatte
Gowtham Ramesh
Shreyansh Goyal
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
ELM
47
81
0
11 Dec 2022
Languages You Know Influence Those You Learn: Impact of Language
  Characteristics on Multi-Lingual Text-to-Text Transfer
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Benjamin Muller
Deepanshu Gupta
Siddharth Patwardhan
J. Fauconnier
David Vandyke
Sachin Agarwal
41
5
0
04 Dec 2022
TyDiP: A Dataset for Politeness Classification in Nine Typologically
  Diverse Languages
TyDiP: A Dataset for Politeness Classification in Nine Typologically Diverse Languages
A. Srinivasan
Eunsol Choi
37
15
0
29 Nov 2022
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction
  and Necessary Resources
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction and Necessary Resources
Xinyan Velocity Yu
Akari Asai
Trina Chatterjee
Junjie Hu
Eunsol Choi
29
21
0
28 Nov 2022
Frustratingly Easy Label Projection for Cross-lingual Transfer
Frustratingly Easy Label Projection for Cross-lingual Transfer
Yang Chen
Chao Jiang
Alan Ritter
Wei-ping Xu
27
31
0
28 Nov 2022
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Zhuoyuan Yao
Shuo Ren
Sanyuan Chen
Ziyang Ma
Pengcheng Guo
Linfu Xie
24
5
0
24 Nov 2022
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP
  benchmark for Polish
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Lukasz Augustyniak
Kamil Tagowski
Albert Sawczyn
Denis Janiak
Roman Bartusiak
...
Arkadiusz Janz
Piotr Szymañski
M. Morzy
Tomasz Kajdanowicz
Maciej Piasecki
18
10
0
23 Nov 2022
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed
  Representations
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Linlin Liu
Xingxuan Li
Megh Thakkar
Xin Li
Chenyu You
Luo Si
Lidong Bing
27
2
0
16 Nov 2022
Disentangling Task Relations for Few-shot Text Classification via
  Self-Supervised Hierarchical Task Clustering
Disentangling Task Relations for Few-shot Text Classification via Self-Supervised Hierarchical Task Clustering
Juan Zha
Zheng Li
Ying Wei
Yu Zhang
31
5
0
16 Nov 2022
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual
  Pre-training
ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training
Henry Tang
Ameet Deshpande
Karthik R. Narasimhan
32
5
0
15 Nov 2022
DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named
  Entity Recognition
DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition
Jiali Zeng
Yu Jiang
Yongjing Yin
Xu Wang
Binghuai Lin
Yunbo Cao
12
2
0
15 Nov 2022
English Contrastive Learning Can Learn Universal Cross-lingual Sentence
  Embeddings
English Contrastive Learning Can Learn Universal Cross-lingual Sentence Embeddings
Yau-Shian Wang
Ashley Wu
Graham Neubig
SSL
35
31
0
11 Nov 2022
Casual Conversations v2: Designing a large consent-driven dataset to
  measure algorithmic bias and robustness
Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness
C. Hazirbas
Yejin Bang
Tiezheng Yu
Parisa Assar
Bilal Porgali
...
Jacqueline Pan
Emily McReynolds
Miranda Bogen
Pascale Fung
Cristian Canton Ferrer
32
8
0
10 Nov 2022
Assistive Completion of Agrammatic Aphasic Sentences: A Transfer
  Learning Approach using Neurolinguistics-based Synthetic Dataset
Assistive Completion of Agrammatic Aphasic Sentences: A Transfer Learning Approach using Neurolinguistics-based Synthetic Dataset
Rohit Misra
S. Mishra
Tapan K. Gandhi
21
2
0
10 Nov 2022
Local Structure Matters Most in Most Languages
Local Structure Matters Most in Most Languages
Louis Clouâtre
Prasanna Parthasarathi
Amal Zouaq
Sarath Chandar
39
1
0
09 Nov 2022
Detecting Languages Unintelligible to Multilingual Models through Local
  Structure Probes
Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes
Louis Clouâtre
Prasanna Parthasarathi
Amal Zouaq
Sarath Chandar
33
3
0
09 Nov 2022
Discord Questions: A Computational Approach To Diversity Analysis in
  News Coverage
Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
Philippe Laban
Chien-Sheng Wu
Lidiya Murakhovs'ka
Xiang Ánthony' Chen
Caiming Xiong
24
12
0
09 Nov 2022
Third-Party Aligner for Neural Word Alignments
Third-Party Aligner for Neural Word Alignments
Jinpeng Zhang
C. Dong
Xiangyu Duan
Yuqi Zhang
Hao Fei
22
0
0
08 Nov 2022
Intriguing Properties of Compression on Multilingual Models
Intriguing Properties of Compression on Multilingual Models
Kelechi Ogueji
Orevaoghene Ahia
Gbemileke Onilude
Sebastian Gehrmann
Sara Hooker
Julia Kreutzer
21
12
0
04 Nov 2022
Multi-level Distillation of Semantic Knowledge for Pre-training
  Multilingual Language Model
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model
Mingqi Li
Fei Ding
Dan Zhang
Long Cheng
Hongxin Hu
Feng Luo
41
6
0
02 Nov 2022
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired
  Speech and Text
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text
Xianghu Yue
Junyi Ao
Xiaoxue Gao
Haizhou Li
SSL
26
8
0
30 Oct 2022
ACES: Translation Accuracy Challenge Sets for Evaluating Machine
  Translation Metrics
ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation Metrics
Chantal Amrhein
Nikita Moghe
Liane Guillou
ELM
34
22
0
27 Oct 2022
What Language Model to Train if You Have One Million GPU Hours?
What Language Model to Train if You Have One Million GPU Hours?
Teven Le Scao
Thomas Wang
Daniel Hesslow
Lucile Saulnier
Stas Bekman
...
Lintang Sutawika
Jaesung Tae
Zheng-Xin Yong
Julien Launay
Iz Beltagy
MoE
AI4CE
230
103
0
27 Oct 2022
Beyond English-Centric Bitexts for Better Multilingual Language
  Representation Learning
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning
Barun Patra
Saksham Singhal
Shaohan Huang
Zewen Chi
Li Dong
Furu Wei
Vishrav Chaudhary
Xia Song
71
23
0
26 Oct 2022
Adapters for Enhanced Modeling of Multilingual Knowledge and Text
Adapters for Enhanced Modeling of Multilingual Knowledge and Text
Yifan Hou
Wenxiang Jiao
Mei-Jun Liu
Carl Allen
Zhaopeng Tu
Mrinmaya Sachan
34
11
0
24 Oct 2022
Previous
123...567...121314
Next