ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.01401
  4. Cited By
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training,
  Understanding and Generation

XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation

3 April 2020
Yaobo Liang
Nan Duan
Yeyun Gong
Ning Wu
Fenfei Guo
Weizhen Qi
Ming Gong
Linjun Shou
Daxin Jiang
Guihong Cao
Xiaodong Fan
Bruce Zhang
Rahul Agrawal
Edward Cui
Sining Wei
Taroon Bharti
Ying Qiao
Jiun-Hung Chen
Winnie Wu
Shuguang Liu
Fan Yang
Daniel Fernando Campos
Rangan Majumder
Ming Zhou
    ELM
    VLM
ArXivPDFHTML

Papers citing "XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation"

50 / 216 papers shown
Title
ReLI: A Language-Agnostic Approach to Human-Robot Interaction
ReLI: A Language-Agnostic Approach to Human-Robot Interaction
Linus Nwankwo
Bjoern Ellensohn
Ozan Özdenizci
Elmar Rueckert
LM&Ro
67
0
0
03 May 2025
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
Jieming Bian
Yuanzhe Peng
Lei Wang
Yin Huang
Jie Xu
FedML
65
0
0
29 Apr 2025
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Yazeed Alnumay
Alexandre Barbet
Anna Bialas
William Darling
Shaan Desai
...
Stephanie Howe
Olivia Lasche
Justin Lee
Anirudh Shrinivason
Jennifer Tracey
94
0
0
18 Mar 2025
TLUE: A Tibetan Language Understanding Evaluation Benchmark
TLUE: A Tibetan Language Understanding Evaluation Benchmark
Fan Gao
Cheng Huang
Nyima Tashi
Xiangxiang Wang
Thupten Tsering
...
Gadeng Luosang
Rinchen Dongrub
Dorje Tashi
Xiao Feng
Yongbin Yu
ELM
84
2
0
15 Mar 2025
LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama
LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama
Naome A. Etori
Kevin Lu
Randu Karisa
Arturs Kanepajs
LRM
ELM
190
0
0
14 Mar 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
152
1
0
07 Mar 2025
Where Are We? Evaluating LLM Performance on African Languages
Where Are We? Evaluating LLM Performance on African Languages
Ife Adebara
Hawau Olamide Toyin
Nahom Tesfu Ghebremichael
AbdelRahim Elmadany
Muhammad Abdul-Mageed
57
0
0
26 Feb 2025
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
Muhammad Farid Adilazuarda
M. Wijanarko
Lucky Susanto
Khumaisa Nuráini
Derry Wijaya
Alham Fikri Aji
52
0
0
25 Feb 2025
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding
Sankalp KJ
Ashutosh Kumar
Laxmaan Balaji
Nikunj Kotecha
Vinija Jain
Aman Chadha
S. Bhaduri
ELM
177
1
0
27 Jan 2025
SailCompass: Towards Reproducible and Robust Evaluation for Southeast
  Asian Languages
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Jia Guo
Longxu Dou
Guangtao Zeng
Stanley Kok
Wei Lu
Qian Liu
ELM
LRM
83
1
0
02 Dec 2024
ChemTEB: Chemical Text Embedding Benchmark, an Overview of Embedding Models Performance & Efficiency on a Specific Domain
ChemTEB: Chemical Text Embedding Benchmark, an Overview of Embedding Models Performance & Efficiency on a Specific Domain
Ali Shiraee Kasmaee
Mohammad Khodadad
Mohammad Arshi Saloot
Nick Sherck
Stephen Dokas
H. Mahyar
Soheila Samiee
ELM
213
0
0
30 Nov 2024
INCLUDE: Evaluating Multilingual Language Understanding with Regional
  Knowledge
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Angelika Romanou
Negar Foroutan
Anna Sotnikova
Zeming Chen
Sree Harsha Nelaturu
...
Mike Zhang
Imanol Schlag
Marzieh Fadaee
Sara Hooker
Antoine Bosselut
ELM
113
6
0
29 Nov 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual
  Semantic Textual Relatedness Task
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness Task
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
68
2
0
28 Nov 2024
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch
Jan Pfister
Julia Wunderle
Andreas Hotho
23
1
0
17 Nov 2024
Delta: A Cloud-assisted Data Enrichment Framework for On-Device
  Continual Learning
Delta: A Cloud-assisted Data Enrichment Framework for On-Device Continual Learning
Chen Gong
Zhenzhe Zheng
Fan Wu
Xiaofeng Jia
Guihai Chen
LMTD
FedML
39
2
0
24 Oct 2024
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic
  Reasoning Tasks
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLM
CoGe
ReLM
VLM
LRM
37
0
0
17 Oct 2024
XTRUST: On the Multilingual Trustworthiness of Large Language Models
XTRUST: On the Multilingual Trustworthiness of Large Language Models
Yahan Li
Yi Wang
Yi-Ju Chang
Yuan Wu
HILM
LRM
34
0
0
24 Sep 2024
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
Basel Mousi
Nadir Durrani
Fatema Ahmad
Md. Arid Hasan
Maram Hasanain
Tameem Kabbani
Fahim Dalvi
Shammur A. Chowdhury
Firoj Alam
51
8
0
17 Sep 2024
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
Joseph Marvin Imperial
Harish Tayyar Madabushi
46
1
0
18 Jul 2024
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models
Nikhil Sharma
Kenton Murray
Ziang Xiao
55
1
0
07 Jul 2024
Multilingual Trolley Problems for Language Models
Multilingual Trolley Problems for Language Models
Zhijing Jin
Sydney Levine
Max Kleiman-Weiner
Giorgio Piatti
Jiarui Liu
...
András Strausz
Mrinmaya Sachan
Rada Mihalcea
Yejin Choi
Bernhard Schölkopf
LRM
53
5
0
02 Jul 2024
PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement
  on Multilingual and Multi-Cultural Data
PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data
Ishaan Watts
Varun Gumma
Aditya Yadavalli
Vivek Seshadri
Manohar Swaminathan
Sunayana Sitaram
ELM
51
9
0
21 Jun 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation
  Offer an Alternative to Human Translations?
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
39
1
0
20 Jun 2024
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation
  Language Model
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model
Runzhe Zhan
Xinyi Yang
Derek F. Wong
Lidia S. Chao
Yue Zhang
58
6
0
25 Apr 2024
Translation of Multifaceted Data without Re-Training of Machine
  Translation Systems
Translation of Multifaceted Data without Re-Training of Machine Translation Systems
Hyeonseok Moon
Seungyoon Lee
Seongtae Hong
Seungjun Lee
Chanjun Park
Heu-Jeoung Lim
28
0
0
25 Apr 2024
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and
  Historical Languages
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages
Aleksei Dorkin
Kairit Sirts
30
1
0
19 Apr 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models
  Using Multisense Consistency
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
39
6
0
18 Apr 2024
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for
  the Neural Processing of Portuguese
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
T. Osório
Bernardo Leite
Henrique Lopes Cardoso
Luís Gomes
João Rodrigues
Rodrigo Santos
António Branco
35
3
0
08 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and
  Frontiers
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Hai-Tao Zheng
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
55
36
0
07 Apr 2024
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in
  Cross-Lingual Textual Relatedness
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness
Shijia Zhou
Huangyan Shan
Barbara Plank
Robert Litschko
45
2
0
03 Apr 2024
Can Machine Translation Bridge Multilingual Pretraining and
  Cross-lingual Transfer Learning?
Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?
Shaoxiong Ji
Timothee Mickus
Vincent Segonne
Jörg Tiedemann
CLL
39
3
0
25 Mar 2024
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for
  Vietnamese Natural Language Understanding
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding
Phong Nguyen-Thuan Do
Son Quoc Tran
Phu Gia Hoang
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
ELM
50
3
0
23 Mar 2024
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and
  Closely-Related Languages
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Fahim Faisal
Orevaoghene Ahia
Aarohi Srivastava
Kabir Ahuja
David Chiang
Yulia Tsvetkov
Antonios Anastasopoulos
64
27
0
16 Mar 2024
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in
  Korean
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
Eunsu Kim
Juyoung Suk
Philhoon Oh
Haneul Yoo
James Thorne
Alice H. Oh
ELM
75
15
0
11 Mar 2024
Cost-Performance Optimization for Processing Low-Resource Language Tasks
  Using Commercial LLMs
Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs
Arijit Nag
Animesh Mukherjee
Niloy Ganguly
Soumen Chakrabarti
46
2
0
08 Mar 2024
A Measure for Transparent Comparison of Linguistic Diversity in
  Multilingual NLP Data Sets
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets
Tanja Samardzic
Ximena Gutierrez-Vasques
Christian Bentz
Steven Moran
Olga Pelloni
34
4
0
06 Mar 2024
Evaluating the Elementary Multilingual Capabilities of Large Language
  Models with MultiQ
Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ
Carolin Holtermann
Paul Röttger
Timm Dill
Anne Lauscher
ELM
LRM
40
22
0
06 Mar 2024
Natural Language Processing Methods for Symbolic Music Generation and
  Information Retrieval: a Survey
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Dinh-Viet-Toan Le
Louis Bigo
Mikaela Keller
Dorien Herremans
MedIm
32
9
0
27 Feb 2024
$C^3$: Confidence Calibration Model Cascade for Inference-Efficient
  Cross-Lingual Natural Language Understanding
C3C^3C3: Confidence Calibration Model Cascade for Inference-Efficient Cross-Lingual Natural Language Understanding
Taixi Lu
Haoyu Wang
Huajie Shao
Jing Gao
Huaxiu Yao
35
0
0
25 Feb 2024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
Fajri Koto
Haonan Li
Sara Shatnawi
Jad Doughman
Abdelrahman Boda Sadallah
...
Neha Sengupta
Shady Shehata
Nizar Habash
Preslav Nakov
Timothy Baldwin
ELM
LRM
80
30
0
20 Feb 2024
On the importance of Data Scale in Pretraining Arabic Language Models
On the importance of Data Scale in Pretraining Arabic Language Models
Abbas Ghaddar
Philippe Langlais
Mehdi Rezagholizadeh
Boxing Chen
27
0
0
15 Jan 2024
TransliCo: A Contrastive Learning Framework to Address the Script
  Barrier in Multilingual Pretrained Language Models
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models
Yihong Liu
Chunlan Ma
Haotian Ye
Hinrich Schütze
33
1
0
12 Jan 2024
Cheetah: Natural Language Generation for 517 African Languages
Cheetah: Natural Language Generation for 517 African Languages
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
29
4
0
02 Jan 2024
ARES: An Automated Evaluation Framework for Retrieval-Augmented
  Generation Systems
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
Jon Saad-Falcon
Omar Khattab
Christopher Potts
Matei A. Zaharia
RALM
30
106
0
16 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei Huang
Junyu Mao
Rui Wang
Hai Hu
30
3
0
15 Nov 2023
On the Analysis of Cross-Lingual Prompt Tuning for Decoder-based
  Multilingual Model
On the Analysis of Cross-Lingual Prompt Tuning for Decoder-based Multilingual Model
Nohil Park
Joonsuk Park
Kang Min Yoo
Sungroh Yoon
33
3
0
14 Nov 2023
XFEVER: Exploring Fact Verification across Languages
XFEVER: Exploring Fact Verification across Languages
Yi-Chen Chang
Canasai Kruengkrai
Junichi Yamagishi
HILM
16
3
0
25 Oct 2023
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64
  Languages
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
Chiyu Zhang
Khai Duy Doan
Qisheng Liao
Muhammad Abdul-Mageed
36
6
0
23 Oct 2023
Surveying the Landscape of Text Summarization with Deep Learning: A
  Comprehensive Review
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TS
AILaw
38
3
0
13 Oct 2023
Large Language Models Only Pass Primary School Exams in Indonesia: A
  Comprehensive Test on IndoMMLU
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
Fajri Koto
Nurul Aisyah
Haonan Li
Timothy Baldwin
AI4Ed
LRM
ELM
32
37
0
07 Oct 2023
12345
Next