Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.20470
Cited By
Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation
31 October 2023
A. Seza Doğruöz
Sunayana Sitaram
Zheng-Xin Yong
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation"
10 / 10 papers shown
Title
Leveraging LLMs for Translating and Classifying Mental Health Data
Konstantinos Skianis
A. Seza Doğruöz
John Pavlopoulos
AI4MH
23
0
0
16 Oct 2024
Grammatical Error Correction for Code-Switched Sentences by Learners of English
Kelvin Wey Han Chan
Christopher Bryant
Li Nguyen
Andrew Caines
Zheng Yuan
49
2
0
18 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Hai-Tao Zheng
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
55
36
0
07 Apr 2024
TRUCE: Private Benchmarking to Prevent Contamination and Improve Comparative Evaluation of LLMs
Tanmay Rajore
Nishanth Chandran
Sunayana Sitaram
Divya Gupta
Rahul Sharma
Kashish Mittal
Manohar Swaminathan
47
14
0
01 Mar 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
Shivalika Singh
Freddie Vargus
Daniel D'souza
Börje F. Karlsson
Abinaya Mahendiran
...
Max Bartolo
Julia Kreutzer
Ahmet Üstün
Marzieh Fadaee
Sara Hooker
122
118
0
09 Feb 2024
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification
Taja Kuzman
I. Mozetič
Nikola Ljubesic
57
92
0
07 Mar 2023
A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies
A. Seza Doğruöz
Sunayana Sitaram
Barbara E. Bullock
Almeida Jacqueline Toribio
78
72
0
05 Jan 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
339
12,003
0
04 Mar 2022
From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text
Ishan Tarunesh
Syamantak Kumar
P. Jyothi
44
45
0
14 Jul 2021
Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences
Genta Indra Winata
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
SyDa
135
92
0
18 Sep 2019
1