Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.03894
Cited By
CamemBERT: a Tasty French Language Model
10 November 2019
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CamemBERT: a Tasty French Language Model"
50 / 361 papers shown
Title
Cascaded Cross-Modal Transformer for Request and Complaint Detection
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
36
3
0
27 Jul 2023
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning Sparse Contextualized Word Representations
Gábor Berend
38
7
0
25 Jul 2023
Zero-th Order Algorithm for Softmax Attention Optimization
Yichuan Deng
Zhihang Li
Sridhar Mahadevan
Zhao Song
38
13
0
17 Jul 2023
Automatic Annotation of Direct Speech in Written French Narratives
Noé Durandard
Viet Tan
Gaspard Michel
Elena V. Epure
21
1
0
27 Jun 2023
CamemBERT-bio: Leveraging Continual Pre-training for Cost-Effective Models on French Biomedical Data
Rian Touchent
Laurent Romary
Eric Villemonte de la Clergerie
MedIm
34
4
0
27 Jun 2023
Comparison of Pre-trained Language Models for Turkish Address Parsing
Muhammed Cihat Unal
Betul Aygun
Aydın Gerek
16
4
0
24 Jun 2023
Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation
Andrei-Marius Avram
V. Mititelu
V. Pais
Dumitru-Clementin Cercel
Stefan Trausan-Matu
43
3
0
17 Jun 2023
Lost in Translation: Large Language Models in Non-English Content Analysis
Gabriel Nicholas
Aliya Bhatia
ELM
18
35
0
12 Jun 2023
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Wissam Antoun
Virginie Mouilleron
Benoît Sagot
Djamé Seddah
DeLMO
22
33
0
09 Jun 2023
Multilingual Clinical NER: Translation or Cross-lingual Transfer?
X. Fontaine
Félix Gaschi
Parisa Rastin
Y. Toussaint
37
8
0
07 Jun 2023
Towards End-to-end Speech-to-text Summarization
Raul Monteiro
Diogo Pernes
9
1
0
06 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELM
FedML
29
4
0
04 Jun 2023
Impact of translation on biomedical information extraction from real-life clinical notes
C. Gérardin
Yu Xiong
Perceval Wajsburt
F. Carrat
X. Tannier
22
1
0
03 Jun 2023
Data-Efficient French Language Modeling with CamemBERTa
Wissam Antoun
Benoît Sagot
Djamé Seddah
28
7
0
02 Jun 2023
ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment
Tarek Naous
Michael Joseph Ryan
Anton Lavrouk
Mohit Chandra
Wei-ping Xu
31
7
0
23 May 2023
Language-Agnostic Bias Detection in Language Models with Bias Probing
Abdullatif Köksal
Omer F. Yalcin
Ahmet Akbiyik
M. Kilavuz
Anna Korhonen
Hinrich Schütze
41
1
0
22 May 2023
PrOnto: Language Model Evaluations for 859 Languages
Luke Gessler
21
1
0
22 May 2023
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain
Mike Zhang
Rob van der Goot
Barbara Plank
24
14
0
20 May 2023
Multilingual Event Extraction from Historical Newspaper Adverts
Nadav Borenstein
N. Perez
Isabelle Augenstein
33
4
0
18 May 2023
Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks
Anas Himmi
Ekhine Irurozki
Nathan Noiry
Stéphan Clémençon
Pierre Colombo
34
5
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip Torr
Adel Bibi
42
97
0
17 May 2023
Emotion Recognition based on Psychological Components in Guided Narratives for Emotion Regulation
Gustave Cortal
Alain Finkel
P. Paroubek
Ye Lin
36
10
0
15 May 2023
Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*
João Rodrigues
Luís Gomes
Joao Silva
António Branco
Rodrigo Santos
Henrique Lopes Cardoso
T. Osório
27
43
0
11 May 2023
An Iterative Algorithm for Rescaled Hyperbolic Functions Regression
Yeqi Gao
Zhao Song
Junze Yin
31
33
0
01 May 2023
Large Scale Genealogical Information Extraction From Handwritten Quebec Parish Records
Solène Tarride
Martin Maarand
Mélodie Boillet
James McGrath
Eugénie Capel
H. Vézina
Christopher Kermorvant
30
10
0
27 Apr 2023
Attention Scheme Inspired Softmax Regression
Yichuan Deng
Zhihang Li
Zhao Song
44
42
0
20 Apr 2023
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages
Verena Blaschke
Hinrich Schütze
Barbara Plank
39
14
0
20 Apr 2023
Measuring Normative and Descriptive Biases in Language Models Using Census Data
Samia Touileb
Lilja Ovrelid
Erik Velldal
27
4
0
12 Apr 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELM
LM&MA
30
270
0
12 Apr 2023
Randomized and Deterministic Attention Sparsification Algorithms for Over-parameterized Feature Dimension
Yichuan Deng
Sridhar Mahadevan
Zhao Song
14
35
0
10 Apr 2023
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain
Yanis Labrak
Adrien Bazoge
Richard Dufour
Mickael Rouvier
Emmanuel Morin
B. Daille
P. Gourraud
27
30
0
09 Apr 2023
Automatic ICD-10 Code Association: A Challenging Task on French Clinical Texts
Yakini Tchouka
Jean-François Couchot
David Laiymani
Philippe Selles
Azzedine Rahmani
33
3
0
06 Apr 2023
The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents
Xing Han Lù
Siva Reddy
H. D. Vries
LMTD
25
7
0
03 Apr 2023
DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains
Yanis Labrak
Adrien Bazoge
Richard Dufour
Mickael Rouvier
Emmanuel Morin
B. Daille
P. Gourraud
LM&MA
25
54
0
03 Apr 2023
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model
Iakovos Evdaimon
Hadi Abdine
Christos Xypolopoulos
Stamatis Outsios
Michalis Vazirgiannis
Giorgos Stamou
VLM
36
7
0
03 Apr 2023
BERTino: an Italian DistilBERT model
Matteo Muffo
E. Bertino
VLM
18
14
0
31 Mar 2023
Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse
X. Tannier
Perceval Wajsburt
Alice Calliger
Basile Dura
Alexandre Mouchet
M. Hilka
R. Bey
26
10
0
23 Mar 2023
SwissBERT: The Multilingual Language Model for Switzerland
Jannis Vamvas
Johannes Graen
Rico Sennrich
38
6
0
23 Mar 2023
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset
Thanh-Dung Le
P. Jouvet
R. Noumeir
MoE
MedIm
72
5
0
22 Mar 2023
Trained on 100 million words and still in shape: BERT meets British National Corpus
David Samuel
Andrey Kutuzov
Lilja Øvrelid
Erik Velldal
27
28
0
17 Mar 2023
Revealing Weaknesses of Vietnamese Language Models Through Unanswerable Questions in Machine Reading Comprehension
Son Quoc Tran
Phong Nguyen-Thuan Do
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
39
0
0
16 Mar 2023
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification
Ben Wu
Olesya Razuvayevskaya
Freddy Heppell
João A. Leite
Carolina Scarton
Kalina Bontcheva
Xingyi Song
16
9
0
16 Mar 2023
Learning for Amalgamation: A Multi-Source Transfer Learning Framework For Sentiment Classification
Cuong V. Nguyen
Khiem H. Le
Anh Tran
Quang Pham
Binh T. Nguyen
15
14
0
16 Mar 2023
MEDBERT.de: A Comprehensive German BERT Model for the Medical Domain
Keno K. Bressem
Jens-Michalis Papaioannou
Paul Grundmann
Florian Borchert
Lisa Christine Adams
...
Moritz Augustin
Lennart Grosser
Marcus R. Makowski
Hugo J. W. L. Aerts
Alexander Loser
AI4MH
13
29
0
14 Mar 2023
Alloprof: a new French question-answer education dataset and its use in an information retrieval case study
Antoine Lefebvre-Brossard
Stephane Gazaille
Michel C. Desmarais
AI4Ed
26
1
0
10 Feb 2023
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
Joel Niklaus
Veton Matoshi
Pooja Rani
Andrea Galassi
Matthias Sturmer
Ilias Chalkidis
ELM
AILaw
19
55
0
30 Jan 2023
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks
Antoine Louis
Gijs van Dijck
Gerasimos Spanakis
AILaw
23
9
0
30 Jan 2023
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Malte Ostendorff
Georg Rehm
CLIP
VLM
CLL
41
23
0
23 Jan 2023
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
Imtiaz Karim
Kazi Samin Mubasshir
Mirza Masfiqur Rahman
Elisa Bertino
19
22
0
22 Jan 2023
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELM
ReLM
29
209
0
16 Jan 2023
Previous
1
2
3
4
5
6
7
8
Next