ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.03894
  4. Cited By
CamemBERT: a Tasty French Language Model

CamemBERT: a Tasty French Language Model

10 November 2019
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
ArXivPDFHTML

Papers citing "CamemBERT: a Tasty French Language Model"

50 / 361 papers shown
Title
Cascaded Cross-Modal Transformer for Request and Complaint Detection
Cascaded Cross-Modal Transformer for Request and Complaint Detection
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
36
3
0
27 Jul 2023
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning
  Sparse Contextualized Word Representations
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning Sparse Contextualized Word Representations
Gábor Berend
38
7
0
25 Jul 2023
Zero-th Order Algorithm for Softmax Attention Optimization
Zero-th Order Algorithm for Softmax Attention Optimization
Yichuan Deng
Zhihang Li
Sridhar Mahadevan
Zhao Song
38
13
0
17 Jul 2023
Automatic Annotation of Direct Speech in Written French Narratives
Automatic Annotation of Direct Speech in Written French Narratives
Noé Durandard
Viet Tan
Gaspard Michel
Elena V. Epure
21
1
0
27 Jun 2023
CamemBERT-bio: Leveraging Continual Pre-training for Cost-Effective
  Models on French Biomedical Data
CamemBERT-bio: Leveraging Continual Pre-training for Cost-Effective Models on French Biomedical Data
Rian Touchent
Laurent Romary
Eric Villemonte de la Clergerie
MedIm
34
4
0
27 Jun 2023
Comparison of Pre-trained Language Models for Turkish Address Parsing
Comparison of Pre-trained Language Models for Turkish Address Parsing
Muhammed Cihat Unal
Betul Aygun
Aydın Gerek
16
4
0
24 Jun 2023
Multilingual Multiword Expression Identification Using Lateral
  Inhibition and Domain Adaptation
Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation
Andrei-Marius Avram
V. Mititelu
V. Pais
Dumitru-Clementin Cercel
Stefan Trausan-Matu
43
3
0
17 Jun 2023
Lost in Translation: Large Language Models in Non-English Content
  Analysis
Lost in Translation: Large Language Models in Non-English Content Analysis
Gabriel Nicholas
Aliya Bhatia
ELM
18
35
0
12 Jun 2023
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT
  that Easy to Detect?
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Wissam Antoun
Virginie Mouilleron
Benoît Sagot
Djamé Seddah
DeLMO
22
33
0
09 Jun 2023
Multilingual Clinical NER: Translation or Cross-lingual Transfer?
Multilingual Clinical NER: Translation or Cross-lingual Transfer?
X. Fontaine
Félix Gaschi
Parisa Rastin
Y. Toussaint
37
8
0
07 Jun 2023
Towards End-to-end Speech-to-text Summarization
Towards End-to-end Speech-to-text Summarization
Raul Monteiro
Diogo Pernes
9
1
0
06 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELM
FedML
29
4
0
04 Jun 2023
Impact of translation on biomedical information extraction from
  real-life clinical notes
Impact of translation on biomedical information extraction from real-life clinical notes
C. Gérardin
Yu Xiong
Perceval Wajsburt
F. Carrat
X. Tannier
22
1
0
03 Jun 2023
Data-Efficient French Language Modeling with CamemBERTa
Data-Efficient French Language Modeling with CamemBERTa
Wissam Antoun
Benoît Sagot
Djamé Seddah
28
7
0
02 Jun 2023
ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain
  Readability Assessment
ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment
Tarek Naous
Michael Joseph Ryan
Anton Lavrouk
Mohit Chandra
Wei-ping Xu
31
7
0
23 May 2023
Language-Agnostic Bias Detection in Language Models with Bias Probing
Language-Agnostic Bias Detection in Language Models with Bias Probing
Abdullatif Köksal
Omer F. Yalcin
Ahmet Akbiyik
M. Kilavuz
Anna Korhonen
Hinrich Schütze
41
1
0
22 May 2023
PrOnto: Language Model Evaluations for 859 Languages
PrOnto: Language Model Evaluations for 859 Languages
Luke Gessler
21
1
0
22 May 2023
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market
  Domain
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain
Mike Zhang
Rob van der Goot
Barbara Plank
24
14
0
20 May 2023
Multilingual Event Extraction from Historical Newspaper Adverts
Multilingual Event Extraction from Historical Newspaper Adverts
Nadav Borenstein
N. Perez
Isabelle Augenstein
33
4
0
18 May 2023
Towards More Robust NLP System Evaluation: Handling Missing Scores in
  Benchmarks
Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks
Anas Himmi
Ekhine Irurozki
Nathan Noiry
Stéphan Clémençon
Pierre Colombo
34
5
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip Torr
Adel Bibi
42
97
0
17 May 2023
Emotion Recognition based on Psychological Components in Guided
  Narratives for Emotion Regulation
Emotion Recognition based on Psychological Components in Guided Narratives for Emotion Regulation
Gustave Cortal
Alain Finkel
P. Paroubek
Ye Lin
36
10
0
15 May 2023
Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*
Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*
João Rodrigues
Luís Gomes
Joao Silva
António Branco
Rodrigo Santos
Henrique Lopes Cardoso
T. Osório
27
43
0
11 May 2023
An Iterative Algorithm for Rescaled Hyperbolic Functions Regression
An Iterative Algorithm for Rescaled Hyperbolic Functions Regression
Yeqi Gao
Zhao Song
Junze Yin
31
33
0
01 May 2023
Large Scale Genealogical Information Extraction From Handwritten Quebec
  Parish Records
Large Scale Genealogical Information Extraction From Handwritten Quebec Parish Records
Solène Tarride
Martin Maarand
Mélodie Boillet
James McGrath
Eugénie Capel
H. Vézina
Christopher Kermorvant
30
10
0
27 Apr 2023
Attention Scheme Inspired Softmax Regression
Attention Scheme Inspired Softmax Regression
Yichuan Deng
Zhihang Li
Zhao Song
44
42
0
20 Apr 2023
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on
  POS Tagging for Non-Standardized Languages
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages
Verena Blaschke
Hinrich Schütze
Barbara Plank
39
14
0
20 Apr 2023
Measuring Normative and Descriptive Biases in Language Models Using
  Census Data
Measuring Normative and Descriptive Biases in Language Models Using Census Data
Samia Touileb
Lilja Ovrelid
Erik Velldal
27
4
0
12 Apr 2023
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
  Language Models in Multilingual Learning
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning
Viet Dac Lai
Nghia Trung Ngo
Amir Pouran Ben Veyseh
Hieu Man
Franck Dernoncourt
Trung Bui
Thien Huu Nguyen
ELM
LM&MA
30
270
0
12 Apr 2023
Randomized and Deterministic Attention Sparsification Algorithms for
  Over-parameterized Feature Dimension
Randomized and Deterministic Attention Sparsification Algorithms for Over-parameterized Feature Dimension
Yichuan Deng
Sridhar Mahadevan
Zhao Song
14
35
0
10 Apr 2023
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for
  Medical domain
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain
Yanis Labrak
Adrien Bazoge
Richard Dufour
Mickael Rouvier
Emmanuel Morin
B. Daille
P. Gourraud
27
30
0
09 Apr 2023
Automatic ICD-10 Code Association: A Challenging Task on French Clinical
  Texts
Automatic ICD-10 Code Association: A Challenging Task on French Clinical Texts
Yakini Tchouka
Jean-François Couchot
David Laiymani
Philippe Selles
Azzedine Rahmani
33
3
0
06 Apr 2023
The StatCan Dialogue Dataset: Retrieving Data Tables through
  Conversations with Genuine Intents
The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents
Xing Han Lù
Siva Reddy
H. D. Vries
LMTD
25
7
0
03 Apr 2023
DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical
  domains
DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains
Yanis Labrak
Adrien Bazoge
Richard Dufour
Mickael Rouvier
Emmanuel Morin
B. Daille
P. Gourraud
LM&MA
25
54
0
03 Apr 2023
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model
Iakovos Evdaimon
Hadi Abdine
Christos Xypolopoulos
Stamatis Outsios
Michalis Vazirgiannis
Giorgos Stamou
VLM
36
7
0
03 Apr 2023
BERTino: an Italian DistilBERT model
BERTino: an Italian DistilBERT model
Matteo Muffo
E. Bertino
VLM
18
14
0
31 Mar 2023
Development and validation of a natural language processing algorithm to
  pseudonymize documents in the context of a clinical data warehouse
Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse
X. Tannier
Perceval Wajsburt
Alice Calliger
Basile Dura
Alexandre Mouchet
M. Hilka
R. Bey
26
10
0
23 Mar 2023
SwissBERT: The Multilingual Language Model for Switzerland
SwissBERT: The Multilingual Language Model for Switzerland
Jannis Vamvas
Johannes Graen
Rico Sennrich
38
6
0
23 Mar 2023
Improving Transformer Performance for French Clinical Notes
  Classification Using Mixture of Experts on a Limited Dataset
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset
Thanh-Dung Le
P. Jouvet
R. Noumeir
MoE
MedIm
72
5
0
22 Mar 2023
Trained on 100 million words and still in shape: BERT meets British
  National Corpus
Trained on 100 million words and still in shape: BERT meets British National Corpus
David Samuel
Andrey Kutuzov
Lilja Øvrelid
Erik Velldal
27
28
0
17 Mar 2023
Revealing Weaknesses of Vietnamese Language Models Through Unanswerable
  Questions in Machine Reading Comprehension
Revealing Weaknesses of Vietnamese Language Models Through Unanswerable Questions in Machine Reading Comprehension
Son Quoc Tran
Phong Nguyen-Thuan Do
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
39
0
0
16 Mar 2023
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches
  for news genre, topic and persuasion technique classification
SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification
Ben Wu
Olesya Razuvayevskaya
Freddy Heppell
João A. Leite
Carolina Scarton
Kalina Bontcheva
Xingyi Song
16
9
0
16 Mar 2023
Learning for Amalgamation: A Multi-Source Transfer Learning Framework
  For Sentiment Classification
Learning for Amalgamation: A Multi-Source Transfer Learning Framework For Sentiment Classification
Cuong V. Nguyen
Khiem H. Le
Anh Tran
Quang Pham
Binh T. Nguyen
15
14
0
16 Mar 2023
MEDBERT.de: A Comprehensive German BERT Model for the Medical Domain
MEDBERT.de: A Comprehensive German BERT Model for the Medical Domain
Keno K. Bressem
Jens-Michalis Papaioannou
Paul Grundmann
Florian Borchert
Lisa Christine Adams
...
Moritz Augustin
Lennart Grosser
Marcus R. Makowski
Hugo J. W. L. Aerts
Alexander Loser
AI4MH
13
29
0
14 Mar 2023
Alloprof: a new French question-answer education dataset and its use in
  an information retrieval case study
Alloprof: a new French question-answer education dataset and its use in an information retrieval case study
Antoine Lefebvre-Brossard
Stephane Gazaille
Michel C. Desmarais
AI4Ed
26
1
0
10 Feb 2023
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
Joel Niklaus
Veton Matoshi
Pooja Rani
Andrea Galassi
Matthias Sturmer
Ilias Chalkidis
ELM
AILaw
19
55
0
30 Jan 2023
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural
  Networks
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks
Antoine Louis
Gijs van Dijck
Gerasimos Spanakis
AILaw
23
9
0
30 Jan 2023
Efficient Language Model Training through Cross-Lingual and Progressive
  Transfer Learning
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Malte Ostendorff
Georg Rehm
CLIP
VLM
CLL
41
23
0
23 Jan 2023
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
Imtiaz Karim
Kazi Samin Mubasshir
Mirza Masfiqur Rahman
Elisa Bertino
19
22
0
22 Jan 2023
Dissociating language and thought in large language models
Dissociating language and thought in large language models
Kyle Mahowald
Anna A. Ivanova
I. Blank
Nancy Kanwisher
J. Tenenbaum
Evelina Fedorenko
ELM
ReLM
29
209
0
16 Jan 2023
Previous
12345678
Next