ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.07076
  4. Cited By
Multilingual is not enough: BERT for Finnish

Multilingual is not enough: BERT for Finnish

15 December 2019
Antti Virtanen
Jenna Kanerva
Rami Ilo
Jouni Luoma
Juhani Luotolahti
T. Salakoski
Filip Ginter
S. Pyysalo
ArXivPDFHTML

Papers citing "Multilingual is not enough: BERT for Finnish"

50 / 52 papers shown
Title
Finnish SQuAD: A Simple Approach to Machine Translation of Span Annotations
Finnish SQuAD: A Simple Approach to Machine Translation of Span Annotations
Emil Nuutinen
Iiro Rastas
Filip Ginter
45
1
0
10 Jan 2025
Transformer-based Entity Legal Form Classification
Transformer-based Entity Legal Form Classification
Alexander Arimond
Mauro Molteni
Dominik Jany
Zornitsa Manolova
Damian Borth
Andreas G. F. Hoepner
MedIm
AILaw
19
1
0
19 Oct 2023
Testing the Predictions of Surprisal Theory in 11 Languages
Testing the Predictions of Surprisal Theory in 11 Languages
Ethan Gotlieb Wilcox
Tiago Pimentel
Clara Meister
Ryan Cotterell
R. Levy
LRM
52
63
0
07 Jul 2023
Multilingual Multiword Expression Identification Using Lateral
  Inhibition and Domain Adaptation
Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation
Andrei-Marius Avram
V. Mititelu
V. Pais
Dumitru-Clementin Cercel
Stefan Trausan-Matu
43
3
0
17 Jun 2023
Do All Languages Cost the Same? Tokenization in the Era of Commercial
  Language Models
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models
Orevaoghene Ahia
Sachin Kumar
Hila Gonen
Jungo Kasai
David R. Mortensen
Noah A. Smith
Yulia Tsvetkov
53
82
0
23 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip Torr
Adel Bibi
42
97
0
17 May 2023
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on
  POS Tagging for Non-Standardized Languages
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages
Verena Blaschke
Hinrich Schütze
Barbara Plank
39
14
0
20 Apr 2023
BERTino: an Italian DistilBERT model
BERTino: an Italian DistilBERT model
Matteo Muffo
E. Bertino
VLM
18
14
0
31 Mar 2023
Cross-lingual German Biomedical Information Extraction: from Zero-shot
  to Human-in-the-Loop
Cross-lingual German Biomedical Information Extraction: from Zero-shot to Human-in-the-Loop
Siting Liang
Mareike Hartmann
Daniel Sonntag
23
3
0
24 Jan 2023
Efficient Language Model Training through Cross-Lingual and Progressive
  Transfer Learning
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Malte Ostendorff
Georg Rehm
CLIP
VLM
CLL
41
23
0
23 Jan 2023
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code
  Completion
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion
Zi Gong
Yinpeng Guo
Pingyi Zhou
Cuiyun Gao
Yasheng Wang
Zenglin Xu
14
8
0
19 Dec 2022
Applying Multilingual Models to Question Answering (QA)
Applying Multilingual Models to Question Answering (QA)
Ayrton San Joaquin
Filip Skubacz
18
1
0
04 Dec 2022
Exploring Predictive Uncertainty and Calibration in NLP: A Study on the
  Impact of Method & Data Scarcity
Exploring Predictive Uncertainty and Calibration in NLP: A Study on the Impact of Method & Data Scarcity
Dennis Ulmer
J. Frellsen
Christian Hardmeier
195
22
0
20 Oct 2022
Self-Adaptive Named Entity Recognition by Retrieving Unstructured
  Knowledge
Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge
Kosuke Nishida
Naoki Yoshinaga
Kyosuke Nishida
30
2
0
14 Oct 2022
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language
  Models for Sinhala Text Classification
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification
Vinura Dhananjaya
Piyumal Demotte
Surangika Ranathunga
Sanath Jayasena
27
14
0
16 Aug 2022
Sort by Structure: Language Model Ranking as Dependency Probing
Sort by Structure: Language Model Ranking as Dependency Probing
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
41
3
0
10 Jun 2022
State-of-the-art in Open-domain Conversational AI: A Survey
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin P. Adewumi
F. Liwicki
Marcus Liwicki
32
15
0
02 May 2022
You Are What You Write: Preserving Privacy in the Era of Large Language
  Models
You Are What You Write: Preserving Privacy in the Era of Large Language Models
Richard Plant
V. Giuffrida
Dimitra Gkatzia
PILM
38
19
0
20 Apr 2022
BERTuit: Understanding Spanish language in Twitter through a native
  transformer
BERTuit: Understanding Spanish language in Twitter through a native transformer
Javier Huertas-Tato
Alejandro Martín
David Camacho
26
9
0
07 Apr 2022
Punctuation restoration in Swedish through fine-tuned KB-BERT
Punctuation restoration in Swedish through fine-tuned KB-BERT
J. Nilsson
18
0
0
14 Feb 2022
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language
  Models
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models
Vésteinn Snæbjarnarson
Haukur Barri Símonarson
Pétur Orri Ragnarsson
Svanhvít Lilja Ingólfsdóttir
H. Jónsson
Vilhjálmur Þorsteinsson
H. Einarsson
24
26
0
14 Jan 2022
Recent Advances in Natural Language Processing via Large Pre-Trained
  Language Models: A Survey
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
83
1,035
0
01 Nov 2021
Cross-lingual Transfer of Monolingual Models
Cross-lingual Transfer of Monolingual Models
Evangelia Gogoulou
Ariel Ekgren
T. Isbister
Magnus Sahlgren
29
17
0
15 Sep 2021
Evaluating Transferability of BERT Models on Uralic Languages
Evaluating Transferability of BERT Models on Uralic Languages
Judit Ács
Dániel Lévai
András Kornai
30
6
0
13 Sep 2021
MultiEURLEX -- A multi-lingual and multi-label legal document
  classification dataset for zero-shot cross-lingual transfer
MultiEURLEX -- A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer
Ilias Chalkidis
Manos Fergadiotis
Ion Androutsopoulos
AILaw
27
107
0
02 Sep 2021
Are the Multilingual Models Better? Improving Czech Sentiment with
  Transformers
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
36
11
0
24 Aug 2021
PyEuroVoc: A Tool for Multilingual Legal Document Classification with
  EuroVoc Descriptors
PyEuroVoc: A Tool for Multilingual Legal Document Classification with EuroVoc Descriptors
Andrei-Marius Avram
V. Pais
D. Tufis
AILaw
VLM
24
17
0
02 Aug 2021
Context-aware Adversarial Training for Name Regularity Bias in Named
  Entity Recognition
Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition
Abbas Ghaddar
Philippe Langlais
Ahmad Rashid
Mehdi Rezagholizadeh
39
42
0
24 Jul 2021
Evaluation of contextual embeddings on less-resourced languages
Evaluation of contextual embeddings on less-resourced languages
Matej Ulvcar
Alevs vZagar
C. S. Armendariz
Andravz Repar
Senja Pollak
Matthew Purver
Marko Robnik-vSikonja
36
11
0
22 Jul 2021
Are Multilingual Models the Best Choice for Moderately Under-resourced
  Languages? A Comprehensive Assessment for Catalan
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan
Jordi Armengol-Estapé
C. Carrino
Carlos Rodríguez-Penagos
Ona de Gibert Bonet
Carme Armentano-Oller
Aitor Gonzalez-Agirre
Maite Melero
Marta Villegas
68
42
0
16 Jul 2021
A Primer on Pretrained Multilingual Language Models
A Primer on Pretrained Multilingual Language Models
Sumanth Doddapaneni
Gowtham Ramesh
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
LRM
43
74
0
01 Jul 2021
RobeCzech: Czech RoBERTa, a monolingual contextualized language
  representation model
RobeCzech: Czech RoBERTa, a monolingual contextualized language representation model
Milan Straka
Jakub Náplava
Jana Straková
David Samuel
31
47
0
24 May 2021
Quantitative Evaluation of Alternative Translations in a Corpus of
  Highly Dissimilar Finnish Paraphrases
Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases
Li-Hsin Chang
S. Pyysalo
Jenna Kanerva
Filip Ginter
28
2
0
06 May 2021
HerBERT: Efficiently Pretrained Transformer-based Language Model for
  Polish
HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish
Robert Mroczkowski
Piotr Rybak
Alina Wróblewska
Ireneusz Gawlik
36
81
0
04 May 2021
Deep learning for sentence clustering in essay grading support
Deep learning for sentence clustering in essay grading support
Li-Hsin Chang
Iiro Rastas
S. Pyysalo
Filip Ginter
26
8
0
23 Apr 2021
Bertinho: Galician BERT Representations
Bertinho: Galician BERT Representations
David Vilares
Marcos Garcia
Carlos Gómez-Rodríguez
65
22
0
25 Mar 2021
Czert -- Czech BERT-like Model for Language Representation
Czert -- Czech BERT-like Model for Language Representation
Jakub Sido
O. Pražák
P. Pribán
Jan Pasek
Michal Seják
Miloslav Konopík
31
43
0
24 Mar 2021
Pre-Training BERT on Arabic Tweets: Practical Considerations
Pre-Training BERT on Arabic Tweets: Practical Considerations
Ahmed Abdelali
Sabit Hassan
Hamdy Mubarak
Kareem Darwish
Younes Samih
25
96
0
21 Feb 2021
FLERT: Document-Level Features for Named Entity Recognition
FLERT: Document-Level Features for Named Entity Recognition
Stefan Schweter
Alan Akbik
22
111
0
13 Nov 2020
EstBERT: A Pretrained Language-Specific BERT for Estonian
EstBERT: A Pretrained Language-Specific BERT for Estonian
Hasan Tanvir
Claudia Kittask
Sandra Eiche
Kairit Sirts
20
36
0
09 Nov 2020
German's Next Language Model
German's Next Language Model
Branden Chan
Stefan Schweter
Timo Möller
27
264
0
21 Oct 2020
The birth of Romanian BERT
The birth of Romanian BERT
Stefan Daniel Dumitrescu
Andrei-Marius Avram
S. Pyysalo
VLM
8
76
0
18 Sep 2020
KR-BERT: A Small-Scale Korean-Specific Language Model
KR-BERT: A Small-Scale Korean-Specific Language Model
Sangah Lee
Hansol Jang
Yunmee Baik
Suzi Park
Hyopil Shin
24
51
0
10 Aug 2020
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
Matej Ulvcar
Marko Robnik-Šikonja
19
48
0
14 Jun 2020
Transferring Monolingual Model to Low-Resource Language: The Case of
  Tigrinya
Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya
Abrhalei Tela
Abraham Woubie
Ville Hautamaki
37
12
0
13 Jun 2020
Pre-trained Models for Natural Language Processing: A Survey
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,452
0
18 Mar 2020
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual
  Lexical Semantic Similarity
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity
Ivan Vulić
Simon Baker
Edoardo Ponti
Ulla Petti
Ira Leviant
...
Eden Bar
Matt Malone
Thierry Poibeau
Roi Reichart
Anna Korhonen
21
82
0
10 Mar 2020
BERTje: A Dutch BERT Model
BERTje: A Dutch BERT Model
Wietse de Vries
Andreas van Cranenburgh
Arianna Bisazza
Tommaso Caselli
Gertjan van Noord
Malvina Nissim
VLM
SSeg
16
291
0
19 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
49
395
0
11 Dec 2019
CamemBERT: a Tasty French Language Model
CamemBERT: a Tasty French Language Model
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
42
956
0
10 Nov 2019
12
Next