ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,335 papers shown
Title
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Karthikeyan K
Zihan Wang
Stephen D. Mayhew
Dan Roth
LRM
36
333
0
17 Dec 2019
Multilingual is not enough: BERT for Finnish
Multilingual is not enough: BERT for Finnish
Antti Virtanen
Jenna Kanerva
Rami Ilo
Jouni Luoma
Juhani Luotolahti
T. Salakoski
Filip Ginter
S. Pyysalo
36
277
0
15 Dec 2019
Knowledge-based Conversational Search
Knowledge-based Conversational Search
Svitlana Vakulenko
22
13
0
14 Dec 2019
Text as Environment: A Deep Reinforcement Learning Text Readability
  Assessment Model
Text as Environment: A Deep Reinforcement Learning Text Readability Assessment Model
Hamid Reza Mohammadi
S. H. Khasteh
Tahereh Firoozi
Taha Samavati
29
20
0
12 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
54
395
0
11 Dec 2019
CoSimLex: A Resource for Evaluating Graded Word Similarity in Context
CoSimLex: A Resource for Evaluating Graded Word Similarity in Context
C. S. Armendariz
Matthew Purver
Matej Ulčar
Senja Pollak
Nikola Ljubesic
Marko Robnik-Šikonja
Mark Granroth-Wilding
Kristiina Vaik
29
35
0
11 Dec 2019
Improving Neural Protein-Protein Interaction Extraction with Knowledge
  Selection
Improving Neural Protein-Protein Interaction Extraction with Knowledge Selection
Huiwei Zhou
Xuefei Li
Weihong Yao
Zhuang Liu
Shixian Ning
Chengkun Lang
Lei Du
24
7
0
11 Dec 2019
Adversarial Analysis of Natural Language Inference Systems
Adversarial Analysis of Natural Language Inference Systems
Tiffany Chien
Jugal Kalita
AAML
39
12
0
07 Dec 2019
Weak Supervision helps Emergence of Word-Object Alignment and improves
  Vision-Language Tasks
Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
21
15
0
06 Dec 2019
Self-Supervised Learning of Video-Induced Visual Invariances
Self-Supervised Learning of Video-Induced Visual Invariances
Michael Tschannen
Josip Djolonga
Marvin Ritter
Aravindh Mahendran
Xiaohua Zhai
N. Houlsby
Sylvain Gelly
Mario Lucic
SSL
18
61
0
05 Dec 2019
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art
  Baseline
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Vishvak Murahari
Dhruv Batra
Devi Parikh
Abhishek Das
VLM
23
115
0
05 Dec 2019
Scratch that! An Evolution-based Adversarial Attack against Neural
  Networks
Scratch that! An Evolution-based Adversarial Attack against Neural Networks
Malhar Jere
Loris Rossi
Briland Hitaj
Gabriela F. Cretu-Ciocarlie
Giacomo Boracchi
F. Koushanfar
AAML
19
18
0
05 Dec 2019
Plug and Play Language Models: A Simple Approach to Controlled Text
  Generation
Plug and Play Language Models: A Simple Approach to Controlled Text Generation
Sumanth Dathathri
Andrea Madotto
Janice Lan
Jane Hung
Eric Frank
Piero Molino
J. Yosinski
Rosanne Liu
KELM
64
946
0
04 Dec 2019
An Exploration of Data Augmentation and Sampling Techniques for
  Domain-Agnostic Question Answering
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering
Shayne Longpre
Yi Lu
Zhucheng Tu
Christopher DuBois
27
70
0
04 Dec 2019
AMUSED: A Multi-Stream Vector Representation Method for Use in Natural
  Dialogue
AMUSED: A Multi-Stream Vector Representation Method for Use in Natural Dialogue
Gaurav Kumar
Rishabh Joshi
Jaspreet Singh
Promod Yenigalla
23
7
0
04 Dec 2019
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
33
313
0
04 Dec 2019
Scalable Bayesian Preference Learning for Crowds
Scalable Bayesian Preference Learning for Crowds
Edwin Simpson
Iryna Gurevych
BDL
19
24
0
04 Dec 2019
Reading the Manual: Event Extraction as Definition Comprehension
Reading the Manual: Event Extraction as Definition Comprehension
Yunmo Chen
Tongfei Chen
Seth Ebner
Aaron Steven White
Benjamin Van Durme
30
63
0
03 Dec 2019
A Comparative Study of Pretrained Language Models on Thai Social Text
  Categorization
A Comparative Study of Pretrained Language Models on Thai Social Text Categorization
Thanapapas Horsuwan
Kasidis Kanwatchara
P. Vateekul
B. Kijsirikul
19
9
0
03 Dec 2019
An Annotated Dataset of Coreference in English Literature
An Annotated Dataset of Coreference in English Literature
David Bamman
Olivia Lewke
A. Mansoor
16
105
0
03 Dec 2019
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven
  Acoustic Embedding Selection
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Shubhi Tyagi
M. Nicolis
Jonas Rohnke
Thomas Drugman
Jaime Lorenzo-Trueba
32
32
0
02 Dec 2019
Solving Arithmetic Word Problems Automatically Using Transformer and
  Unambiguous Representations
Solving Arithmetic Word Problems Automatically Using Transformer and Unambiguous Representations
Kaden Griffith
Jugal Kalita
18
16
0
02 Dec 2019
Multi-Scale Self-Attention for Text Classification
Multi-Scale Self-Attention for Text Classification
Qipeng Guo
Xipeng Qiu
Pengfei Liu
Xiangyang Xue
Zheng-Wei Zhang
ViT
13
62
0
02 Dec 2019
Knowledge Infused Learning (K-IL): Towards Deep Incorporation of
  Knowledge in Deep Learning
Knowledge Infused Learning (K-IL): Towards Deep Incorporation of Knowledge in Deep Learning
Ugur Kursuncu
Manas Gaur
A. Sheth
NAI
26
57
0
01 Dec 2019
Preserving Patient Privacy while Training a Predictive Model of
  In-hospital Mortality
Preserving Patient Privacy while Training a Predictive Model of In-hospital Mortality
Pulkit Sharma
Farah E. Shamout
David Clifton
25
25
0
01 Dec 2019
Automatic Creation of Text Corpora for Low-Resource Languages from the
  Internet: The Case of Swiss German
Automatic Creation of Text Corpora for Low-Resource Languages from the Internet: The Case of Swiss German
Lucy Linder
Michael Jungo
J. Hennebert
C. Musat
Andreas Fischer
22
15
0
30 Nov 2019
Deconstructing and reconstructing word embedding algorithms
Deconstructing and reconstructing word embedding algorithms
Edward Newell
Kian Kenyon-Dean
Jackie C.K. Cheung
39
4
0
29 Nov 2019
An Iterative Polishing Framework based on Quality Aware Masked Language
  Model for Chinese Poetry Generation
An Iterative Polishing Framework based on Quality Aware Masked Language Model for Chinese Poetry Generation
Li-Ming Deng
Jie Wang
Hangming Liang
Hui-ping Chen
Zhiqiang Xie
Bojin Zhuang
Shaojun Wang
Jing Xiao
57
22
0
29 Nov 2019
Blockwisely Supervised Neural Architecture Search with Knowledge
  Distillation
Blockwisely Supervised Neural Architecture Search with Knowledge Distillation
Changlin Li
Jiefeng Peng
Liuchun Yuan
Guangrun Wang
Xiaodan Liang
Liang Lin
Xiaojun Chang
36
180
0
29 Nov 2019
Inducing Relational Knowledge from BERT
Inducing Relational Knowledge from BERT
Zied Bouraoui
Jose Camacho-Collados
Steven Schockaert
34
166
0
28 Nov 2019
DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence
  Modeling
DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling
Sachin Mehta
Rik Koncel-Kedziorski
Mohammad Rastegari
Hannaneh Hajishirzi
AI4TS
38
23
0
27 Nov 2019
FairyTED: A Fair Rating Predictor for TED Talk Data
FairyTED: A Fair Rating Predictor for TED Talk Data
Rupam Acharyya
Shouman Das
Ankani Chattoraj
Md. Iftekhar Tanveer
27
12
0
25 Nov 2019
Corpus Wide Argument Mining -- a Working Solution
Corpus Wide Argument Mining -- a Working Solution
L. Ein-Dor
Eyal Shnarch
Lena Dankin
Alon Halfon
Benjamin Sznajder
...
Leshem Choshen
Yufang Hou
Yonatan Bilu
R. Aharonov
Noam Slonim
22
62
0
25 Nov 2019
End-to-End Trainable Non-Collaborative Dialog System
End-to-End Trainable Non-Collaborative Dialog System
Yu Li
Kun Qian
Weiyan Shi
Zhou Yu
21
45
0
25 Nov 2019
Who did They Respond to? Conversation Structure Modeling using Masked
  Hierarchical Transformer
Who did They Respond to? Conversation Structure Modeling using Masked Hierarchical Transformer
Henghui Zhu
Feng Nan
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
30
39
0
25 Nov 2019
Invenio: Discovering Hidden Relationships Between Tasks/Domains Using
  Structured Meta Learning
Invenio: Discovering Hidden Relationships Between Tasks/Domains Using Structured Meta Learning
Sameeksha Katoch
Kowshik Thopalli
Jayaraman J. Thiagarajan
Pavan Turaga
A. Spanias
14
4
0
24 Nov 2019
Task-Oriented Dialog Systems that Consider Multiple Appropriate
  Responses under the Same Context
Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context
Yichi Zhang
Zhijian Ou
Zhou Yu
27
182
0
24 Nov 2019
A Transformer-based approach to Irony and Sarcasm detection
A Transformer-based approach to Irony and Sarcasm detection
Rolandos Alexandros Potamias
Georgios Siolas
A. Stafylopatis
33
206
0
23 Nov 2019
Factorized Multimodal Transformer for Multimodal Sequential Learning
Factorized Multimodal Transformer for Multimodal Sequential Learning
Amir Zadeh
Chengfeng Mao
Kelly Shi
Yiwei Zhang
Paul Pu Liang
Soujanya Poria
Louis-Philippe Morency
25
44
0
22 Nov 2019
WildMix Dataset and Spectro-Temporal Transformer Model for Monoaural
  Audio Source Separation
WildMix Dataset and Spectro-Temporal Transformer Model for Monoaural Audio Source Separation
Amir Zadeh
Tianjun Ma
Soujanya Poria
Louis-Philippe Morency
19
8
0
21 Nov 2019
Separate and Attend in Personal Email Search
Separate and Attend in Personal Email Search
Yu Meng
Maryam Karimzadehgan
Honglei Zhuang
Donald Metzler
FedML
220
2
0
21 Nov 2019
Improving Conditioning in Context-Aware Sequence to Sequence Models
Improving Conditioning in Context-Aware Sequence to Sequence Models
Xinyi Wang
Jason Weston
Michael Auli
Yacine Jernite
31
13
0
21 Nov 2019
Automatically Neutralizing Subjective Bias in Text
Automatically Neutralizing Subjective Bias in Text
Reid Pryzant
Richard Diehl Martinez
Nathan Dass
Sadao Kurohashi
Dan Jurafsky
Diyi Yang
33
175
0
21 Nov 2019
Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual
  Task-oriented Dialogue Systems
Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems
Zihan Liu
Genta Indra Winata
Zhaojiang Lin
Peng Xu
Pascale Fung
25
98
0
21 Nov 2019
Generating Interactive Worlds with Text
Generating Interactive Worlds with Text
Angela Fan
Jack Urbanek
Pratik Ringshia
Emily Dinan
Emma Qian
...
Shrimai Prabhumoye
Douwe Kiela
Tim Rocktaschel
Arthur Szlam
Jason Weston
27
27
0
20 Nov 2019
Distributionally Robust Neural Networks for Group Shifts: On the
  Importance of Regularization for Worst-Case Generalization
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization
Shiori Sagawa
Pang Wei Koh
Tatsunori B. Hashimoto
Percy Liang
OOD
16
1,200
0
20 Nov 2019
Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine
  Translation
Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Junliang Guo
Xu Tan
Linli Xu
Tao Qin
Enhong Chen
Tie-Yan Liu
14
85
0
20 Nov 2019
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
Amirata Ghorbani
Vivek Natarajan
David Coz
Yuan Liu
GAN
MedIm
21
98
0
20 Nov 2019
Global Greedy Dependency Parsing
Global Greedy Dependency Parsing
Z. Li
Zhao Hai
Kevin Parnow
31
31
0
20 Nov 2019
Towards Lingua Franca Named Entity Recognition with BERT
Towards Lingua Franca Named Entity Recognition with BERT
Taesun Moon
Parul Awasthy
Jian Ni
Radu Florian
19
29
0
19 Nov 2019
Previous
123...350351352...365366367
Next