ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,511 papers shown
Title
Better Automatic Evaluation of Open-Domain Dialogue Systems with
  Contextualized Embeddings
Better Automatic Evaluation of Open-Domain Dialogue Systems with Contextualized Embeddings
Sarik Ghazarian
Johnny Tian-Zheng Wei
Aram Galstyan
Nanyun Peng
58
90
0
24 Apr 2019
Low-Memory Neural Network Training: A Technical Report
Low-Memory Neural Network Training: A Technical Report
N. Sohoni
Christopher R. Aberger
Megan Leszczynski
Jian Zhang
Christopher Ré
92
103
0
24 Apr 2019
Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference
Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference
Nikita Kitaev
Dan Klein
67
23
0
22 Apr 2019
Exploring Unsupervised Pretraining and Sentence Structure Modelling for
  Winograd Schema Challenge
Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge
Yu-Ping Ruan
Xiao-Dan Zhu
Zhenhua Ling
Zhan Shi
Quan Liu
Si Wei
58
16
0
22 Apr 2019
Poly-encoders: Transformer Architectures and Pre-training Strategies for
  Fast and Accurate Multi-sentence Scoring
Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring
Samuel Humeau
Kurt Shuster
Marie-Anne Lachaux
Jason Weston
190
289
0
22 Apr 2019
Fine-Grained Argument Unit Recognition and Classification
Fine-Grained Argument Unit Recognition and Classification
Dietrich Trautmann
Johannes Daxenberger
Christian Stab
Hinrich Schütze
Iryna Gurevych
63
64
0
22 Apr 2019
Investigating Prior Knowledge for Challenging Chinese Machine Reading
  Comprehension
Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
100
103
0
21 Apr 2019
BERTScore: Evaluating Text Generation with BERT
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
649
5,896
0
21 Apr 2019
Model Compression with Multi-Task Knowledge Distillation for Web-scale
  Question Answering System
Model Compression with Multi-Task Knowledge Distillation for Web-scale Question Answering System
Ze Yang
Linjun Shou
Ming Gong
Wutao Lin
Daxin Jiang
KELM
74
20
0
21 Apr 2019
Obfuscation for Privacy-preserving Syntactic Parsing
Obfuscation for Privacy-preserving Syntactic Parsing
Zhifeng Hu
Serhii Havrylov
Ivan Titov
Shay B. Cohen
59
7
0
21 Apr 2019
PullNet: Open Domain Question Answering with Iterative Retrieval on
  Knowledge Bases and Text
PullNet: Open Domain Question Answering with Iterative Retrieval on Knowledge Bases and Text
Haitian Sun
Tania Bedrax-Weiss
William W. Cohen
RALMReLM
103
356
0
21 Apr 2019
Few-Shot NLG with Pre-Trained Language Model
Few-Shot NLG with Pre-Trained Language Model
Zhiyu Zoey Chen
H. Eavani
Wenhu Chen
Yinyin Liu
William Yang Wang
LMTD
122
142
0
21 Apr 2019
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for
  Natural Language Understanding
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
FedML
92
183
0
20 Apr 2019
Language Models with Transformers
Language Models with Transformers
Chenguang Wang
Mu Li
Alex Smola
102
122
0
20 Apr 2019
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
Marjan Ghazvininejad
Omer Levy
Yinhan Liu
Luke Zettlemoyer
MoE
74
35
0
19 Apr 2019
An Evaluation of Transfer Learning for Classifying Sales Engagement
  Emails at Large Scale
An Evaluation of Transfer Learning for Classifying Sales Engagement Emails at Large Scale
Yong Liu
Pavel A. Dmitriev
Yifei Huang
Andrew Brooks
Li Dong
54
4
0
19 Apr 2019
Unifying Question Answering, Text Classification, and Regression via
  Span Extraction
Unifying Question Answering, Text Classification, and Regression via Span Extraction
N. Keskar
Bryan McCann
Caiming Xiong
R. Socher
BDL
79
21
0
19 Apr 2019
ERNIE: Enhanced Representation through Knowledge Integration
ERNIE: Enhanced Representation through Knowledge Integration
Yu Sun
Shuohuan Wang
Yukun Li
Shikun Feng
Xuyi Chen
Han Zhang
Xin Tian
Danxiang Zhu
Hao Tian
Hua Wu
135
907
0
19 Apr 2019
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Shijie Wu
Mark Dredze
VLMSSeg
151
681
0
19 Apr 2019
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
Christine Basta
Marta R. Costa-jussá
Noe Casas
83
195
0
18 Apr 2019
Inpatient2Vec: Medical Representation Learning for Inpatients
Inpatient2Vec: Medical Representation Learning for Inpatients
Ying Wang
Xiao Xu
Tao Jin
Xiang Li
Guotong Xie
Jianmin Wang
AI4TS
136
21
0
18 Apr 2019
Headline Generation: Learning from Decomposable Document Titles
Headline Generation: Learning from Decomposable Document Titles
Oleg V. Vasilyev
Tom Grek
John Bohannon
74
10
0
17 Apr 2019
DocBERT: BERT for Document Classification
DocBERT: BERT for Document Classification
Ashutosh Adhikari
Achyudh Ram
Raphael Tang
Jimmy J. Lin
LLMAGVLM
103
301
0
17 Apr 2019
Casting Light on Invisible Cities: Computationally Engaging with
  Literary Criticism
Casting Light on Invisible Cities: Computationally Engaging with Literary Criticism
Shufan Wang
Mohit Iyyer
36
4
0
17 Apr 2019
Document Expansion by Query Prediction
Document Expansion by Query Prediction
Rodrigo Nogueira
Wei Yang
Jimmy J. Lin
Kyunghyun Cho
152
418
0
17 Apr 2019
Amobee at SemEval-2019 Tasks 5 and 6: Multiple Choice CNN Over
  Contextual Embedding
Amobee at SemEval-2019 Tasks 5 and 6: Multiple Choice CNN Over Contextual Embedding
A. Rozental
Dadi Biton
49
15
0
17 Apr 2019
Complementary Fusion of Multi-Features and Multi-Modalities in Sentiment
  Analysis
Complementary Fusion of Multi-Features and Multi-Modalities in Sentiment Analysis
Feiyang Chen
Ziqian Luo
Yanyan Xu
Dengfeng Ke
64
76
0
17 Apr 2019
Causality Extraction based on Self-Attentive BiLSTM-CRF with Transferred
  Embeddings
Causality Extraction based on Self-Attentive BiLSTM-CRF with Transferred Embeddings
Zhaoning Li
Qi Li
Xiaotian Zou
Jiangtao Ren
85
122
0
16 Apr 2019
Understanding the Behaviors of BERT in Ranking
Understanding the Behaviors of BERT in Ranking
Yifan Qiao
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
113
217
0
16 Apr 2019
Just-in-Time Dynamic-Batching
Just-in-Time Dynamic-Batching
Sheng Zha
Ziheng Jiang
Yanghua Peng
Zhi-Li Zhang
31
4
0
16 Apr 2019
Something's Brewing! Early Prediction of Controversy-causing Posts from
  Discussion Features
Something's Brewing! Early Prediction of Controversy-causing Posts from Discussion Features
Jack Hessel
Lillian Lee
59
70
0
15 Apr 2019
Multi-Head Multi-Layer Attention to Deep Language Representations for
  Grammatical Error Detection
Multi-Head Multi-Layer Attention to Deep Language Representations for Grammatical Error Detection
Masahiro Kaneko
Mamoru Komachi
48
31
0
15 Apr 2019
Natural Language Semantics With Pictures: Some Language & Vision
  Datasets and Potential Uses for Computational Semantics
Natural Language Semantics With Pictures: Some Language & Vision Datasets and Potential Uses for Computational Semantics
David Schlangen
67
6
0
15 Apr 2019
CEDR: Contextualized Embeddings for Document Ranking
CEDR: Contextualized Embeddings for Document Ranking
Sean MacAvaney
Andrew Yates
Arman Cohan
Nazli Goharian
80
335
0
15 Apr 2019
Characterization of citizens using word2vec and latent topic analysis in
  a large set of tweets
Characterization of citizens using word2vec and latent topic analysis in a large set of tweets
V. Vladimir
Camargo Jorge
25
42
0
15 Apr 2019
Rare Words: A Major Problem for Contextualized Embeddings And How to Fix
  it by Attentive Mimicking
Rare Words: A Major Problem for Contextualized Embeddings And How to Fix it by Attentive Mimicking
Timo Schick
Hinrich Schütze
99
98
0
14 Apr 2019
BERT4Rec: Sequential Recommendation with Bidirectional Encoder
  Representations from Transformer
BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer
Fei Sun
Jun Liu
Jian Wu
Changhua Pei
Xiao Lin
Wenwu Ou
Peng Jiang
BDLHAI
208
2,207
0
14 Apr 2019
Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering
Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering
Wei Yang
Yuqing Xie
Luchen Tan
Kun Xiong
Ming Li
Jimmy J. Lin
RALMOOD
73
64
0
14 Apr 2019
HAKE: Human Activity Knowledge Engine
HAKE: Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Mingyang Chen
Ze Ma
Shiyi Wang
Haoshu Fang
Cewu Lu
HAI
67
61
0
13 Apr 2019
A Repository of Conversational Datasets
A Repository of Conversational Datasets
Matthew Henderson
Paweł Budzianowski
I. Casanueva
Sam Coope
D. Gerz
...
N. Mrksic
Georgios P. Spithourakis
Pei-hao Su
Ivan Vulić
Tsung-Hsien Wen
73
89
0
13 Apr 2019
Legal Area Classification: A Comparative Study of Text Classifiers on
  Singapore Supreme Court Judgments
Legal Area Classification: A Comparative Study of Text Classifiers on Singapore Supreme Court Judgments
Jerrold Soh Tsin Howe
Lim How Khang
Ian Chai
AILawELM
62
54
0
13 Apr 2019
wav2vec: Unsupervised Pre-training for Speech Recognition
wav2vec: Unsupervised Pre-training for Speech Recognition
Steffen Schneider
Alexei Baevski
R. Collobert
Michael Auli
SSL
123
575
0
11 Apr 2019
Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data
  In Your Machine Translation System?
Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data In Your Machine Translation System?
Sorami Hisamoto
Matt Post
Kevin Duh
MIACVSLR
81
107
0
11 Apr 2019
ClinicalBERT: Modeling Clinical Notes and Predicting Hospital
  Readmission
ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission
Kexin Huang
Jaan Altosaar
Rajesh Ranganath
OOD
150
913
0
10 Apr 2019
Deep Neural Networks Ensemble for Detecting Medication Mentions in
  Tweets
Deep Neural Networks Ensemble for Detecting Medication Mentions in Tweets
D. Weissenbacher
A. Sarker
A. Klein
K. O’Connor
Arjun Magge Ranganatha
G. Gonzalez-Hernandez
36
47
0
10 Apr 2019
Simple BERT Models for Relation Extraction and Semantic Role Labeling
Simple BERT Models for Relation Extraction and Semantic Role Labeling
Peng Shi
Jimmy J. Lin
VLM
81
446
0
10 Apr 2019
Quizbowl: The Case for Incremental Question Answering
Quizbowl: The Case for Incremental Question Answering
Pedro Rodriguez
Shi Feng
Mohit Iyyer
He He
Jordan L. Boyd-Graber
76
50
0
09 Apr 2019
Knowledge-Augmented Language Model and its Application to Unsupervised
  Named-Entity Recognition
Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition
Angli Liu
Jingfei Du
Veselin Stoyanov
82
38
0
09 Apr 2019
$L_0$-ARM: Network Sparsification via Stochastic Binary Optimization
L0L_0L0​-ARM: Network Sparsification via Stochastic Binary Optimization
Yang Li
Shihao Ji
MQ
49
15
0
09 Apr 2019
Jointly Measuring Diversity and Quality in Text Generation Models
Jointly Measuring Diversity and Quality in Text Generation Models
Ehsan Montahaei
Danial Alihosseini
M. Baghshah
63
77
0
08 Apr 2019
Previous
123...464465466...469470471
Next