Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,511 papers shown
Title
Better Automatic Evaluation of Open-Domain Dialogue Systems with Contextualized Embeddings
Sarik Ghazarian
Johnny Tian-Zheng Wei
Aram Galstyan
Nanyun Peng
58
90
0
24 Apr 2019
Low-Memory Neural Network Training: A Technical Report
N. Sohoni
Christopher R. Aberger
Megan Leszczynski
Jian Zhang
Christopher Ré
92
103
0
24 Apr 2019
Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference
Nikita Kitaev
Dan Klein
67
23
0
22 Apr 2019
Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge
Yu-Ping Ruan
Xiao-Dan Zhu
Zhenhua Ling
Zhan Shi
Quan Liu
Si Wei
58
16
0
22 Apr 2019
Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring
Samuel Humeau
Kurt Shuster
Marie-Anne Lachaux
Jason Weston
190
289
0
22 Apr 2019
Fine-Grained Argument Unit Recognition and Classification
Dietrich Trautmann
Johannes Daxenberger
Christian Stab
Hinrich Schütze
Iryna Gurevych
63
64
0
22 Apr 2019
Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
100
103
0
21 Apr 2019
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
649
5,896
0
21 Apr 2019
Model Compression with Multi-Task Knowledge Distillation for Web-scale Question Answering System
Ze Yang
Linjun Shou
Ming Gong
Wutao Lin
Daxin Jiang
KELM
74
20
0
21 Apr 2019
Obfuscation for Privacy-preserving Syntactic Parsing
Zhifeng Hu
Serhii Havrylov
Ivan Titov
Shay B. Cohen
59
7
0
21 Apr 2019
PullNet: Open Domain Question Answering with Iterative Retrieval on Knowledge Bases and Text
Haitian Sun
Tania Bedrax-Weiss
William W. Cohen
RALM
ReLM
103
356
0
21 Apr 2019
Few-Shot NLG with Pre-Trained Language Model
Zhiyu Zoey Chen
H. Eavani
Wenhu Chen
Yinyin Liu
William Yang Wang
LMTD
122
142
0
21 Apr 2019
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
FedML
92
183
0
20 Apr 2019
Language Models with Transformers
Chenguang Wang
Mu Li
Alex Smola
102
122
0
20 Apr 2019
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
Marjan Ghazvininejad
Omer Levy
Yinhan Liu
Luke Zettlemoyer
MoE
74
35
0
19 Apr 2019
An Evaluation of Transfer Learning for Classifying Sales Engagement Emails at Large Scale
Yong Liu
Pavel A. Dmitriev
Yifei Huang
Andrew Brooks
Li Dong
54
4
0
19 Apr 2019
Unifying Question Answering, Text Classification, and Regression via Span Extraction
N. Keskar
Bryan McCann
Caiming Xiong
R. Socher
BDL
79
21
0
19 Apr 2019
ERNIE: Enhanced Representation through Knowledge Integration
Yu Sun
Shuohuan Wang
Yukun Li
Shikun Feng
Xuyi Chen
Han Zhang
Xin Tian
Danxiang Zhu
Hao Tian
Hua Wu
135
907
0
19 Apr 2019
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Shijie Wu
Mark Dredze
VLM
SSeg
151
681
0
19 Apr 2019
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
Christine Basta
Marta R. Costa-jussá
Noe Casas
83
195
0
18 Apr 2019
Inpatient2Vec: Medical Representation Learning for Inpatients
Ying Wang
Xiao Xu
Tao Jin
Xiang Li
Guotong Xie
Jianmin Wang
AI4TS
136
21
0
18 Apr 2019
Headline Generation: Learning from Decomposable Document Titles
Oleg V. Vasilyev
Tom Grek
John Bohannon
74
10
0
17 Apr 2019
DocBERT: BERT for Document Classification
Ashutosh Adhikari
Achyudh Ram
Raphael Tang
Jimmy J. Lin
LLMAG
VLM
103
301
0
17 Apr 2019
Casting Light on Invisible Cities: Computationally Engaging with Literary Criticism
Shufan Wang
Mohit Iyyer
36
4
0
17 Apr 2019
Document Expansion by Query Prediction
Rodrigo Nogueira
Wei Yang
Jimmy J. Lin
Kyunghyun Cho
152
418
0
17 Apr 2019
Amobee at SemEval-2019 Tasks 5 and 6: Multiple Choice CNN Over Contextual Embedding
A. Rozental
Dadi Biton
49
15
0
17 Apr 2019
Complementary Fusion of Multi-Features and Multi-Modalities in Sentiment Analysis
Feiyang Chen
Ziqian Luo
Yanyan Xu
Dengfeng Ke
64
76
0
17 Apr 2019
Causality Extraction based on Self-Attentive BiLSTM-CRF with Transferred Embeddings
Zhaoning Li
Qi Li
Xiaotian Zou
Jiangtao Ren
85
122
0
16 Apr 2019
Understanding the Behaviors of BERT in Ranking
Yifan Qiao
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
113
217
0
16 Apr 2019
Just-in-Time Dynamic-Batching
Sheng Zha
Ziheng Jiang
Yanghua Peng
Zhi-Li Zhang
31
4
0
16 Apr 2019
Something's Brewing! Early Prediction of Controversy-causing Posts from Discussion Features
Jack Hessel
Lillian Lee
59
70
0
15 Apr 2019
Multi-Head Multi-Layer Attention to Deep Language Representations for Grammatical Error Detection
Masahiro Kaneko
Mamoru Komachi
48
31
0
15 Apr 2019
Natural Language Semantics With Pictures: Some Language & Vision Datasets and Potential Uses for Computational Semantics
David Schlangen
67
6
0
15 Apr 2019
CEDR: Contextualized Embeddings for Document Ranking
Sean MacAvaney
Andrew Yates
Arman Cohan
Nazli Goharian
80
335
0
15 Apr 2019
Characterization of citizens using word2vec and latent topic analysis in a large set of tweets
V. Vladimir
Camargo Jorge
25
42
0
15 Apr 2019
Rare Words: A Major Problem for Contextualized Embeddings And How to Fix it by Attentive Mimicking
Timo Schick
Hinrich Schütze
99
98
0
14 Apr 2019
BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer
Fei Sun
Jun Liu
Jian Wu
Changhua Pei
Xiao Lin
Wenwu Ou
Peng Jiang
BDL
HAI
208
2,207
0
14 Apr 2019
Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering
Wei Yang
Yuqing Xie
Luchen Tan
Kun Xiong
Ming Li
Jimmy J. Lin
RALM
OOD
73
64
0
14 Apr 2019
HAKE: Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Mingyang Chen
Ze Ma
Shiyi Wang
Haoshu Fang
Cewu Lu
HAI
67
61
0
13 Apr 2019
A Repository of Conversational Datasets
Matthew Henderson
Paweł Budzianowski
I. Casanueva
Sam Coope
D. Gerz
...
N. Mrksic
Georgios P. Spithourakis
Pei-hao Su
Ivan Vulić
Tsung-Hsien Wen
73
89
0
13 Apr 2019
Legal Area Classification: A Comparative Study of Text Classifiers on Singapore Supreme Court Judgments
Jerrold Soh Tsin Howe
Lim How Khang
Ian Chai
AILaw
ELM
62
54
0
13 Apr 2019
wav2vec: Unsupervised Pre-training for Speech Recognition
Steffen Schneider
Alexei Baevski
R. Collobert
Michael Auli
SSL
123
575
0
11 Apr 2019
Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data In Your Machine Translation System?
Sorami Hisamoto
Matt Post
Kevin Duh
MIACV
SLR
81
107
0
11 Apr 2019
ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission
Kexin Huang
Jaan Altosaar
Rajesh Ranganath
OOD
150
913
0
10 Apr 2019
Deep Neural Networks Ensemble for Detecting Medication Mentions in Tweets
D. Weissenbacher
A. Sarker
A. Klein
K. O’Connor
Arjun Magge Ranganatha
G. Gonzalez-Hernandez
36
47
0
10 Apr 2019
Simple BERT Models for Relation Extraction and Semantic Role Labeling
Peng Shi
Jimmy J. Lin
VLM
81
446
0
10 Apr 2019
Quizbowl: The Case for Incremental Question Answering
Pedro Rodriguez
Shi Feng
Mohit Iyyer
He He
Jordan L. Boyd-Graber
76
50
0
09 Apr 2019
Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition
Angli Liu
Jingfei Du
Veselin Stoyanov
82
38
0
09 Apr 2019
L
0
L_0
L
0
-ARM: Network Sparsification via Stochastic Binary Optimization
Yang Li
Shihao Ji
MQ
49
15
0
09 Apr 2019
Jointly Measuring Diversity and Quality in Text Generation Models
Ehsan Montahaei
Danial Alihosseini
M. Baghshah
63
77
0
08 Apr 2019
Previous
1
2
3
...
464
465
466
...
469
470
471
Next