Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,534 papers shown
Title
Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation
Ke Wang
Hang Hua
Xiaojun Wan
82
89
0
30 May 2019
A Compare-Aggregate Model with Latent Clustering for Answer Selection
Seunghyun Yoon
Franck Dernoncourt
Doo Soon Kim
Trung Bui
Kyomin Jung
72
69
0
30 May 2019
Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention
Wenhu Chen
Jianshu Chen
Pengda Qin
Xifeng Yan
William Yang Wang
90
129
0
30 May 2019
A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension
Yasuhito Ohsugi
Itsumi Saito
Kyosuke Nishida
Hisako Asano
J. Tomita
85
43
0
30 May 2019
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
93
46
0
29 May 2019
Educating Text Autoencoders: Latent Representation Guidance via Denoising
T. Shen
Jonas W. Mueller
Regina Barzilay
Tommi Jaakkola
46
4
0
29 May 2019
Unsupervised Paraphrasing without Translation
Aurko Roy
David Grangier
BDL
LRM
124
61
0
29 May 2019
Adapting Text Embeddings for Causal Inference
Victor Veitch
Dhanya Sridhar
David M. Blei
CML
61
21
0
29 May 2019
Defending Against Neural Fake News
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
153
1,035
0
29 May 2019
Towards better substitution-based word sense induction
Asaf Amrami
Yoav Goldberg
100
40
0
29 May 2019
Learning Task-specific Representation for Novel Words in Sequence Labeling
Minlong Peng
Qi Zhang
Xiaoyu Xing
Tao Gui
Jinlan Fu
Xuanjing Huang
70
8
0
29 May 2019
Strategies for Pre-training Graph Neural Networks
Weihua Hu
Bowen Liu
Joseph Gomes
Marinka Zitnik
Percy Liang
Vijay S. Pande
J. Leskovec
SSL
AI4CE
133
1,424
0
29 May 2019
Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer
Yanshuai Cao
Peng Xu
28
2
0
28 May 2019
On Variational Learning of Controllable Representations for Text without Supervision
Peng Xu
Jackie C.K. Cheung
Yanshuai Cao
SSL
DRL
70
9
0
28 May 2019
EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction
Diane Bouchacourt
Ludovic Denoyer
FAtt
74
21
0
28 May 2019
Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)
Mariya Toneva
Leila Wehbe
MILM
AI4CE
108
235
0
28 May 2019
DSReg: Using Distant Supervision as a Regularizer
Yuxian Meng
Muyu Li
Xiaoya Li
Wei Wu
Jiwei Li
84
3
0
28 May 2019
XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering
Jasdeep Singh
Bryan McCann
N. Keskar
Caiming Xiong
R. Socher
ELM
81
81
0
27 May 2019
Combating Adversarial Misspellings with Robust Word Recognition
Danish Pruthi
Bhuwan Dhingra
Zachary Chase Lipton
221
309
0
27 May 2019
STAR-GCN: Stacked and Reconstructed Graph Convolutional Networks for Recommender Systems
Jiani Zhang
Xingjian Shi
Shenglin Zhao
Irwin King
63
228
0
27 May 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models
Linfeng Zhang
Zhanhong Tan
Jiebo Song
Jingwei Chen
Chenglong Bao
Kaisheng Ma
55
71
0
27 May 2019
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume
H. K. Ekenel
Jean-Philippe Thiran
187
372
0
27 May 2019
Levenshtein Transformer
Jiatao Gu
Changhan Wang
Jake Zhao
165
359
0
27 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
148
122
0
27 May 2019
QuesNet: A Unified Representation for Heterogeneous Test Questions
Yu Yin
Qi Liu
Zhenya Huang
Enhong Chen
Wei Tong
Shijin Wang
Yu-Ho Su
28
47
0
27 May 2019
Extreme Multi-Label Legal Text Classification: A case study in EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Nikolaos Aletras
Ion Androutsopoulos
AILaw
81
75
0
26 May 2019
Where's My Head? Definition, Dataset and Models for Numeric Fused-Heads Identification and Resolution
Yanai Elazar
Yoav Goldberg
108
23
0
26 May 2019
TIGS: An Inference Algorithm for Text Infilling with Gradient Search
Dayiheng Liu
Jie Fu
Pengfei Liu
Jiancheng Lv
DiffM
139
27
0
26 May 2019
TACAM: Topic And Context Aware Argument Mining
Michael Fromm
Evgeniy Faerman
T. Seidl
75
25
0
26 May 2019
Hashing based Answer Selection
Dong Xu
Wu-Jun Li
53
6
0
26 May 2019
Graph Attention Auto-Encoders
Amin Salehi
H. Davulcu
GNN
72
125
0
26 May 2019
Are Sixteen Heads Really Better than One?
Paul Michel
Omer Levy
Graham Neubig
MoE
120
1,073
0
25 May 2019
Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers
Liwei Wu
Shuqing Li
Cho-Jui Hsieh
James Sharpnack
84
33
0
25 May 2019
SemEval-2019 Task 8: Fact Checking in Community Question Answering Forums
Tsvetomila Mihaylova
Georgi Karadzhov
Pepa Atanasova
R. Baly
Mitra Mohtarami
Preslav Nakov
79
62
0
25 May 2019
Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark
Nikita Nangia
Samuel R. Bowman
ELM
ALM
84
76
0
24 May 2019
Discrete Flows: Invertible Generative Models of Discrete Data
Dustin Tran
Keyon Vafa
Kumar Krishna Agrawal
Laurent Dinh
Ben Poole
DRL
166
117
0
24 May 2019
SCRAM: Spatially Coherent Randomized Attention Maps
D. A. Calian
P. Roelants
Jacques Calì
B. Carr
K. Dubba
John E. Reid
Dell Zhang
54
2
0
24 May 2019
Controlling Risk of Web Question Answering
Lixin Su
Jiafeng Guo
Yixing Fan
Yanyan Lan
Xueqi Cheng
56
9
0
24 May 2019
Label-aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification
Xin Huang
Boli Chen
Lin Xiao
L. Jing
62
36
0
24 May 2019
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions
Christopher Clark
Kenton Lee
Ming-Wei Chang
Tom Kwiatkowski
Michael Collins
Kristina Toutanova
399
1,564
0
24 May 2019
Personalizing Dialogue Agents via Meta-Learning
Zhaojiang Lin
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
140
187
0
24 May 2019
Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor
Malvina Nissim
Rik van Noord
Rob van der Goot
FaML
92
103
0
23 May 2019
Zero-shot Knowledge Transfer via Adversarial Belief Matching
P. Micaelli
Amos Storkey
82
230
0
23 May 2019
Misspelling Oblivious Word Embeddings
Bora Edizel
Aleksandra Piktus
Piotr Bojanowski
Rui A. Ferreira
Edouard Grave
Fabrizio Silvestri
74
65
0
23 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
76
70
0
23 May 2019
An Investigation of Transfer Learning-Based Sentiment Analysis in Japanese
Enkhbold Bataa
Joshua Wu
78
33
0
23 May 2019
AMSI-Based Detection of Malicious PowerShell Code Using Contextual Embeddings
Amir Rubin
Shay Kels
Danny Hendler
57
2
0
23 May 2019
Data-Efficient Image Recognition with Contrastive Predictive Coding
Olivier J. Hénaff
A. Srinivas
J. Fauw
Ali Razavi
Carl Doersch
S. M. Ali Eslami
Aaron van den Oord
SSL
204
1,437
0
22 May 2019
Deeper Text Understanding for IR with Contextual Neural Language Modeling
Zhuyun Dai
Jamie Callan
80
449
0
22 May 2019
Simplified Neural Unsupervised Domain Adaptation
Timothy A. Miller
67
29
0
22 May 2019
Previous
1
2
3
...
462
463
464
...
469
470
471
Next