Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,491 papers shown
Title
A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization
Wonseok Hwang
Ji-Yoon Yim
Seunghyun Park
Minjoon Seo
109
232
0
04 Feb 2019
Extracting Multiple-Relations in One-Pass with Pre-Trained Transformers
Haoyu Wang
Ming Tan
Mo Yu
Shiyu Chang
Dakuo Wang
Kun Xu
Xiaoxiao Guo
Saloni Potdar
ViT
104
98
0
04 Feb 2019
Graph Warp Module: an Auxiliary Module for Boosting the Power of Graph Neural Networks in Molecular Graph Analysis
Katsuhiko Ishiguro
S. Maeda
Masanori Koyama
GNN
87
33
0
04 Feb 2019
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference
R. Thomas McCoy
Ellie Pavlick
Tal Linzen
161
1,244
0
04 Feb 2019
Improving Question Answering with External Knowledge
Xiaoman Pan
Kai Sun
Dian Yu
Jianshu Chen
Heng Ji
Claire Cardie
Dong Yu
KELM
101
66
0
03 Feb 2019
Incremental Learning with Maximum Entropy Regularization: Rethinking Forgetting and Intransigence
Dahyun Kim
Jihwan Bae
Yeonsik Jo
Jonghyun Choi
OOD
CLL
75
20
0
03 Feb 2019
Review Conversational Reading Comprehension
Hu Xu
Bing-Quan Liu
Lei Shu
Philip S. Yu
75
18
0
03 Feb 2019
Parameter-Efficient Transfer Learning for NLP
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
240
4,556
0
02 Feb 2019
A Multi-Resolution Word Embedding for Document Retrieval from Large Unstructured Knowledge Bases
Tolgahan Cakaloglu
Xiaowei Xu
RALM
110
5
0
02 Feb 2019
tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification
Blaž Škrlj
Matej Martinc
Jan Kralj
Nada Lavrac
Senja Pollak
103
44
0
01 Feb 2019
Compressing Gradient Optimizers via Count-Sketches
Ryan Spring
Anastasios Kyrillidis
Vijai Mohan
Anshumali Shrivastava
58
36
0
01 Feb 2019
The Second Conversational Intelligence Challenge (ConvAI2)
Emily Dinan
V. Logacheva
Valentin Malykh
Alexander H. Miller
Kurt Shuster
...
Alexander I. Rudnicky
Jason Williams
Joelle Pineau
Andrey Kravchenko
Jason Weston
DRL
106
368
0
31 Jan 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
158
1,275
0
31 Jan 2019
A large-scale crowdsourced analysis of abuse against women journalists and politicians on Twitter
Laure Delisle
Freddie Kalaitzis
Krzysztof Majewski
A. D. Berker
M. Marin
Julien Cornebise
50
30
0
31 Jan 2019
Learning and Evaluating General Linguistic Intelligence
Dani Yogatama
Cyprien de Masson dÁutume
Jerome T. Connor
Tomás Kociský
Mike Chrzanowski
...
Angeliki Lazaridou
Wang Ling
Lei Yu
Chris Dyer
Phil Blunsom
ELM
AI4CE
174
211
0
31 Jan 2019
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
Jason W. Wei
Kai Zou
121
1,965
0
31 Jan 2019
Memory-Efficient Adaptive Optimization
Rohan Anil
Vineet Gupta
Tomer Koren
Y. Singer
ODL
92
49
0
30 Jan 2019
The Evolved Transformer
David R. So
Chen Liang
Quoc V. Le
ViT
143
467
0
30 Jan 2019
Tensorized Embedding Layers for Efficient Model Compression
Oleksii Hrinchuk
Valentin Khrulkov
L. Mirvakhabova
Elena Orlova
Ivan Oseledets
91
73
0
30 Jan 2019
Glyce: Glyph-vectors for Chinese Character Representations
Yuxian Meng
Wei Wu
Fei Wang
Xiaoya Li
Ping Nie
J. Mei
Muyu Li
Qinghong Han
Xiaofei Sun
Jiwei Li
VLM
109
193
0
29 Jan 2019
Evaluating Word Embedding Models: Methods and Experimental Results
Bin Wang
Angela Wang
Fenxiao Chen
Yun Cheng Wang
C.-C. Jay Kuo
ELM
98
265
0
28 Jan 2019
Language Independent Sequence Labelling for Opinion Target Extraction
Rodrigo Agerri
German Rigau
29
22
0
28 Jan 2019
Stiffness: A New Perspective on Generalization in Neural Networks
Stanislav Fort
Pawel Krzysztof Nowak
Stanislaw Jastrzebski
S. Narayanan
155
94
0
28 Jan 2019
Dual Co-Matching Network for Multi-choice Reading Comprehension
Shuailiang Zhang
Zhao Hai
Yuwei Wu
Zhuosheng Zhang
Xi Zhou
Xiaoping Zhou
102
131
0
27 Jan 2019
Context in Neural Machine Translation: A Review of Models and Evaluations
Andrei Popescu-Belis
MedIm
69
28
0
25 Jan 2019
Deep Learning on Small Datasets without Pre-Training using Cosine Loss
Björn Barz
Joachim Denzler
84
132
0
25 Jan 2019
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Jinhyuk Lee
Wonjin Yoon
Sungdong Kim
Donghyeon Kim
Sunkyu Kim
Chan Ho So
Jaewoo Kang
OOD
243
5,728
0
25 Jan 2019
A BERT Baseline for the Natural Questions
Chris Alberti
Kenton Lee
Michael Collins
ELM
AI4MH
108
127
0
24 Jan 2019
Large-Batch Training for LSTM and Beyond
Yang You
Jonathan Hseu
Chris Ying
J. Demmel
Kurt Keutzer
Cho-Jui Hsieh
65
91
0
24 Jan 2019
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
Thomas Wolf
Victor Sanh
Julien Chaumond
Clement Delangue
132
500
0
23 Jan 2019
A Question-Entailment Approach to Question Answering
Asma Ben Abacha
Dina Demner-Fushman
88
196
0
23 Jan 2019
Automated Essay Scoring based on Two-Stage Learning
Jiawei Liu
Yang Xu
Yaguang Zhu
31
61
0
23 Jan 2019
Deep learning and sub-word-unit approach in written art generation
K. Wołk
Emilia Zawadzka-Gosk
Wojciech Czarnowski
38
1
0
22 Jan 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
171
2,752
0
22 Jan 2019
Spatial Broadcast Decoder: A Simple Architecture for Learning Disentangled Representations in VAEs
Nicholas Watters
Loic Matthey
Christopher P. Burgess
Alexander Lerchner
CoGe
111
169
0
21 Jan 2019
Mixed Formal Learning: A Path to Transparent Machine Learning
Sandra Carrico
AI4CE
21
1
0
20 Jan 2019
Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data
Yinhao Zhu
N. Zabaras
P. Koutsourelakis
P. Perdikaris
PINN
AI4CE
148
876
0
18 Jan 2019
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
203
194
0
16 Jan 2019
Assessing BERT's Syntactic Abilities
Yoav Goldberg
89
496
0
16 Jan 2019
Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang
Pilsung Kang
31
2
0
16 Jan 2019
Investigating Antigram Behaviour using Distributional Semantics
Saptarshi Sengupta
36
0
0
15 Jan 2019
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis
Yusuke Tsuzuku
Issei Sato
Masashi Sugiyama
84
77
0
15 Jan 2019
Passage Re-ranking with BERT
Rodrigo Nogueira
Kyunghyun Cho
OOD
130
1,099
0
13 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments
Alex Warstadt
Samuel R. Bowman
92
23
0
11 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
277
3,758
0
09 Jan 2019
On the Possibilities and Limitations of Multi-hop Reasoning Under Linguistic Imperfections
Daniel Khashabi
Erfan Sadeqi Azer
Tushar Khot
Ashish Sabharwal
Dan Roth
LRM
76
8
0
08 Jan 2019
Multi-style Generative Reading Comprehension
Kyosuke Nishida
Itsumi Saito
Kosuke Nishida
Kazutoshi Shinoda
Atsushi Otsuka
Hisako Asano
J. Tomita
85
71
0
08 Jan 2019
Feature reinforcement with word embedding and parsing information in neural TTS
Huaiping Ming
Lei He
Haohan Guo
Frank Soong
165
15
0
03 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
85
30
0
02 Jan 2019
Text Infilling
Wanrong Zhu
Zhiting Hu
Eric Xing
137
63
0
01 Jan 2019
Previous
1
2
3
...
467
468
469
470
Next