Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 19,786 papers shown
Title
Dual Co-Matching Network for Multi-choice Reading Comprehension
Shuailiang Zhang
Zhao Hai
Yuwei Wu
Zhuosheng Zhang
Xi Zhou
Xiaoping Zhou
44
131
0
27 Jan 2019
Deep Learning on Small Datasets without Pre-Training using Cosine Loss
Björn Barz
Joachim Denzler
32
130
0
25 Jan 2019
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Jinhyuk Lee
Wonjin Yoon
Sungdong Kim
Donghyeon Kim
Sunkyu Kim
Chan Ho So
Jaewoo Kang
OOD
90
5,545
0
25 Jan 2019
A BERT Baseline for the Natural Questions
Chris Alberti
Kenton Lee
Michael Collins
ELM
AI4MH
25
127
0
24 Jan 2019
Large-Batch Training for LSTM and Beyond
Yang You
Jonathan Hseu
Chris Ying
J. Demmel
Kurt Keutzer
Cho-Jui Hsieh
33
89
0
24 Jan 2019
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
Thomas Wolf
Victor Sanh
Julien Chaumond
Clement Delangue
45
493
0
23 Jan 2019
A Question-Entailment Approach to Question Answering
Asma Ben Abacha
Dina Demner-Fushman
32
191
0
23 Jan 2019
Programmable Neural Network Trojan for Pre-Trained Feature Extractor
Yu Ji
Zixin Liu
Xing Hu
Peiqi Wang
Youhui Zhang
AAML
27
17
0
23 Jan 2019
Deep learning and sub-word-unit approach in written art generation
K. Wołk
Emilia Zawadzka-Gosk
Wojciech Czarnowski
27
1
0
22 Jan 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
25
2,721
0
22 Jan 2019
Mixed Formal Learning: A Path to Transparent Machine Learning
Sandra Carrico
AI4CE
19
1
0
20 Jan 2019
Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data
Yinhao Zhu
N. Zabaras
P. Koutsourelakis
P. Perdikaris
PINN
AI4CE
51
857
0
18 Jan 2019
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
57
190
0
16 Jan 2019
Assessing BERT's Syntactic Abilities
Yoav Goldberg
25
494
0
16 Jan 2019
Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang
Pilsung Kang
19
2
0
16 Jan 2019
Investigating Antigram Behaviour using Distributional Semantics
Saptarshi Sengupta
16
0
0
15 Jan 2019
Exploiting Synchronized Lyrics And Vocal Features For Music Emotion Detection
Loreto Parisi
Simone Francia
Silvio Olivastri
Maria Stella Tavella
26
11
0
15 Jan 2019
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis
Yusuke Tsuzuku
Issei Sato
Masashi Sugiyama
35
76
0
15 Jan 2019
Passage Re-ranking with BERT
Rodrigo Nogueira
Kyunghyun Cho
OOD
72
1,078
0
13 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments
Alex Warstadt
Samuel R. Bowman
22
23
0
11 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
48
3,700
0
09 Jan 2019
On the Possibilities and Limitations of Multi-hop Reasoning Under Linguistic Imperfections
Daniel Khashabi
Erfan Sadeqi Azer
Tushar Khot
Ashish Sabharwal
Dan Roth
LRM
17
8
0
08 Jan 2019
Multi-style Generative Reading Comprehension
Kyosuke Nishida
Itsumi Saito
Kosuke Nishida
Kazutoshi Shinoda
Atsushi Otsuka
Hisako Asano
J. Tomita
32
71
0
08 Jan 2019
Feature reinforcement with word embedding and parsing information in neural TTS
Huaiping Ming
Lei He
Haohan Guo
Frank Soong
82
15
0
03 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
35
30
0
02 Jan 2019
Text Infilling
Wanrong Zhu
Zhiting Hu
Eric Xing
38
62
0
01 Jan 2019
A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing
Dat Quoc Nguyen
40
12
0
30 Dec 2018
Adversarial Attack and Defense on Graph Data: A Survey
Lichao Sun
Yingtong Dou
Carl Yang
Ji Wang
Yixin Liu
Philip S. Yu
Lifang He
Yangqiu Song
GNN
AAML
42
276
0
26 Dec 2018
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification
Mozhi Zhang
Yoshinari Fujinuma
Jordan L. Boyd-Graber
33
21
0
22 Dec 2018
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Chenwei Zhang
Yaliang Li
Nan Du
Wei Fan
Philip S. Yu
19
234
0
22 Dec 2018
A Survey on Deep Learning for Named Entity Recognition
Junlin Li
Aixin Sun
Jianglei Han
Chenliang Li
3DV
52
1,150
0
22 Dec 2018
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CE
GNN
137
5,433
0
20 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
36
364
0
13 Dec 2018
Detecting weak and strong Islamophobic hate speech on social media
Bertie Vidgen
T. Yasseri
22
138
0
12 Dec 2018
SMIT: Stochastic Multi-Label Image-to-Image Translation
Andrés Romero
Pablo Arbelaez
Luc Van Gool
Radu Timofte
25
66
0
10 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
92
696
0
06 Dec 2018
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Hongsheng Li
61
518
0
04 Dec 2018
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
100
872
0
27 Nov 2018
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
Tianyu Zhang
Liwei Zhang
Philip R. O. Payne
Fuhai Li
30
96
0
16 Nov 2018
Survey of Computational Approaches to Lexical Semantic Change
Nina Tahmasebi
L. Borin
Adam Jatowt
52
163
0
15 Nov 2018
Extractive Summary as Discrete Latent Variables
Aran Komatsuzaki
26
3
0
14 Nov 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
32
246
0
12 Nov 2018
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency
Won Ik Cho
Hyeon Seung Lee
J. Yoon
Seokhwan Kim
N. Kim
56
5
0
10 Nov 2018
Elastic CRFs for Open-ontology Slot Filling
Yinpei Dai
Yichi Zhang
Hong Liu
Zhijian Ou
Yanmeng Wang
Junlan Feng
63
2
0
04 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs
Gaurav Maheshwari
Priyansh Trivedi
Denis Lukovnikov
Nilesh Chakraborty
Asja Fischer
Jens Lehmann
GNN
31
72
0
02 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
33
467
0
02 Nov 2018
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
79
1,653
0
02 Nov 2018
On the Generation of Medical Question-Answer Pairs
Sheng Shen
Yaliang Li
Nan Du
X. Wu
Yusheng Xie
Shen Ge
Tao Yang
Kai Wang
Xin-Fang Liang
Wei Fan
MedIm
18
21
0
01 Nov 2018
Improving Machine Reading Comprehension with General Reading Strategies
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
AI4CE
47
116
0
31 Oct 2018
A Pragmatic Guide to Geoparsing Evaluation
Milan Gritta
Mohammad Taher Pilehvar
Nigel Collier
19
67
0
29 Oct 2018
Previous
1
2
3
...
394
395
396
Next