Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,491 papers shown
Title
Multilingual Constituency Parsing with Self-Attention and Pre-Training
Nikita Kitaev
Steven Cao
Dan Klein
LRM
93
255
0
31 Dec 2018
A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing
Dat Quoc Nguyen
81
12
0
30 Dec 2018
Double Neural Counterfactual Regret Minimization
Hui Li
Kailiang Hu
Zhibang Ge
Tao Jiang
Yuan Qi
Le Song
76
52
0
27 Dec 2018
Adversarial Attack and Defense on Graph Data: A Survey
Lichao Sun
Yingtong Dou
Carl Yang
Ji Wang
Yixin Liu
Philip S. Yu
Lifang He
Yangqiu Song
GNN
AAML
139
286
0
26 Dec 2018
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
Mikel Artetxe
Holger Schwenk
3DV
194
1,020
0
26 Dec 2018
Deep Representation Learning for Clustering of Health Tweets
O. Gencoglu
SSL
30
10
0
25 Dec 2018
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification
Mozhi Zhang
Yoshinari Fujinuma
Jordan L. Boyd-Graber
85
21
0
22 Dec 2018
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Chenwei Zhang
Yaliang Li
Nan Du
Wei Fan
Philip S. Yu
62
234
0
22 Dec 2018
A Survey on Deep Learning for Named Entity Recognition
Junlin Li
Aixin Sun
Jianglei Han
Chenliang Li
3DV
110
1,173
0
22 Dec 2018
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CE
GNN
1.2K
5,605
0
20 Dec 2018
Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities
Hai Pham
Paul Pu Liang
Thomas Manzini
Louis-Philippe Morency
Barnabás Póczós
89
417
0
19 Dec 2018
A Tutorial on Deep Latent Variable Models of Natural Language
Yoon Kim
Sam Wiseman
Alexander M. Rush
BDL
VLM
121
42
0
17 Dec 2018
Conditional BERT Contextual Augmentation
Xing Wu
Shangwen Lv
Liangjun Zang
Jizhong Han
Songlin Hu
77
315
0
17 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
106
368
0
13 Dec 2018
Detecting weak and strong Islamophobic hate speech on social media
Bertie Vidgen
T. Yasseri
92
140
0
12 Dec 2018
SMIT: Stochastic Multi-Label Image-to-Image Translation
Andrés Romero
Pablo Arbelaez
Luc Van Gool
Radu Timofte
73
66
0
10 Dec 2018
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering
Chenguang Zhu
Michael Zeng
Xuedong Huang
87
125
0
10 Dec 2018
What is the Effect of Importance Weighting in Deep Learning?
Jonathon Byrd
Zachary Chase Lipton
165
467
0
08 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
182
704
0
06 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
140
234
0
05 Dec 2018
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Hongsheng Li
138
536
0
04 Dec 2018
Practical Text Classification With Large Pre-Trained Language Models
Neel Kant
Raul Puri
Nikolai Yakovenko
Bryan Catanzaro
VLM
59
68
0
04 Dec 2018
Flexible and Scalable State Tracking Framework for Goal-Oriented Dialogue Systems
Rahul Goel
Shachi Paul
Tagyoung Chung
Jérémie Lecomte
Arindam Mandal
Dilek Z. Hakkani-Tür
61
15
0
30 Nov 2018
Visual Question Answering as Reading Comprehension
Hui Li
Peng Wang
Chunhua Shen
Anton Van Den Hengel
62
41
0
29 Nov 2018
Unsupervised Multi-modal Neural Machine Translation
Yuanhang Su
Kai Fan
Nguyen Bach
C.-C. Jay Kuo
Fei Huang
152
63
0
28 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
248
885
0
27 Nov 2018
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
Tianyu Zhang
Liwei Zhang
Philip R. O. Payne
Fuhai Li
45
98
0
16 Nov 2018
Survey of Computational Approaches to Lexical Semantic Change
Nina Tahmasebi
L. Borin
Adam Jatowt
115
164
0
15 Nov 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
83
247
0
12 Nov 2018
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency
Won Ik Cho
Hyeon Seung Lee
J. Yoon
Seokhwan Kim
N. Kim
85
5
0
10 Nov 2018
Language GANs Falling Short
Massimo Caccia
Lucas Caccia
W. Fedus
Hugo Larochelle
Joelle Pineau
Laurent Charlin
236
220
0
06 Nov 2018
Elastic CRFs for Open-ontology Slot Filling
Yinpei Dai
Yichi Zhang
Hong Liu
Zhijian Ou
Yanmeng Wang
Junlan Feng
87
2
0
04 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs
Gaurav Maheshwari
Priyansh Trivedi
Denis Lukovnikov
Nilesh Chakraborty
Asja Fischer
Jens Lehmann
GNN
87
72
0
02 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
125
470
0
02 Nov 2018
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
172
1,754
0
02 Nov 2018
On the Generation of Medical Question-Answer Pairs
Sheng Shen
Yaliang Li
Nan Du
X. Wu
Yusheng Xie
Shen Ge
Tao Yang
Kai Wang
Xin-Fang Liang
Wei Fan
MedIm
61
21
0
01 Nov 2018
A Corpus for Reasoning About Natural Language Grounded in Photographs
Alane Suhr
Stephanie Zhou
Ally Zhang
Iris Zhang
Huajun Bai
Yoav Artzi
LRM
122
610
0
01 Nov 2018
Improving Machine Reading Comprehension with General Reading Strategies
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
AI4CE
87
116
0
31 Oct 2018
Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog
Sebastian Schuster
S. Gupta
Rushin Shah
M. Lewis
112
286
0
31 Oct 2018
Automated Machine Learning: From Principles to Practices
Quanming Yao
Mengshuo Wang
Hugo Jair Escalante
Huan Zhao
Qiang Yang
121
259
0
31 Oct 2018
Giving Space to Your Message: Assistive Word Segmentation for the Electronic Typing of Digital Minorities
Won Ik Cho
Sung Jun Cheon
Woohyun Kang
Jiwon Kim
N. Kim
41
2
0
31 Oct 2018
GraphIE: A Graph-Based Framework for Information Extraction
Yujie Qian
Enrico Santus
Zhijing Jin
Jiang Guo
Regina Barzilay
GNN
94
113
0
31 Oct 2018
Advancing PICO Element Detection in Biomedical Text via Deep Neural Networks
Di Jin
Peter Szolovits
56
32
0
30 Oct 2018
A Simple Recurrent Unit with Reduced Tensor Product Representations
Shuai Tang
P. Smolensky
V. D. Sa
90
2
0
29 Oct 2018
A Pragmatic Guide to Geoparsing Evaluation
Milan Gritta
Mohammad Taher Pilehvar
Nigel Collier
97
69
0
29 Oct 2018
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
K. Chahal
Manraj Singh Grover
Kuntal Dey
3DH
OOD
90
54
0
28 Oct 2018
Variational Semi-supervised Aspect-term Sentiment Analysis via Transformer
Xingyi Cheng
Weidi Xu
Taifeng Wang
Wei Chu
63
23
0
24 Oct 2018
Testing the Generalization Power of Neural Network Models Across NLI Benchmarks
Aarne Talman
S. Chatzikyriakidis
ELM
91
48
0
23 Oct 2018
Compositional Coding Capsule Network with K-Means Routing for Text Classification
Hao Ren
Hong-wei Lu
104
53
0
22 Oct 2018
pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
Mandar Joshi
Eunsol Choi
Omer Levy
Daniel S. Weld
Luke Zettlemoyer
CoGe
67
47
0
20 Oct 2018
Previous
1
2
3
...
468
469
470
Next