Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,573 papers shown
Title
Multi-Task Self-Supervised Learning for Disfluency Detection
Shaolei Wang
Wanxiang Che
Qi Liu
Pengda Qin
Ting Liu
William Yang Wang
SSL
76
56
0
15 Aug 2019
Towards Debiasing Fact Verification Models
Tal Schuster
Darsh J. Shah
Yun Jie Serene Yeo
Daniel Filizzola
Enrico Santus
Regina Barzilay
124
212
0
14 Aug 2019
Debiasing Personal Identities in Toxicity Classification
Apik Zorian
Chandra Shekar Bikkanur
27
2
0
14 Aug 2019
SG-Net: Syntax-Guided Machine Reading Comprehension
Zhuosheng Zhang
Yuwei Wu
Junru Zhou
Sufeng Duan
Hai Zhao
Rui Wang
100
188
0
14 Aug 2019
FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension
Yi-Ting Yeh
Yun-Nung Chen
96
41
0
14 Aug 2019
X-WikiRE: A Large, Multilingual Resource for Relation Extraction as Machine Comprehension
Mostafa Abdou
Cezar Sas
Rahul Aralikatte
Isabelle Augenstein
Anders Søgaard
70
8
0
14 Aug 2019
Fusion of Detected Objects in Text for Visual Question Answering
Chris Alberti
Jeffrey Ling
Michael Collins
David Reitter
97
173
0
14 Aug 2019
FlexNER: A Flexible LSTM-CNN Stack Framework for Named Entity Recognition
Hongyin Zhu
Wenpeng Hu
Yi Zeng
40
5
0
14 Aug 2019
Unsupervised Out-of-Distribution Detection by Maximum Classifier Discrepancy
Qing Yu
Kiyoharu Aizawa
OODD
72
168
0
14 Aug 2019
Entity-aware ELMo: Learning Contextual Entity Representation for Entity Disambiguation
Hamed Shahbazi
Xiaoli Z. Fern
Reza Ghaeini
Rasha Obeidat
Prasad Tadepalli
109
21
0
14 Aug 2019
Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT
Han He
Jinho Choi
3DV
84
51
0
14 Aug 2019
Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation
Yu Chen
Lingfei Wu
Mohammed J Zaki
GNN
97
156
0
14 Aug 2019
HorNet: A Hierarchical Offshoot Recurrent Network for Improving Person Re-ID via Image Captioning
Shiyang Yan
Jun Xu
Yuai Liu
Lin Xu
104
7
0
14 Aug 2019
An Effective Domain Adaptive Post-Training Method for BERT in Response Selection
Taesun Whang
Dongyub Lee
Chanhee Lee
Kisu Yang
Dongsuk Oh
Heuiseok Lim
78
26
0
13 Aug 2019
Fine-grained Information Status Classification Using Discourse Context-Aware Self-Attention
Yufang Hou
31
0
0
13 Aug 2019
BioFLAIR: Pretrained Pooled Contextualized Embeddings for Biomedical Sequence Labeling Tasks
Shreyas Sharma
Ron Daniel
53
33
0
13 Aug 2019
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
Wei Wang
Bin Bi
Ming Yan
Chen Henry Wu
Zuyi Bao
Jiangnan Xia
Liwei Peng
Luo Si
102
264
0
13 Aug 2019
Tackling Online Abuse: A Survey of Automated Abuse Detection Methods
Pushkar Mishra
H. Yannakoudakis
Ekaterina Shutova
95
79
0
13 Aug 2019
Incorporating Relation Knowledge into Commonsense Reading Comprehension with Multi-task Learning
Jiangnan Xia
Chen Henry Wu
Ming Yan
73
21
0
13 Aug 2019
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
63
20
0
13 Aug 2019
Understanding Spatial Language in Radiology: Representation Framework, Annotation, and Spatial Relation Extraction from Chest X-ray Reports using Deep Learning
Surabhi Datta
Yuqi Si
Laritza M. Rodriguez
S. E. Shooshan
Dina Demner-Fushman
Kirk Roberts
MedIm
64
35
0
13 Aug 2019
On the Convergence of AdaBound and its Connection to SGD
Pedro H. P. Savarese
ODL
54
19
0
13 Aug 2019
On Identifiability in Transformers
Gino Brunner
Yang Liu
Damian Pascual
Oliver Richter
Massimiliano Ciaramita
Roger Wattenhofer
ViT
107
189
0
12 Aug 2019
Taming Unbalanced Training Workloads in Deep Learning with Partial Collective Operations
Shigang Li
Tal Ben-Nun
Salvatore Di Girolamo
Dan Alistarh
Torsten Hoefler
149
59
0
12 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
109
38
0
12 Aug 2019
TAPER: Time-Aware Patient EHR Representation
Sajad Darabi
Mohammad Kachuee
Shayan Fazeli
Majid Sarrafzadeh
77
57
0
11 Aug 2019
Exploiting Temporal Relationships in Video Moment Localization with Natural Language
Songyang Zhang
Jinsong Su
Jiebo Luo
65
74
0
11 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
69
82
0
10 Aug 2019
A Generate-Validate Approach to Answering Questions about Qualitative Relationships
Arindam Mitra
Chitta Baral
Aurgho Bhattacharjee
Ishan Shrivastava
32
6
0
09 Aug 2019
VisualBERT: A Simple and Performant Baseline for Vision and Language
Liunian Harold Li
Mark Yatskar
Da Yin
Cho-Jui Hsieh
Kai-Wei Chang
VLM
267
1,975
0
09 Aug 2019
BERT-based Ranking for Biomedical Entity Normalization
Zongcheng Ji
Qiang Wei
Hua Xu
OOD
MedIm
68
126
0
09 Aug 2019
Neural Image Compression and Explanation
Xiang Li
Shihao Ji
35
10
0
09 Aug 2019
Uncheatable Machine Learning Inference
Mustafa Canim
A. Kundu
Josh Payne
42
1
0
08 Aug 2019
On the Variance of the Adaptive Learning Rate and Beyond
Liyuan Liu
Haoming Jiang
Pengcheng He
Weizhu Chen
Xiaodong Liu
Jianfeng Gao
Jiawei Han
ODL
397
1,916
0
08 Aug 2019
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense
Difei Gao
Ruiping Wang
Shiguang Shan
Xilin Chen
CoGe
LRM
129
28
0
08 Aug 2019
Do Neural Language Representations Learn Physical Commonsense?
Maxwell Forbes
Ari Holtzman
Yejin Choi
NAI
LRM
AI4CE
58
110
0
08 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
415
3,720
0
06 Aug 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations
Aarne Talman
Antti Suni
H. Çelikkanat
Sofoklis Kakouros
Jörg Tiedemann
M. Vainio
77
31
0
06 Aug 2019
Classification of Hand Movements from EEG using a Deep Attention-based LSTM Network
Guangyi Zhang
Vandad Davoodnia
Alireza Sepas-Moghaddam
Yaoxue Zhang
Ali Etemad
83
130
0
06 Aug 2019
Clustering of Deep Contextualized Representations for Summarization of Biomedical Texts
M. Moradi
Matthias Samwald
28
7
0
06 Aug 2019
Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification
Zongze Ren
Zhiyong Chen
Shugong Xu
33
8
0
06 Aug 2019
Dialog State Tracking: A Neural Reading Comprehension Approach
Shuyang Gao
Abhishek Sethi
Sanchit Agarwal
Tagyoung Chung
Dilek Z. Hakkani-Tür
77
161
0
06 Aug 2019
Exploring Neural Net Augmentation to BERT for Question Answering on SQUAD 2.0
Suhas Gupta
AI4MH
23
1
0
04 Aug 2019
Semi-supervised Thai Sentence Segmentation Using Local and Distant Word Representations
Chanatip Saetia
Ekapol Chuangsuwanich
Tawunrat Chalothorn
P. Vateekul
74
5
0
04 Aug 2019
TABOR: A Highly Accurate Approach to Inspecting and Restoring Trojan Backdoors in AI Systems
Wenbo Guo
Lun Wang
Masashi Sugiyama
Min Du
Basel Alomair
96
231
0
02 Aug 2019
DELTA: A DEep learning based Language Technology plAtform
Kun Han
Junwen Chen
Hui Zhang
Haiyang Xu
Yiping Peng
...
Cheng Gong
Yunbo Wang
Wei Zou
Hui Song
Xiangang Li
VLM
18
10
0
02 Aug 2019
MSnet: A BERT-based Network for Gendered Pronoun Resolution
Zili Wang
47
4
0
01 Aug 2019
GraphFlow: Exploiting Conversation Flow with Graph Neural Networks for Conversational Machine Comprehension
Yu Chen
Lingfei Wu
Mohammed J Zaki
79
76
0
31 Jul 2019
On Mutual Information Maximization for Representation Learning
Michael Tschannen
Josip Djolonga
Paul Kishan Rubenstein
Sylvain Gelly
Mario Lucic
SSL
206
502
0
31 Jul 2019
What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models
Allyson Ettinger
128
610
0
31 Jul 2019
Previous
1
2
3
...
456
457
458
...
470
471
472
Next