Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 18,282 papers shown
Title
Automatic Fact-guided Sentence Modification
Darsh J. Shah
Tal Schuster
Regina Barzilay
KELM
21
40
0
30 Sep 2019
A Closer Look at Data Bias in Neural Extractive Summarization Models
Ming Zhong
Danqing Wang
Pengfei Liu
Xipeng Qiu
Xuanjing Huang
48
42
0
30 Sep 2019
A Simple and Effective Model for Answering Multi-span Questions
Elad Segal
Avia Efrat
Mor Shoham
Amir Globerson
Jonathan Berant
KELM
25
30
0
29 Sep 2019
OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction
Xu Han
Tianyu Gao
Yuan Yao
Deming Ye
Zhiyuan Liu
Maosong Sun
KELM
VLM
27
150
0
28 Sep 2019
Self-Attention Transducers for End-to-End Speech Recognition
Zhengkun Tian
Jiangyan Yi
J. Tao
Ye Bai
Zhengqi Wen
AI4TS
29
70
0
28 Sep 2019
LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval
Reuben Tan
Huijuan Xu
Kate Saenko
Bryan A. Plummer
28
67
0
27 Sep 2019
On the use of BERT for Neural Machine Translation
S. Clinchant
K. Jung
Vassilina Nikoulina
27
89
0
27 Sep 2019
HateMonitors: Language Agnostic Abuse Detection in Social Media
Punyajoy Saha
Binny Mathew
Pawan Goyal
Animesh Mukherjee
21
28
0
27 Sep 2019
Improving Pre-Trained Multilingual Models with Vocabulary Expansion
Hai Wang
Dian Yu
Kai Sun
Jianshu Chen
Dong Yu
30
41
0
26 Sep 2019
Learning the Difference that Makes a Difference with Counterfactually-Augmented Data
Divyansh Kaushik
Eduard H. Hovy
Zachary Chase Lipton
CML
28
562
0
26 Sep 2019
Biomedical relation extraction with pre-trained language representations and minimal task-specific architecture
Ashok Thillaisundaram
Theodosia Togia
24
17
0
26 Sep 2019
Scaling data-driven robotics with reward sketching and batch reinforcement learning
Serkan Cabi
Sergio Gomez Colmenarejo
Alexander Novikov
Ksenia Konyushkova
Scott E. Reed
...
David Barker
Jonathan Scholz
Misha Denil
Nando de Freitas
Ziyun Wang
OffRL
28
29
0
26 Sep 2019
DARTS: Dialectal Arabic Transcription System
Sameer Khurana
Ahmed M. Ali
James R. Glass
19
11
0
26 Sep 2019
Symplectic ODE-Net: Learning Hamiltonian Dynamics with Control
Yaofeng Desmond Zhong
Biswadip Dey
Amit Chakraborty
PINN
54
269
0
26 Sep 2019
Towards Understanding the Transferability of Deep Representations
Hong Liu
Mingsheng Long
Jianmin Wang
Michael I. Jordan
30
25
0
26 Sep 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
112
6,380
0
26 Sep 2019
Fine-tune Bert for DocRED with Two-step Process
Hong Wang
C. Focke
Rob Sylvester
Nilesh Mishra
Wenjie Wang
22
115
0
26 Sep 2019
Extremely Small BERT Models from Mixed-Vocabulary Training
Sanqiang Zhao
Raghav Gupta
Yang Song
Denny Zhou
VLM
14
53
0
25 Sep 2019
Reducing Transformer Depth on Demand with Structured Dropout
Angela Fan
Edouard Grave
Armand Joulin
43
585
0
25 Sep 2019
Synthetic Data for Deep Learning
Sergey I. Nikolenko
46
349
0
25 Sep 2019
A Survey of Binary Code Similarity
I. Haq
Juan Caballero
16
134
0
25 Sep 2019
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Cheolhyoung Lee
Kyunghyun Cho
Wanmo Kang
MoE
249
208
0
25 Sep 2019
TalkDown: A Corpus for Condescension Detection in Context
Zijian Wang
Christopher Potts
16
51
0
25 Sep 2019
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
252
928
0
24 Sep 2019
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
Chao-Wei Huang
Yun-Nung Chen
16
43
0
24 Sep 2019
Talk2Car: Taking Control of Your Self-Driving Car
Thierry Deruyttere
Simon Vandenhende
Dusan Grujicic
Luc Van Gool
Marie-Francine Moens
LM&Ro
31
124
0
24 Sep 2019
An Empirical Study of Content Understanding in Conversational Question Answering
Ting-Rui Chiang
Hao-Tong Ye
Yun-Nung Chen
ELM
33
8
0
24 Sep 2019
Do Massively Pretrained Language Models Make Better Storytellers?
A. See
Aneesh S. Pappu
Rohun Saxena
Akhila Yerukola
Christopher D. Manning
45
166
0
24 Sep 2019
Knowledge-Enriched Transformer for Emotion Detection in Textual Conversations
Peixiang Zhong
Di Wang
Chunyan Miao
24
269
0
24 Sep 2019
Portuguese Named Entity Recognition using BERT-CRF
Fábio Souza
Rodrigo Nogueira
R. Lotufo
22
251
0
23 Sep 2019
AI Matrix: A Deep Learning Benchmark for Alibaba Data Centers
Wei Zhang
Wei Wei
Lingjie Xu
Lingling Jin
Cheng Li
ELM
30
18
0
23 Sep 2019
Learning Dense Representations for Entity Retrieval
D. Gillick
Sayali Kulkarni
L. Lansing
Alessandro Presta
Jason Baldridge
Eugene Ie
Diego Garcia-Olano
RALM
30
201
0
23 Sep 2019
Cross-Lingual Natural Language Generation via Pre-Training
Zewen Chi
Li Dong
Furu Wei
Wenhui Wang
Xian-Ling Mao
Heyan Huang
27
136
0
23 Sep 2019
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings
Gregor Wiedemann
Steffen Remus
Avi Chawla
Chris Biemann
27
174
0
23 Sep 2019
Self-supervised 6D Object Pose Estimation for Robot Manipulation
Xinke Deng
Yu Xiang
Arsalan Mousavian
Clemens Eppner
Timothy Bretl
Dieter Fox
3DPC
SSL
35
183
0
23 Sep 2019
Using Chinese Glyphs for Named Entity Recognition
Arijit Sehanobish
Chan Hee Song
26
22
0
22 Sep 2019
CodeSearchNet Challenge: Evaluating the State of Semantic Code Search
Hamel Husain
Hongqiu Wu
Tiferet Gazit
Miltiadis Allamanis
Marc Brockschmidt
ELM
81
1,056
0
20 Sep 2019
Sampling Bias in Deep Active Classification: An Empirical Study
Ameya Prabhu
Charles Dognin
M. Singh
19
64
0
20 Sep 2019
What's Missing: A Knowledge Gap Guided Approach for Multi-hop Question Answering
Tushar Khot
Ashish Sabharwal
Peter Clark
RALM
24
22
0
19 Sep 2019
AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models
Eric Wallace
Jens Tuyls
Junlin Wang
Sanjay Subramanian
Matt Gardner
Sameer Singh
MILM
28
137
0
19 Sep 2019
Representation Learning for Electronic Health Records
W. Weng
Peter Szolovits
36
19
0
19 Sep 2019
CogniVal: A Framework for Cognitive Word Embedding Evaluation
Nora Hollenstein
A. D. L. Torre
N. Langer
Ce Zhang
38
66
0
19 Sep 2019
Deep Contextualized Pairwise Semantic Similarity for Arabic Language Questions
Hesham Al-Bataineh
Wael Farhan
Ahmad Mustafa
Haitham Seelawi
Hussein T. Al-Natsheh
24
16
0
19 Sep 2019
ASU at TextGraphs 2019 Shared Task: Explanation ReGeneration using Language Models and Iterative Re-Ranking
Pratyay Banerjee
LRM
19
21
0
19 Sep 2019
How Additional Knowledge can Improve Natural Language Commonsense Question Answering?
Arindam Mitra
Pratyay Banerjee
Kuntal Kumar Pal
Swaroop Mishra
Chitta Baral
KELM
27
31
0
19 Sep 2019
Made for Each Other: Broad-coverage Semantic Structures Meet Preposition Supersenses
Jakob Prange
Nathan Schneider
Omri Abend
12
11
0
19 Sep 2019
Summary Level Training of Sentence Rewriting for Abstractive Summarization
Sanghwan Bae
Taeuk Kim
Jihoon Kim
Sang-goo Lee
38
68
0
19 Sep 2019
Language models and Automated Essay Scoring
Pedro Uría Rodríguez
Amir Jafari
C. Ormerod
30
82
0
18 Sep 2019
Simple, Scalable Adaptation for Neural Machine Translation
Ankur Bapna
N. Arivazhagan
Orhan Firat
AI4CE
56
408
0
18 Sep 2019
Enriching BERT with Knowledge Graph Embeddings for Document Classification
Malte Ostendorff
Peter Bourgonje
Maria Berger
J. Moreno-Schneider
Georg Rehm
Bela Gipp
25
80
0
18 Sep 2019
Previous
1
2
3
...
354
355
356
...
364
365
366
Next