Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,641 papers shown
Title
Understanding Mobile GUI: from Pixel-Words to Screen-Sentences
Jingwen Fu
Xiaoyi Zhang
Yuwang Wang
Wenjun Zeng
Sam Yang
Grayson Hilliard
81
15
0
25 May 2021
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Rahul Aralikatte
Shashi Narayan
Joshua Maynez
S. Rothe
Ryan T. McDonald
117
46
0
25 May 2021
Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling
Marcin Namysl
Sven Behnke
Joachim Kohler
NoLa
51
5
0
25 May 2021
Estimating Redundancy in Clinical Text
Thomas Searle
Zina M. Ibrahim
J. Teo
Richard J. B. Dobson
69
21
0
25 May 2021
Argument Undermining: Counter-Argument Generation by Attacking Weak Premises
Milad Alshomary
S. Syed
Arkajit Dhar
Martin Potthast
Henning Wachsmuth
54
25
0
25 May 2021
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
Yuanmeng Yan
Rumei Li
Sirui Wang
Fuzheng Zhang
Wei Wu
Weiran Xu
SSL
138
563
0
25 May 2021
Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation
Xingyi Yang
Muchao Ye
Quanzeng You
Fenglong Ma
MedIm
57
38
0
25 May 2021
Transfer Learning and Curriculum Learning in Sokoban
Zhao Yang
Mike Preuss
Aske Plaat
OffRL
50
3
0
25 May 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Weihong Lin
Qifang Gao
Lei-huan Sun
Zhuoyao Zhong
Kaiqin Hu
Qin Ren
Qiang Huo
72
39
0
25 May 2021
TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference
Deming Ye
Yankai Lin
Yufei Huang
Maosong Sun
MQ
84
65
0
25 May 2021
Personalized Transformer for Explainable Recommendation
Lei Li
Yongfeng Zhang
Li Chen
136
141
0
25 May 2021
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation
Tao Tu
Q. Ping
Govind Thattai
Gokhan Tur
Premkumar Natarajan
75
18
0
24 May 2021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Zhengyuan Yang
Songyang Zhang
Liwei Wang
Jiebo Luo
3DPC
122
126
0
24 May 2021
True Few-Shot Learning with Language Models
Ethan Perez
Douwe Kiela
Kyunghyun Cho
169
440
0
24 May 2021
Diacritics Restoration using BERT with Analysis on Czech language
Jakub Náplava
Milan Straka
Jana Straková
39
11
0
24 May 2021
VANiLLa : Verbalized Answers in Natural Language at Large Scale
Debanjali Biswas
Mohnish Dubey
Md. Rony
Jens Lehmann
43
9
0
24 May 2021
View Distillation with Unlabeled Data for Extracting Adverse Drug Effects from User-Generated Data
Payam Karisani
Jinho Choi
Li Xiong
MedIm
60
2
0
24 May 2021
Classifying Math KCs via Task-Adaptive Pre-Trained BERT
J. Shen
Michiharu Yamashita
Ethan Prihar
Neil T. Heffernan
Xintao Wu
Sean McGrew
Dongwon Lee
28
0
0
24 May 2021
Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training
Jong Hak Moon
HyunGyung Lee
W. Shin
Young-Hak Kim
Edward Choi
MedIm
113
161
0
24 May 2021
Neural Language Models for Nineteenth-Century English
Kasra Hosseini
K. Beelen
Giovanni Colavizza
Mariona Coll Ardanuy
89
18
0
24 May 2021
RobeCzech: Czech RoBERTa, a monolingual contextualized language representation model
Milan Straka
Jakub Náplava
Jana Straková
David Samuel
75
47
0
24 May 2021
DaN+: Danish Nested Named Entities and Lexical Normalization
Barbara Plank
Kristian Nørgaard Jensen
Rob van der Goot
79
36
0
24 May 2021
Neural Machine Translation with Monolingual Translation Memory
Deng Cai
Yan Wang
Huayang Li
Wai Lam
Lemao Liu
99
103
0
24 May 2021
PTR: Prompt Tuning with Rules for Text Classification
Xu Han
Weilin Zhao
Ning Ding
Zhiyuan Liu
Maosong Sun
VLM
110
533
0
24 May 2021
Cross-lingual Text Classification with Heterogeneous Graph Neural Network
ZiYun Wang
Xuan Liu
Pei-Yin Yang
Shixing Liu
Zhisheng Wang
84
32
0
24 May 2021
Distantly-Supervised Long-Tailed Relation Extraction Using Constraint Graphs
Tianming Liang
Yang Liu
Xiaoyan Liu
Hao Zhang
Gaurav Sharma
Maozu Guo
101
25
0
24 May 2021
StructuralLM: Structural Pre-training for Form Understanding
Chenliang Li
Bin Bi
Ming Yan
Wei Wang
Songfang Huang
Fei Huang
Luo Si
LMTD
AI4CE
120
134
0
24 May 2021
Improved OOD Generalization via Adversarial Training and Pre-training
Mingyang Yi
Lu Hou
Jiacheng Sun
Lifeng Shang
Xin Jiang
Qun Liu
Zhi-Ming Ma
VLM
79
84
0
24 May 2021
Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Comprehension Models
Jieyu Lin
Jiajie Zou
Nai Ding
AAML
81
45
0
24 May 2021
Unsupervised Speech Recognition
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
169
275
0
24 May 2021
One4all User Representation for Recommender Systems in E-commerce
Kyuyong Shin
Hanock Kwak
KyungHyun Kim
Minkyu Kim
Young-Jin Park
Jisu Jeong
Seungjae Jung
69
28
0
24 May 2021
Automatic Product Ontology Extraction from Textual Reviews
Joel Oksanen
O. Cocarascu
Francesca Toni
47
4
0
23 May 2021
Structural Pre-training for Dialogue Comprehension
Zhuosheng Zhang
Hai Zhao
94
31
0
23 May 2021
OntoED: Low-resource Event Detection with Ontology Embedding
Shumin Deng
Ningyu Zhang
Luoqiu Li
Hui Chen
Huaixiao Tou
Mosha Chen
Fei Huang
Huajun Chen
101
56
0
23 May 2021
CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding
Dustin Wright
Isabelle Augenstein
103
25
0
23 May 2021
Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs
Chen Chen
Xuanli He
Lingjuan Lyu
Fangzhao Wu
SILM
MIACV
102
8
0
23 May 2021
DepressionNet: A Novel Summarization Boosted Deep Framework for Depression Detection on Social Media
Hamad Zogan
Imran Razzak
Shoaib Jameel
Guandong Xu
83
60
0
23 May 2021
Hypergraph Pre-training with Graph Neural Networks
Boxin Du
Changhe Yuan
Robert A. Barton
T. Neiman
Hanghang Tong
AI4CE
62
14
0
23 May 2021
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly
Yuchen Jin
Dinesh Manocha
Liangyu Zhao
Yibo Zhu
Chuanxiong Guo
Marco Canini
Arvind Krishnamurthy
95
19
0
22 May 2021
Denoising Noisy Neural Networks: A Bayesian Approach with Compensation
Yulin Shao
Soung Chang Liew
Deniz Gunduz
94
14
0
22 May 2021
CEREC: A Corpus for Entity Resolution in Email Conversations
Parag Dakle
D. Moldovan
25
4
0
21 May 2021
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Ivana Kvapilíková
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Ondrej Bojar
SSL
74
36
0
21 May 2021
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
LM&MA
VLM
SyDa
106
192
0
21 May 2021
Stance Detection with BERT Embeddings for Credibility Analysis of Information on Social Media
Hema R. Karande
Rahee Walambe
Victor A. Benjamin
K. Kotecha
TS Raghu
95
39
0
21 May 2021
Semantic Representation for Dialogue Modeling
Xuefeng Bai
Yulong Chen
Linfeng Song
Yue Zhang
129
53
0
21 May 2021
A Non-Linear Structural Probe
Jennifer C. White
Tiago Pimentel
Naomi Saphra
Ryan Cotterell
55
25
0
21 May 2021
Revisiting the Negative Data of Distantly Supervised Relation Extraction
Chenhao Xie
Jiaqing Liang
Jingping Liu
Chengsong Huang
Wenhao Huang
Yanghua Xiao
96
29
0
21 May 2021
Training Bi-Encoders for Word Sense Disambiguation
Harsh Kohli
100
4
0
21 May 2021
Towards Automatic Comparison of Data Privacy Documents: A Preliminary Experiment on GDPR-like Laws
Kornraphop Kawintiranon
Yaguang Liu
AILaw
26
4
0
21 May 2021
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Zhiyi Ma
Kawin Ethayarajh
Tristan Thrush
Somya Jain
Ledell Yu Wu
Robin Jia
Christopher Potts
Adina Williams
Douwe Kiela
ELM
117
59
0
21 May 2021
Previous
1
2
3
...
334
335
336
...
471
472
473
Next