Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 18,765 papers shown
Title
Active Imitation Learning with Noisy Guidance
Kianté Brantley
Amr Sharaf
Hal Daumé
27
23
0
26 May 2020
Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types
Belén Saldías
D. Roy
37
13
0
26 May 2020
Machine Learning-Based Unbalance Detection of a Rotating Shaft Using Vibration Data
Oliver Mey
Willi Neudeck
André Schneider
Olaf Enge-Rosenblatt
22
28
0
26 May 2020
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
Kostiantyn Omelianchuk
Vitaliy Atrasevych
Artem Chernodub
Oleksandr Skurzhanskyi
36
305
0
26 May 2020
What Are People Asking About COVID-19? A Question Classification Dataset
Jerry W. Wei
Chengyu Huang
Soroush Vosoughi
Jason W. Wei
19
34
0
26 May 2020
BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
Jihyung Moon
Won Ik Cho
Junbum Lee
30
94
0
26 May 2020
Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis
Bing He
Caleb Ziems
Sandeep Soni
Naren Ramakrishnan
Diyi Yang
Srijan Kumar
32
172
0
25 May 2020
NILE : Natural Language Inference with Faithful Natural Language Explanations
Sawan Kumar
Partha P. Talukdar
XAI
LRM
27
160
0
25 May 2020
Køpsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
Daniel Hershcovich
Miryam de Lhoneux
Artur Kulmizev
E. Pejhan
Joakim Nivre
19
6
0
25 May 2020
Sentiment Analysis: Automatically Detecting Valence, Emotions, and Other Affectual States from Text
Saif M. Mohammad
27
312
0
25 May 2020
Stronger Baselines for Grammatical Error Correction Using Pretrained Encoder-Decoder Model
Satoru Katsumata
Mamoru Komachi
33
53
0
24 May 2020
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
Anne Lauscher
Olga Majewska
Leonardo F. R. Ribeiro
Iryna Gurevych
Nikolai Rozanov
Goran Glavaš
KELM
39
79
0
24 May 2020
Query Resolution for Conversational Search with Limited Supervision
Nikos Voskarides
Dan Li
Pengjie Ren
Evangelos Kanoulas
Maarten de Rijke
30
123
0
24 May 2020
A Novel Distributed Representation of News (DRNews) for Stock Market Predictions
Ye Ma
Lu Zong
Peiwan Wang
AIFin
19
5
0
24 May 2020
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
Chen Liu
Su Zhu
Zijian Zhao
Ruisheng Cao
Lu Chen
Kai Yu
39
19
0
24 May 2020
L2R2: Leveraging Ranking for Abductive Reasoning
Yunchang Zhu
Liang Pang
Yanyan Lan
Xueqi Cheng
24
14
0
22 May 2020
A Generative Approach to Titling and Clustering Wikipedia Sections
Anjalie Field
S. Rothe
Simon Baumgartner
Cong Yu
Abe Ittycheriah
37
4
0
22 May 2020
End-to-end Named Entity Recognition from English Speech
Hemant Yadav
Sreyan Ghosh
Yi Yu
R. Shah
34
56
0
22 May 2020
Living Machines: A study of atypical animacy
Mariona Coll Ardanuy
F. Nanni
K. Beelen
Kasra Hosseini
R. Ahnert
J. Lawrence
Katherine McDonough
Giorgia Tolfo
Daniel C. S. Wilson
Barbara McGillivray
21
20
0
22 May 2020
Bootstrapping Named Entity Recognition in E-Commerce with Positive Unlabeled Learning
Hanchu Zhang
Leonhard Hennig
Christoph Alt
Changjian Hu
Yao Meng
Chao Wang
35
15
0
22 May 2020
Customized Graph Neural Networks
Yiqi Wang
Yao Ma
Wei Jin
Chaozhuo Li
Charu C. Aggarwal
Jiliang Tang
40
2
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MH
LM&MA
41
662
0
22 May 2020
The Frankfurt Latin Lexicon: From Morphological Expansion and Word Embeddings to SemioGraphs
Alexander Mehler
Bernhard Jussen
T. Geelhaar
Alexander Henlein
Giuseppe Abrami
Daniel Baumartz
Tolga Uslu
Wahed Hemati
27
8
0
21 May 2020
Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation
Weixin Liang
James Zou
Zhou Yu
ELM
40
33
0
21 May 2020
Sequential Recommendation with Self-Attentive Multi-Adversarial Network
Ruiyang Ren
Zhaoyang Liu
Yaliang Li
Wayne Xin Zhao
Hongya Wang
Bolin Ding
Ji-Rong Wen
GAN
14
107
0
21 May 2020
Text-to-Text Pre-Training for Data-to-Text Tasks
Mihir Kale
Abhinav Rastogi
AI4CE
19
200
0
21 May 2020
Stance Prediction and Claim Verification: An Arabic Perspective
Jude Khouja
25
58
0
21 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
Dan Iter
Kelvin Guu
L. Lansing
Dan Jurafsky
6
77
0
20 May 2020
BERTweet: A pre-trained language model for English Tweets
Dat Quoc Nguyen
Thanh Tien Vu
A. Nguyen
VLM
36
902
0
20 May 2020
A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal
D. Ghalandari
Chris Hokamp
N. Pham
John Glover
Georgiana Ifrim
21
108
0
20 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
38
30
0
20 May 2020
Graph-based, Self-Supervised Program Repair from Diagnostic Feedback
Michihiro Yasunaga
Percy Liang
LRM
37
172
0
20 May 2020
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
D. Gao
Linbo Jin
Ben Chen
Minghui Qiu
Peng Li
Yi Wei
Yitao Hu
Haozhe Jasper Wang
OOD
25
133
0
20 May 2020
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
16
21
0
19 May 2020
Adversarial Alignment of Multilingual Models for Extracting Temporal Expressions from Text
Lukas Lange
Anastasiia Iurshina
Heike Adel
Jannik Strötgen
30
29
0
19 May 2020
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text
Felix Hill
Soňa Mokrá
Nathaniel Wong
Tim Harley
LM&Ro
26
81
0
19 May 2020
Staying True to Your Word: (How) Can Attention Become Explanation?
Martin Tutek
Jan Snajder
24
27
0
19 May 2020
The Effect of Moderation on Online Mental Health Conversations
David Wadden
Tal August
Qisheng Li
Tim Althoff
AI4MH
11
44
0
19 May 2020
Table Search Using a Deep Contextualized Language Model
Zhiyu Zoey Chen
M. Trabelsi
J. Heflin
Yinan Xu
Brian D. Davison
LMTD
26
56
0
19 May 2020
Quantifying the Uncertainty of Precision Estimates for Rule based Text Classifiers
J. Nutaro
Özgür Özmen
21
0
0
19 May 2020
Human-like general language processing
Feng Qi
Guanjun Jiang
AI4CE
27
2
0
19 May 2020
GPT-too: A language-model-first approach for AMR-to-text generation
Manuel Mager
Ramón Fernández Astudillo
Tahira Naseem
Md Arafat Sultan
Young-Suk Lee
Radu Florian
Salim Roukos
32
99
0
18 May 2020
Contextual Embeddings: When Are They Worth It?
Simran Arora
Avner May
Jian Zhang
Christopher Ré
21
59
0
18 May 2020
Are All Languages Created Equal in Multilingual BERT?
Shijie Wu
Mark Dredze
32
318
0
18 May 2020
Question-Driven Summarization of Answers to Consumer Health Questions
Max E. Savery
Asma Ben Abacha
Soumya Gayen
Dina Demner-Fushman
35
78
0
18 May 2020
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
Sam Coope
Tyler Farghly
D. Gerz
Ivan Vulić
Matthew Henderson
32
62
0
18 May 2020
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke Higuchi
Shinji Watanabe
Nanxin Chen
Tetsuji Ogawa
Tetsunori Kobayashi
19
137
0
18 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
11
147
0
18 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
29
460
0
18 May 2020
Text Classification with Few Examples using Controlled Generalization
A. Mahabal
Jason Baldridge
Burcu Karagol Ayan
Vincent Perot
Dan Roth
OOD
AI4CE
28
11
0
18 May 2020
Previous
1
2
3
...
344
345
346
...
374
375
376
Next