ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,765 papers shown
Title
Active Imitation Learning with Noisy Guidance
Active Imitation Learning with Noisy Guidance
Kianté Brantley
Amr Sharaf
Hal Daumé
27
23
0
26 May 2020
Exploring aspects of similarity between spoken personal narratives by
  disentangling them into narrative clause types
Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types
Belén Saldías
D. Roy
37
13
0
26 May 2020
Machine Learning-Based Unbalance Detection of a Rotating Shaft Using
  Vibration Data
Machine Learning-Based Unbalance Detection of a Rotating Shaft Using Vibration Data
Oliver Mey
Willi Neudeck
André Schneider
Olaf Enge-Rosenblatt
22
28
0
26 May 2020
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
Kostiantyn Omelianchuk
Vitaliy Atrasevych
Artem Chernodub
Oleksandr Skurzhanskyi
36
305
0
26 May 2020
What Are People Asking About COVID-19? A Question Classification Dataset
What Are People Asking About COVID-19? A Question Classification Dataset
Jerry W. Wei
Chengyu Huang
Soroush Vosoughi
Jason W. Wei
19
34
0
26 May 2020
BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
Jihyung Moon
Won Ik Cho
Junbum Lee
30
94
0
26 May 2020
Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media
  during the COVID-19 Crisis
Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis
Bing He
Caleb Ziems
Sandeep Soni
Naren Ramakrishnan
Diyi Yang
Srijan Kumar
32
172
0
25 May 2020
NILE : Natural Language Inference with Faithful Natural Language
  Explanations
NILE : Natural Language Inference with Faithful Natural Language Explanations
Sawan Kumar
Partha P. Talukdar
XAI
LRM
27
160
0
25 May 2020
Køpsala: Transition-Based Graph Parsing via Efficient Training and
  Effective Encoding
Køpsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
Daniel Hershcovich
Miryam de Lhoneux
Artur Kulmizev
E. Pejhan
Joakim Nivre
19
6
0
25 May 2020
Sentiment Analysis: Automatically Detecting Valence, Emotions, and Other
  Affectual States from Text
Sentiment Analysis: Automatically Detecting Valence, Emotions, and Other Affectual States from Text
Saif M. Mohammad
27
312
0
25 May 2020
Stronger Baselines for Grammatical Error Correction Using Pretrained
  Encoder-Decoder Model
Stronger Baselines for Grammatical Error Correction Using Pretrained Encoder-Decoder Model
Satoru Katsumata
Mamoru Komachi
33
53
0
24 May 2020
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge
  Injection into Pretrained Transformers
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
Anne Lauscher
Olga Majewska
Leonardo F. R. Ribeiro
Iryna Gurevych
Nikolai Rozanov
Goran Glavaš
KELM
39
79
0
24 May 2020
Query Resolution for Conversational Search with Limited Supervision
Query Resolution for Conversational Search with Limited Supervision
Nikos Voskarides
Dan Li
Pengjie Ren
Evangelos Kanoulas
Maarten de Rijke
30
123
0
24 May 2020
A Novel Distributed Representation of News (DRNews) for Stock Market
  Predictions
A Novel Distributed Representation of News (DRNews) for Stock Market Predictions
Ye Ma
Lu Zong
Peiwan Wang
AIFin
19
5
0
24 May 2020
Jointly Encoding Word Confusion Network and Dialogue Context with BERT
  for Spoken Language Understanding
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
Chen Liu
Su Zhu
Zijian Zhao
Ruisheng Cao
Lu Chen
Kai Yu
39
19
0
24 May 2020
L2R2: Leveraging Ranking for Abductive Reasoning
L2R2: Leveraging Ranking for Abductive Reasoning
Yunchang Zhu
Liang Pang
Yanyan Lan
Xueqi Cheng
24
14
0
22 May 2020
A Generative Approach to Titling and Clustering Wikipedia Sections
A Generative Approach to Titling and Clustering Wikipedia Sections
Anjalie Field
S. Rothe
Simon Baumgartner
Cong Yu
Abe Ittycheriah
37
4
0
22 May 2020
End-to-end Named Entity Recognition from English Speech
End-to-end Named Entity Recognition from English Speech
Hemant Yadav
Sreyan Ghosh
Yi Yu
R. Shah
34
56
0
22 May 2020
Living Machines: A study of atypical animacy
Living Machines: A study of atypical animacy
Mariona Coll Ardanuy
F. Nanni
K. Beelen
Kasra Hosseini
R. Ahnert
J. Lawrence
Katherine McDonough
Giorgia Tolfo
Daniel C. S. Wilson
Barbara McGillivray
21
20
0
22 May 2020
Bootstrapping Named Entity Recognition in E-Commerce with Positive
  Unlabeled Learning
Bootstrapping Named Entity Recognition in E-Commerce with Positive Unlabeled Learning
Hanchu Zhang
Leonhard Hennig
Christoph Alt
Changjian Hu
Yao Meng
Chao Wang
35
15
0
22 May 2020
Customized Graph Neural Networks
Customized Graph Neural Networks
Yiqi Wang
Yao Ma
Wei Jin
Chaozhuo Li
Charu C. Aggarwal
Jiliang Tang
40
2
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale
  structured electronic health records for disease prediction
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MH
LM&MA
41
662
0
22 May 2020
The Frankfurt Latin Lexicon: From Morphological Expansion and Word
  Embeddings to SemioGraphs
The Frankfurt Latin Lexicon: From Morphological Expansion and Word Embeddings to SemioGraphs
Alexander Mehler
Bernhard Jussen
T. Geelhaar
Alexander Henlein
Giuseppe Abrami
Daniel Baumartz
Tolga Uslu
Wahed Hemati
27
8
0
21 May 2020
Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for
  Automatic Dialog Evaluation
Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation
Weixin Liang
James Zou
Zhou Yu
ELM
40
33
0
21 May 2020
Sequential Recommendation with Self-Attentive Multi-Adversarial Network
Sequential Recommendation with Self-Attentive Multi-Adversarial Network
Ruiyang Ren
Zhaoyang Liu
Yaliang Li
Wayne Xin Zhao
Hongya Wang
Bolin Ding
Ji-Rong Wen
GAN
14
107
0
21 May 2020
Text-to-Text Pre-Training for Data-to-Text Tasks
Text-to-Text Pre-Training for Data-to-Text Tasks
Mihir Kale
Abhinav Rastogi
AI4CE
19
200
0
21 May 2020
Stance Prediction and Claim Verification: An Arabic Perspective
Stance Prediction and Claim Verification: An Arabic Perspective
Jude Khouja
25
58
0
21 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse
  Performance of Language Models
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
Dan Iter
Kelvin Guu
L. Lansing
Dan Jurafsky
6
77
0
20 May 2020
BERTweet: A pre-trained language model for English Tweets
BERTweet: A pre-trained language model for English Tweets
Dat Quoc Nguyen
Thanh Tien Vu
A. Nguyen
VLM
36
902
0
20 May 2020
A Large-Scale Multi-Document Summarization Dataset from the Wikipedia
  Current Events Portal
A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal
D. Ghalandari
Chris Hokamp
N. Pham
John Glover
Georgiana Ifrim
21
108
0
20 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based
  Quantized DNNs
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
38
30
0
20 May 2020
Graph-based, Self-Supervised Program Repair from Diagnostic Feedback
Graph-based, Self-Supervised Program Repair from Diagnostic Feedback
Michihiro Yasunaga
Percy Liang
LRM
37
172
0
20 May 2020
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal
  Retrieval
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
D. Gao
Linbo Jin
Ben Chen
Minghui Qiu
Peng Li
Yi Wei
Yitao Hu
Haozhe Jasper Wang
OOD
25
133
0
20 May 2020
Normalized Attention Without Probability Cage
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
16
21
0
19 May 2020
Adversarial Alignment of Multilingual Models for Extracting Temporal
  Expressions from Text
Adversarial Alignment of Multilingual Models for Extracting Temporal Expressions from Text
Lukas Lange
Anastasiia Iurshina
Heike Adel
Jannik Strötgen
30
29
0
19 May 2020
Human Instruction-Following with Deep Reinforcement Learning via
  Transfer-Learning from Text
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text
Felix Hill
Soňa Mokrá
Nathaniel Wong
Tim Harley
LM&Ro
26
81
0
19 May 2020
Staying True to Your Word: (How) Can Attention Become Explanation?
Staying True to Your Word: (How) Can Attention Become Explanation?
Martin Tutek
Jan Snajder
24
27
0
19 May 2020
The Effect of Moderation on Online Mental Health Conversations
The Effect of Moderation on Online Mental Health Conversations
David Wadden
Tal August
Qisheng Li
Tim Althoff
AI4MH
11
44
0
19 May 2020
Table Search Using a Deep Contextualized Language Model
Table Search Using a Deep Contextualized Language Model
Zhiyu Zoey Chen
M. Trabelsi
J. Heflin
Yinan Xu
Brian D. Davison
LMTD
26
56
0
19 May 2020
Quantifying the Uncertainty of Precision Estimates for Rule based Text
  Classifiers
Quantifying the Uncertainty of Precision Estimates for Rule based Text Classifiers
J. Nutaro
Özgür Özmen
21
0
0
19 May 2020
Human-like general language processing
Human-like general language processing
Feng Qi
Guanjun Jiang
AI4CE
27
2
0
19 May 2020
GPT-too: A language-model-first approach for AMR-to-text generation
GPT-too: A language-model-first approach for AMR-to-text generation
Manuel Mager
Ramón Fernández Astudillo
Tahira Naseem
Md Arafat Sultan
Young-Suk Lee
Radu Florian
Salim Roukos
32
99
0
18 May 2020
Contextual Embeddings: When Are They Worth It?
Contextual Embeddings: When Are They Worth It?
Simran Arora
Avner May
Jian Zhang
Christopher Ré
21
59
0
18 May 2020
Are All Languages Created Equal in Multilingual BERT?
Are All Languages Created Equal in Multilingual BERT?
Shijie Wu
Mark Dredze
32
318
0
18 May 2020
Question-Driven Summarization of Answers to Consumer Health Questions
Question-Driven Summarization of Answers to Consumer Health Questions
Max E. Savery
Asma Ben Abacha
Soumya Gayen
Dina Demner-Fushman
35
78
0
18 May 2020
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained
  Conversational Representations
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
Sam Coope
Tyler Farghly
D. Gerz
Ivan Vulić
Matthew Henderson
32
62
0
18 May 2020
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke Higuchi
Shinji Watanabe
Nanxin Chen
Tetsuji Ogawa
Tetsunori Kobayashi
19
137
0
18 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio
  Representation
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
11
147
0
18 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory
  Prediction
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
29
460
0
18 May 2020
Text Classification with Few Examples using Controlled Generalization
Text Classification with Few Examples using Controlled Generalization
A. Mahabal
Jason Baldridge
Burcu Karagol Ayan
Vincent Perot
Dan Roth
OOD
AI4CE
28
11
0
18 May 2020
Previous
123...344345346...374375376
Next