ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,641 papers shown
Title
Understanding Mobile GUI: from Pixel-Words to Screen-Sentences
Understanding Mobile GUI: from Pixel-Words to Screen-Sentences
Jingwen Fu
Xiaoyi Zhang
Yuwang Wang
Wenjun Zeng
Sam Yang
Grayson Hilliard
81
15
0
25 May 2021
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Rahul Aralikatte
Shashi Narayan
Joshua Maynez
S. Rothe
Ryan T. McDonald
117
46
0
25 May 2021
Empirical Error Modeling Improves Robustness of Noisy Neural Sequence
  Labeling
Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling
Marcin Namysl
Sven Behnke
Joachim Kohler
NoLa
51
5
0
25 May 2021
Estimating Redundancy in Clinical Text
Estimating Redundancy in Clinical Text
Thomas Searle
Zina M. Ibrahim
J. Teo
Richard J. B. Dobson
69
21
0
25 May 2021
Argument Undermining: Counter-Argument Generation by Attacking Weak
  Premises
Argument Undermining: Counter-Argument Generation by Attacking Weak Premises
Milad Alshomary
S. Syed
Arkajit Dhar
Martin Potthast
Henning Wachsmuth
54
25
0
25 May 2021
ConSERT: A Contrastive Framework for Self-Supervised Sentence
  Representation Transfer
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
Yuanmeng Yan
Rumei Li
Sirui Wang
Fuzheng Zhang
Wei Wu
Weiran Xu
SSL
138
563
0
25 May 2021
Writing by Memorizing: Hierarchical Retrieval-based Medical Report
  Generation
Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation
Xingyi Yang
Muchao Ye
Quanzeng You
Fenglong Ma
MedIm
57
38
0
25 May 2021
Transfer Learning and Curriculum Learning in Sokoban
Transfer Learning and Curriculum Learning in Sokoban
Zhao Yang
Mike Preuss
Aske Plaat
OffRL
50
3
0
25 May 2021
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for
  Key Information Extraction from Documents
ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
Weihong Lin
Qifang Gao
Lei-huan Sun
Zhuoyao Zhong
Kaiqin Hu
Qin Ren
Qiang Huo
72
39
0
25 May 2021
TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference
TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference
Deming Ye
Yankai Lin
Yufei Huang
Maosong Sun
MQ
84
65
0
25 May 2021
Personalized Transformer for Explainable Recommendation
Personalized Transformer for Explainable Recommendation
Lei Li
Yongfeng Zhang
Li Chen
136
141
0
25 May 2021
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic
  Representation
Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation
Tao Tu
Q. Ping
Govind Thattai
Gokhan Tur
Premkumar Natarajan
75
18
0
24 May 2021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Zhengyuan Yang
Songyang Zhang
Liwei Wang
Jiebo Luo
3DPC
122
126
0
24 May 2021
True Few-Shot Learning with Language Models
True Few-Shot Learning with Language Models
Ethan Perez
Douwe Kiela
Kyunghyun Cho
169
440
0
24 May 2021
Diacritics Restoration using BERT with Analysis on Czech language
Diacritics Restoration using BERT with Analysis on Czech language
Jakub Náplava
Milan Straka
Jana Straková
39
11
0
24 May 2021
VANiLLa : Verbalized Answers in Natural Language at Large Scale
VANiLLa : Verbalized Answers in Natural Language at Large Scale
Debanjali Biswas
Mohnish Dubey
Md. Rony
Jens Lehmann
43
9
0
24 May 2021
View Distillation with Unlabeled Data for Extracting Adverse Drug
  Effects from User-Generated Data
View Distillation with Unlabeled Data for Extracting Adverse Drug Effects from User-Generated Data
Payam Karisani
Jinho Choi
Li Xiong
MedIm
60
2
0
24 May 2021
Classifying Math KCs via Task-Adaptive Pre-Trained BERT
Classifying Math KCs via Task-Adaptive Pre-Trained BERT
J. Shen
Michiharu Yamashita
Ethan Prihar
Neil T. Heffernan
Xintao Wu
Sean McGrew
Dongwon Lee
28
0
0
24 May 2021
Multi-modal Understanding and Generation for Medical Images and Text via
  Vision-Language Pre-Training
Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training
Jong Hak Moon
HyunGyung Lee
W. Shin
Young-Hak Kim
Edward Choi
MedIm
113
161
0
24 May 2021
Neural Language Models for Nineteenth-Century English
Neural Language Models for Nineteenth-Century English
Kasra Hosseini
K. Beelen
Giovanni Colavizza
Mariona Coll Ardanuy
89
18
0
24 May 2021
RobeCzech: Czech RoBERTa, a monolingual contextualized language
  representation model
RobeCzech: Czech RoBERTa, a monolingual contextualized language representation model
Milan Straka
Jakub Náplava
Jana Straková
David Samuel
75
47
0
24 May 2021
DaN+: Danish Nested Named Entities and Lexical Normalization
DaN+: Danish Nested Named Entities and Lexical Normalization
Barbara Plank
Kristian Nørgaard Jensen
Rob van der Goot
79
36
0
24 May 2021
Neural Machine Translation with Monolingual Translation Memory
Neural Machine Translation with Monolingual Translation Memory
Deng Cai
Yan Wang
Huayang Li
Wai Lam
Lemao Liu
99
103
0
24 May 2021
PTR: Prompt Tuning with Rules for Text Classification
PTR: Prompt Tuning with Rules for Text Classification
Xu Han
Weilin Zhao
Ning Ding
Zhiyuan Liu
Maosong Sun
VLM
110
533
0
24 May 2021
Cross-lingual Text Classification with Heterogeneous Graph Neural
  Network
Cross-lingual Text Classification with Heterogeneous Graph Neural Network
ZiYun Wang
Xuan Liu
Pei-Yin Yang
Shixing Liu
Zhisheng Wang
84
32
0
24 May 2021
Distantly-Supervised Long-Tailed Relation Extraction Using Constraint
  Graphs
Distantly-Supervised Long-Tailed Relation Extraction Using Constraint Graphs
Tianming Liang
Yang Liu
Xiaoyan Liu
Hao Zhang
Gaurav Sharma
Maozu Guo
101
25
0
24 May 2021
StructuralLM: Structural Pre-training for Form Understanding
StructuralLM: Structural Pre-training for Form Understanding
Chenliang Li
Bin Bi
Ming Yan
Wei Wang
Songfang Huang
Fei Huang
Luo Si
LMTDAI4CE
120
134
0
24 May 2021
Improved OOD Generalization via Adversarial Training and Pre-training
Improved OOD Generalization via Adversarial Training and Pre-training
Mingyang Yi
Lu Hou
Jiacheng Sun
Lifeng Shang
Xin Jiang
Qun Liu
Zhi-Ming Ma
VLM
79
84
0
24 May 2021
Using Adversarial Attacks to Reveal the Statistical Bias in Machine
  Reading Comprehension Models
Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Comprehension Models
Jieyu Lin
Jiajie Zou
Nai Ding
AAML
81
45
0
24 May 2021
Unsupervised Speech Recognition
Unsupervised Speech Recognition
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
169
275
0
24 May 2021
One4all User Representation for Recommender Systems in E-commerce
One4all User Representation for Recommender Systems in E-commerce
Kyuyong Shin
Hanock Kwak
KyungHyun Kim
Minkyu Kim
Young-Jin Park
Jisu Jeong
Seungjae Jung
69
28
0
24 May 2021
Automatic Product Ontology Extraction from Textual Reviews
Automatic Product Ontology Extraction from Textual Reviews
Joel Oksanen
O. Cocarascu
Francesca Toni
47
4
0
23 May 2021
Structural Pre-training for Dialogue Comprehension
Structural Pre-training for Dialogue Comprehension
Zhuosheng Zhang
Hai Zhao
94
31
0
23 May 2021
OntoED: Low-resource Event Detection with Ontology Embedding
OntoED: Low-resource Event Detection with Ontology Embedding
Shumin Deng
Ningyu Zhang
Luoqiu Li
Hui Chen
Huaixiao Tou
Mosha Chen
Fei Huang
Huajun Chen
101
56
0
23 May 2021
CiteWorth: Cite-Worthiness Detection for Improved Scientific Document
  Understanding
CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding
Dustin Wright
Isabelle Augenstein
103
25
0
23 May 2021
Killing One Bird with Two Stones: Model Extraction and Attribute
  Inference Attacks against BERT-based APIs
Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs
Chen Chen
Xuanli He
Lingjuan Lyu
Fangzhao Wu
SILMMIACV
102
8
0
23 May 2021
DepressionNet: A Novel Summarization Boosted Deep Framework for
  Depression Detection on Social Media
DepressionNet: A Novel Summarization Boosted Deep Framework for Depression Detection on Social Media
Hamad Zogan
Imran Razzak
Shoaib Jameel
Guandong Xu
83
60
0
23 May 2021
Hypergraph Pre-training with Graph Neural Networks
Hypergraph Pre-training with Graph Neural Networks
Boxin Du
Changhe Yuan
Robert A. Barton
T. Neiman
Hanghang Tong
AI4CE
62
14
0
23 May 2021
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on
  the Fly
AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly
Yuchen Jin
Dinesh Manocha
Liangyu Zhao
Yibo Zhu
Chuanxiong Guo
Marco Canini
Arvind Krishnamurthy
95
19
0
22 May 2021
Denoising Noisy Neural Networks: A Bayesian Approach with Compensation
Denoising Noisy Neural Networks: A Bayesian Approach with Compensation
Yulin Shao
Soung Chang Liew
Deniz Gunduz
94
14
0
22 May 2021
CEREC: A Corpus for Entity Resolution in Email Conversations
CEREC: A Corpus for Entity Resolution in Email Conversations
Parag Dakle
D. Moldovan
25
4
0
21 May 2021
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Ivana Kvapilíková
Mikel Artetxe
Gorka Labaka
Eneko Agirre
Ondrej Bojar
SSL
74
36
0
21 May 2021
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A Survey
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Ji-Rong Wen
LM&MAVLMSyDa
106
192
0
21 May 2021
Stance Detection with BERT Embeddings for Credibility Analysis of
  Information on Social Media
Stance Detection with BERT Embeddings for Credibility Analysis of Information on Social Media
Hema R. Karande
Rahee Walambe
Victor A. Benjamin
K. Kotecha
TS Raghu
95
39
0
21 May 2021
Semantic Representation for Dialogue Modeling
Semantic Representation for Dialogue Modeling
Xuefeng Bai
Yulong Chen
Linfeng Song
Yue Zhang
129
53
0
21 May 2021
A Non-Linear Structural Probe
A Non-Linear Structural Probe
Jennifer C. White
Tiago Pimentel
Naomi Saphra
Ryan Cotterell
55
25
0
21 May 2021
Revisiting the Negative Data of Distantly Supervised Relation Extraction
Revisiting the Negative Data of Distantly Supervised Relation Extraction
Chenhao Xie
Jiaqing Liang
Jingping Liu
Chengsong Huang
Wenhao Huang
Yanghua Xiao
96
29
0
21 May 2021
Training Bi-Encoders for Word Sense Disambiguation
Training Bi-Encoders for Word Sense Disambiguation
Harsh Kohli
100
4
0
21 May 2021
Towards Automatic Comparison of Data Privacy Documents: A Preliminary
  Experiment on GDPR-like Laws
Towards Automatic Comparison of Data Privacy Documents: A Preliminary Experiment on GDPR-like Laws
Kornraphop Kawintiranon
Yaguang Liu
AILaw
26
4
0
21 May 2021
Dynaboard: An Evaluation-As-A-Service Platform for Holistic
  Next-Generation Benchmarking
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
Zhiyi Ma
Kawin Ethayarajh
Tristan Thrush
Somya Jain
Ledell Yu Wu
Robin Jia
Christopher Potts
Adina Williams
Douwe Kiela
ELM
117
59
0
21 May 2021
Previous
123...334335336...471472473
Next