ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,688 papers shown
Title
Membership Inference Attacks on Machine Learning: A Survey
Membership Inference Attacks on Machine Learning: A Survey
Hongsheng Hu
Z. Salcic
Lichao Sun
Gillian Dobbie
Philip S. Yu
Xuyun Zhang
MIACV
125
449
0
14 Mar 2021
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple
  Levels
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels
Chenliang Li
Ming Yan
Haiyang Xu
Fuli Luo
Wei Wang
Bin Bi
Songfang Huang
VLM
74
36
0
14 Mar 2021
Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent
  Prediction and Slot Filling
Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling
Jitin Krishnan
Antonios Anastasopoulos
Hemant Purohit
Huzefa Rangwala
99
41
0
13 Mar 2021
Context Transformer with Stacked Pointer Networks for Conversational
  Question Answering over Knowledge Graphs
Context Transformer with Stacked Pointer Networks for Conversational Question Answering over Knowledge Graphs
Joan Plepi
Endri Kacupaj
Kuldeep Singh
Harsh Thakkar
Jens Lehmann
GNN
121
23
0
13 Mar 2021
Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet
  Extraction
Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet Extraction
Shaowei Chen
Yu Wang
Jie Liu
Yuelin Wang
61
180
0
13 Mar 2021
Simpson's Bias in NLP Training
Simpson's Bias in NLP Training
Fei Yuan
Longtu Zhang
Bojun Huang
Yaobo Liang
AI4CE
42
3
0
13 Mar 2021
Text Mining of Stocktwits Data for Predicting Stock Prices
Text Mining of Stocktwits Data for Predicting Stock Prices
Mukul Jaggi
Priyanka Mandal
Shreya Narang
Usman Naseem
Matloob Khushi
AIFin
73
41
0
13 Mar 2021
Approximating How Single Head Attention Learns
Approximating How Single Head Attention Learns
Charles Burton Snell
Ruiqi Zhong
Dan Klein
Jacob Steinhardt
MLT
71
31
0
13 Mar 2021
Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level
  Sentiment Classification
Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level Sentiment Classification
Xiaochen Hou
Peng Qi
Guangtao Wang
Rex Ying
Jing Huang
Xiaodong He
Bowen Zhou
77
60
0
12 Mar 2021
Cooperative Self-training of Machine Reading Comprehension
Cooperative Self-training of Machine Reading Comprehension
Hongyin Luo
Shang-Wen Li
Ming Gao
Seunghak Yu
James R. Glass
SyDaRALM
59
12
0
12 Mar 2021
Explaining and Improving BERT Performance on Lexical Semantic Change
  Detection
Explaining and Improving BERT Performance on Lexical Semantic Change Detection
Severin Laicher
Sinan Kurtyigit
Dominik Schlechtweg
Jonas Kuhn
Sabine Schulte im Walde
80
54
0
12 Mar 2021
Constrained Text Generation with Global Guidance -- Case Study on
  CommonGen
Constrained Text Generation with Global Guidance -- Case Study on CommonGen
Yixian Liu
Liwen Zhang
Wenjuan Han
Yue Zhang
Kewei Tu
87
10
0
12 Mar 2021
Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of
  Pre-trained Models' Transferability
Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models' Transferability
Wei-Tsung Kao
Hung-yi Lee
52
16
0
12 Mar 2021
Training Networks in Null Space of Feature Covariance for Continual
  Learning
Training Networks in Null Space of Feature Covariance for Continual Learning
Shipeng Wang
Xiaorong Li
Jian Sun
Zongben Xu
CLL
103
144
0
12 Mar 2021
Inductive Relation Prediction by BERT
Inductive Relation Prediction by BERT
H. Zha
Zhiyu Zoey Chen
Xifeng Yan
146
58
0
12 Mar 2021
A Weakly Supervised Approach for Classifying Stance in Twitter Replies
A Weakly Supervised Approach for Classifying Stance in Twitter Replies
Sumeet Kumar
R. Villa-Cox
M. Babcock
Kathleen M. Carley
33
1
0
12 Mar 2021
SuperMeshing: A New Deep Learning Architecture for Increasing the Mesh
  Density of Metal Forming Stress Field with Attention Mechanism and Perceptual
  Features
SuperMeshing: A New Deep Learning Architecture for Increasing the Mesh Density of Metal Forming Stress Field with Attention Mechanism and Perceptual Features
Qingfeng Xu
Zhenguo Nie
Handing Xu
Hao Zhou
Xin-Jun Liu
AI4CE
36
1
0
12 Mar 2021
Severity Quantification and Lesion Localization of COVID-19 on CXR using
  Vision Transformer
Severity Quantification and Lesion Localization of COVID-19 on CXR using Vision Transformer
Gwanghyun Kim
Sangjoon Park
Y. Oh
J. Seo
Sang Min Lee
Jin Hwan Kim
Sungjun Moon
Jae-Kwang Lim
J. C. Ye
ViTMedIm
83
4
0
12 Mar 2021
Vision Transformer for COVID-19 CXR Diagnosis using Chest X-ray Feature
  Corpus
Vision Transformer for COVID-19 CXR Diagnosis using Chest X-ray Feature Corpus
Sangjoon Park
Gwanghyun Kim
Y. Oh
J. Seo
Sang Min Lee
Jin Hwan Kim
Sungjun Moon
Jae-Kwang Lim
J. C. Ye
ViTMedIm
98
34
0
12 Mar 2021
Towards Socially Intelligent Agents with Mental State Transition and
  Human Utility
Towards Socially Intelligent Agents with Mental State Transition and Human Utility
Liang Qiu
Yizhou Zhao
Yuan Liang
Pan Lu
Weiyan Shi
Zhou Yu
Song-Chun Zhu
LLMAG
91
15
0
12 Mar 2021
Continuous 3D Multi-Channel Sign Language Production via Progressive
  Transformers and Mixture Density Networks
Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks
Ben Saunders
Necati Cihan Camgöz
Richard Bowden
SLR
73
80
0
11 Mar 2021
Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU
  Models
Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU Models
Mengnan Du
Varun Manjunatha
R. Jain
Ruchi Deshpande
Franck Dernoncourt
Jiuxiang Gu
Tong Sun
Helen Zhou
110
107
0
11 Mar 2021
On Improving Deep Learning Trace Analysis with System Call Arguments
On Improving Deep Learning Trace Analysis with System Call Arguments
Quentin Fournier
Daniel Aloise
S. V. Azhari
François Tetreault
64
10
0
11 Mar 2021
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language
  Representation
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation
J. Clark
Dan Garrette
Iulia Turc
John Wieting
129
224
0
11 Mar 2021
COVID-19 Smart Chatbot Prototype for Patient Monitoring
COVID-19 Smart Chatbot Prototype for Patient Monitoring
Hannah Lei
Weiqi Lu
A. Ji
Emmett Bertram
Paul Gao
Xiaoqian Jiang
Arko Barman
64
4
0
11 Mar 2021
MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding
MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding
Tuhin Chakrabarty
Xurui Zhang
Smaranda Muresan
Nanyun Peng
73
70
0
11 Mar 2021
ENTRUST: Argument Reframing with Language Models and Entailment
ENTRUST: Argument Reframing with Language Models and Entailment
Tuhin Chakrabarty
Christopher Hidey
Smaranda Muresan
71
13
0
11 Mar 2021
The Interplay of Variant, Size, and Task Type in Arabic Pre-trained
  Language Models
The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models
Go Inoue
Bashar Alhafni
Nurpeiis Baimukan
Houda Bouamor
Nizar Habash
115
237
0
11 Mar 2021
Domain State Tracking for a Simplified Dialogue System
Domain State Tracking for a Simplified Dialogue System
Hyunmin Jeon
G. G. Lee
81
19
0
11 Mar 2021
ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment
  Analysis and Rating Prediction
ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment Analysis and Rating Prediction
Jiahao Bu
Lei Ren
Shuang Zheng
Yang Yang
Jingang Wang
Fuzheng Zhang
Wei Wu
77
69
0
11 Mar 2021
Fair Mixup: Fairness via Interpolation
Fair Mixup: Fairness via Interpolation
Ching-Yao Chuang
Youssef Mroueh
79
140
0
11 Mar 2021
Conversational Answer Generation and Factuality for Reading
  Comprehension Question-Answering
Conversational Answer Generation and Factuality for Reading Comprehension Question-Answering
Stanislav Peshterliev
Barlas Oğuz
Debojeet Chatterjee
Hakan Inan
Vikas Bhardwaj
39
4
0
11 Mar 2021
Read Like Humans: Autonomous, Bidirectional and Iterative Language
  Modeling for Scene Text Recognition
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
Shancheng Fang
Hongtao Xie
Yuxin Wang
Zhendong Mao
Yongdong Zhang
90
306
0
11 Mar 2021
ReinforceBug: A Framework to Generate Adversarial Textual Examples
ReinforceBug: A Framework to Generate Adversarial Textual Examples
Bushra Sabir
M. Babar
R. Gaire
SILMAAML
64
3
0
11 Mar 2021
Towards Multi-Sense Cross-Lingual Alignment of Contextual Embeddings
Towards Multi-Sense Cross-Lingual Alignment of Contextual Embeddings
Linlin Liu
Thien Hai Nguyen
Shafiq Joty
Lidong Bing
Luo Si
108
5
0
11 Mar 2021
Full Page Handwriting Recognition via Image to Sequence Extraction
Full Page Handwriting Recognition via Image to Sequence Extraction
Sumeet S. Singh
Sergey Karayev
81
55
0
11 Mar 2021
Improving Adversarial Robustness via Channel-wise Activation Suppressing
Improving Adversarial Robustness via Channel-wise Activation Suppressing
Yang Bai
Yuyuan Zeng
Yong Jiang
Shutao Xia
Xingjun Ma
Yisen Wang
AAML
102
131
0
11 Mar 2021
LightMBERT: A Simple Yet Effective Method for Multilingual BERT
  Distillation
LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation
Xiaoqi Jiao
Yichun Yin
Lifeng Shang
Xin Jiang
Xiao Chen
Linlin Li
Fang Wang
Qun Liu
65
9
0
11 Mar 2021
FairFil: Contrastive Neural Debiasing Method for Pretrained Text
  Encoders
FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders
Pengyu Cheng
Weituo Hao
Siyang Yuan
Shijing Si
Lawrence Carin
82
105
0
11 Mar 2021
ReportAGE: Automatically extracting the exact age of Twitter users based
  on self-reports in tweets
ReportAGE: Automatically extracting the exact age of Twitter users based on self-reports in tweets
A. Klein
A. Magge
G. Gonzalez-Hernandez
28
20
0
10 Mar 2021
Unified Pre-training for Program Understanding and Generation
Unified Pre-training for Program Understanding and Generation
Wasi Uddin Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
162
775
0
10 Mar 2021
Hurdles to Progress in Long-form Question Answering
Hurdles to Progress in Long-form Question Answering
Kalpesh Krishna
Aurko Roy
Mohit Iyyer
80
200
0
10 Mar 2021
CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Dan Hendrycks
Collin Burns
Anya Chen
Spencer Ball
ELMAILaw
90
195
0
10 Mar 2021
Quantization-Guided Training for Compact TinyML Models
Quantization-Guided Training for Compact TinyML Models
Sedigh Ghamari
Koray Ozcan
Thu Dinh
A. Melnikov
Juan Carvajal
Jan Ernst
S. Chai
MQ
67
18
0
10 Mar 2021
U-Net Transformer: Self and Cross Attention for Medical Image
  Segmentation
U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
Olivier Petit
Nicolas Thome
Clément Rambour
L. Soler
ViTMedIm
116
251
0
10 Mar 2021
A Result based Portable Framework for Spoken Language Understanding
A Result based Portable Framework for Spoken Language Understanding
Lizhi Cheng
Weijia Jia
Wenmian Yang
70
8
0
10 Mar 2021
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End
  Information Extraction
DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction
Freddy Chongtat Chua
Nigel P. Duffy
87
7
0
10 Mar 2021
Combining Context-Free and Contextualized Representations for Arabic
  Sarcasm Detection and Sentiment Identification
Combining Context-Free and Contextualized Representations for Arabic Sarcasm Detection and Sentiment Identification
Amey Hengle
Atharva Kshirsagar
Shaily Desai
M. Marathe
34
13
0
09 Mar 2021
Select, Substitute, Search: A New Benchmark for Knowledge-Augmented
  Visual Question Answering
Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering
Aman Jain
Mayank Kothyari
Vishwajeet Kumar
Preethi Jyothi
Ganesh Ramakrishnan
Soumen Chakrabarti
68
36
0
09 Mar 2021
When is it permissible for artificial intelligence to lie? A trust-based
  approach
When is it permissible for artificial intelligence to lie? A trust-based approach
Tae Wan Kim
Tong Lu
Lu
Kyusong Lee
Zhaoqi Cheng
Yanhan Tang
J. N. Hooker
56
4
0
09 Mar 2021
Previous
123...355356357...472473474
Next