ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,573 papers shown
Title
Multi-Task Self-Supervised Learning for Disfluency Detection
Multi-Task Self-Supervised Learning for Disfluency Detection
Shaolei Wang
Wanxiang Che
Qi Liu
Pengda Qin
Ting Liu
William Yang Wang
SSL
76
56
0
15 Aug 2019
Towards Debiasing Fact Verification Models
Towards Debiasing Fact Verification Models
Tal Schuster
Darsh J. Shah
Yun Jie Serene Yeo
Daniel Filizzola
Enrico Santus
Regina Barzilay
124
212
0
14 Aug 2019
Debiasing Personal Identities in Toxicity Classification
Debiasing Personal Identities in Toxicity Classification
Apik Zorian
Chandra Shekar Bikkanur
27
2
0
14 Aug 2019
SG-Net: Syntax-Guided Machine Reading Comprehension
SG-Net: Syntax-Guided Machine Reading Comprehension
Zhuosheng Zhang
Yuwei Wu
Junru Zhou
Sufeng Duan
Hai Zhao
Rui Wang
100
188
0
14 Aug 2019
FlowDelta: Modeling Flow Information Gain in Reasoning for
  Conversational Machine Comprehension
FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension
Yi-Ting Yeh
Yun-Nung Chen
96
41
0
14 Aug 2019
X-WikiRE: A Large, Multilingual Resource for Relation Extraction as
  Machine Comprehension
X-WikiRE: A Large, Multilingual Resource for Relation Extraction as Machine Comprehension
Mostafa Abdou
Cezar Sas
Rahul Aralikatte
Isabelle Augenstein
Anders Søgaard
70
8
0
14 Aug 2019
Fusion of Detected Objects in Text for Visual Question Answering
Fusion of Detected Objects in Text for Visual Question Answering
Chris Alberti
Jeffrey Ling
Michael Collins
David Reitter
97
173
0
14 Aug 2019
FlexNER: A Flexible LSTM-CNN Stack Framework for Named Entity
  Recognition
FlexNER: A Flexible LSTM-CNN Stack Framework for Named Entity Recognition
Hongyin Zhu
Wenpeng Hu
Yi Zeng
40
5
0
14 Aug 2019
Unsupervised Out-of-Distribution Detection by Maximum Classifier
  Discrepancy
Unsupervised Out-of-Distribution Detection by Maximum Classifier Discrepancy
Qing Yu
Kiyoharu Aizawa
OODD
72
168
0
14 Aug 2019
Entity-aware ELMo: Learning Contextual Entity Representation for Entity
  Disambiguation
Entity-aware ELMo: Learning Contextual Entity Representation for Entity Disambiguation
Hamed Shahbazi
Xiaoli Z. Fern
Reza Ghaeini
Rasha Obeidat
Prasad Tadepalli
109
21
0
14 Aug 2019
Establishing Strong Baselines for the New Decade: Sequence Tagging,
  Syntactic and Semantic Parsing with BERT
Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT
Han He
Jinho Choi
3DV
84
51
0
14 Aug 2019
Reinforcement Learning Based Graph-to-Sequence Model for Natural
  Question Generation
Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation
Yu Chen
Lingfei Wu
Mohammed J Zaki
GNN
97
156
0
14 Aug 2019
HorNet: A Hierarchical Offshoot Recurrent Network for Improving Person
  Re-ID via Image Captioning
HorNet: A Hierarchical Offshoot Recurrent Network for Improving Person Re-ID via Image Captioning
Shiyang Yan
Jun Xu
Yuai Liu
Lin Xu
104
7
0
14 Aug 2019
An Effective Domain Adaptive Post-Training Method for BERT in Response
  Selection
An Effective Domain Adaptive Post-Training Method for BERT in Response Selection
Taesun Whang
Dongyub Lee
Chanhee Lee
Kisu Yang
Dongsuk Oh
Heuiseok Lim
78
26
0
13 Aug 2019
Fine-grained Information Status Classification Using Discourse
  Context-Aware Self-Attention
Fine-grained Information Status Classification Using Discourse Context-Aware Self-Attention
Yufang Hou
31
0
0
13 Aug 2019
BioFLAIR: Pretrained Pooled Contextualized Embeddings for Biomedical
  Sequence Labeling Tasks
BioFLAIR: Pretrained Pooled Contextualized Embeddings for Biomedical Sequence Labeling Tasks
Shreyas Sharma
Ron Daniel
53
33
0
13 Aug 2019
StructBERT: Incorporating Language Structures into Pre-training for Deep
  Language Understanding
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
Wei Wang
Bin Bi
Ming Yan
Chen Henry Wu
Zuyi Bao
Jiangnan Xia
Liwei Peng
Luo Si
102
264
0
13 Aug 2019
Tackling Online Abuse: A Survey of Automated Abuse Detection Methods
Tackling Online Abuse: A Survey of Automated Abuse Detection Methods
Pushkar Mishra
H. Yannakoudakis
Ekaterina Shutova
95
79
0
13 Aug 2019
Incorporating Relation Knowledge into Commonsense Reading Comprehension
  with Multi-task Learning
Incorporating Relation Knowledge into Commonsense Reading Comprehension with Multi-task Learning
Jiangnan Xia
Chen Henry Wu
Ming Yan
73
21
0
13 Aug 2019
Generative Question Refinement with Deep Reinforcement Learning in
  Retrieval-based QA System
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
63
20
0
13 Aug 2019
Understanding Spatial Language in Radiology: Representation Framework,
  Annotation, and Spatial Relation Extraction from Chest X-ray Reports using
  Deep Learning
Understanding Spatial Language in Radiology: Representation Framework, Annotation, and Spatial Relation Extraction from Chest X-ray Reports using Deep Learning
Surabhi Datta
Yuqi Si
Laritza M. Rodriguez
S. E. Shooshan
Dina Demner-Fushman
Kirk Roberts
MedIm
64
35
0
13 Aug 2019
On the Convergence of AdaBound and its Connection to SGD
On the Convergence of AdaBound and its Connection to SGD
Pedro H. P. Savarese
ODL
54
19
0
13 Aug 2019
On Identifiability in Transformers
On Identifiability in Transformers
Gino Brunner
Yang Liu
Damian Pascual
Oliver Richter
Massimiliano Ciaramita
Roger Wattenhofer
ViT
107
189
0
12 Aug 2019
Taming Unbalanced Training Workloads in Deep Learning with Partial
  Collective Operations
Taming Unbalanced Training Workloads in Deep Learning with Partial Collective Operations
Shigang Li
Tal Ben-Nun
Salvatore Di Girolamo
Dan Alistarh
Torsten Hoefler
149
59
0
12 Aug 2019
Multimodal Unified Attention Networks for Vision-and-Language
  Interactions
Multimodal Unified Attention Networks for Vision-and-Language Interactions
Zhou Yu
Yuhao Cui
Jun Yu
Dacheng Tao
Q. Tian
109
38
0
12 Aug 2019
TAPER: Time-Aware Patient EHR Representation
TAPER: Time-Aware Patient EHR Representation
Sajad Darabi
Mohammad Kachuee
Shayan Fazeli
Majid Sarrafzadeh
77
57
0
11 Aug 2019
Exploiting Temporal Relationships in Video Moment Localization with
  Natural Language
Exploiting Temporal Relationships in Video Moment Localization with Natural Language
Songyang Zhang
Jinsong Su
Jiebo Luo
65
74
0
11 Aug 2019
Multi-modality Latent Interaction Network for Visual Question Answering
Multi-modality Latent Interaction Network for Visual Question Answering
Peng Gao
Haoxuan You
Zhanpeng Zhang
Xiaogang Wang
Hongsheng Li
69
82
0
10 Aug 2019
A Generate-Validate Approach to Answering Questions about Qualitative
  Relationships
A Generate-Validate Approach to Answering Questions about Qualitative Relationships
Arindam Mitra
Chitta Baral
Aurgho Bhattacharjee
Ishan Shrivastava
32
6
0
09 Aug 2019
VisualBERT: A Simple and Performant Baseline for Vision and Language
VisualBERT: A Simple and Performant Baseline for Vision and Language
Liunian Harold Li
Mark Yatskar
Da Yin
Cho-Jui Hsieh
Kai-Wei Chang
VLM
267
1,975
0
09 Aug 2019
BERT-based Ranking for Biomedical Entity Normalization
BERT-based Ranking for Biomedical Entity Normalization
Zongcheng Ji
Qiang Wei
Hua Xu
OODMedIm
68
126
0
09 Aug 2019
Neural Image Compression and Explanation
Neural Image Compression and Explanation
Xiang Li
Shihao Ji
35
10
0
09 Aug 2019
Uncheatable Machine Learning Inference
Uncheatable Machine Learning Inference
Mustafa Canim
A. Kundu
Josh Payne
42
1
0
08 Aug 2019
On the Variance of the Adaptive Learning Rate and Beyond
On the Variance of the Adaptive Learning Rate and Beyond
Liyuan Liu
Haoming Jiang
Pengcheng He
Weizhu Chen
Xiaodong Liu
Jianfeng Gao
Jiawei Han
ODL
397
1,916
0
08 Aug 2019
CRIC: A VQA Dataset for Compositional Reasoning on Vision and
  Commonsense
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense
Difei Gao
Ruiping Wang
Shiguang Shan
Xilin Chen
CoGeLRM
129
28
0
08 Aug 2019
Do Neural Language Representations Learn Physical Commonsense?
Do Neural Language Representations Learn Physical Commonsense?
Maxwell Forbes
Ari Holtzman
Yejin Choi
NAILRMAI4CE
58
110
0
08 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for
  Vision-and-Language Tasks
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSLVLM
415
3,720
0
06 Aug 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized
  Word Representations
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations
Aarne Talman
Antti Suni
H. Çelikkanat
Sofoklis Kakouros
Jörg Tiedemann
M. Vainio
77
31
0
06 Aug 2019
Classification of Hand Movements from EEG using a Deep Attention-based
  LSTM Network
Classification of Hand Movements from EEG using a Deep Attention-based LSTM Network
Guangyi Zhang
Vandad Davoodnia
Alireza Sepas-Moghaddam
Yaoxue Zhang
Ali Etemad
83
130
0
06 Aug 2019
Clustering of Deep Contextualized Representations for Summarization of
  Biomedical Texts
Clustering of Deep Contextualized Representations for Summarization of Biomedical Texts
M. Moradi
Matthias Samwald
28
7
0
06 Aug 2019
Triplet Based Embedding Distance and Similarity Learning for
  Text-independent Speaker Verification
Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification
Zongze Ren
Zhiyong Chen
Shugong Xu
33
8
0
06 Aug 2019
Dialog State Tracking: A Neural Reading Comprehension Approach
Dialog State Tracking: A Neural Reading Comprehension Approach
Shuyang Gao
Abhishek Sethi
Sanchit Agarwal
Tagyoung Chung
Dilek Z. Hakkani-Tür
77
161
0
06 Aug 2019
Exploring Neural Net Augmentation to BERT for Question Answering on
  SQUAD 2.0
Exploring Neural Net Augmentation to BERT for Question Answering on SQUAD 2.0
Suhas Gupta
AI4MH
23
1
0
04 Aug 2019
Semi-supervised Thai Sentence Segmentation Using Local and Distant Word
  Representations
Semi-supervised Thai Sentence Segmentation Using Local and Distant Word Representations
Chanatip Saetia
Ekapol Chuangsuwanich
Tawunrat Chalothorn
P. Vateekul
74
5
0
04 Aug 2019
TABOR: A Highly Accurate Approach to Inspecting and Restoring Trojan
  Backdoors in AI Systems
TABOR: A Highly Accurate Approach to Inspecting and Restoring Trojan Backdoors in AI Systems
Wenbo Guo
Lun Wang
Masashi Sugiyama
Min Du
Basel Alomair
96
231
0
02 Aug 2019
DELTA: A DEep learning based Language Technology plAtform
DELTA: A DEep learning based Language Technology plAtform
Kun Han
Junwen Chen
Hui Zhang
Haiyang Xu
Yiping Peng
...
Cheng Gong
Yunbo Wang
Wei Zou
Hui Song
Xiangang Li
VLM
18
10
0
02 Aug 2019
MSnet: A BERT-based Network for Gendered Pronoun Resolution
MSnet: A BERT-based Network for Gendered Pronoun Resolution
Zili Wang
47
4
0
01 Aug 2019
GraphFlow: Exploiting Conversation Flow with Graph Neural Networks for
  Conversational Machine Comprehension
GraphFlow: Exploiting Conversation Flow with Graph Neural Networks for Conversational Machine Comprehension
Yu Chen
Lingfei Wu
Mohammed J Zaki
79
76
0
31 Jul 2019
On Mutual Information Maximization for Representation Learning
On Mutual Information Maximization for Representation Learning
Michael Tschannen
Josip Djolonga
Paul Kishan Rubenstein
Sylvain Gelly
Mario Lucic
SSL
206
502
0
31 Jul 2019
What BERT is not: Lessons from a new suite of psycholinguistic
  diagnostics for language models
What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models
Allyson Ettinger
128
610
0
31 Jul 2019
Previous
123...456457458...470471472
Next