ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,599 papers shown
Title
Time Series Forecasting With Deep Learning: A Survey
Time Series Forecasting With Deep Learning: A Survey
Bryan Lim
S. Zohren
AI4TS
AI4CE
59
1,192
0
28 Apr 2020
Self-Attention with Cross-Lingual Position Representation
Self-Attention with Cross-Lingual Position Representation
Liang Ding
Longyue Wang
Dacheng Tao
MILM
33
37
0
28 Apr 2020
VD-BERT: A Unified Vision and Dialog Transformer with BERT
VD-BERT: A Unified Vision and Dialog Transformer with BERT
Yue Wang
Chenyu You
Michael R. Lyu
Irwin King
Caiming Xiong
Guosheng Lin
24
102
0
28 Apr 2020
Deep Conversational Recommender Systems: A New Frontier for
  Goal-Oriented Dialogue Systems
Deep Conversational Recommender Systems: A New Frontier for Goal-Oriented Dialogue Systems
Dai Hoang Tran
Quan Z. Sheng
W. Zhang
S. Hamad
Munazza Zaib
Nguyen H. Tran
Lina Yao
N. Khoa
16
6
0
28 Apr 2020
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
Ji Xin
Raphael Tang
Jaejun Lee
Yaoliang Yu
Jimmy J. Lin
17
365
0
27 Apr 2020
FlexSA: Flexible Systolic Array Architecture for Efficient Pruned DNN
  Model Training
FlexSA: Flexible Systolic Array Architecture for Efficient Pruned DNN Model Training
Sangkug Lym
M. Erez
21
25
0
27 Apr 2020
Empirical Bayes Transductive Meta-Learning with Synthetic Gradients
Empirical Bayes Transductive Meta-Learning with Synthetic Gradients
S. Hu
Pablo G. Moreno
Yanghua Xiao
Xin Shen
G. Obozinski
Neil D. Lawrence
Andreas C. Damianou
BDL
30
125
0
27 Apr 2020
Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less
  Forgetting
Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting
Sanyuan Chen
Yutai Hou
Yiming Cui
Wanxiang Che
Ting Liu
Xiangzhan Yu
KELM
CLL
21
214
0
27 Apr 2020
Sequential Interpretability: Methods, Applications, and Future Direction
  for Understanding Deep Learning Models in the Context of Sequential Data
Sequential Interpretability: Methods, Applications, and Future Direction for Understanding Deep Learning Models in the Context of Sequential Data
B. Shickel
Parisa Rashidi
AI4TS
33
17
0
27 Apr 2020
Single-/Multi-Source Cross-Lingual NER via Teacher-Student Learning on
  Unlabeled Data in Target Language
Single-/Multi-Source Cross-Lingual NER via Teacher-Student Learning on Unlabeled Data in Target Language
Qianhui Wu
Zijia Lin
Börje F. Karlsson
Jian-Guang Lou
Biqing Huang
24
69
0
26 Apr 2020
GLUECoS : An Evaluation Benchmark for Code-Switched NLP
GLUECoS : An Evaluation Benchmark for Code-Switched NLP
Simran Khanuja
Sandipan Dandapat
A. Srinivasan
Sunayana Sitaram
Monojit Choudhury
ELM
29
142
0
26 Apr 2020
Multi-Domain Dialogue Acts and Response Co-Generation
Multi-Domain Dialogue Acts and Response Co-Generation
Kai Wang
Junfeng Tian
Rui Wang
Xiaojun Quan
Jianxing Yu
8
58
0
26 Apr 2020
Relational Graph Attention Network for Aspect-based Sentiment Analysis
Relational Graph Attention Network for Aspect-based Sentiment Analysis
Kai Wang
Weizhou Shen
Yunyi Yang
Xiaojun Quan
Rui Wang
43
547
0
26 Apr 2020
Dual Learning for Semi-Supervised Natural Language Understanding
Dual Learning for Semi-Supervised Natural Language Understanding
Su Zhu
Ruisheng Cao
Kai Yu
45
31
0
26 Apr 2020
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical
  Encoder for Long-Form Document Matching
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching
Liu Yang
Mingyang Zhang
Cheng Li
Michael Bendersky
Marc Najork
38
87
0
26 Apr 2020
A Perspective on Deep Learning for Molecular Modeling and Simulations
A Perspective on Deep Learning for Molecular Modeling and Simulations
Jun Zhang
Yao-Kun Lei
Zhen Zhang
Junhan Chang
Maodong Li
Xu Han
Lijiang Yang
Yue Yang
Y. Gao
AI4CE
42
8
0
25 Apr 2020
How Does NLP Benefit Legal System: A Summary of Legal Artificial
  Intelligence
How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence
Haoxiang Zhong
Chaojun Xiao
Cunchao Tu
Tianyang Zhang
Zhiyuan Liu
Maosong Sun
AILaw
67
313
0
25 Apr 2020
Reevaluating Adversarial Examples in Natural Language
Reevaluating Adversarial Examples in Natural Language
John X. Morris
Eli Lifland
Jack Lanchantin
Yangfeng Ji
Yanjun Qi
SILM
AAML
20
111
0
25 Apr 2020
Syntactic Data Augmentation Increases Robustness to Inference Heuristics
Syntactic Data Augmentation Increases Robustness to Inference Heuristics
Junghyun Min
R. Thomas McCoy
Dipanjan Das
Emily Pitler
Tal Linzen
35
175
0
24 Apr 2020
Template-Based Question Generation from Retrieved Sentences for Improved
  Unsupervised Question Answering
Template-Based Question Generation from Retrieved Sentences for Improved Unsupervised Question Answering
Alexander R. Fabbri
Patrick Ng
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
32
77
0
24 Apr 2020
Lite Transformer with Long-Short Range Attention
Lite Transformer with Long-Short Range Attention
Zhanghao Wu
Zhijian Liu
Ji Lin
Chengyue Wu
Song Han
23
318
0
24 Apr 2020
MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond
MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond
Duy-Kien Nguyen
Vedanuj Goswami
Xinlei Chen
39
23
0
24 Apr 2020
Learning the grammar of drug prescription: recurrent neural network
  grammars for medication information extraction in clinical texts
Learning the grammar of drug prescription: recurrent neural network grammars for medication information extraction in clinical texts
Ivan Lerner
Jordan Jouffroy
Anita Burgun
A. Neuraz
27
9
0
24 Apr 2020
Generative Data Augmentation for Commonsense Reasoning
Generative Data Augmentation for Commonsense Reasoning
Yiben Yang
Chaitanya Malaviya
Jared Fernandez
Swabha Swayamdipta
Ronan Le Bras
Ji-ping Wang
Chandra Bhagavatula
Yejin Choi
Doug Downey
LRM
30
91
0
24 Apr 2020
Distilling Knowledge for Fast Retrieval-based Chat-bots
Distilling Knowledge for Fast Retrieval-based Chat-bots
Amir Vakili Tahami
Kamyar Ghajar
A. Shakery
32
31
0
23 Apr 2020
QURIOUS: Question Generation Pretraining for Text Generation
QURIOUS: Question Generation Pretraining for Text Generation
Shashi Narayan
Gonçalo Simães
Ji Ma
Hannah Craighead
Ryan T. McDonald
37
15
0
23 Apr 2020
A Review of Winograd Schema Challenge Datasets and Approaches
A Review of Winograd Schema Challenge Datasets and Approaches
Vid Kocijan
Thomas Lukasiewicz
E. Davis
G. Marcus
L. Morgenstern
25
44
0
23 Apr 2020
Residual Energy-Based Models for Text Generation
Residual Energy-Based Models for Text Generation
Yuntian Deng
A. Bakhtin
Myle Ott
Arthur Szlam
MarcÁurelio Ranzato
22
126
0
22 Apr 2020
AmbigQA: Answering Ambiguous Open-domain Questions
AmbigQA: Answering Ambiguous Open-domain Questions
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
52
296
0
22 Apr 2020
Contextualised Graph Attention for Improved Relation Extraction
Contextualised Graph Attention for Improved Relation Extraction
Angrosh Mandya
Danushka Bollegala
Frans Coenen
GNN
26
6
0
22 Apr 2020
Keyphrase Prediction With Pre-trained Language Model
Keyphrase Prediction With Pre-trained Language Model
R. Liu
Zheng Lin
Weiping Wang
28
17
0
22 Apr 2020
Logical Natural Language Generation from Open-Domain Tables
Logical Natural Language Generation from Open-Domain Tables
Wenhu Chen
Jianshu Chen
Yunde Su
Zhiyu Zoey Chen
William Yang Wang
LMTD
34
155
0
22 Apr 2020
Textual Visual Semantic Dataset for Text Spotting
Textual Visual Semantic Dataset for Text Spotting
Ahmed Sabir
Francesc Moreno-Noguer
Lluís Padró
21
3
0
21 Apr 2020
Deep Learning for Time Series Forecasting: Tutorial and Literature
  Survey
Deep Learning for Time Series Forecasting: Tutorial and Literature Survey
Konstantinos Benidis
Syama Sundar Rangapuram
Valentin Flunkert
Bernie Wang
Danielle C. Maddix
...
David Salinas
Lorenzo Stella
François-Xavier Aubet
Laurent Callot
Tim Januschowski
AI4TS
30
176
0
21 Apr 2020
MT-Clinical BERT: Scaling Clinical Information Extraction with Multitask
  Learning
MT-Clinical BERT: Scaling Clinical Information Extraction with Multitask Learning
Andriy Mulyar
Bridget T. McInnes
24
52
0
21 Apr 2020
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic
  Reinforcement Learning
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning
Ryan Julian
Benjamin Swanson
Gaurav Sukhatme
Sergey Levine
Chelsea Finn
Karol Hausman
OnRL
CLL
35
43
0
21 Apr 2020
Logic-Guided Data Augmentation and Regularization for Consistent
  Question Answering
Logic-Guided Data Augmentation and Regularization for Consistent Question Answering
Akari Asai
Hannaneh Hajishirzi
NAI
21
111
0
21 Apr 2020
Experience Grounds Language
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
24
351
0
21 Apr 2020
Vector Quantized Contrastive Predictive Coding for Template-based Music
  Generation
Vector Quantized Contrastive Predictive Coding for Template-based Music Generation
Gaëtan Hadjeres
Léopold Crestel
34
18
0
21 Apr 2020
Curriculum Pre-training for End-to-End Speech Translation
Curriculum Pre-training for End-to-End Speech Translation
Chengyi Wang
Yu Wu
Shujie Liu
Ming Zhou
Zhenglu Yang
29
108
0
21 Apr 2020
The Ivory Tower Lost: How College Students Respond Differently than the
  General Public to the COVID-19 Pandemic
The Ivory Tower Lost: How College Students Respond Differently than the General Public to the COVID-19 Pandemic
Viet-An Duong
Phu Pham
Tongyu Yang
Yu Wang
Jiebo Luo
AI4CE
27
90
0
21 Apr 2020
DIET: Lightweight Language Understanding for Dialogue Systems
DIET: Lightweight Language Understanding for Dialogue Systems
Tanja Bunk
Daksh Varshneya
Vladimir Vlasov
Alan Nichol
29
160
0
21 Apr 2020
A Generic Network Compression Framework for Sequential Recommender
  Systems
A Generic Network Compression Framework for Sequential Recommender Systems
Yang Sun
Fajie Yuan
Ming Yang
Guoao Wei
Zhou Zhao
Duo Liu
26
54
0
21 Apr 2020
Train No Evil: Selective Masking for Task-Guided Pre-Training
Train No Evil: Selective Masking for Task-Guided Pre-Training
Yuxian Gu
Zhengyan Zhang
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
32
59
0
21 Apr 2020
Word Embedding-based Text Processing for Comprehensive Summarization and
  Distinct Information Extraction
Word Embedding-based Text Processing for Comprehensive Summarization and Distinct Information Extraction
Xiangpeng Wan
Hakim Ghazzai
Y. Massoud
21
2
0
21 Apr 2020
Leveraging Personal Navigation Assistant Systems Using Automated Social
  Media Traffic Reporting
Leveraging Personal Navigation Assistant Systems Using Automated Social Media Traffic Reporting
Xiangpeng Wan
Hakim Ghazzai
Y. Massoud
21
1
0
21 Apr 2020
Mirror Ritual: An Affective Interface for Emotional Self-Reflection
Mirror Ritual: An Affective Interface for Emotional Self-Reflection
Nina Rajcic
Jon McCormack
19
48
0
21 Apr 2020
StereoSet: Measuring stereotypical bias in pretrained language models
StereoSet: Measuring stereotypical bias in pretrained language models
Moin Nadeem
Anna Bethke
Siva Reddy
37
961
0
20 Apr 2020
A Study of Cross-Lingual Ability and Language-specific Information in
  Multilingual BERT
A Study of Cross-Lingual Ability and Language-specific Information in Multilingual BERT
Chi-Liang Liu
Tsung-Yuan Hsu
Yung-Sung Chuang
Hung-yi Lee
34
14
0
20 Apr 2020
The State and Fate of Linguistic Diversity and Inclusion in the NLP
  World
The State and Fate of Linguistic Diversity and Inclusion in the NLP World
Pratik M. Joshi
Sebastin Santy
A. Budhiraja
Kalika Bali
Monojit Choudhury
LMTD
31
806
0
20 Apr 2020
Previous
123...347348349...370371372
Next