ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,786 papers shown
Title
Semantics-aware BERT for Language Understanding
Semantics-aware BERT for Language Understanding
Zhuosheng Zhang
Yuwei Wu
Zhao Hai
Z. Li
Shuailiang Zhang
Xi Zhou
Xiang Zhou
26
365
0
05 Sep 2019
NERO: A Neural Rule Grounding Framework for Label-Efficient Relation
  Extraction
NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction
Wenxuan Zhou
Hongtao Lin
Bill Yuchen Lin
Ziqi Wang
Junyi Du
Leonardo Neves
Xiang Ren
NAI
50
54
0
05 Sep 2019
KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning
KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning
Bill Yuchen Lin
Xinyue Chen
Jamin Chen
Xiang Ren
31
460
0
04 Sep 2019
An Evaluation Dataset for Intent Classification and Out-of-Scope
  Prediction
An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction
Stefan Larson
Anish Mahendran
Joseph Peper
Christopher Clarke
Andrew Lee
...
Jonathan K. Kummerfeld
Kevin Leach
M. Laurenzano
Lingjia Tang
Jason Mars
31
518
0
04 Sep 2019
From 'F' to Á' on the N.Y. Regents Science Exams: An Overview of the
  Aristo Project
From 'F' to Á' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
Peter Clark
Oren Etzioni
Daniel Khashabi
Tushar Khot
Bhavana Dalvi
...
Niket Tandon
Sumithra Bhakthavatsalam
Dirk Groeneveld
Michal Guerquin
Michael Schmitz
ELM
31
99
0
04 Sep 2019
ParaQG: A System for Generating Questions and Answers from Paragraphs
ParaQG: A System for Generating Questions and Answers from Paragraphs
Vishwajeet Kumar
Sivaanandh Muneeswaran
Ganesh Ramakrishnan
Yuan-Fang Li
24
9
0
04 Sep 2019
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models
Answers Unite! Unsupervised Metrics for Reinforced Summarization Models
Thomas Scialom
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
32
149
0
04 Sep 2019
Towards Better Modeling Hierarchical Structure for Self-Attention with
  Ordered Neurons
Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered Neurons
Jie Hao
Xing Wang
Shuming Shi
Jinfeng Zhang
Zhaopeng Tu
34
12
0
04 Sep 2019
CrossWeigh: Training Named Entity Tagger from Imperfect Annotations
CrossWeigh: Training Named Entity Tagger from Imperfect Annotations
Zihan Wang
Jingbo Shang
Liyuan Liu
Lihao Lu
Jiacheng Liu
Jiawei Han
NoLa
24
102
0
03 Sep 2019
The Bottom-up Evolution of Representations in the Transformer: A Study
  with Machine Translation and Language Modeling Objectives
The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
Elena Voita
Rico Sennrich
Ivan Titov
232
184
0
03 Sep 2019
Better Rewards Yield Better Summaries: Learning to Summarise Without
  References
Better Rewards Yield Better Summaries: Learning to Summarise Without References
F. Böhm
Yang Gao
Christian M. Meyer
Ori Shapira
Ido Dagan
Iryna Gurevych
38
107
0
03 Sep 2019
Encode, Tag, Realize: High-Precision Text Editing
Encode, Tag, Realize: High-Precision Text Editing
Eric Malmi
Sebastian Krause
S. Rothe
Daniil Mirylenka
Aliaksei Severyn
3DV
32
170
0
03 Sep 2019
Language Models as Knowledge Bases?
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
483
2,618
0
03 Sep 2019
Unicoder: A Universal Language Encoder by Pre-training with Multiple
  Cross-lingual Tasks
Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks
Haoyang Huang
Yaobo Liang
Nan Duan
Ming Gong
Linjun Shou
Daxin Jiang
M. Zhou
47
230
0
03 Sep 2019
Editing-Based SQL Query Generation for Cross-Domain Context-Dependent
  Questions
Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions
Rui Zhang
Tao Yu
H. Er
Sungrok Shim
Eric Xue
Xi Lin
Tianze Shi
Caiming Xiong
R. Socher
Dragomir R. Radev
32
146
0
02 Sep 2019
Scalable and Accurate Dialogue State Tracking via Hierarchical Sequence
  Generation
Scalable and Accurate Dialogue State Tracking via Hierarchical Sequence Generation
Liliang Ren
Jianmo Ni
Julian McAuley
40
80
0
02 Sep 2019
SumQE: a BERT-based Summary Quality Estimation Model
SumQE: a BERT-based Summary Quality Estimation Model
Stratos Xenouleas
Prodromos Malakasiotis
Marianna Apidianaki
Ion Androutsopoulos
28
37
0
02 Sep 2019
Syntax-aware Multilingual Semantic Role Labeling
Syntax-aware Multilingual Semantic Role Labeling
Shexia He
Z. Li
Zhao Hai
33
49
0
01 Sep 2019
Cosmos QA: Machine Reading Comprehension with Contextual Commonsense
  Reasoning
Cosmos QA: Machine Reading Comprehension with Contextual Commonsense Reasoning
Lifu Huang
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
AIMat
RALM
LRM
43
448
0
31 Aug 2019
Humor Detection: A Transformer Gets the Last Laugh
Humor Detection: A Transformer Gets the Last Laugh
Orion Weller
Kevin Seppi
72
122
0
31 Aug 2019
WSLLN: Weakly Supervised Natural Language Localization Networks
WSLLN: Weakly Supervised Natural Language Localization Networks
M. Gao
L. Davis
R. Socher
Caiming Xiong
21
80
0
31 Aug 2019
NEZHA: Neural Contextualized Representation for Chinese Language
  Understanding
NEZHA: Neural Contextualized Representation for Chinese Language Understanding
Junqiu Wei
Xiaozhe Ren
Xiaoguang Li
Wenyong Huang
Yi-Lun Liao
Yasheng Wang
Jianghao Lin
Xin Jiang
Xiao Chen
Qun Liu
8
116
0
31 Aug 2019
Benchmarking Zero-shot Text Classification: Datasets, Evaluation and
  Entailment Approach
Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach
Wenpeng Yin
Jamaal Hay
Dan Roth
65
542
0
31 Aug 2019
Adversarial Learning with Contextual Embeddings for Zero-resource
  Cross-lingual Classification and NER
Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER
Phillip Keung
Y. Lu
Vikas Bhardwaj
39
81
0
31 Aug 2019
Deep Reinforcement Learning with Distributional Semantic Rewards for
  Abstractive Summarization
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization
Siyao Li
Deren Lei
Pengda Qin
William Yang Wang
19
43
0
31 Aug 2019
A Logic-Driven Framework for Consistency of Neural Models
A Logic-Driven Framework for Consistency of Neural Models
Tao Li
Vivek Gupta
Maitrey Mehta
Vivek Srikumar
AI4CE
34
101
0
31 Aug 2019
A Semantics-Assisted Video Captioning Model Trained with Scheduled
  Sampling
A Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling
Haoran Chen
Ke Lin
A. Maye
Jianmin Li
Xiaoling Hu
30
47
0
31 Aug 2019
(Male, Bachelor) and (Female, Ph.D) have different connotations:
  Parallelly Annotated Stylistic Language Dataset with Multiple Personas
(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Annotated Stylistic Language Dataset with Multiple Personas
Dongyeop Kang
Varun Gangal
Eduard H. Hovy
29
16
0
31 Aug 2019
Exploring Domain Shift in Extractive Text Summarization
Exploring Domain Shift in Extractive Text Summarization
Danqing Wang
Pengfei Liu
Ming Zhong
Jie Fu
Xipeng Qiu
Xuanjing Huang
34
36
0
30 Aug 2019
Shallow Syntax in Deep Water
Shallow Syntax in Deep Water
Swabha Swayamdipta
Matthew E. Peters
Brendan Roof
Chris Dyer
Noah A. Smith
29
10
0
29 Aug 2019
Neural Snowball for Few-Shot Relation Learning
Neural Snowball for Few-Shot Relation Learning
Tianyu Gao
Xu Han
Ruobing Xie
Zhiyuan Liu
Fen Lin
Leyu Lin
Maosong Sun
47
79
0
29 Aug 2019
Interactive Language Learning by Question Answering
Interactive Language Learning by Question Answering
Xingdi Yuan
Marc-Alexandre Côté
Jie Fu
Zhouhan Lin
C. Pal
Yoshua Bengio
Adam Trischler
34
47
0
28 Aug 2019
Analyzing Customer Feedback for Product Fit Prediction
Analyzing Customer Feedback for Product Fit Prediction
S. Baier
26
4
0
28 Aug 2019
Stochastic AUC Maximization with Deep Neural Networks
Stochastic AUC Maximization with Deep Neural Networks
Mingrui Liu
Zhuoning Yuan
Yiming Ying
Tianbao Yang
32
103
0
28 Aug 2019
Ensemble-Based Deep Reinforcement Learning for Chatbots
Ensemble-Based Deep Reinforcement Learning for Chatbots
Heriberto Cuayáhuitl
Donghyeon Lee
Seonghan Ryu
Yongjin Cho
Sungja Choi
Satish Reddy Indurthi
Seunghak Yu
Hyungtak Choi
Inchul Hwang
J. Kim
OffRL
29
69
0
27 Aug 2019
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
132
11,894
0
27 Aug 2019
Real-world Conversational AI for Hotel Bookings
Real-world Conversational AI for Hotel Bookings
Bai Li
Nanyi Jiang
Joey Sham
Henry Shi
Hussein Fazal
34
15
0
27 Aug 2019
Improving Automatic Jazz Melody Generation by Transfer Learning
  Techniques
Improving Automatic Jazz Melody Generation by Transfer Learning Techniques
Hsiao-Tzu Hung
Chung-Yang Wang
Yi-Hsuan Yang
Hsin-Min Wang
17
19
0
26 Aug 2019
Patient Knowledge Distillation for BERT Model Compression
Patient Knowledge Distillation for BERT Model Compression
S. Sun
Yu Cheng
Zhe Gan
Jingjing Liu
83
832
0
25 Aug 2019
Adversarial Domain Adaptation for Machine Reading Comprehension
Adversarial Domain Adaptation for Machine Reading Comprehension
Huazheng Wang
Zhe Gan
Xiaodong Liu
Jingjing Liu
Jianfeng Gao
Hongning Wang
30
64
0
24 Aug 2019
DGSAN: Discrete Generative Self-Adversarial Network
DGSAN: Discrete Generative Self-Adversarial Network
Ehsan Montahaei
Danial Alihosseini
M. Baghshah
19
13
0
24 Aug 2019
BERT for Coreference Resolution: Baselines and Analysis
BERT for Coreference Resolution: Baselines and Analysis
Mandar Joshi
Omer Levy
Daniel S. Weld
Luke Zettlemoyer
28
320
0
24 Aug 2019
Fairness in Deep Learning: A Computational Perspective
Fairness in Deep Learning: A Computational Perspective
Mengnan Du
Fan Yang
Na Zou
Xia Hu
FaML
FedML
15
230
0
23 Aug 2019
Spiking Neural Predictive Coding for Continual Learning from Data
  Streams
Spiking Neural Predictive Coding for Continual Learning from Data Streams
Alexander Ororbia
43
25
0
23 Aug 2019
Unsupervised Text Summarization via Mixed Model Back-Translation
Unsupervised Text Summarization via Mixed Model Back-Translation
Yacine Jernite
46
2
0
22 Aug 2019
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
Weijie Su
Xizhou Zhu
Yue Cao
Bin Li
Lewei Lu
Furu Wei
Jifeng Dai
VLM
MLLM
SSL
87
1,652
0
22 Aug 2019
Unsupervised Lemmatization as Embeddings-Based Word Clustering
Unsupervised Lemmatization as Embeddings-Based Word Clustering
Rudolf Rosa
Z. Žabokrtský
20
10
0
22 Aug 2019
The many Shapley values for model explanation
The many Shapley values for model explanation
Mukund Sundararajan
A. Najmi
TDI
FAtt
11
625
0
22 Aug 2019
Text Summarization with Pretrained Encoders
Text Summarization with Pretrained Encoders
Yang Liu
Mirella Lapata
MILM
283
1,437
0
22 Aug 2019
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Liang Wang
Wei Zhao
Ruoyu Jia
Sujian Li
Jingming Liu
VLM
AI4CE
42
37
0
22 Aug 2019
Previous
123...384385386...394395396
Next