ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 19,786 papers shown
Title
Dual Co-Matching Network for Multi-choice Reading Comprehension
Dual Co-Matching Network for Multi-choice Reading Comprehension
Shuailiang Zhang
Zhao Hai
Yuwei Wu
Zhuosheng Zhang
Xi Zhou
Xiaoping Zhou
44
131
0
27 Jan 2019
Deep Learning on Small Datasets without Pre-Training using Cosine Loss
Deep Learning on Small Datasets without Pre-Training using Cosine Loss
Björn Barz
Joachim Denzler
32
130
0
25 Jan 2019
BioBERT: a pre-trained biomedical language representation model for
  biomedical text mining
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Jinhyuk Lee
Wonjin Yoon
Sungdong Kim
Donghyeon Kim
Sunkyu Kim
Chan Ho So
Jaewoo Kang
OOD
90
5,545
0
25 Jan 2019
A BERT Baseline for the Natural Questions
A BERT Baseline for the Natural Questions
Chris Alberti
Kenton Lee
Michael Collins
ELM
AI4MH
25
127
0
24 Jan 2019
Large-Batch Training for LSTM and Beyond
Large-Batch Training for LSTM and Beyond
Yang You
Jonathan Hseu
Chris Ying
J. Demmel
Kurt Keutzer
Cho-Jui Hsieh
33
89
0
24 Jan 2019
TransferTransfo: A Transfer Learning Approach for Neural Network Based
  Conversational Agents
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
Thomas Wolf
Victor Sanh
Julien Chaumond
Clement Delangue
45
493
0
23 Jan 2019
A Question-Entailment Approach to Question Answering
A Question-Entailment Approach to Question Answering
Asma Ben Abacha
Dina Demner-Fushman
32
191
0
23 Jan 2019
Programmable Neural Network Trojan for Pre-Trained Feature Extractor
Programmable Neural Network Trojan for Pre-Trained Feature Extractor
Yu Ji
Zixin Liu
Xing Hu
Peiqi Wang
Youhui Zhang
AAML
27
17
0
23 Jan 2019
Deep learning and sub-word-unit approach in written art generation
Deep learning and sub-word-unit approach in written art generation
K. Wołk
Emilia Zawadzka-Gosk
Wojciech Czarnowski
27
1
0
22 Jan 2019
Cross-lingual Language Model Pretraining
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
25
2,721
0
22 Jan 2019
Mixed Formal Learning: A Path to Transparent Machine Learning
Mixed Formal Learning: A Path to Transparent Machine Learning
Sandra Carrico
AI4CE
19
1
0
20 Jan 2019
Physics-Constrained Deep Learning for High-dimensional Surrogate
  Modeling and Uncertainty Quantification without Labeled Data
Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data
Yinhao Zhu
N. Zabaras
P. Koutsourelakis
P. Perdikaris
PINN
AI4CE
51
857
0
18 Jan 2019
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
57
190
0
16 Jan 2019
Assessing BERT's Syntactic Abilities
Assessing BERT's Syntactic Abilities
Yoav Goldberg
25
494
0
16 Jan 2019
Sentence transition matrix: An efficient approach that preserves
  sentence semantics
Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang
Pilsung Kang
19
2
0
16 Jan 2019
Investigating Antigram Behaviour using Distributional Semantics
Investigating Antigram Behaviour using Distributional Semantics
Saptarshi Sengupta
16
0
0
15 Jan 2019
Exploiting Synchronized Lyrics And Vocal Features For Music Emotion
  Detection
Exploiting Synchronized Lyrics And Vocal Features For Music Emotion Detection
Loreto Parisi
Simone Francia
Silvio Olivastri
Maria Stella Tavella
26
11
0
15 Jan 2019
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat
  Minima for Neural Networks using PAC-Bayesian Analysis
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis
Yusuke Tsuzuku
Issei Sato
Masashi Sugiyama
35
76
0
15 Jan 2019
Passage Re-ranking with BERT
Passage Re-ranking with BERT
Rodrigo Nogueira
Kyunghyun Cho
OOD
72
1,078
0
13 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability
  Judgments
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments
Alex Warstadt
Samuel R. Bowman
22
23
0
11 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
48
3,700
0
09 Jan 2019
On the Possibilities and Limitations of Multi-hop Reasoning Under
  Linguistic Imperfections
On the Possibilities and Limitations of Multi-hop Reasoning Under Linguistic Imperfections
Daniel Khashabi
Erfan Sadeqi Azer
Tushar Khot
Ashish Sabharwal
Dan Roth
LRM
17
8
0
08 Jan 2019
Multi-style Generative Reading Comprehension
Multi-style Generative Reading Comprehension
Kyosuke Nishida
Itsumi Saito
Kosuke Nishida
Kazutoshi Shinoda
Atsushi Otsuka
Hisako Asano
J. Tomita
32
71
0
08 Jan 2019
Feature reinforcement with word embedding and parsing information in
  neural TTS
Feature reinforcement with word embedding and parsing information in neural TTS
Huaiping Ming
Lei He
Haohan Guo
Frank Soong
82
15
0
03 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language
  Models for Online Review Generation
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
35
30
0
02 Jan 2019
Text Infilling
Text Infilling
Wanrong Zhu
Zhiting Hu
Eric Xing
38
62
0
01 Jan 2019
A neural joint model for Vietnamese word segmentation, POS tagging and
  dependency parsing
A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing
Dat Quoc Nguyen
40
12
0
30 Dec 2018
Adversarial Attack and Defense on Graph Data: A Survey
Adversarial Attack and Defense on Graph Data: A Survey
Lichao Sun
Yingtong Dou
Carl Yang
Ji Wang
Yixin Liu
Philip S. Yu
Lifang He
Yangqiu Song
GNN
AAML
42
276
0
26 Dec 2018
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document
  Classification
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification
Mozhi Zhang
Yoshinari Fujinuma
Jordan L. Boyd-Graber
33
21
0
22 Dec 2018
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Chenwei Zhang
Yaliang Li
Nan Du
Wei Fan
Philip S. Yu
19
234
0
22 Dec 2018
A Survey on Deep Learning for Named Entity Recognition
A Survey on Deep Learning for Named Entity Recognition
Junlin Li
Aixin Sun
Jianglei Han
Chenliang Li
3DV
52
1,150
0
22 Dec 2018
Graph Neural Networks: A Review of Methods and Applications
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CE
GNN
137
5,433
0
20 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
36
364
0
13 Dec 2018
Detecting weak and strong Islamophobic hate speech on social media
Detecting weak and strong Islamophobic hate speech on social media
Bertie Vidgen
T. Yasseri
22
138
0
12 Dec 2018
SMIT: Stochastic Multi-Label Image-to-Image Translation
SMIT: Stochastic Multi-Label Image-to-Image Translation
Andrés Romero
Pablo Arbelaez
Luc Van Gool
Radu Timofte
25
66
0
10 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
92
696
0
06 Dec 2018
Efficient Attention: Attention with Linear Complexities
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Hongsheng Li
61
518
0
04 Dec 2018
From Recognition to Cognition: Visual Commonsense Reasoning
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
100
872
0
27 Nov 2018
Synergistic Drug Combination Prediction by Integrating Multi-omics Data
  in Deep Learning Models
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
Tianyu Zhang
Liwei Zhang
Philip R. O. Payne
Fuhai Li
30
96
0
16 Nov 2018
Survey of Computational Approaches to Lexical Semantic Change
Survey of Computational Approaches to Lexical Semantic Change
Nina Tahmasebi
L. Borin
Adam Jatowt
52
163
0
15 Nov 2018
Extractive Summary as Discrete Latent Variables
Extractive Summary as Discrete Latent Variables
Aran Komatsuzaki
26
3
0
14 Nov 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
32
246
0
12 Nov 2018
Speech Intention Understanding in a Head-final Language: A
  Disambiguation Utilizing Intonation-dependency
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency
Won Ik Cho
Hyeon Seung Lee
J. Yoon
Seokhwan Kim
N. Kim
56
5
0
10 Nov 2018
Elastic CRFs for Open-ontology Slot Filling
Elastic CRFs for Open-ontology Slot Filling
Yinpei Dai
Yichi Zhang
Hong Liu
Zhijian Ou
Yanmeng Wang
Junlan Feng
63
2
0
04 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over
  Knowledge Graphs
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs
Gaurav Maheshwari
Priyansh Trivedi
Denis Lukovnikov
Nilesh Chakraborty
Asja Fischer
Jens Lehmann
GNN
31
72
0
02 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate
  Labeled-data Tasks
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
33
467
0
02 Nov 2018
CommonsenseQA: A Question Answering Challenge Targeting Commonsense
  Knowledge
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
79
1,653
0
02 Nov 2018
On the Generation of Medical Question-Answer Pairs
On the Generation of Medical Question-Answer Pairs
Sheng Shen
Yaliang Li
Nan Du
X. Wu
Yusheng Xie
Shen Ge
Tao Yang
Kai Wang
Xin-Fang Liang
Wei Fan
MedIm
18
21
0
01 Nov 2018
Improving Machine Reading Comprehension with General Reading Strategies
Improving Machine Reading Comprehension with General Reading Strategies
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
AI4CE
47
116
0
31 Oct 2018
A Pragmatic Guide to Geoparsing Evaluation
A Pragmatic Guide to Geoparsing Evaluation
Milan Gritta
Mohammad Taher Pilehvar
Nigel Collier
19
67
0
29 Oct 2018
Previous
123...394395396
Next