ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

30 / 23,780 papers shown
Title
Exploiting Invertible Decoders for Unsupervised Sentence Representation
  Learning
Exploiting Invertible Decoders for Unsupervised Sentence Representation Learning
Shuai Tang
V. D. Sa
SSL
74
1
0
08 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text
  Generation
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric Xing
VLM
155
56
0
04 Sep 2018
Question Answering by Reasoning Across Documents with Graph
  Convolutional Networks
Question Answering by Reasoning Across Documents with Graph Convolutional Networks
Nicola De Cao
Wilker Aziz
Ivan Titov
BDLRALMGNN
142
226
0
29 Aug 2018
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive
  Meaning Representations
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations
Mohammad Taher Pilehvar
Jose Camacho-Collados
229
493
0
28 Aug 2018
CoQA: A Conversational Question Answering Challenge
CoQA: A Conversational Question Answering Challenge
Siva Reddy
Danqi Chen
Christopher D. Manning
RALMHAI
176
1,213
0
21 Aug 2018
The Influence of Down-Sampling Strategies on SVD Word Embedding
  Stability
The Influence of Down-Sampling Strategies on SVD Word Embedding Stability
Johannes Hellrich
B. Kampe
U. Hahn
56
10
0
21 Aug 2018
Gender Bias in Neural Natural Language Processing
Gender Bias in Neural Natural Language Processing
Kaiji Lu
Piotr (Peter) Mardziel
Fangjing Wu
Preetam Amancharla
Anupam Datta
128
362
0
31 Jul 2018
Video Storytelling: Textual Summaries for Events
Video Storytelling: Textual Summaries for Events
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
DiffM
72
47
0
25 Jul 2018
Modeling Word Emotion in Historical Language: Quantity Beats Supposed
  Stability in Seed Word Selection
Modeling Word Emotion in Historical Language: Quantity Beats Supposed Stability in Seed Word Selection
Johannes Hellrich
Sven Buechel
U. Hahn
55
7
0
21 Jun 2018
DRCD: a Chinese Machine Reading Comprehension Dataset
DRCD: a Chinese Machine Reading Comprehension Dataset
C. Shao
Trois Liu
Yuting Lai
Yiying Tseng
Sam S. Tsai
102
128
0
04 Jun 2018
Like a Baby: Visually Situated Neural Language Acquisition
Like a Baby: Visually Situated Neural Language Acquisition
Alexander Ororbia
A. Mali
Mary Alexandria Kelly
David Reitter
40
4
0
29 May 2018
Explainable Recommendation: A Survey and New Perspectives
Explainable Recommendation: A Survey and New Perspectives
Yongfeng Zhang
Xu Chen
XAILRM
124
884
0
30 Apr 2018
Stochastic Answer Networks for Natural Language Inference
Stochastic Answer Networks for Natural Language Inference
Xiaodong Liu
Kevin Duh
Jianfeng Gao
BDL
76
45
0
21 Apr 2018
Utilizing Neural Networks and Linguistic Metadata for Early Detection of
  Depression Indications in Text Sequences
Utilizing Neural Networks and Linguistic Metadata for Early Detection of Depression Indications in Text Sequences
Marcel Trotzek
Sven Koitka
Christoph M. Friedrich
66
201
0
19 Apr 2018
Interact and Decide: Medley of Sub-Attention Networks for Effective
  Group Recommendation
Interact and Decide: Medley of Sub-Attention Networks for Effective Group Recommendation
Lucas Vinh Tran
T. Pham
Yi Tay
Yiding Liu
Gao Cong
Xiaoli Li
64
97
0
12 Apr 2018
Clinical Concept Embeddings Learned from Massive Sources of Multimodal
  Medical Data
Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data
Andrew L. Beam
Benjamin Kompa
A. Schmaltz
Inbar Fried
G. Weber
N. Palmer
Xu Shi
Tianxi Cai
I. Kohane
70
179
0
04 Apr 2018
The Geometry of Culture: Analyzing Meaning through Word Embeddings
The Geometry of Culture: Analyzing Meaning through Word Embeddings
Austin C. Kozlowski
Matt Taddy
James A. Evans
76
392
0
25 Mar 2018
SparCML: High-Performance Sparse Communication for Machine Learning
SparCML: High-Performance Sparse Communication for Machine Learning
Cédric Renggli
Saleh Ashkboos
Mehdi Aghagolzadeh
Dan Alistarh
Torsten Hoefler
102
127
0
22 Feb 2018
Deep Learning for Genomics: A Concise Overview
Deep Learning for Genomics: A Concise Overview
Tianwei Yue
Yuanxin Wang
Longxiang Zhang
Chunming Gu
Haohan Wang
Wenping Wang
Qi Lyu
Yujie Dun
AILawVLMBDL
86
91
0
02 Feb 2018
Natural Language Processing: State of The Art, Current Trends and
  Challenges
Natural Language Processing: State of The Art, Current Trends and Challenges
Diksha Khurana
Aditya Koli
Kiran Khatter
Sukhdev Singh
65
1,085
0
17 Aug 2017
Simple and Effective Dimensionality Reduction for Word Embeddings
Simple and Effective Dimensionality Reduction for Word Embeddings
Vikas Raunak
96
102
0
11 Aug 2017
Recent Trends in Deep Learning Based Natural Language Processing
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
153
2,849
0
09 Aug 2017
A Survey Of Cross-lingual Word Embedding Models
A Survey Of Cross-lingual Word Embedding Models
Sebastian Ruder
Ivan Vulić
Anders Søgaard
112
534
0
15 Jun 2017
Lifelong Generative Modeling
Lifelong Generative Modeling
Jason Ramapuram
Magda Gregorova
Alexandros Kalousis
BDLCLL
152
120
0
27 May 2017
Jointly Learning Sentence Embeddings and Syntax with Unsupervised
  Tree-LSTMs
Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs
Jean Maillard
S. Clark
Dani Yogatama
82
89
0
25 May 2017
A survey of embedding models of entities and relationships for knowledge
  graph completion
A survey of embedding models of entities and relationships for knowledge graph completion
Dat Quoc Nguyen
108
100
0
23 Mar 2017
Evolving Deep Neural Networks
Evolving Deep Neural Networks
Risto Miikkulainen
J. Liang
Elliot Meyerson
Aditya Rawal
Daniel Fink
...
B. Raju
Hormoz Shahrzad
Arshak Navruzyan
Nigel P. Duffy
Babak Hodjat
160
892
0
01 Mar 2017
Symbolic, Distributed and Distributional Representations for Natural
  Language Processing in the Era of Deep Learning: a Survey
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
62
38
0
02 Feb 2017
Quantifying the probable approximation error of probabilistic inference
  programs
Quantifying the probable approximation error of probabilistic inference programs
Marco F. Cusumano-Towner
Vikash K. Mansinghka
102
7
0
31 May 2016
Impact of Power System Partitioning on the Efficiency of Distributed
  Multi-Step Optimization
Impact of Power System Partitioning on the Efficiency of Distributed Multi-Step Optimization
Dongliang Chen
A. Bucchiarone
Zhihan Lv
47
12
0
31 May 2016
Previous
123...474475476