ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

8 / 23,508 papers shown
Title
A Survey Of Cross-lingual Word Embedding Models
A Survey Of Cross-lingual Word Embedding Models
Sebastian Ruder
Ivan Vulić
Anders Søgaard
110
534
0
15 Jun 2017
Lifelong Generative Modeling
Lifelong Generative Modeling
Jason Ramapuram
Magda Gregorova
Alexandros Kalousis
BDLCLL
152
120
0
27 May 2017
Jointly Learning Sentence Embeddings and Syntax with Unsupervised
  Tree-LSTMs
Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs
Jean Maillard
S. Clark
Dani Yogatama
82
89
0
25 May 2017
A survey of embedding models of entities and relationships for knowledge
  graph completion
A survey of embedding models of entities and relationships for knowledge graph completion
Dat Quoc Nguyen
108
100
0
23 Mar 2017
Evolving Deep Neural Networks
Evolving Deep Neural Networks
Risto Miikkulainen
J. Liang
Elliot Meyerson
Aditya Rawal
Daniel Fink
...
B. Raju
Hormoz Shahrzad
Arshak Navruzyan
Nigel P. Duffy
Babak Hodjat
139
892
0
01 Mar 2017
Symbolic, Distributed and Distributional Representations for Natural
  Language Processing in the Era of Deep Learning: a Survey
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
51
38
0
02 Feb 2017
Quantifying the probable approximation error of probabilistic inference
  programs
Quantifying the probable approximation error of probabilistic inference programs
Marco F. Cusumano-Towner
Vikash K. Mansinghka
102
5
0
31 May 2016
Impact of Power System Partitioning on the Efficiency of Distributed
  Multi-Step Optimization
Impact of Power System Partitioning on the Efficiency of Distributed Multi-Step Optimization
Dongliang Chen
A. Bucchiarone
Zhihan Lv
47
4
0
31 May 2016
Previous
123...469470471