ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

35 / 23,885 papers shown
Title
Multi-task Learning with Sample Re-weighting for Machine Reading
  Comprehension
Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension
Yichong Xu
Xiaodong Liu
Yelong Shen
Jingjing Liu
Jianfeng Gao
113
51
0
18 Sep 2018
RumourEval 2019: Determining Rumour Veracity and Support for Rumours
RumourEval 2019: Determining Rumour Veracity and Support for Rumours
G. Gorrell
Kalina Bontcheva
Leon Derczynski
E. Kochkina
Maria Liakata
A. Zubiaga
123
218
0
18 Sep 2018
Categorizing Comparative Sentences
Categorizing Comparative Sentences
Sergey Petrakov
Alexander Bondarenko
Mirco Franzek
Matthias Hagen
Chris Biemann
55
3
0
17 Sep 2018
Meta-Embedding as Auxiliary Task Regularization
Meta-Embedding as Auxiliary Task Regularization
J. Ó. Neill
Danushka Bollegala
SSL
48
9
0
16 Sep 2018
Explicit Contextual Semantics for Text Comprehension
Explicit Contextual Semantics for Text Comprehension
Zhuosheng Zhang
Yuwei Wu
Z. Li
Hai Zhao
64
29
0
08 Sep 2018
Exploiting Invertible Decoders for Unsupervised Sentence Representation
  Learning
Exploiting Invertible Decoders for Unsupervised Sentence Representation Learning
Shuai Tang
V. D. Sa
SSL
74
1
0
08 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text
  Generation
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric Xing
VLM
155
56
0
04 Sep 2018
Question Answering by Reasoning Across Documents with Graph
  Convolutional Networks
Question Answering by Reasoning Across Documents with Graph Convolutional Networks
Nicola De Cao
Wilker Aziz
Ivan Titov
BDLRALMGNN
150
226
0
29 Aug 2018
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive
  Meaning Representations
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations
Mohammad Taher Pilehvar
Jose Camacho-Collados
231
493
0
28 Aug 2018
CoQA: A Conversational Question Answering Challenge
CoQA: A Conversational Question Answering Challenge
Siva Reddy
Danqi Chen
Christopher D. Manning
RALMHAI
179
1,213
0
21 Aug 2018
The Influence of Down-Sampling Strategies on SVD Word Embedding
  Stability
The Influence of Down-Sampling Strategies on SVD Word Embedding Stability
Johannes Hellrich
B. Kampe
U. Hahn
56
10
0
21 Aug 2018
Gender Bias in Neural Natural Language Processing
Gender Bias in Neural Natural Language Processing
Kaiji Lu
Piotr (Peter) Mardziel
Fangjing Wu
Preetam Amancharla
Anupam Datta
128
362
0
31 Jul 2018
Video Storytelling: Textual Summaries for Events
Video Storytelling: Textual Summaries for Events
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
DiffM
72
47
0
25 Jul 2018
Modeling Word Emotion in Historical Language: Quantity Beats Supposed
  Stability in Seed Word Selection
Modeling Word Emotion in Historical Language: Quantity Beats Supposed Stability in Seed Word Selection
Johannes Hellrich
Sven Buechel
U. Hahn
55
7
0
21 Jun 2018
DRCD: a Chinese Machine Reading Comprehension Dataset
DRCD: a Chinese Machine Reading Comprehension Dataset
C. Shao
Trois Liu
Yuting Lai
Yiying Tseng
Sam S. Tsai
102
128
0
04 Jun 2018
Like a Baby: Visually Situated Neural Language Acquisition
Like a Baby: Visually Situated Neural Language Acquisition
Alexander Ororbia
A. Mali
Mary Alexandria Kelly
David Reitter
40
4
0
29 May 2018
Explainable Recommendation: A Survey and New Perspectives
Explainable Recommendation: A Survey and New Perspectives
Yongfeng Zhang
Xu Chen
XAILRM
124
884
0
30 Apr 2018
Stochastic Answer Networks for Natural Language Inference
Stochastic Answer Networks for Natural Language Inference
Xiaodong Liu
Kevin Duh
Jianfeng Gao
BDL
76
45
0
21 Apr 2018
Utilizing Neural Networks and Linguistic Metadata for Early Detection of
  Depression Indications in Text Sequences
Utilizing Neural Networks and Linguistic Metadata for Early Detection of Depression Indications in Text Sequences
Marcel Trotzek
Sven Koitka
Christoph M. Friedrich
75
201
0
19 Apr 2018
Interact and Decide: Medley of Sub-Attention Networks for Effective
  Group Recommendation
Interact and Decide: Medley of Sub-Attention Networks for Effective Group Recommendation
Lucas Vinh Tran
T. Pham
Yi Tay
Yiding Liu
Gao Cong
Xiaoli Li
79
97
0
12 Apr 2018
Clinical Concept Embeddings Learned from Massive Sources of Multimodal
  Medical Data
Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data
Andrew L. Beam
Benjamin Kompa
A. Schmaltz
Inbar Fried
G. Weber
N. Palmer
Xu Shi
Tianxi Cai
I. Kohane
70
179
0
04 Apr 2018
The Geometry of Culture: Analyzing Meaning through Word Embeddings
The Geometry of Culture: Analyzing Meaning through Word Embeddings
Austin C. Kozlowski
Matt Taddy
James A. Evans
76
392
0
25 Mar 2018
SparCML: High-Performance Sparse Communication for Machine Learning
SparCML: High-Performance Sparse Communication for Machine Learning
Cédric Renggli
Saleh Ashkboos
Mehdi Aghagolzadeh
Dan Alistarh
Torsten Hoefler
104
127
0
22 Feb 2018
Deep Learning for Genomics: A Concise Overview
Deep Learning for Genomics: A Concise Overview
Tianwei Yue
Yuanxin Wang
Longxiang Zhang
Chunming Gu
Haohan Wang
Wenping Wang
Qi Lyu
Yujie Dun
AILawVLMBDL
86
91
0
02 Feb 2018
Natural Language Processing: State of The Art, Current Trends and
  Challenges
Natural Language Processing: State of The Art, Current Trends and Challenges
Diksha Khurana
Aditya Koli
Kiran Khatter
Sukhdev Singh
67
1,085
0
17 Aug 2017
Simple and Effective Dimensionality Reduction for Word Embeddings
Simple and Effective Dimensionality Reduction for Word Embeddings
Vikas Raunak
96
102
0
11 Aug 2017
Recent Trends in Deep Learning Based Natural Language Processing
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
155
2,849
0
09 Aug 2017
A Survey Of Cross-lingual Word Embedding Models
A Survey Of Cross-lingual Word Embedding Models
Sebastian Ruder
Ivan Vulić
Anders Søgaard
114
534
0
15 Jun 2017
Lifelong Generative Modeling
Lifelong Generative Modeling
Jason Ramapuram
Magda Gregorova
Alexandros Kalousis
BDLCLL
152
120
0
27 May 2017
Jointly Learning Sentence Embeddings and Syntax with Unsupervised
  Tree-LSTMs
Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs
Jean Maillard
S. Clark
Dani Yogatama
82
89
0
25 May 2017
A survey of embedding models of entities and relationships for knowledge
  graph completion
A survey of embedding models of entities and relationships for knowledge graph completion
Dat Quoc Nguyen
108
100
0
23 Mar 2017
Evolving Deep Neural Networks
Evolving Deep Neural Networks
Risto Miikkulainen
J. Liang
Elliot Meyerson
Aditya Rawal
Daniel Fink
...
B. Raju
Hormoz Shahrzad
Arshak Navruzyan
Nigel P. Duffy
Babak Hodjat
172
892
0
01 Mar 2017
Symbolic, Distributed and Distributional Representations for Natural
  Language Processing in the Era of Deep Learning: a Survey
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
62
38
0
02 Feb 2017
Quantifying the probable approximation error of probabilistic inference
  programs
Quantifying the probable approximation error of probabilistic inference programs
Marco F. Cusumano-Towner
Vikash K. Mansinghka
102
5
0
31 May 2016
Impact of Power System Partitioning on the Efficiency of Distributed
  Multi-Step Optimization
Impact of Power System Partitioning on the Efficiency of Distributed Multi-Step Optimization
Dongliang Chen
A. Bucchiarone
Zhihan Lv
47
5
0
31 May 2016
Previous
123...476477478