ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 1,211 papers shown
Title
TyDi QA: A Benchmark for Information-Seeking Question Answering in
  Typologically Diverse Languages
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
73
601
0
10 Mar 2020
Efficient Intent Detection with Dual Sentence Encoders
Efficient Intent Detection with Dual Sentence Encoders
I. Casanueva
Tadas Temvcinas
D. Gerz
Matthew Henderson
Ivan Vulić
VLM
190
463
0
10 Mar 2020
Keeping it simple: Implementation and performance of the proto-principle
  of adaptation and learning in the language sciences
Keeping it simple: Implementation and performance of the proto-principle of adaptation and learning in the language sciences
P. Milin
Harish Tayyar Madabushi
Mike Croucher
Dagmar Divjak
34
13
0
08 Mar 2020
Communication optimization strategies for distributed deep neural
  network training: A survey
Communication optimization strategies for distributed deep neural network training: A survey
Shuo Ouyang
Dezun Dong
Yemao Xu
Liquan Xiao
40
12
0
06 Mar 2020
PhoBERT: Pre-trained language models for Vietnamese
PhoBERT: Pre-trained language models for Vietnamese
Dat Quoc Nguyen
A. Nguyen
188
350
0
02 Mar 2020
A Dataset Independent Set of Baselines for Relation Prediction in
  Argument Mining
A Dataset Independent Set of Baselines for Relation Prediction in Argument Mining
O. Cocarascu
Elena Cabrio
S. Villata
Francesca Toni
44
7
0
14 Feb 2020
Localized Flood DetectionWith Minimal Labeled Social Media Data Using
  Transfer Learning
Localized Flood DetectionWith Minimal Labeled Social Media Data Using Transfer Learning
Neha Singh
Nirmalya Roy
A. Gangopadhyay
42
6
0
10 Feb 2020
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question
  Answering
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering
Akari Asai
Kazuma Hashimoto
Hannaneh Hajishirzi
R. Socher
Caiming Xiong
RALM
KELM
LRM
37
284
0
24 Nov 2019
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Zewen Chi
Li Dong
Furu Wei
Xian-Ling Mao
Heyan Huang
LRM
VLM
37
13
0
10 Nov 2019
Meta Label Correction for Noisy Label Learning
Meta Label Correction for Noisy Label Learning
Guoqing Zheng
Ahmed Hassan Awadallah
S. Dumais
NoLa
OffRL
34
179
0
10 Nov 2019
Multi-Sentence Argument Linking
Multi-Sentence Argument Linking
Seth Ebner
Patrick Xia
Ryan Culkin
Kyle Rawlins
Benjamin Van Durme
HAI
41
159
0
09 Nov 2019
Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity
Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity
Nina Poerner
Ulli Waltinger
Hinrich Schütze
AI4TS
39
20
0
09 Nov 2019
How Decoding Strategies Affect the Verifiability of Generated Text
How Decoding Strategies Affect the Verifiability of Generated Text
Luca Massarelli
Fabio Petroni
Aleksandra Piktus
Myle Ott
Tim Rocktaschel
Vassilis Plachouras
Fabrizio Silvestri
Sebastian Riedel
38
50
0
09 Nov 2019
On the Relationship between Self-Attention and Convolutional Layers
On the Relationship between Self-Attention and Convolutional Layers
Jean-Baptiste Cordonnier
Andreas Loukas
Martin Jaggi
50
530
0
08 Nov 2019
Hierarchical Contextualized Representation for Named Entity Recognition
Hierarchical Contextualized Representation for Named Entity Recognition
Ying Luo
Fengshun Xiao
Zhao Hai
42
129
0
06 Nov 2019
Learning to Fix Build Errors with Graph2Diff Neural Networks
Learning to Fix Build Errors with Graph2Diff Neural Networks
Daniel Tarlow
Subhodeep Moitra
Andrew Rice
Zimin Chen
Pierre-Antoine Manzagol
Charles Sutton
E. Aftandilian
GNN
33
62
0
04 Nov 2019
MRNN: A Multi-Resolution Neural Network with Duplex Attention for
  Document Retrieval in the Context of Question Answering
MRNN: A Multi-Resolution Neural Network with Duplex Attention for Document Retrieval in the Context of Question Answering
Tolgahan Cakaloglu
Xiaowei Xu
26
2
0
03 Nov 2019
Question Answering for Privacy Policies: Combining Computational and
  Legal Perspectives
Question Answering for Privacy Policies: Combining Computational and Legal Perspectives
Abhilasha Ravichander
A. Black
Shomir Wilson
Thomas B. Norton
Norman M. Sadeh
AILaw
45
106
0
03 Nov 2019
Discourse-Aware Neural Extractive Text Summarization
Discourse-Aware Neural Extractive Text Summarization
Jiacheng Xu
Zhe Gan
Yu Cheng
Jingjing Liu
BDL
97
279
0
30 Oct 2019
Inducing brain-relevant bias in natural language processing models
Inducing brain-relevant bias in natural language processing models
Dan Schwartz
Mariya Toneva
Leila Wehbe
21
80
0
29 Oct 2019
Weakly-Supervised Deep Learning for Domain Invariant Sentiment
  Classification
Weakly-Supervised Deep Learning for Domain Invariant Sentiment Classification
Pratik Kayal
M. Singh
Pawan Goyal
53
4
0
29 Oct 2019
Thieves on Sesame Street! Model Extraction of BERT-based APIs
Thieves on Sesame Street! Model Extraction of BERT-based APIs
Kalpesh Krishna
Gaurav Singh Tomar
Ankur P. Parikh
Nicolas Papernot
Mohit Iyyer
MIACV
MLAU
48
197
0
27 Oct 2019
Evaluation of Sentence Representations in Polish
Evaluation of Sentence Representations in Polish
Slawomir Dadas
Michal Perelkiewicz
Rafal Poswiata
45
13
0
25 Oct 2019
HUBERT Untangles BERT to Improve Transfer across NLP Tasks
HUBERT Untangles BERT to Improve Transfer across NLP Tasks
M. Moradshahi
Hamid Palangi
M. Lam
P. Smolensky
Jianfeng Gao
42
16
0
25 Oct 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
72
372
0
25 Oct 2019
Diversify Your Datasets: Analyzing Generalization via Controlled
  Variance in Adversarial Datasets
Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets
Ohad Rozen
Vered Shwartz
Roee Aharoni
Ido Dagan
AAML
37
38
0
21 Oct 2019
Discovering the Compositional Structure of Vector Representations with
  Role Learning Networks
Discovering the Compositional Structure of Vector Representations with Role Learning Networks
Paul Soulos
R. Thomas McCoy
Tal Linzen
P. Smolensky
CoGe
43
43
0
21 Oct 2019
Multi-granularity hierarchical attention fusion networks for reading
  comprehension and question answering
Multi-granularity hierarchical attention fusion networks for reading comprehension and question answering
Wei Wang
Ming Yan
Chen Henry Wu
45
173
0
29 Nov 2018
U-Net: Machine Reading Comprehension with Unanswerable Questions
U-Net: Machine Reading Comprehension with Unanswerable Questions
Fu Sun
Linyang Li
Xipeng Qiu
Yang Liu
48
47
0
12 Oct 2018
Semi-Supervised Sequence Modeling with Cross-View Training
Semi-Supervised Sequence Modeling with Cross-View Training
Kevin Clark
Minh-Thang Luong
Christopher D. Manning
Quoc V. Le
SSL
24
333
0
22 Sep 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
Dissecting Contextual Word Embeddings: Architecture and Representation
Matthew E. Peters
Mark Neumann
Luke Zettlemoyer
Wen-tau Yih
58
429
0
27 Aug 2018
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense
  Inference
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference
Rowan Zellers
Yonatan Bisk
Roy Schwartz
Yejin Choi
45
710
0
16 Aug 2018
Character-Level Language Modeling with Deeper Self-Attention
Character-Level Language Modeling with Deeper Self-Attention
Rami Al-Rfou
Dokook Choe
Noah Constant
Mandy Guo
Llion Jones
57
388
0
09 Aug 2018
Neural Network Acceptability Judgments
Neural Network Acceptability Judgments
Alex Warstadt
Amanpreet Singh
Samuel R. Bowman
97
1,390
0
31 May 2018
QANet: Combining Local Convolution with Global Self-Attention for
  Reading Comprehension
QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension
Adams Wei Yu
David Dohan
Minh-Thang Luong
Rui Zhao
Kai Chen
Mohammad Norouzi
Quoc V. Le
RALM
AIMat
47
1,093
0
23 Apr 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
336
7,080
0
20 Apr 2018
An efficient framework for learning sentence representations
An efficient framework for learning sentence representations
Lajanugen Logeswaran
Honglak Lee
35
540
0
07 Mar 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
21
11,520
0
15 Feb 2018
MaskGAN: Better Text Generation via Filling in the______
MaskGAN: Better Text Generation via Filling in the______
W. Fedus
Ian Goodfellow
Andrew M. Dai
29
469
0
23 Jan 2018
Simple and Effective Multi-Paragraph Reading Comprehension
Simple and Effective Multi-Paragraph Reading Comprehension
Christopher Clark
Matt Gardner
RALM
43
456
0
29 Oct 2017
Learned in Translation: Contextualized Word Vectors
Learned in Translation: Contextualized Word Vectors
Bryan McCann
James Bradbury
Caiming Xiong
R. Socher
71
907
0
01 Aug 2017
SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and
  Cross-lingual Focused Evaluation
SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and Cross-lingual Focused Evaluation
Daniel Cer
Mona T. Diab
Eneko Agirre
I. Lopez-Gazpio
Lucia Specia
32
1,870
0
31 Jul 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
178
129,831
0
12 Jun 2017
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for
  Reading Comprehension
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
118
2,576
0
09 May 2017
Supervised Learning of Universal Sentence Representations from Natural
  Language Inference Data
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
Alexis Conneau
Douwe Kiela
Holger Schwenk
Loïc Barrault
Antoine Bordes
AI4TS
SSL
94
2,099
0
05 May 2017
Semi-supervised sequence tagging with bidirectional language models
Semi-supervised sequence tagging with bidirectional language models
Matthew E. Peters
Bridger Waleed Ammar
Chandra Bhagavatula
Russell Power
39
634
0
29 Apr 2017
Discourse-Based Objectives for Fast Unsupervised Sentence Representation
  Learning
Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning
Yacine Jernite
Samuel R. Bowman
David Sontag
34
111
0
23 Apr 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through
  Inference
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
280
4,444
0
18 Apr 2017
Bidirectional Attention Flow for Machine Comprehension
Bidirectional Attention Flow for Machine Comprehension
Minjoon Seo
Aniruddha Kembhavi
Ali Farhadi
Hannaneh Hajishirzi
74
2,088
0
05 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
741
6,768
0
26 Sep 2016
Previous
123...232425
Next