ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,690 papers shown
Title
Deep Unsupervised Cardinality Estimation
Deep Unsupervised Cardinality Estimation
Zongheng Yang
Eric Liang
Amog Kamsetty
Chenggang Wu
Yan Duan
Peter Chen
Pieter Abbeel
J. M. Hellerstein
S. Krishnan
Ion Stoica
29
203
0
10 May 2019
Language Modeling with Deep Transformers
Language Modeling with Deep Transformers
Kazuki Irie
Albert Zeyer
Ralf Schluter
Hermann Ney
KELM
48
171
0
10 May 2019
Improving Discrete Latent Representations With Differentiable
  Approximation Bridges
Improving Discrete Latent Representations With Differentiable Approximation Bridges
Jason Ramapuram
Russ Webb
DRL
19
9
0
09 May 2019
Unified Language Model Pre-training for Natural Language Understanding
  and Generation
Unified Language Model Pre-training for Natural Language Understanding and Generation
Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu-Chiang Frank Wang
Jianfeng Gao
M. Zhou
H. Hon
ELM
AI4CE
80
1,551
0
08 May 2019
Show, Price and Negotiate: A Negotiator with Online Value Look-Ahead
Show, Price and Negotiate: A Negotiator with Online Value Look-Ahead
Amin Parvaneh
Ehsan Abbasnejad
Qi Wu
Javen Qinfeng Shi
Anton van den Hengel
OffRL
29
5
0
07 May 2019
Neural Architecture Refinement: A Practical Way for Avoiding Overfitting
  in NAS
Neural Architecture Refinement: A Practical Way for Avoiding Overfitting in NAS
Yangzhou Jiang
Cong Zhao
Zeyang Dou
Lei Pang
14
5
0
07 May 2019
Taming Pretrained Transformers for Extreme Multi-label Text
  Classification
Taming Pretrained Transformers for Extreme Multi-label Text Classification
Wei-Cheng Chang
Hsiang-Fu Yu
Kai Zhong
Yiming Yang
Inderjit Dhillon
27
20
0
07 May 2019
Investigating the Successes and Failures of BERT for Passage Re-Ranking
Investigating the Successes and Failures of BERT for Passage Re-Ranking
Harshith Padigela
Hamed Zamani
W. Bruce Croft
27
47
0
05 May 2019
Learning to Denoise Distantly-Labeled Data for Entity Typing
Learning to Denoise Distantly-Labeled Data for Entity Typing
Yasumasa Onoe
Greg Durrett
32
57
0
04 May 2019
ASER: A Large-scale Eventuality Knowledge Graph
ASER: A Large-scale Eventuality Knowledge Graph
Hongming Zhang
Xin Liu
Haojie Pan
Yangqiu Song
C. Leung
SLR
32
159
0
01 May 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Yue Liu
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
34
587
0
30 Apr 2019
Very Deep Self-Attention Networks for End-to-End Speech Recognition
Very Deep Self-Attention Networks for End-to-End Speech Recognition
Ngoc-Quan Pham
T. Nguyen
Jan Niehues
Markus Müller
Sebastian Stüker
A. Waibel
28
161
0
30 Apr 2019
Segmentation is All You Need
Segmentation is All You Need
Zehua Cheng
Yuxiang Wu
Zhenghua Xu
Thomas Lukasiewicz
Weiyan Wang
33
20
0
30 Apr 2019
Towards Efficient Model Compression via Learned Global Ranking
Towards Efficient Model Compression via Learned Global Ranking
Ting-Wu Chin
Ruizhou Ding
Cha Zhang
Diana Marculescu
19
170
0
28 Apr 2019
Improved Conditional VRNNs for Video Prediction
Improved Conditional VRNNs for Video Prediction
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
DRL
23
161
0
27 Apr 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
31
227
0
25 Apr 2019
Probing What Different NLP Tasks Teach Machines about Function Word
  Comprehension
Probing What Different NLP Tasks Teach Machines about Function Word Comprehension
Najoung Kim
Roma Patel
Adam Poliak
Alex Jinpeng Wang
Patrick Xia
...
Alexis Ross
Tal Linzen
Benjamin Van Durme
Samuel R. Bowman
Ellie Pavlick
28
106
0
25 Apr 2019
Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference
Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference
Nikita Kitaev
Dan Klein
38
21
0
22 Apr 2019
Understanding Roles and Entities: Datasets and Models for Natural
  Language Inference
Understanding Roles and Entities: Datasets and Models for Natural Language Inference
Arindam Mitra
Ishan Shrivastava
Chitta Baral
28
2
0
22 Apr 2019
Poly-encoders: Transformer Architectures and Pre-training Strategies for
  Fast and Accurate Multi-sentence Scoring
Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring
Samuel Humeau
Kurt Shuster
Marie-Anne Lachaux
Jason Weston
38
280
0
22 Apr 2019
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for
  Natural Language Understanding
Improving Multi-Task Deep Neural Networks via Knowledge Distillation for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
FedML
13
181
0
20 Apr 2019
Language Models with Transformers
Language Models with Transformers
Chenguang Wang
Mu Li
Alex Smola
20
121
0
20 Apr 2019
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
Mask-Predict: Parallel Decoding of Conditional Masked Language Models
Marjan Ghazvininejad
Omer Levy
Yinhan Liu
Luke Zettlemoyer
MoE
27
35
0
19 Apr 2019
An Evaluation of Transfer Learning for Classifying Sales Engagement
  Emails at Large Scale
An Evaluation of Transfer Learning for Classifying Sales Engagement Emails at Large Scale
Yong Liu
Pavel A. Dmitriev
Yifei Huang
Andrew Brooks
Li Dong
21
4
0
19 Apr 2019
ERNIE: Enhanced Representation through Knowledge Integration
ERNIE: Enhanced Representation through Knowledge Integration
Yu Sun
Shuohuan Wang
Yukun Li
Shikun Feng
Xuyi Chen
Han Zhang
Xin Tian
Danxiang Zhu
Hao Tian
Hua Wu
79
896
0
19 Apr 2019
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT
Shijie Wu
Mark Dredze
VLM
SSeg
44
670
0
19 Apr 2019
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
Christine Basta
Marta R. Costa-jussá
Noe Casas
27
189
0
18 Apr 2019
DocBERT: BERT for Document Classification
DocBERT: BERT for Document Classification
Ashutosh Adhikari
Achyudh Ram
Raphael Tang
Jimmy J. Lin
LLMAG
VLM
15
296
0
17 Apr 2019
Document Expansion by Query Prediction
Document Expansion by Query Prediction
Rodrigo Nogueira
Wei Yang
Jimmy J. Lin
Kyunghyun Cho
54
409
0
17 Apr 2019
Amobee at SemEval-2019 Tasks 5 and 6: Multiple Choice CNN Over
  Contextual Embedding
Amobee at SemEval-2019 Tasks 5 and 6: Multiple Choice CNN Over Contextual Embedding
A. Rozental
Dadi Biton
17
14
0
17 Apr 2019
Natural Language Semantics With Pictures: Some Language & Vision
  Datasets and Potential Uses for Computational Semantics
Natural Language Semantics With Pictures: Some Language & Vision Datasets and Potential Uses for Computational Semantics
David Schlangen
33
6
0
15 Apr 2019
Pre-training of Context-aware Item Representation for Next Basket
  Recommendation
Pre-training of Context-aware Item Representation for Next Basket Recommendation
Jingxuan Yang
Jun Xu
Jianzhuo Tong
Sheng Gao
Jun Guo
Jirong Wen
15
7
0
14 Apr 2019
Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data
  In Your Machine Translation System?
Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data In Your Machine Translation System?
Sorami Hisamoto
Matt Post
Kevin Duh
MIACV
SLR
36
106
0
11 Apr 2019
Deep Neural Networks Ensemble for Detecting Medication Mentions in
  Tweets
Deep Neural Networks Ensemble for Detecting Medication Mentions in Tweets
D. Weissenbacher
A. Sarker
A. Klein
K. O’Connor
Arjun Magge Ranganatha
G. Gonzalez-Hernandez
21
47
0
10 Apr 2019
Jointly Measuring Diversity and Quality in Text Generation Models
Jointly Measuring Diversity and Quality in Text Generation Models
Ehsan Montahaei
Danial Alihosseini
M. Baghshah
27
76
0
08 Apr 2019
UM-IU@LING at SemEval-2019 Task 6: Identifying Offensive Tweets Using
  BERT and SVMs
UM-IU@LING at SemEval-2019 Task 6: Identifying Offensive Tweets Using BERT and SVMs
Jian Zhu
Zuoyu Tian
Sandra Kübler
VLM
27
39
0
06 Apr 2019
Evaluating Coherence in Dialogue Systems using Entailment
Evaluating Coherence in Dialogue Systems using Entailment
Nouha Dziri
Ehsan Kamalloo
K. Mathewson
Osmar Zaiane
14
94
0
06 Apr 2019
Publicly Available Clinical BERT Embeddings
Publicly Available Clinical BERT Embeddings
Emily Alsentzer
John R. Murphy
Willie Boag
W. Weng
Di Jin
Tristan Naumann
Matthew B. A. McDermott
AI4MH
48
1,933
0
06 Apr 2019
Gender Bias in Contextualized Word Embeddings
Gender Bias in Contextualized Word Embeddings
Jieyu Zhao
Tianlu Wang
Mark Yatskar
Ryan Cotterell
Vicente Ordonez
Kai-Wei Chang
FaML
42
417
0
05 Apr 2019
PoMo: Generating Entity-Specific Post-Modifiers in Context
PoMo: Generating Entity-Specific Post-Modifiers in Context
Jun Seok Kang
IV RobertL.Logan
Zewei Chu
Yang Chen
Dheeru Dua
Kevin Gimpel
Sameer Singh
Niranjan Balasubramanian
34
11
0
05 Apr 2019
A Literature Study of Embeddings on Source Code
A Literature Study of Embeddings on Source Code
Zimin Chen
Monperrus Martin
49
82
0
05 Apr 2019
NL-FIIT at SemEval-2019 Task 9: Neural Model Ensemble for Suggestion
  Mining
NL-FIIT at SemEval-2019 Task 9: Neural Model Ensemble for Suggestion Mining
Samuel Pecar
Marian Simko
Maria Bielikova
14
7
0
05 Apr 2019
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence
  Labeling
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling
Xiaochuang Han
Jacob Eisenstein
22
20
0
04 Apr 2019
Multi-Context Term Embeddings: the Use Case of Corpus-based Term Set
  Expansion
Multi-Context Term Embeddings: the Use Case of Corpus-based Term Set Expansion
Jonathan Mamou
Oren Pereg
Moshe Wasserblat
Ido Dagan
21
0
0
04 Apr 2019
Probing Biomedical Embeddings from Language Models
Probing Biomedical Embeddings from Language Models
Qiao Jin
Bhuwan Dhingra
William W. Cohen
Xinghua Lu
29
116
0
03 Apr 2019
75 Languages, 1 Model: Parsing Universal Dependencies Universally
75 Languages, 1 Model: Parsing Universal Dependencies Universally
Dan Kondratyuk
Milan Straka
39
263
0
03 Apr 2019
Structural Scaffolds for Citation Intent Classification in Scientific
  Publications
Structural Scaffolds for Citation Intent Classification in Scientific Publications
Arman Cohan
Bridger Waleed Ammar
Madeleine van Zuylen
Field Cady
27
249
0
02 Apr 2019
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence
  Representations
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations
Mingda Chen
Qingming Tang
Sam Wiseman
Kevin Gimpel
DRL
25
76
0
02 Apr 2019
Recent Advances in Natural Language Inference: A Survey of Benchmarks,
  Resources, and Approaches
Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches
Shane Storks
Qiaozi Gao
J. Chai
26
128
0
02 Apr 2019
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
Yang You
Jing Li
Sashank J. Reddi
Jonathan Hseu
Sanjiv Kumar
Srinadh Bhojanapalli
Xiaodan Song
J. Demmel
Kurt Keutzer
Cho-Jui Hsieh
ODL
28
985
0
01 Apr 2019
Previous
123...370371372373374
Next