ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,508 papers shown
Title
UM-IU@LING at SemEval-2019 Task 6: Identifying Offensive Tweets Using
  BERT and SVMs
UM-IU@LING at SemEval-2019 Task 6: Identifying Offensive Tweets Using BERT and SVMs
Jian Zhu
Zuoyu Tian
Sandra Kübler
VLM
75
39
0
06 Apr 2019
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling
Modeling Point Clouds with Self-Attention and Gumbel Subset Sampling
Jiancheng Yang
Qiang Zhang
Bingbing Ni
Linguo Li
Jinxian Liu
Mengdie Zhou
Qi Tian
3DPC
95
382
0
06 Apr 2019
Evaluating Coherence in Dialogue Systems using Entailment
Evaluating Coherence in Dialogue Systems using Entailment
Nouha Dziri
Ehsan Kamalloo
K. Mathewson
Osmar Zaiane
105
97
0
06 Apr 2019
ThisIsCompetition at SemEval-2019 Task 9: BERT is unstable for
  out-of-domain samples
ThisIsCompetition at SemEval-2019 Task 9: BERT is unstable for out-of-domain samples
Cheoneum Park
Juae Kim
Hyeon-gu Lee
Reinald Kim Amplayo
H. Kim
Jungyun Seo
Changki Lee
62
12
0
06 Apr 2019
Publicly Available Clinical BERT Embeddings
Publicly Available Clinical BERT Embeddings
Emily Alsentzer
John R. Murphy
Willie Boag
W. Weng
Di Jin
Tristan Naumann
Matthew B. A. McDermott
AI4MH
233
2,002
0
06 Apr 2019
Gender Bias in Contextualized Word Embeddings
Gender Bias in Contextualized Word Embeddings
Jieyu Zhao
Tianlu Wang
Mark Yatskar
Ryan Cotterell
Vicente Ordonez
Kai-Wei Chang
FaML
129
421
0
05 Apr 2019
An Unsupervised Autoregressive Model for Speech Representation Learning
An Unsupervised Autoregressive Model for Speech Representation Learning
Yu-An Chung
Wei-Ning Hsu
Hao Tang
James R. Glass
SSL
116
409
0
05 Apr 2019
Information Aggregation for Multi-Head Attention with
  Routing-by-Agreement
Information Aggregation for Multi-Head Attention with Routing-by-Agreement
Jian Li
Baosong Yang
Zi-Yi Dou
Xing Wang
Michael R. Lyu
Zhaopeng Tu
80
46
0
05 Apr 2019
CLEARumor at SemEval-2019 Task 7: ConvoLving ELMo Against Rumors
CLEARumor at SemEval-2019 Task 7: ConvoLving ELMo Against Rumors
Ipek Baris
Lukas Schmelzeisen
Steffen Staab
26
15
0
05 Apr 2019
A Literature Study of Embeddings on Source Code
A Literature Study of Embeddings on Source Code
Zimin Chen
Monperrus Martin
113
83
0
05 Apr 2019
NL-FIIT at SemEval-2019 Task 9: Neural Model Ensemble for Suggestion
  Mining
NL-FIIT at SemEval-2019 Task 9: Neural Model Ensemble for Suggestion Mining
Samuel Pecar
Marian Simko
Maria Bielikova
21
7
0
05 Apr 2019
An Attentive Survey of Attention Models
An Attentive Survey of Attention Models
S. Chaudhari
Varun Mithal
Gungor Polatkan
R. Ramanath
200
666
0
05 Apr 2019
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence
  Labeling
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling
Xiaochuang Han
Jacob Eisenstein
58
20
0
04 Apr 2019
In Other News: A Bi-style Text-to-speech Model for Synthesizing
  Newscaster Voice with Limited Data
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
N. Prateek
Mateusz Lajszczak
Roberto Barra-Chicote
Thomas Drugman
Jaime Lorenzo-Trueba
Thomas Merritt
S. Ronanki
Trevor Wood
87
30
0
04 Apr 2019
Multi-Context Term Embeddings: the Use Case of Corpus-based Term Set
  Expansion
Multi-Context Term Embeddings: the Use Case of Corpus-based Term Set Expansion
Jonathan Mamou
Oren Pereg
Moshe Wasserblat
Ido Dagan
33
0
0
04 Apr 2019
Composition of Sentence Embeddings:Lessons from Statistical Relational
  Learning
Composition of Sentence Embeddings:Lessons from Statistical Relational Learning
Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller
CoGeAI4TS
46
0
0
04 Apr 2019
BERT Post-Training for Review Reading Comprehension and Aspect-based
  Sentiment Analysis
BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis
Hu Xu
Bing-Quan Liu
Lei Shu
Philip S. Yu
102
701
0
03 Apr 2019
Probing Biomedical Embeddings from Language Models
Probing Biomedical Embeddings from Language Models
Qiao Jin
Bhuwan Dhingra
William W. Cohen
Xinghua Lu
84
116
0
03 Apr 2019
CAN-NER: Convolutional Attention Network for Chinese Named Entity
  Recognition
CAN-NER: Convolutional Attention Network for Chinese Named Entity Recognition
Yuying Zhu
Guoxin Wang
Börje F. Karlsson
102
119
0
03 Apr 2019
75 Languages, 1 Model: Parsing Universal Dependencies Universally
75 Languages, 1 Model: Parsing Universal Dependencies Universally
Dan Kondratyuk
Milan Straka
121
264
0
03 Apr 2019
Modeling Vocabulary for Big Code Machine Learning
Modeling Vocabulary for Big Code Machine Learning
Hlib Babii
Andrea Janes
Romain Robbes
36
22
0
03 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLMSSL
92
1,253
0
03 Apr 2019
The Verbal and Non Verbal Signals of Depression -- Combining Acoustics,
  Text and Visuals for Estimating Depression Level
The Verbal and Non Verbal Signals of Depression -- Combining Acoustics, Text and Visuals for Estimating Depression Level
Syed Arbaaz Qureshi
Mohammed Hasanuzzaman
S. Saha
G. Dias
33
17
0
02 Apr 2019
Structural Scaffolds for Citation Intent Classification in Scientific
  Publications
Structural Scaffolds for Citation Intent Classification in Scientific Publications
Arman Cohan
Bridger Waleed Ammar
Madeleine van Zuylen
Field Cady
88
253
0
02 Apr 2019
Neural Vector Conceptualization for Word Vector Space Interpretation
Neural Vector Conceptualization for Word Vector Space Interpretation
Robert Schwarzenberg
Lisa Raithel
David Harbecke
LLMSVVLM
54
10
0
02 Apr 2019
Temporal and Aspectual Entailment
Temporal and Aspectual Entailment
Thomas Kober
Sander Bijl de Vroe
Mark Steedman
61
16
0
02 Apr 2019
Habitat: A Platform for Embodied AI Research
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
145
1,424
0
02 Apr 2019
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence
  Representations
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations
Mingda Chen
Qingming Tang
Sam Wiseman
Kevin Gimpel
DRL
97
76
0
02 Apr 2019
Recent Advances in Natural Language Inference: A Survey of Benchmarks,
  Resources, and Approaches
Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches
Shane Storks
Qiaozi Gao
J. Chai
102
132
0
02 Apr 2019
PAWS: Paraphrase Adversaries from Word Scrambling
PAWS: Paraphrase Adversaries from Word Scrambling
Yuan Zhang
Jason Baldridge
Luheng He
85
545
0
01 Apr 2019
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
Yang You
Jing Li
Sashank J. Reddi
Jonathan Hseu
Sanjiv Kumar
Srinadh Bhojanapalli
Xiaodan Song
J. Demmel
Kurt Keutzer
Cho-Jui Hsieh
ODL
341
1,001
0
01 Apr 2019
Using Similarity Measures to Select Pretraining Data for NER
Using Similarity Measures to Select Pretraining Data for NER
Xiang Dai
Sarvnaz Karimi
Ben Hachey
Cécile Paris
93
51
0
01 Apr 2019
ANA at SemEval-2019 Task 3: Contextual Emotion detection in
  Conversations through hierarchical LSTMs and BERT
ANA at SemEval-2019 Task 3: Contextual Emotion detection in Conversations through hierarchical LSTMs and BERT
Chenyang Huang
Amine Trabelsi
Osmar R. Zaïane
72
74
0
30 Mar 2019
Interpreting Black Box Models via Hypothesis Testing
Interpreting Black Box Models via Hypothesis Testing
Collin Burns
Jesse Thomason
Wesley Tansey
FAtt
80
9
0
29 Mar 2019
Integrating Semantic Knowledge to Tackle Zero-shot Text Classification
Integrating Semantic Knowledge to Tackle Zero-shot Text Classification
Jingqing Zhang
Piyawat Lertvittayakumjorn
Yike Guo
VLM
104
118
0
29 Mar 2019
Towards Knowledge-Based Personalized Product Description Generation in
  E-commerce
Towards Knowledge-Based Personalized Product Description Generation in E-commerce
Qibin Chen
Junyang Lin
Yichang Zhang
Hongxia Yang
Jingren Zhou
Jie Tang
79
99
0
29 Mar 2019
Informed Machine Learning -- A Taxonomy and Survey of Integrating
  Knowledge into Learning Systems
Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems
Laura von Rueden
S. Mayer
Katharina Beckh
B. Georgiev
Sven Giesselbach
...
Rajkumar Ramamurthy
Michal Walczak
Jochen Garcke
Christian Bauckhage
Jannis Schuecker
160
655
0
29 Mar 2019
Distilling Task-Specific Knowledge from BERT into Simple Neural Networks
Distilling Task-Specific Knowledge from BERT into Simple Neural Networks
Raphael Tang
Yao Lu
Linqing Liu
Lili Mou
Olga Vechtomova
Jimmy J. Lin
80
421
0
28 Mar 2019
Train, Sort, Explain: Learning to Diagnose Translation Models
Train, Sort, Explain: Learning to Diagnose Translation Models
Robert Schwarzenberg
David Harbecke
Vivien Macketanz
Eleftherios Avramidis
Sebastian Möller
62
7
0
28 Mar 2019
Mining Discourse Markers for Unsupervised Sentence Representation
  Learning
Mining Discourse Markers for Unsupervised Sentence Representation Learning
Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller
97
69
0
28 Mar 2019
Wasserstein Dependency Measure for Representation Learning
Wasserstein Dependency Measure for Representation Learning
Sherjil Ozair
Corey Lynch
Yoshua Bengio
Aaron van den Oord
Sergey Levine
P. Sermanet
SSLDRL
146
119
0
28 Mar 2019
Small Data Challenges in Big Data Era: A Survey of Recent Progress on
  Unsupervised and Semi-Supervised Methods
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods
Guo-Jun Qi
Jiebo Luo
SSL
61
246
0
27 Mar 2019
On Attribution of Recurrent Neural Network Predictions via Additive
  Decomposition
On Attribution of Recurrent Neural Network Predictions via Additive Decomposition
Mengnan Du
Ninghao Liu
Fan Yang
Shuiwang Ji
Helen Zhou
FAtt
71
51
0
27 Mar 2019
ner and pos when nothing is capitalized
ner and pos when nothing is capitalized
Stephen D. Mayhew
Tatiana Tsygankova
Dan Roth
58
30
0
27 Mar 2019
Simple Applications of BERT for Ad Hoc Document Retrieval
Simple Applications of BERT for Ad Hoc Document Retrieval
Wei Yang
Haotian Zhang
Jimmy J. Lin
111
198
0
26 Mar 2019
Interoperability and machine-to-machine translation model with mappings
  to machine learning tasks
Interoperability and machine-to-machine translation model with mappings to machine learning tasks
Jacob Nilsson
Fredrik Sandin
J. Delsing
AI4CE
63
18
0
26 Mar 2019
SciBERT: A Pretrained Language Model for Scientific Text
SciBERT: A Pretrained Language Model for Scientific Text
Iz Beltagy
Kyle Lo
Arman Cohan
262
2,998
0
26 Mar 2019
On Measuring Social Biases in Sentence Encoders
On Measuring Social Biases in Sentence Encoders
Chandler May
Alex Jinpeng Wang
Shikha Bordia
Samuel R. Bowman
Rachel Rudinger
131
608
0
25 Mar 2019
Fine-tune BERT for Extractive Summarization
Fine-tune BERT for Extractive Summarization
Yang Liu
74
487
0
25 Mar 2019
Knowledge Aware Conversation Generation with Explainable Reasoning over
  Augmented Graphs
Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs
Zhibin Liu
Zheng-Yu Niu
Hua Wu
Haifeng Wang
96
17
0
25 Mar 2019
Previous
123...465466467...469470471
Next