ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,282 papers shown
Title
BERT-based Ranking for Biomedical Entity Normalization
BERT-based Ranking for Biomedical Entity Normalization
Zongcheng Ji
Qiang Wei
Hua Xu
OOD
MedIm
24
121
0
09 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for
  Vision-and-Language Tasks
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
114
3,634
0
06 Aug 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized
  Word Representations
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations
Aarne Talman
Antti Suni
H. Çelikkanat
Sofoklis Kakouros
Jörg Tiedemann
M. Vainio
30
31
0
06 Aug 2019
Triplet Based Embedding Distance and Similarity Learning for
  Text-independent Speaker Verification
Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification
Zongze Ren
Zhiyong Chen
Shugong Xu
13
8
0
06 Aug 2019
Theme-Aware Aesthetic Distribution Prediction With Full-Resolution
  Photographs
Theme-Aware Aesthetic Distribution Prediction With Full-Resolution Photographs
Gengyun Jia
Peipei Li
Ran He
27
12
0
04 Aug 2019
MSnet: A BERT-based Network for Gendered Pronoun Resolution
MSnet: A BERT-based Network for Gendered Pronoun Resolution
Zili Wang
21
4
0
01 Aug 2019
DuTongChuan: Context-aware Translation Model for Simultaneous
  Interpreting
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting
Hao Xiong
Ruiqing Zhang
Chuanqiang Zhang
Zhongjun He
Hua Wu
Haifeng Wang
41
25
0
30 Jul 2019
Neural Mention Detection
Neural Mention Detection
Juntao Yu
Bernd Bohnet
Massimo Poesio
40
17
0
29 Jul 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
73
433
0
29 Jul 2019
VIANA: Visual Interactive Annotation of Argumentation
VIANA: Visual Interactive Annotation of Argumentation
F. Sperrle
Rita Sevastjanova
Rebecca Kehlbeck
Mennatallah El-Assady
31
25
0
29 Jul 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang
Wei-Lun Chao
D. Xuan
23
50
0
28 Jul 2019
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
54
93
0
27 Jul 2019
DLGNet: A Transformer-based Model for Dialogue Response Generation
DLGNet: A Transformer-based Model for Dialogue Response Generation
O. Olabiyi
Erik T. Mueller
16
12
0
26 Jul 2019
Automatically Learning Construction Injury Precursors from Text
Automatically Learning Construction Injury Precursors from Text
Henrietta Baker
Matthew R. Hallowell
A. Tixier
44
101
0
26 Jul 2019
Investigating Self-Attention Network for Chinese Word Segmentation
Investigating Self-Attention Network for Chinese Word Segmentation
Leilei Gan
Yue Zhang
21
11
0
26 Jul 2019
Visual Interaction with Deep Learning Models through Collaborative
  Semantic Inference
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference
Sebastian Gehrmann
Hendrik Strobelt
Robert Krüger
Hanspeter Pfister
Alexander M. Rush
HAI
21
57
0
24 Jul 2019
SpanBERT: Improving Pre-training by Representing and Predicting Spans
SpanBERT: Improving Pre-training by Representing and Predicting Spans
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
27
1,945
0
24 Jul 2019
Tripartite Heterogeneous Graph Propagation for Large-scale Social
  Recommendation
Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation
KyungHyun Kim
Donghyun Kwak
Hanock Kwak
Young-Jin Park
Sangkwon Sim
Jae-Han Cho
Minkyu Kim
Jihun Kwon
Nako Sung
Jung-Woo Ha
13
19
0
24 Jul 2019
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context
  in Morphology
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology
Aditi Chaudhary
Elizabeth Salesky
G. Bhat
David R. Mortensen
J. Carbonell
Yulia Tsvetkov
26
4
0
23 Jul 2019
EmotionX-HSU: Adopting Pre-trained BERT for Emotion Classification
EmotionX-HSU: Adopting Pre-trained BERT for Emotion Classification
Li Luo
Yue Wang
39
26
0
23 Jul 2019
BEHRT: Transformer for Electronic Health Records
BEHRT: Transformer for Electronic Health Records
Yikuan Li
Shishir Rao
J. R. A. Solares
A. Hassaine
D. Canoy
Yajie Zhu
K. Rahimi
G. Salimi-Khorshidi
OOD
33
446
0
22 Jul 2019
Emotion Detection in Text: Focusing on Latent Representation
Emotion Detection in Text: Focusing on Latent Representation
Armin Seyeditabari
N. Tabari
Shafie Gholizadeh
Wlodek Zadrozny
25
14
0
22 Jul 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
25
133
0
22 Jul 2019
ELI5: Long Form Question Answering
ELI5: Long Form Question Answering
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MH
ELM
43
600
0
22 Jul 2019
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact
  Verification
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification
Jie Zhou
Xu Han
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
21
198
0
22 Jul 2019
ER-AE: Differentially Private Text Generation for Authorship
  Anonymization
ER-AE: Differentially Private Text Generation for Authorship Anonymization
Haohan Bo
Steven H. H. Ding
Benjamin C. M. Fung
Farkhund Iqbal
DeLMO
39
38
0
20 Jul 2019
What is this Article about? Extreme Summarization with Topic-aware
  Convolutional Neural Networks
What is this Article about? Extreme Summarization with Topic-aware Convolutional Neural Networks
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
34
18
0
19 Jul 2019
Structure-Invariant Testing for Machine Translation
Structure-Invariant Testing for Machine Translation
Pinjia He
Clara Meister
Z. Su
27
104
0
19 Jul 2019
WriterForcing: Generating more interesting story endings
WriterForcing: Generating more interesting story endings
Prakhar Gupta
Vinayshekhar Bannihatti Kumar
Mukul Bhutani
A. Black
31
18
0
18 Jul 2019
Joint Learning of Named Entity Recognition and Entity Linking
Joint Learning of Named Entity Recognition and Entity Linking
Pedro Henrique Martins
Zita Marinho
André F. T. Martins
64
93
0
18 Jul 2019
Deep Neural Models for Medical Concept Normalization in User-Generated
  Texts
Deep Neural Models for Medical Concept Normalization in User-Generated Texts
Z. Miftahutdinov
E. Tutubalina
MedIm
16
44
0
18 Jul 2019
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb
  Constructions?
What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
Miryam de Lhoneux
Sara Stymne
Joakim Nivre
12
3
0
18 Jul 2019
Probing Neural Network Comprehension of Natural Language Arguments
Probing Neural Network Comprehension of Natural Language Arguments
Timothy Niven
Hung-Yu kao
AAML
45
453
0
17 Jul 2019
Fake News Detection as Natural Language Inference
Fake News Detection as Natural Language Inference
Kai-Chou Yang
Timothy Niven
Hung-Yu kao
18
35
0
17 Jul 2019
DeepTrax: Embedding Graphs of Financial Transactions
DeepTrax: Embedding Graphs of Financial Transactions
C. Bayan Bruss
Anish Khazane
Jonathan Rider
R. Serpe
Antonia Gogoglou
Keegan E. Hines
AIFin
GNN
32
43
0
16 Jul 2019
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis
Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis
Zhongkai Sun
P. Sarma
W. Sethares
E. Bucy
24
23
0
15 Jul 2019
Myers-Briggs Personality Classification and Personality-Specific
  Language Generation Using Pre-trained Language Models
Myers-Briggs Personality Classification and Personality-Specific Language Generation Using Pre-trained Language Models
Sedrick Scott Keh
Immensee Cheng
24
49
0
15 Jul 2019
A Novel User Representation Paradigm for Making Personalized Candidate
  Retrieval
A Novel User Representation Paradigm for Making Personalized Candidate Retrieval
Zheng Liu
Yu Xing
Jianxun Lian
Defu Lian
Ziyao Li
Xing Xie
32
3
0
15 Jul 2019
TWEETQA: A Social Media Focused Question Answering Dataset
TWEETQA: A Social Media Focused Question Answering Dataset
Wenhan Xiong
Jiawei Wu
Hong Wang
Vivek Kulkarni
Mo Yu
Shiyu Chang
Xiaoxiao Guo
William Yang Wang
26
75
0
14 Jul 2019
Task Selection Policies for Multitask Learning
Task Selection Policies for Multitask Learning
John Glover
Chris Hokamp
OffRL
29
7
0
14 Jul 2019
Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level
  Neural Machine Translation
Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation
Marcin Junczys-Dowmunt
15
156
0
14 Jul 2019
The University of Edinburgh's Submissions to the WMT19 News Translation
  Task
The University of Edinburgh's Submissions to the WMT19 News Translation Task
Rachel Bawden
Nikolay Bogoychev
Ulrich Germann
Roman Grundkiewicz
Faheem Kirefu
Antonio Valerio Miceli Barone
Alexandra Birch
22
32
0
12 Jul 2019
R-Transformer: Recurrent Neural Network Enhanced Transformer
R-Transformer: Recurrent Neural Network Enhanced Transformer
Z. Wang
Yao Ma
Zitao Liu
Jiliang Tang
ViT
21
105
0
12 Jul 2019
LakhNES: Improving multi-instrumental music generation with cross-domain
  pre-training
LakhNES: Improving multi-instrumental music generation with cross-domain pre-training
Chris Donahue
H. H. Mao
Yiting Li
G. Cottrell
Julian McAuley
41
117
0
10 Jul 2019
Sparse Networks from Scratch: Faster Training without Losing Performance
Sparse Networks from Scratch: Faster Training without Losing Performance
Tim Dettmers
Luke Zettlemoyer
20
334
0
10 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
Kevin Clark
Minh-Thang Luong
Urvashi Khandelwal
Christopher D. Manning
Quoc V. Le
30
228
0
10 Jul 2019
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural
  Language Processing
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing
Jian Guo
He He
Tong He
Leonard Lausen
Mu Li
...
Hang Zhang
Zhi-Li Zhang
Zhongyue Zhang
Shuai Zheng
Yi Zhu
VLM
BDL
29
194
0
09 Jul 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu
David Harwath
James R. Glass
SSL
18
32
0
09 Jul 2019
To Tune or Not To Tune? How About the Best of Both Worlds?
To Tune or Not To Tune? How About the Best of Both Worlds?
Ran A. Wang
Haibo Su
Chunye Wang
Kailin Ji
J. Ding
VLM
36
17
0
09 Jul 2019
Incorporating Query Term Independence Assumption for Efficient Retrieval
  and Ranking using Deep Neural Networks
Incorporating Query Term Independence Assumption for Efficient Retrieval and Ranking using Deep Neural Networks
Bhaskar Mitra
Corby Rosset
D. Hawking
Nick Craswell
Fernando Diaz
Emine Yilmaz
14
30
0
08 Jul 2019
Previous
123...358359360...364365366
Next