ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 18,335 papers shown
Title
In Defense of Grid Features for Visual Question Answering
In Defense of Grid Features for Visual Question Answering
Huaizu Jiang
Ishan Misra
Marcus Rohrbach
Erik Learned-Miller
Xinlei Chen
OOD
ObjD
23
318
0
10 Jan 2020
Linking Social Media Posts to News with Siamese Transformers
Linking Social Media Posts to News with Siamese Transformers
Jacob Danovitch
24
2
0
10 Jan 2020
Theory In, Theory Out: The uses of social theory in machine learning for
  social science
Theory In, Theory Out: The uses of social theory in machine learning for social science
J. Radford
K. Joseph
16
44
0
09 Jan 2020
Multiplex Word Embeddings for Selectional Preference Acquisition
Multiplex Word Embeddings for Selectional Preference Acquisition
Hongming Zhang
Jiaxin Bai
Yan Song
Kun Xu
Changlong Yu
Yangqiu Song
Wilfred Ng
Dong Yu
24
17
0
09 Jan 2020
On Interpretability of Artificial Neural Networks: A Survey
On Interpretability of Artificial Neural Networks: A Survey
Fenglei Fan
Jinjun Xiong
Mengzhou Li
Ge Wang
AAML
AI4CE
45
301
0
08 Jan 2020
An Exploration of Embodied Visual Exploration
An Exploration of Embodied Visual Exploration
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
LM&Ro
37
98
0
07 Jan 2020
Missing-Class-Robust Domain Adaptation by Unilateral Alignment for Fault
  Diagnosis
Missing-Class-Robust Domain Adaptation by Unilateral Alignment for Fault Diagnosis
Qin Wang
Gabriel Michau
Olga Fink
25
55
0
07 Jan 2020
Attention over Parameters for Dialogue Systems
Attention over Parameters for Dialogue Systems
Andrea Madotto
Zhaojiang Lin
Chien-Sheng Wu
Jamin Shin
Pascale Fung
35
20
0
07 Jan 2020
Language Models Are An Effective Patient Representation Learning
  Technique For Electronic Health Record Data
Language Models Are An Effective Patient Representation Learning Technique For Electronic Health Record Data
E. Steinberg
Kenneth Jung
Jason Alan Fries
Conor K. Corbin
Stephen R. Pfohl
N. Shah
29
103
0
06 Jan 2020
Social Media Attributions in the Context of Water Crisis
Social Media Attributions in the Context of Water Crisis
Rupak Sarkar
Hirak Sarkar
Sayantan Mahinder
Ashiqur R. KhudaBukhsh
24
10
0
06 Jan 2020
Exploring Benefits of Transfer Learning in Neural Machine Translation
Exploring Benefits of Transfer Learning in Neural Machine Translation
Tom Kocmi
35
17
0
06 Jan 2020
A Survey on Machine Reading Comprehension Systems
A Survey on Machine Reading Comprehension Systems
Razieh Baradaran
Razieh Ghiasi
Hossein Amirkhani
FaML
18
85
0
06 Jan 2020
Stance Detection Benchmark: How Robust Is Your Stance Detection?
Stance Detection Benchmark: How Robust Is Your Stance Detection?
Benjamin Schiller
Johannes Daxenberger
Iryna Gurevych
21
96
0
06 Jan 2020
Computationally Efficient NER Taggers with Combined Embeddings and Constrained Decoding
Brian Lester
Daniel Pressel
Amy Hemmeter
Sagnik Ray Choudhury
26
3
0
05 Jan 2020
Empirical Studies on the Properties of Linear Regions in Deep Neural
  Networks
Empirical Studies on the Properties of Linear Regions in Deep Neural Networks
Xiao Zhang
Dongrui Wu
21
38
0
04 Jan 2020
Adapting Deep Learning for Sentiment Classification of Code-Switched
  Informal Short Text
Adapting Deep Learning for Sentiment Classification of Code-Switched Informal Short Text
M. Shakeel
Asim Karim
20
10
0
04 Jan 2020
Two-Level Transformer and Auxiliary Coherence Modeling for Improved Text
  Segmentation
Two-Level Transformer and Auxiliary Coherence Modeling for Improved Text Segmentation
Goran Glavaš
Swapna Somasundaran
VLM
30
56
0
03 Jan 2020
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling
  and Denoising
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising
Ziyi Yang
Chenguang Zhu
R. Gmyr
Michael Zeng
Xuedong Huang
Eric Darve
23
61
0
03 Jan 2020
A Deep Learning Approach to Diagnosing Multiple Sclerosis from
  Smartphone Data
A Deep Learning Approach to Diagnosing Multiple Sclerosis from Smartphone Data
Patrick Schwab
W. Karlen
41
24
0
02 Jan 2020
Stacked DeBERT: All Attention in Incomplete Data for Text Classification
Stacked DeBERT: All Attention in Incomplete Data for Text Classification
Gwenaelle Cunha Sergio
Minho Lee
27
30
0
01 Jan 2020
Deep Attentive Ranking Networks for Learning to Order Sentences
Deep Attentive Ranking Networks for Learning to Order Sentences
Pawan Kumar
Dhanajit Brahma
H. Karnick
Piyush Rai
21
45
0
31 Dec 2019
LayoutLM: Pre-training of Text and Layout for Document Image
  Understanding
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Yiheng Xu
Minghao Li
Lei Cui
Shaohan Huang
Furu Wei
Ming Zhou
71
686
0
31 Dec 2019
oLMpics -- On what Language Model Pre-training Captures
oLMpics -- On what Language Model Pre-training Captures
Alon Talmor
Yanai Elazar
Yoav Goldberg
Jonathan Berant
LRM
34
300
0
31 Dec 2019
AraNet: A Deep Learning Toolkit for Arabic Social Media
AraNet: A Deep Learning Toolkit for Arabic Social Media
Muhammad Abdul-Mageed
Chiyu Zhang
A. Hashemi
El Moatez Billah Nagoudi
GNN
27
32
0
30 Dec 2019
Semi-Supervised Learning with Normalizing Flows
Semi-Supervised Learning with Normalizing Flows
Pavel Izmailov
Polina Kirichenko
Marc Finzi
A. Wilson
DRL
BDL
40
111
0
30 Dec 2019
AutoDiscern: Rating the Quality of Online Health Information with
  Hierarchical Encoder Attention-based Neural Networks
AutoDiscern: Rating the Quality of Online Health Information with Hierarchical Encoder Attention-based Neural Networks
Laura Kinkead
Ahmed Allam
Michael Krauthammer
25
19
0
30 Dec 2019
Machine Learning from a Continuous Viewpoint
Machine Learning from a Continuous Viewpoint
E. Weinan
Chao Ma
Lei Wu
33
102
0
30 Dec 2019
ORB: An Open Reading Benchmark for Comprehensive Evaluation of Machine
  Reading Comprehension
ORB: An Open Reading Benchmark for Comprehensive Evaluation of Machine Reading Comprehension
Dheeru Dua
Ananth Gottumukkala
Alon Talmor
Sameer Singh
Matt Gardner
23
10
0
29 Dec 2019
Towards Deep Federated Defenses Against Malware in Cloud Ecosystems
Towards Deep Federated Defenses Against Malware in Cloud Ecosystems
Josh Payne
A. Kundu
FedML
26
10
0
27 Dec 2019
Encoding word order in complex embeddings
Encoding word order in complex embeddings
Benyou Wang
Donghao Zhao
Christina Lioma
Qiuchi Li
Peng Zhang
J. Simonsen
19
111
0
27 Dec 2019
Text Classification for Azerbaijani Language Using Machine Learning and
  Embedding
Text Classification for Azerbaijani Language Using Machine Learning and Embedding
U. Suleymanov
Behnam Kiani Kalejahi
Elkhan Amrahov
Rashid Badirkhanli
8
9
0
26 Dec 2019
Explicit Sparse Transformer: Concentrated Attention Through Explicit
  Selection
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
Guangxiang Zhao
Junyang Lin
Zhiyuan Zhang
Xuancheng Ren
Qi Su
Xu Sun
22
108
0
25 Dec 2019
Multi-Graph Transformer for Free-Hand Sketch Recognition
Multi-Graph Transformer for Free-Hand Sketch Recognition
Peng Xu
Chaitanya K. Joshi
Xavier Bresson
ViT
27
85
0
24 Dec 2019
Probing the phonetic and phonological knowledge of tones in Mandarin TTS
  models
Probing the phonetic and phonological knowledge of tones in Mandarin TTS models
Jian Zhu
26
8
0
23 Dec 2019
A Multimodal Target-Source Classifier with Attention Branches to
  Understand Ambiguous Instructions for Fetching Daily Objects
A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects
A. Magassouba
K. Sugiura
Hisashi Kawai
38
9
0
23 Dec 2019
Harnessing Evolution of Multi-Turn Conversations for Effective Answer
  Retrieval
Harnessing Evolution of Multi-Turn Conversations for Effective Answer Retrieval
Mohammad Aliannejadi
Manajit Chakraborty
E. A. Ríssola
Fabio Crestani
33
48
0
22 Dec 2019
Learning to Impute: A General Framework for Semi-supervised Learning
Learning to Impute: A General Framework for Semi-supervised Learning
Wei-Hong Li
Chuan-Sheng Foo
Hakan Bilen
SSL
24
9
0
22 Dec 2019
Are Transformers universal approximators of sequence-to-sequence
  functions?
Are Transformers universal approximators of sequence-to-sequence functions?
Chulhee Yun
Srinadh Bhojanapalli
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
28
336
0
20 Dec 2019
End-to-end Named Entity Recognition and Relation Extraction using
  Pre-trained Language Models
End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language Models
John Giorgi
Xindi Wang
Nicola Sahar
W. Shin
Gary D. Bader
Bo Wang
27
36
0
20 Dec 2019
SberQuAD -- Russian Reading Comprehension Dataset: Description and
  Analysis
SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis
Pavel Efimov
Andrey Chertok
Leonid Boytsov
Pavel Braslavski
75
59
0
20 Dec 2019
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language
  Model
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model
Wenhan Xiong
Jingfei Du
William Yang Wang
Veselin Stoyanov
SSL
KELM
52
201
0
20 Dec 2019
BERTje: A Dutch BERT Model
BERTje: A Dutch BERT Model
Wietse de Vries
Andreas van Cranenburgh
Arianna Bisazza
Tommaso Caselli
Gertjan van Noord
Malvina Nissim
VLM
SSeg
28
291
0
19 Dec 2019
Neural Simile Recognition with Cyclic Multitask Learning and Local
  Attention
Neural Simile Recognition with Cyclic Multitask Learning and Local Attention
Jiali Zeng
Linfeng Song
Jinsong Su
Jun Xie
Wei Song
Jiebo Luo
13
23
0
19 Dec 2019
Fashion Outfit Complementary Item Retrieval
Fashion Outfit Complementary Item Retrieval
Yen-Liang Lin
Son N. Tran
Larry S. Davis
27
84
0
19 Dec 2019
Optimization for deep learning: theory and algorithms
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
38
168
0
19 Dec 2019
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive
  Summarization
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
87
2,019
0
18 Dec 2019
MedCAT -- Medical Concept Annotation Tool
MedCAT -- Medical Concept Annotation Tool
Z. Kraljevic
D. Bean
Aurelie Mascio
Lukasz Roguski
A. Folarin
A. Roberts
R. Bendayan
Richard J. B. Dobson
MedIm
26
29
0
18 Dec 2019
Transfer learning in hybrid classical-quantum neural networks
Transfer learning in hybrid classical-quantum neural networks
A. Mari
T. Bromley
J. Izaac
Maria Schuld
N. Killoran
27
282
0
17 Dec 2019
Meshed-Memory Transformer for Image Captioning
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
16
869
0
17 Dec 2019
A Multi-task Learning Model for Chinese-oriented Aspect Polarity
  Classification and Aspect Term Extraction
A Multi-task Learning Model for Chinese-oriented Aspect Polarity Classification and Aspect Term Extraction
Heng Yang
Biqing Zeng
Jianhao Yang
Youwei Song
Ruyang Xu
40
133
0
17 Dec 2019
Previous
123...349350351...365366367
Next