ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,754 papers shown
Title
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0
L. Gris
Edresson Casanova
F. S. Oliveira
A. S. Soares
A. Júnior
46
17
0
23 Jul 2021
Powering Effective Climate Communication with a Climate Knowledge Base
Powering Effective Climate Communication with a Climate Knowledge Base
Kameron B. Rodrigues
Shweta Khushu
Mukut Mukherjee
Andrew Banister
Anthony Hevia
Sampath Duddu
Nikita Bhutani
41
0
0
23 Jul 2021
A Differentiable Language Model Adversarial Attack on Text Classifiers
A Differentiable Language Model Adversarial Attack on Text Classifiers
I. Fursov
Alexey Zaytsev
Pavel Burnyshev
Ekaterina Dmitrieva
Nikita Klyuchnikov
A. Kravchenko
Ekaterina Artemova
Evgeny Burnaev
SILM
77
15
0
23 Jul 2021
Modeling Bilingual Conversational Characteristics for Neural Chat
  Translation
Modeling Bilingual Conversational Characteristics for Neural Chat Translation
Yunlong Liang
Fandong Meng
Jinan Xu
Jinan Xu
Jie Zhou
65
28
0
23 Jul 2021
Improving Early Sepsis Prediction with Multi Modal Learning
Improving Early Sepsis Prediction with Multi Modal Learning
Fred Qin
V. Madan
Ujjwal Ratan
Zohar Karnin
Vishaal Kapoor
Parminder Bhatia
Taha A. Kass-Hout
74
5
0
23 Jul 2021
Emotion analysis and detection during COVID-19
Emotion analysis and detection during COVID-19
Tiberiu Sosea
Chau Minh Pham
Alexander Tekle
Cornelia Caragea
Junjie Li
70
14
0
23 Jul 2021
VisDA-2021 Competition Universal Domain Adaptation to Improve
  Performance on Out-of-Distribution Data
VisDA-2021 Competition Universal Domain Adaptation to Improve Performance on Out-of-Distribution Data
D. Bashkirova
Dan Hendrycks
Donghyun Kim
Samarth Mishra
Kate Saenko
Kuniaki Saito
Piotr Teterwak
Ben Usman
OOD
73
21
0
23 Jul 2021
Graph-Based Learning for Stock Movement Prediction with Textual and
  Relational Data
Graph-Based Learning for Stock Movement Prediction with Textual and Relational Data
Qinkai Chen
C. Robert
AIFin
92
25
0
22 Jul 2021
DeepTitle -- Leveraging BERT to generate Search Engine Optimized
  Headlines
DeepTitle -- Leveraging BERT to generate Search Engine Optimized Headlines
Cristian Anastasiu
Hanna Behnke
Sarah Lück
Viktor Malesevic
Aamna Najmi
Javier Poveda-Panter
101
3
0
22 Jul 2021
Did the Cat Drink the Coffee? Challenging Transformers with Generalized
  Event Knowledge
Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge
Paolo Pedinotti
Giulia Rambelli
Emmanuele Chersoni
Enrico Santus
Alessandro Lenci
P. Blache
55
27
0
22 Jul 2021
On the Certified Robustness for Ensemble Models and Beyond
On the Certified Robustness for Ensemble Models and Beyond
Zhuolin Yang
Linyi Li
Xiaojun Xu
B. Kailkhura
Tao Xie
Yue Liu
AAML
106
50
0
22 Jul 2021
Query2Label: A Simple Transformer Way to Multi-Label Classification
Query2Label: A Simple Transformer Way to Multi-Label Classification
Shilong Liu
Lei Zhang
Xiao Yang
Hang Su
Jun Zhu
75
193
0
22 Jul 2021
To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for
  Machine Translation
To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation
Tom Kocmi
C. Federmann
Roman Grundkiewicz
Marcin Junczys-Dowmunt
Hitokazu Matsushita
Arul Menezes
99
210
0
22 Jul 2021
Evaluation of contextual embeddings on less-resourced languages
Evaluation of contextual embeddings on less-resourced languages
Matej Ulvcar
Alevs vZagar
C. S. Armendariz
Andravz Repar
Senja Pollak
Matthew Purver
Marko Robnik-vSikonja
68
11
0
22 Jul 2021
Target-Oriented Fine-tuning for Zero-Resource Named Entity Recognition
Target-Oriented Fine-tuning for Zero-Resource Named Entity Recognition
Ying Zhang
Fandong Meng
Jinan Xu
Jinan Xu
Jie Zhou
84
10
0
22 Jul 2021
Tsformer: Time series Transformer for tourism demand forecasting
Tsformer: Time series Transformer for tourism demand forecasting
Siyuan Yi
Xing Chen
Chuanming Tang
AI4TS
31
2
0
22 Jul 2021
Back-Translated Task Adaptive Pretraining: Improving Accuracy and
  Robustness on Text Classification
Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification
Junghoon Lee
Jounghee Kim
Pilsung Kang
VLM
74
5
0
22 Jul 2021
Spinning Sequence-to-Sequence Models with Meta-Backdoors
Eugene Bagdasaryan
Vitaly Shmatikov
SILMAAML
93
8
0
22 Jul 2021
Theoretical foundations and limits of word embeddings: what types of
  meaning can they capture?
Theoretical foundations and limits of word embeddings: what types of meaning can they capture?
Alina Arseniev-Koehler
70
21
0
22 Jul 2021
Multi-Stream Transformers
Multi-Stream Transformers
Andrey Kravchenko
Anna Rumshisky
AI4CE
29
0
0
21 Jul 2021
Small-Text: Active Learning for Text Classification in Python
Small-Text: Active Learning for Text Classification in Python
Christopher Schröder
Lydia Muller
A. Niekler
Martin Potthast
CLIPVLMAI4CE
128
28
0
21 Jul 2021
A Review of Some Techniques for Inclusion of Domain-Knowledge into Deep
  Neural Networks
A Review of Some Techniques for Inclusion of Domain-Knowledge into Deep Neural Networks
T. Dash
Sharad Chitlangia
Aditya Ahuja
A. Srinivasan
122
133
0
21 Jul 2021
CycleMLP: A MLP-like Architecture for Dense Prediction
CycleMLP: A MLP-like Architecture for Dense Prediction
Shoufa Chen
Enze Xie
Chongjian Ge
Runjian Chen
Ding Liang
Ping Luo
151
235
0
21 Jul 2021
Generative Models for Security: Attacks, Defenses, and Opportunities
Generative Models for Security: Attacks, Defenses, and Opportunities
L. A. Bauer
Vincent Bindschaedler
114
4
0
21 Jul 2021
Improved Text Classification via Contrastive Adversarial Training
Improved Text Classification via Contrastive Adversarial Training
Lin Pan
Chung-Wei Hang
Avirup Sil
Saloni Potdar
AAML
67
92
0
21 Jul 2021
CATE: CAusality Tree Extractor from Natural Language Requirements
CATE: CAusality Tree Extractor from Natural Language Requirements
Noah Jadallah
Jannik Fischbach
Julian Frattini
Andreas Vogelsang
44
4
0
21 Jul 2021
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with
  Minimal Supervision
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision
Zhongyang Li
Xiao Ding
Kuo Liao
Bing Qin
Ting Liu
CML
107
19
0
21 Jul 2021
Interactive Storytelling for Children: A Case-study of Design and
  Development Considerations for Ethical Conversational AI
Interactive Storytelling for Children: A Case-study of Design and Development Considerations for Ethical Conversational AI
J. Chubb
S. Missaoui
S. Concannon
Liam Maloney
James Alfred Walker
58
35
0
20 Jul 2021
BoningKnife: Joint Entity Mention Detection and Typing for Nested NER
  via prior Boundary Knowledge
BoningKnife: Joint Entity Mention Detection and Typing for Nested NER via prior Boundary Knowledge
Huiqiang Jiang
Guoxin Wang
Weile Chen
Chengxi Zhang
Börje F. Karlsson
35
5
0
20 Jul 2021
Follow Your Path: a Progressive Method for Knowledge Distillation
Follow Your Path: a Progressive Method for Knowledge Distillation
Wenxian Shi
Yuxuan Song
Hao Zhou
Bohan Li
Lei Li
60
15
0
20 Jul 2021
Sequence Model with Self-Adaptive Sliding Window for Efficient Spoken
  Document Segmentation
Sequence Model with Self-Adaptive Sliding Window for Efficient Spoken Document Segmentation
Qinglin Zhang
Qian Chen
Yali Li
Jiaqing Liu
Wen Wang
165
16
0
20 Jul 2021
Generative Video Transformer: Can Objects be the Words?
Generative Video Transformer: Can Objects be the Words?
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
ViT
114
34
0
20 Jul 2021
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Qiushi Huang
Tom Ko
Lilian H. Y. Tang
Xubo Liu
Boyong Wu
70
23
0
19 Jul 2021
Just Train Twice: Improving Group Robustness without Training Group
  Information
Just Train Twice: Improving Group Robustness without Training Group Information
Emmy Liu
Behzad Haghgoo
Annie S. Chen
Aditi Raghunathan
Pang Wei Koh
Shiori Sagawa
Percy Liang
Chelsea Finn
OOD
137
563
0
19 Jul 2021
Image Fusion Transformer
Image Fusion Transformer
VS Vibashan
Jeya Maria Jose Valanarasu
Poojan Oza
Vishal M. Patel
ViT
87
123
0
19 Jul 2021
OODformer: Out-Of-Distribution Detection Transformer
OODformer: Out-Of-Distribution Detection Transformer
Rajat Koner
Poulami Sinhamahapatra
Karsten Roscher
Stephan Günnemann
Volker Tresp
ViT
64
40
0
19 Jul 2021
Clinical Relation Extraction Using Transformer-based Models
Clinical Relation Extraction Using Transformer-based Models
Xi Yang
Zehao Yu
Yi Guo
Jiang Bian
Yonghui Wu
LM&MAMedIm
65
20
0
19 Jul 2021
Epistemic Neural Networks
Epistemic Neural Networks
Ian Osband
Zheng Wen
M. Asghari
Vikranth Dwaracherla
M. Ibrahimi
Xiyuan Lu
Benjamin Van Roy
UQCVBDL
142
109
0
19 Jul 2021
Adaptive Transfer Learning on Graph Neural Networks
Adaptive Transfer Learning on Graph Neural Networks
Xueting Han
Zhenhuan Huang
Bang An
Jing Bai
120
57
0
19 Jul 2021
Stock Movement Prediction with Financial News using Contextualized
  Embedding from BERT
Stock Movement Prediction with Financial News using Contextualized Embedding from BERT
Qinkai Chen
AIFin
52
19
0
19 Jul 2021
Structural Watermarking to Deep Neural Networks via Network Channel
  Pruning
Structural Watermarking to Deep Neural Networks via Network Channel Pruning
Xiangyu Zhao
Yinzhe Yao
Hanzhou Wu
Xinpeng Zhang
AAML
128
25
0
19 Jul 2021
Constructing Multi-Modal Dialogue Dataset by Replacing Text with
  Semantically Relevant Images
Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images
Nyoungwoo Lee
Suwon Shin
Jaegul Choo
Ho‐Jin Choi
S. Myaeng
62
27
0
19 Jul 2021
Video Crowd Localization with Multi-focus Gaussian Neighborhood
  Attention and a Large-Scale Benchmark
Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark
Haopeng Li
Lingbo Liu
Kunlin Yang
Shinan Liu
Junyuan Gao
Bin Zhao
Rui Zhang
Jun Hou
150
16
0
19 Jul 2021
CHEF: A Cheap and Fast Pipeline for Iteratively Cleaning Label
  Uncertainties (Technical Report)
CHEF: A Cheap and Fast Pipeline for Iteratively Cleaning Label Uncertainties (Technical Report)
Yinjun Wu
James Weimer
S. Davidson
72
4
0
19 Jul 2021
Bridging the Gap between Language Model and Reading Comprehension:
  Unsupervised MRC via Self-Supervision
Bridging the Gap between Language Model and Reading Comprehension: Unsupervised MRC via Self-Supervision
Ning Bian
Xianpei Han
Bo Chen
Hongyu Lin
Xianpei Han
Le Sun
SSLLRM
96
5
0
19 Jul 2021
Argument Linking: A Survey and Forecast
Argument Linking: A Survey and Forecast
William Gantt
69
3
0
18 Jul 2021
Stock price prediction using BERT and GAN
Stock price prediction using BERT and GAN
Priyank Sonkiya
Vikas Bajpai
Anukriti Bansal
AIFin
63
39
0
18 Jul 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based
  Games
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRLAI4CE
105
29
0
18 Jul 2021
AS-MLP: An Axial Shifted MLP Architecture for Vision
AS-MLP: An Axial Shifted MLP Architecture for Vision
Dongze Lian
Zehao Yu
Xing Sun
Shenghua Gao
133
192
0
18 Jul 2021
A Survey on Data-driven Software Vulnerability Assessment and
  Prioritization
A Survey on Data-driven Software Vulnerability Assessment and Prioritization
T. H. Le
Huaming Chen
Muhammad Ali Babar
104
86
0
18 Jul 2021
Previous
123...317318319...474475476
Next