ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,780 papers shown
Title
Detecting Propaganda on the Sentence Level during the COVID-19 Pandemic
Detecting Propaganda on the Sentence Level during the COVID-19 Pandemic
Rong-Ching Chang
Chu-Hsing Lin
18
1
0
31 Jul 2021
CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale
  Attention
CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention
Wenxiao Wang
Lulian Yao
Long Chen
Binbin Lin
Deng Cai
Xiaofei He
Wei Liu
227
273
0
31 Jul 2021
On The State of Data In Computer Vision: Human Annotations Remain
  Indispensable for Developing Deep Learning Models
On The State of Data In Computer Vision: Human Annotations Remain Indispensable for Developing Deep Learning Models
Z. Emam
Andrew Kondrich
Sasha Harrison
Felix Lau
Yushi Wang
Aerin Kim
E. Branson
VLM
54
13
0
31 Jul 2021
Structural Guidance for Transformer Language Models
Structural Guidance for Transformer Language Models
Peng Qian
Tahira Naseem
R. Levy
Ramón Fernández Astudillo
121
31
0
30 Jul 2021
The History of Speech Recognition to the Year 2030
The History of Speech Recognition to the Year 2030
Awni Y. Hannun
AI4TS
123
21
0
30 Jul 2021
MTVR: Multilingual Moment Retrieval in Videos
MTVR: Multilingual Moment Retrieval in Videos
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
77
11
0
30 Jul 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Andrew Jaegle
Sebastian Borgeaud
Jean-Baptiste Alayrac
Carl Doersch
Catalin Ionescu
...
Olivier J. Hénaff
M. Botvinick
Andrew Zisserman
Oriol Vinyals
João Carreira
MLLMVLMGNN
211
585
0
30 Jul 2021
Automatic Claim Review for Climate Science via Explanation Generation
Automatic Claim Review for Climate Science via Explanation Generation
Shraey Bhatia
Jey Han Lau
Timothy Baldwin
39
5
0
30 Jul 2021
EmailSum: Abstractive Email Thread Summarization
EmailSum: Abstractive Email Thread Summarization
Shiyue Zhang
Asli Celikyilmaz
Jianfeng Gao
Joey Tianyi Zhou
84
39
0
30 Jul 2021
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval
  via Cross-modal Pretraining
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining
Xunlin Zhan
Yangxin Wu
Xiao Dong
Yunchao Wei
Minlong Lu
Yichi Zhang
Hang Xu
Xiaodan Liang
ViT
97
67
0
30 Jul 2021
Talk2Data: A Natural Language Interface for Exploratory Visual Analysis
  via Question Decomposition
Talk2Data: A Natural Language Interface for Exploratory Visual Analysis via Question Decomposition
Yi Guo
Danqing Shi
Mingjuan Guo
Yanqiu Wu
Qing Chen
Nana Cao
77
11
0
30 Jul 2021
Self-Supervised Transformer for Sparse and Irregularly Sampled
  Multivariate Clinical Time-Series
Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series
Sindhu Tipirneni
Chandan K. Reddy
AI4TS
72
111
0
29 Jul 2021
Rethinking and Improving Relative Position Encoding for Vision
  Transformer
Rethinking and Improving Relative Position Encoding for Vision Transformer
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
ViT
126
340
0
29 Jul 2021
A Unified Efficient Pyramid Transformer for Semantic Segmentation
A Unified Efficient Pyramid Transformer for Semantic Segmentation
Fangrui Zhu
Yi Zhu
Li Zhang
Chongruo Wu
Yanwei Fu
Mu Li
ViT
105
30
0
29 Jul 2021
On the combined effect of class imbalance and concept complexity in deep
  learning
On the combined effect of class imbalance and concept complexity in deep learning
Kushankur Ghosh
C. Bellinger
Roberto Corizzo
Bartosz Krawczyk
Nathalie Japkowicz
57
8
0
29 Jul 2021
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Xianrui Zheng
Chao Zhang
P. Woodland
34
49
0
29 Jul 2021
SeqScore: Addressing Barriers to Reproducible Named Entity Recognition
  Evaluation
SeqScore: Addressing Barriers to Reproducible Named Entity Recognition Evaluation
Chester Palen-Michel
Nolan Holley
Constantine Lignos
70
12
0
29 Jul 2021
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
Yu Fu
Tianyang Xu
Xiaojun Wu
J. Kittler
ViT
72
40
0
29 Jul 2021
Multimodal Co-learning: Challenges, Applications with Datasets, Recent
  Advances and Future Directions
Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions
Anil Rahate
Rahee Walambe
S. Ramanna
K. Kotecha
121
143
0
29 Jul 2021
Video Generation from Text Employing Latent Path Construction for
  Temporal Modeling
Video Generation from Text Employing Latent Path Construction for Temporal Modeling
Amir Mazaheri
M. Shah
75
8
0
29 Jul 2021
Term Expansion and FinBERT fine-tuning for Hypernym and Synonym Ranking
  of Financial Terms
Term Expansion and FinBERT fine-tuning for Hypernym and Synonym Ranking of Financial Terms
Ankush Chopra
Sohom Ghosh
39
7
0
29 Jul 2021
UIBert: Learning Generic Multimodal Representations for UI Understanding
UIBert: Learning Generic Multimodal Representations for UI Understanding
Chongyang Bai
Xiaoxue Zang
Ying Xu
Srinivas Sunkara
Abhinav Rastogi
Jindong Chen
Blaise Agüera y Arcas
95
95
0
29 Jul 2021
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient
  Pre-trained Language Models
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
Yichun Yin
Cheng Chen
Lifeng Shang
Xin Jiang
Xiao Chen
Qun Liu
VLM
71
50
0
29 Jul 2021
Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal
  Sentiment Analysis
Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment Analysis
Wei Han
Hui Chen
Alexander Gelbukh
Amir Zadeh
Louis-Philippe Morency
Soujanya Poria
70
185
0
28 Jul 2021
Domain-matched Pre-training Tasks for Dense Retrieval
Domain-matched Pre-training Tasks for Dense Retrieval
Barlas Oğuz
Kushal Lakhotia
Anchit Gupta
Patrick Lewis
Vladimir Karpukhin
...
Xilun Chen
Sebastian Riedel
Wen-tau Yih
Sonal Gupta
Yashar Mehdad
RALM
87
67
0
28 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLMSyDa
449
4,053
0
28 Jul 2021
Towards Robustness Against Natural Language Word Substitutions
Towards Robustness Against Natural Language Word Substitutions
Xinshuai Dong
Anh Tuan Luu
Rongrong Ji
Hong Liu
SILMAAML
169
115
0
28 Jul 2021
Sentiment Analysis of the COVID-related r/Depression Posts
Sentiment Analysis of the COVID-related r/Depression Posts
Zihan Chen
Marina Sokolova
136
4
0
28 Jul 2021
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving
Zhenwen Liang
Jipeng Zhang
Lei Wang
Wei Qin
Yunshi Lan
Jie Shao
Xiangliang Zhang
AIMat
85
66
0
28 Jul 2021
Multi-Scale Feature and Metric Learning for Relation Extraction
Multi-Scale Feature and Metric Learning for Relation Extraction
Mi Zhang
T. Qian
68
0
0
28 Jul 2021
Predicting the Future from First Person (Egocentric) Vision: A Survey
Predicting the Future from First Person (Egocentric) Vision: A Survey
Ivan Rodin
Antonino Furnari
Dimitrios Mavroeidis
G. Farinella
EgoV
104
44
0
28 Jul 2021
XFL: Naming Functions in Binaries with Extreme Multi-label Learning
XFL: Naming Functions in Binaries with Extreme Multi-label Learning
James Patrick-Evans
Moritz Dannehl
Johannes Kinder
82
12
0
28 Jul 2021
Predicting Patch Correctness Based on the Similarity of Failing Test
  Cases
Predicting Patch Correctness Based on the Similarity of Failing Test Cases
Haoye Tian
Yinghua Li
Weiguo Pian
Abdoul Kader Kaboré
Kui Liu
Andrew Habib
Jacques Klein
Tegawende F. Bissyande
65
31
0
28 Jul 2021
Arabic aspect sentiment polarity classification using BERT
Arabic aspect sentiment polarity classification using BERT
Mohammed M. Abdelgwad
T. H. Soliman
A. Taloba
58
34
0
28 Jul 2021
Homogeneous Architecture Augmentation for Neural Predictor
Homogeneous Architecture Augmentation for Neural Predictor
Yuqiao Liu
Yehui Tang
Yizhou Sun
77
26
0
28 Jul 2021
Is Object Detection Necessary for Human-Object Interaction Recognition?
Is Object Detection Necessary for Human-Object Interaction Recognition?
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Zicheng Liu
Lei Li
80
7
0
27 Jul 2021
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Exceeding the Limits of Visual-Linguistic Multi-Task Learning
Cameron R. Wolfe
Keld T. Lundgaard
VLM
80
2
0
27 Jul 2021
A Case Study on Sampling Strategies for Evaluating Neural Sequential
  Item Recommendation Models
A Case Study on Sampling Strategies for Evaluating Neural Sequential Item Recommendation Models
Alexander Dallmann
Daniel Zoller
Andreas Hotho
71
58
0
27 Jul 2021
Dataset Distillation with Infinitely Wide Convolutional Networks
Dataset Distillation with Infinitely Wide Convolutional Networks
Timothy Nguyen
Roman Novak
Lechao Xiao
Jaehoon Lee
DD
120
237
0
27 Jul 2021
A Physiologically-Adapted Gold Standard for Arousal during Stress
A Physiologically-Adapted Gold Standard for Arousal during Stress
Alice Baird
Lukas Stappen
Lukas Christ
Lea Schumann
Eva-Maria Messner
Björn W. Schuller
31
3
0
27 Jul 2021
gaBERT -- an Irish Language Model
gaBERT -- an Irish Language Model
James Barry
Joachim Wagner
Lauren Cassidy
Alan Cowap
Teresa Lynn
Abigail Walsh
Mícheál J. Ó Meachair
Jennifer Foster
65
18
0
27 Jul 2021
Unsupervised Domain Adaptation for Hate Speech Detection Using a Data
  Augmentation Approach
Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach
Sheikh Muhammad Sarwar
Vanessa Murdock
96
22
0
27 Jul 2021
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Zuchao Li
Kevin Parnow
Hai Zhao
Zhuosheng Zhang
Rui Wang
Masao Utiyama
Eiichiro Sumita
57
8
0
27 Jul 2021
Measuring daily-life fear perception change: a computational study in
  the context of COVID-19
Measuring daily-life fear perception change: a computational study in the context of COVID-19
Y. Chai
J. Palacios
Jianghao Wang
Yichun Fan
Siqi Zheng
28
4
0
27 Jul 2021
PiSLTRc: Position-informed Sign Language Transformer with Content-aware
  Convolution
PiSLTRc: Position-informed Sign Language Transformer with Content-aware Convolution
Pan Xie
Mengyi Zhao
Xiaohui Hu
ViTSLR
99
35
0
27 Jul 2021
Dual Slot Selector via Local Reliability Verification for Dialogue State
  Tracking
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo
Kai Shuang
Jijie Li
Zihan Wang
80
18
0
27 Jul 2021
Language Grounding with 3D Objects
Language Grounding with 3D Objects
Jesse Thomason
Mohit Shridhar
Yonatan Bisk
Chris Paxton
Luke Zettlemoyer
LM&Ro
96
53
0
26 Jul 2021
From Implicit to Explicit feedback: A deep neural network for modeling
  sequential behaviours and long-short term preferences of online users
From Implicit to Explicit feedback: A deep neural network for modeling sequential behaviours and long-short term preferences of online users
Quyen Tran
Lam C. Tran
Linh Chu Hai
Ngo Van Linh
Khoat Than
HAI
39
13
0
26 Jul 2021
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Yuren Cong
Wentong Liao
H. Ackermann
Bodo Rosenhahn
M. Yang
ViT
74
129
0
26 Jul 2021
Contextual Transformer Networks for Visual Recognition
Contextual Transformer Networks for Visual Recognition
Yehao Li
Ting Yao
Yingwei Pan
Tao Mei
ViT
108
495
0
26 Jul 2021
Previous
123...316317318...474475476
Next