ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,694 papers shown
Title
Interactive query expansion for professional search applications
Interactive query expansion for professional search applications
Tony Russell-Rose
Phil Gooch
Udo Kruschwitz
28
7
0
25 Jun 2021
Probing Inter-modality: Visual Parsing with Self-Attention for
  Vision-Language Pre-training
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
Hongwei Xue
Yupan Huang
Bei Liu
Houwen Peng
Jianlong Fu
Houqiang Li
Jiebo Luo
101
89
0
25 Jun 2021
A Picture May Be Worth a Hundred Words for Visual Question Answering
A Picture May Be Worth a Hundred Words for Visual Question Answering
Yusuke Hirota
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Ittetsu Taniguchi
Takao Onoye
ViT
35
4
0
25 Jun 2021
JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE
  2021
JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021
Nguyen Ha Thanh
Phuong Minh Nguyen
Thi-Hai-Yen Vuong
Quan Minh Bui
Chau Nguyen
Binh Dang
Vu Tran
Minh Le Nguyen
Ken Satoh
AILaw
53
16
0
25 Jun 2021
ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text
  Processing
ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing
Nguyen Ha Thanh
Vu Tran
Phuong Minh Nguyen
Thi-Hai-Yen Vuong
Quan Minh Bui
Chau Nguyen
Binh Dang
Minh Le Nguyen
Kenji Satoh
AILaw
52
10
0
25 Jun 2021
To the Point: Efficient 3D Object Detection in the Range Image with
  Graph Convolution Kernels
To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
Yuning Chai
Pei Sun
Jiquan Ngiam
Weiyue Wang
Benjamin Caine
Vijay Vasudevan
Xiao Zhang
Drago Anguelov
3DPC
116
69
0
25 Jun 2021
Domain-Specific Pretraining for Vertical Search: Case Study on
  Biomedical Literature
Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature
Yu Wang
Jinchao Li
Tristan Naumann
Chenyan Xiong
Hao Cheng
...
Yang Qin
Eric Horvitz
Paul N. Bennett
Jianfeng Gao
Hoifung Poon
OOD
85
14
0
25 Jun 2021
VOGUE: Answer Verbalization through Multi-Task Learning
VOGUE: Answer Verbalization through Multi-Task Learning
Endri Kacupaj
Shyamnath Premnadh
Kuldeep Singh
Jens Lehmann
M. Maleshkova
62
7
0
24 Jun 2021
Multitask Learning for Citation Purpose Classification
Multitask Learning for Citation Purpose Classification
Alexander X. Oesterling
Angikar Ghosal
Haoyang Yu
Rui Xin
Yasa Baig
Lesia Semenova
Cynthia Rudin
31
7
0
24 Jun 2021
Towards Understanding and Mitigating Social Biases in Language Models
Towards Understanding and Mitigating Social Biases in Language Models
Paul Pu Liang
Chiyu Wu
Louis-Philippe Morency
Ruslan Salakhutdinov
118
399
0
24 Jun 2021
Learning Language and Multimodal Privacy-Preserving Markers of Mood from
  Mobile Data
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data
Paul Pu Liang
Terrance Liu
Anna Cai
Michal Muszynski
Ryo Ishii
Nicholas B. Allen
Randy P. Auerbach
David Brent
Ruslan Salakhutdinov
Louis-Philippe Morency
89
18
0
24 Jun 2021
FitVid: Overfitting in Pixel-Level Video Prediction
FitVid: Overfitting in Pixel-Level Video Prediction
Mohammad Babaeizadeh
M. Saffar
Suraj Nair
Sergey Levine
Chelsea Finn
D. Erhan
VLM
115
84
0
24 Jun 2021
VOLO: Vision Outlooker for Visual Recognition
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
141
328
0
24 Jun 2021
A Transformer-based Cross-modal Fusion Model with Adversarial Training
  for VQA Challenge 2021
A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021
Keda Lu
Bo Fang
Kuan-Yu Chen
ViT
47
2
0
24 Jun 2021
Autoformer: Decomposition Transformers with Auto-Correlation for
  Long-Term Series Forecasting
Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting
Haixu Wu
Jiehui Xu
Jianmin Wang
Mingsheng Long
AI4TS
146
2,384
0
24 Jun 2021
AIT-QA: Question Answering Dataset over Complex Tables in the Airline
  Industry
AIT-QA: Question Answering Dataset over Complex Tables in the Airline Industry
Yannis Katsis
Saneem A. Chemmengath
Vishwajeet Kumar
Samarth Bharadwaj
Mustafa Canim
...
A. Gliozzo
FeiFei Pan
Jaydeep Sen
Karthik Sankaranarayanan
Soumen Chakrabarti
LMTDRALM
81
38
0
24 Jun 2021
MatchVIE: Exploiting Match Relevancy between Entities for Visual
  Information Extraction
MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Guozhi Tang
Lele Xie
Lianwen Jin
Jiapeng Wang
Jingdong Chen
Zhen Xu
Qianying Wang
Yaqiang Wu
Hui Li
103
29
0
24 Jun 2021
Accelerating variational quantum algorithms with multiple quantum
  processors
Accelerating variational quantum algorithms with multiple quantum processors
Yuxuan Du
Yan Qian
Dacheng Tao
55
8
0
24 Jun 2021
Modeling Diagnostic Label Correlation for Automatic ICD Coding
Modeling Diagnostic Label Correlation for Automatic ICD Coding
Shang-Chi Tsai
Chaorui Huang
Yun-Nung Chen
65
16
0
24 Jun 2021
Where is the disease? Semi-supervised pseudo-normality synthesis from an
  abnormal image
Where is the disease? Semi-supervised pseudo-normality synthesis from an abnormal image
Yuanqi Du
Quan Quan
Hu Han
S. Kevin Zhou
GANMedIm
52
3
0
24 Jun 2021
Label Disentanglement in Partition-based Extreme Multilabel
  Classification
Label Disentanglement in Partition-based Extreme Multilabel Classification
Xuanqing Liu
Wei-Cheng Chang
Hsiang-Fu Yu
Cho-Jui Hsieh
Inderjit S. Dhillon
63
11
0
24 Jun 2021
An Automated Knowledge Mining and Document Classification System with
  Multi-model Transfer Learning
An Automated Knowledge Mining and Document Classification System with Multi-model Transfer Learning
J. Chong
Zhiyuan Chen
Mei Shin Oh
28
2
0
24 Jun 2021
Discovering novel drug-supplement interactions using a dietary
  supplements knowledge graph generated from the biomedical literature
Discovering novel drug-supplement interactions using a dietary supplements knowledge graph generated from the biomedical literature
Dalton Schutte
J. Vasilakes
A. Bompelli
Yuqi Zhou
M. Fiszman
Hua Xu
H. Kilicoglu
J. Bishop
T. Adam
Rui Zhang
19
2
0
24 Jun 2021
An Efficient Group-based Search Engine Marketing System for E-Commerce
An Efficient Group-based Search Engine Marketing System for E-Commerce
Cheng Jie
Da Xu
Zigeng Wang
Lu Wang
Wei Shen
61
1
0
24 Jun 2021
Fairness via Representation Neutralization
Fairness via Representation Neutralization
Mengnan Du
Subhabrata Mukherjee
Guanchu Wang
Ruixiang Tang
Ahmed Hassan Awadallah
Helen Zhou
90
81
0
23 Jun 2021
Charformer: Fast Character Transformers via Gradient-based Subword
  Tokenization
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Yi Tay
Vinh Q. Tran
Sebastian Ruder
Jai Gupta
Hyung Won Chung
Dara Bahri
Zhen Qin
Simon Baumgartner
Cong Yu
Donald Metzler
175
162
0
23 Jun 2021
Extreme Multi-label Learning for Semantic Matching in Product Search
Extreme Multi-label Learning for Semantic Matching in Product Search
Wei-Cheng Chang
Daniel Jiang
Hsiang-Fu Yu
C. Teo
Jiong Zhang
...
Qie Hu
Nikhil Shandilya
Vyacheslav Ievgrafov
Japinder Singh
Inderjit S. Dhillon
90
60
0
23 Jun 2021
Clinical Named Entity Recognition using Contextualized Token
  Representations
Clinical Named Entity Recognition using Contextualized Token Representations
Yichao Zhou
C. Ju
J. H. Caufield
Kevin J. Shih
Calvin Yu‐Chian Chen
Yizhou Sun
Kai-Wei Chang
Peipei Ping
Wei Wang
115
11
0
23 Jun 2021
Stable, Fast and Accurate: Kernelized Attention with Relative Positional
  Encoding
Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding
Shengjie Luo
Shanda Li
Tianle Cai
Di He
Dinglan Peng
Shuxin Zheng
Guolin Ke
Liwei Wang
Tie-Yan Liu
97
50
0
23 Jun 2021
BERT-based Multi-Task Model for Country and Province Level Modern
  Standard Arabic and Dialectal Arabic Identification
BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification
Abdellah El Mekki
Abdelkader El Mahdaouy
Kabil Essefar
Nabil El Mamoun
Ismail Berrada
A. Khoumsi
46
18
0
23 Jun 2021
Deep Multi-Task Model for Sarcasm Detection and Sentiment Analysis in
  Arabic Language
Deep Multi-Task Model for Sarcasm Detection and Sentiment Analysis in Arabic Language
Abdelkader El Mahdaouy
Abdellah El Mekki
Kabil Essefar
Nabil El Mamoun
Ismail Berrada
A. Khoumsi
52
37
0
23 Jun 2021
From Canonical Correlation Analysis to Self-supervised Graph Neural
  Networks
From Canonical Correlation Analysis to Self-supervised Graph Neural Networks
Hengrui Zhang
Qitian Wu
Junchi Yan
David Wipf
Philip S. Yu
SSL
115
223
0
23 Jun 2021
Classifying Textual Data with Pre-trained Vision Models through Transfer
  Learning and Data Transformations
Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data Transformations
Charaf Eddine Benarab
CLIPVLM
52
2
0
23 Jun 2021
Mixtures of Deep Neural Experts for Automated Speech Scoring
Mixtures of Deep Neural Experts for Automated Speech Scoring
Sara Papi
E. Trentin
R. Gretter
M. Matassoni
D. Falavigna
MoE
38
10
0
23 Jun 2021
Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit
  Argument Relations
Reinforcement Learning-based Dialogue Guided Event Extraction to Exploit Argument Relations
Qian Li
Hao Peng
Jianxin Li
Hongzhi Zhang
Yuanxing Ning
Lihong Wang
Philip S. Yu
Ziyi Wang
61
26
0
23 Jun 2021
Learning from Pseudo Lesion: A Self-supervised Framework for COVID-19
  Diagnosis
Learning from Pseudo Lesion: A Self-supervised Framework for COVID-19 Diagnosis
Zhongliang Li
Zhihao Jin
Xuechen Li
Linlin Shen
SSLMedIm
49
1
0
23 Jun 2021
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU
  Tensor Cores
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Boyuan Feng
Yuke Wang
Tong Geng
Ang Li
Yufei Ding
MQ
74
37
0
23 Jun 2021
NodePiece: Compositional and Parameter-Efficient Representations of
  Large Knowledge Graphs
NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs
Mikhail Galkin
E. Denis
Jiapeng Wu
William L. Hamilton
OCL
83
90
0
23 Jun 2021
Probabilistic Attention for Interactive Segmentation
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
105
13
0
23 Jun 2021
Revisiting Deep Learning Models for Tabular Data
Revisiting Deep Learning Models for Tabular Data
Yu. V. Gorishniy
Ivan Rubachev
Valentin Khrulkov
Artem Babenko
LMTD
157
784
0
22 Jun 2021
On Adversarial Robustness of Synthetic Code Generation
On Adversarial Robustness of Synthetic Code Generation
Mrinal Anand
Pratik Kayal
M. Singh
132
5
0
22 Jun 2021
Learn to Resolve Conversational Dependency: A Consistency Training
  Framework for Conversational Question Answering
Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering
Gangwoo Kim
Hyunjae Kim
Jungsoo Park
Jaewoo Kang
103
38
0
22 Jun 2021
SENT: Sentence-level Distant Relation Extraction via Negative Training
SENT: Sentence-level Distant Relation Extraction via Negative Training
Ruotian Ma
Tao Gui
Linyang Li
Qi Zhang
Yaqian Zhou
Xuanjing Huang
53
32
0
22 Jun 2021
DocFormer: End-to-End Transformer for Document Understanding
DocFormer: End-to-End Transformer for Document Understanding
Srikar Appalaraju
Bhavan A. Jasani
Bhargava Urala Kota
Yusheng Xie
R. Manmatha
ViT
121
281
0
22 Jun 2021
Key-Sparse Transformer for Multimodal Speech Emotion Recognition
Key-Sparse Transformer for Multimodal Speech Emotion Recognition
Weidong Chen
Xiaofen Xing
Xiangmin Xu
Jichen Yang
Jianxin Pang
83
52
0
22 Jun 2021
Graph Routing between Capsules
Graph Routing between Capsules
Yang Li
Wei Zhao
Min Zhang
Suhang Wang
Steffen Eger
GNN
37
14
0
22 Jun 2021
BARTScore: Evaluating Generated Text as Text Generation
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
222
851
0
22 Jun 2021
A Comprehensive Comparison of Pre-training Language Models
A Comprehensive Comparison of Pre-training Language Models
Tonglei Guo
VLMELM
102
3
0
22 Jun 2021
Incremental Deep Neural Network Learning using Classification Confidence
  Thresholding
Incremental Deep Neural Network Learning using Classification Confidence Thresholding
Justin Leo
Jugal Kalita
CLL
65
18
0
21 Jun 2021
Hi-BEHRT: Hierarchical Transformer-based model for accurate prediction
  of clinical events using multimodal longitudinal electronic health records
Hi-BEHRT: Hierarchical Transformer-based model for accurate prediction of clinical events using multimodal longitudinal electronic health records
Yikuan Li
M. Mamouei
G. Salimi-Khorshidi
Shishir Rao
A. Hassaine
D. Canoy
Thomas Lukasiewicz
K. Rahimi
98
83
0
21 Jun 2021
Previous
123...322323324...472473474
Next