ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,802 papers shown
Title
Logic Explained Networks
Logic Explained Networks
Gabriele Ciravegna
Pietro Barbiero
Francesco Giannini
Marco Gori
Pietro Lio
Marco Maggini
S. Melacci
90
70
0
11 Aug 2021
Representation Learning for Remote Sensing: An Unsupervised Sensor
  Fusion Approach
Representation Learning for Remote Sensing: An Unsupervised Sensor Fusion Approach
Aidan M. Swope
X. Rudelis
Kyle T. Story
SSL
132
20
0
11 Aug 2021
Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report
  Generation With Alternate Learning
Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning
Guangyi Liu
Yinghong Liao
Fuyu Wang
Bin Zhang
Lu Zhang
...
Xiang Wan
Shaolin Li
Zhen Li
Shuixing Zhang
Shuguang Cui
121
59
0
11 Aug 2021
Learning Oculomotor Behaviors from Scanpath
Learning Oculomotor Behaviors from Scanpath
Beibin Li
Nicholas Nuechterlein
E. Barney
Claire E. Foster
Minah Kim
...
Li Feng
Quan Wang
P. Ventola
Linda G. Shapiro
Frederick Shic
57
5
0
11 Aug 2021
A Transformer-based Math Language Model for Handwritten Math Expression
  Recognition
A Transformer-based Math Language Model for Handwritten Math Expression Recognition
Quang Huy Ung
C. Nguyen
Hung Tuan Nguyen
Thanh-Nghia Truong
M. Nakagawa
24
9
0
11 Aug 2021
Perturbing Inputs for Fragile Interpretations in Deep Natural Language
  Processing
Perturbing Inputs for Fragile Interpretations in Deep Natural Language Processing
Sanchit Sinha
Hanjie Chen
Arshdeep Sekhon
Yangfeng Ji
Yanjun Qi
AAMLFAtt
79
42
0
11 Aug 2021
SoK: How Robust is Image Classification Deep Neural Network
  Watermarking? (Extended Version)
SoK: How Robust is Image Classification Deep Neural Network Watermarking? (Extended Version)
Nils Lukas
Edward Jiang
Xinda Li
Florian Kerschbaum
AAML
119
92
0
11 Aug 2021
A Study of Social and Behavioral Determinants of Health in Lung Cancer
  Patients Using Transformers-based Natural Language Processing Models
A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models
Zehao Yu
Xi Yang
Chong Dang
Songzi Wu
P. Adekkanattu
...
T. George
William R. Hogan
Yi Guo
Jiang Bian
Yonghui Wu
61
38
0
10 Aug 2021
BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease
  Diagnosis
BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis
Masoud Monajatipoor
Mozhdeh Rouhsedaghat
Liunian Harold Li
Aichi Chien
C.-C. Jay Kuo
Fabien Scalzo
Kai-Wei Chang
LM&MAMedIm
60
31
0
10 Aug 2021
Embodied BERT: A Transformer Model for Embodied, Language-guided Visual
  Task Completion
Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion
Alessandro Suglia
Qiaozi Gao
Jesse Thomason
Govind Thattai
Gaurav Sukhatme
LM&Ro
133
78
0
10 Aug 2021
Post-hoc Interpretability for Neural NLP: A Survey
Post-hoc Interpretability for Neural NLP: A Survey
Andreas Madsen
Siva Reddy
A. Chandar
XAI
131
234
0
10 Aug 2021
Binary Complex Neural Network Acceleration on FPGA
Binary Complex Neural Network Acceleration on FPGA
Hongwu Peng
Shangli Zhou
Scott Weitze
Jiaxin Li
Sahidul Islam
...
Wei Zhang
M. Song
Mimi Xie
Hang Liu
Caiwen Ding
MQ
63
20
0
10 Aug 2021
Headed-Span-Based Projective Dependency Parsing
Headed-Span-Based Projective Dependency Parsing
Aaron Courville
Kewei Tu
76
14
0
10 Aug 2021
Automated Audio Captioning using Transfer Learning and Reconstruction
  Latent Space Similarity Regularization
Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization
Andrew Koh
Fuzhao Xue
Chng Eng Siong
68
20
0
10 Aug 2021
Differentiable Subset Pruning of Transformer Heads
Differentiable Subset Pruning of Transformer Heads
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
134
57
0
10 Aug 2021
Learning Canonical 3D Object Representation for Fine-Grained Recognition
Learning Canonical 3D Object Representation for Fine-Grained Recognition
Sunghun Joung
Seungryong Kim
Minsu Kim
Ig-Jae Kim
Kwanghoon Sohn
85
10
0
10 Aug 2021
Hope Speech detection in under-resourced Kannada language
Hope Speech detection in under-resourced Kannada language
Adeep Hande
R. Priyadharshini
Anbukkarasi Sampath
K. Thamburaj
Prabakaran Chandran
Bharathi Raja Chakravarthi
69
29
0
10 Aug 2021
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code
  Representation
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
142
118
0
10 Aug 2021
BROS: A Pre-trained Language Model Focusing on Text and Layout for
  Better Key Information Extraction from Documents
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents
Teakgyu Hong
Donghyun Kim
Mingi Ji
Wonseok Hwang
Daehyun Nam
Sungrae Park
VLM
127
154
0
10 Aug 2021
Lifelong Intent Detection via Multi-Strategy Rebalancing
Lifelong Intent Detection via Multi-Strategy Rebalancing
Qingbin Liu
Xiaoyan Yu
Shizhu He
Kang Liu
Jun Zhao
CLLOffRL
50
16
0
10 Aug 2021
Making Transformers Solve Compositional Tasks
Making Transformers Solve Compositional Tasks
Santiago Ontañón
Joshua Ainslie
Vaclav Cvicek
Zachary Kenneth Fisher
116
74
0
09 Aug 2021
COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer
  Reviews
COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews
Shruti Singh
M. Singh
Pawan Goyal
57
8
0
09 Aug 2021
Kori: Interactive Synthesis of Text and Charts in Data Documents
Kori: Interactive Synthesis of Text and Charts in Data Documents
Shahid Latif
Zhengzhong Zhou
Yoon Kim
Fabian Beck
N. Kim
53
63
0
09 Aug 2021
Aspect-based Sentiment Analysis in Document -- FOMC Meeting Minutes on
  Economic Projection
Aspect-based Sentiment Analysis in Document -- FOMC Meeting Minutes on Economic Projection
Yifei Wang
27
2
0
09 Aug 2021
Multi-modal Retrieval of Tables and Texts Using Tri-encoder Models
Multi-modal Retrieval of Tables and Texts Using Tri-encoder Models
Bogdan Kostić
Julian Risch
Timo Moller
RALM
184
23
0
09 Aug 2021
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language
  Models
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
Zheyuan Liu
Cristian Rodriguez-Opazo
Damien Teney
Stephen Gould
VLM
86
207
0
09 Aug 2021
A Neural Approach for Detecting Morphological Analogies
A Neural Approach for Detecting Morphological Analogies
Safa Alsaidi
Amandine Decker
Puthineath Lay
Esteban Marquer
Pierre-Alexandre Murena
Miguel Couceiro
67
20
0
09 Aug 2021
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture
  Search
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search
Xiangning Xie
Yuqiao Liu
Yanan Sun
Gary G. Yen
Bing Xue
Mengjie Zhang
117
19
0
09 Aug 2021
Disentangling Hate in Online Memes
Disentangling Hate in Online Memes
Rui Cao
Ziqing Fan
Roy Ka-wei Lee
Wen-Haw Chong
Jing Jiang
65
81
0
09 Aug 2021
Learning Joint Embedding with Modality Alignments for Cross-Modal
  Retrieval of Recipes and Food Images
Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images
Zhongwei Xie
Ling Liu
Lin Li
Luo Zhong
59
10
0
09 Aug 2021
Efficacy of BERT embeddings on predicting disaster from Twitter data
Efficacy of BERT embeddings on predicting disaster from Twitter data
Ashis Kumar Chanda
92
13
0
08 Aug 2021
#StayHome or #Marathon? Social Media Enhanced Pandemic Surveillance on
  Spatial-temporal Dynamic Graphs
#StayHome or #Marathon? Social Media Enhanced Pandemic Surveillance on Spatial-temporal Dynamic Graphs
Yichao Zhou
Jyun-Yu Jiang
Xiusi Chen
Wei Wang
102
8
0
08 Aug 2021
Online Evolutionary Batch Size Orchestration for Scheduling Deep
  Learning Workloads in GPU Clusters
Online Evolutionary Batch Size Orchestration for Scheduling Deep Learning Workloads in GPU Clusters
Chen Sun
Shenggui Li
Jinyue Wang
Jun Yu
119
48
0
08 Aug 2021
Unifying Heterogeneous Electronic Health Records Systems via Text-Based
  Code Embedding
Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
Kyunghoon Hur
Jiyoung Lee
Jungwoo Oh
Wesley Price
Young-Hak Kim
Edward Choi
103
19
0
08 Aug 2021
Improving Similar Language Translation With Transfer Learning
Improving Similar Language Translation With Transfer Learning
Ife Adebara
Muhammad Abdul-Mageed
67
1
0
07 Aug 2021
Compositional Generalization in Multilingual Semantic Parsing over
  Wikidata
Compositional Generalization in Multilingual Semantic Parsing over Wikidata
Ruixiang Cui
Rahul Aralikatte
Heather Lent
Daniel Hershcovich
90
11
0
07 Aug 2021
Detecting Propaganda Techniques in Memes
Detecting Propaganda Techniques in Memes
Dimitar Dimitrov
Bishr Bin Ali
Shaden Shaar
Firoj Alam
Fabrizio Silvestri
Hamed Firooz
Preslav Nakov
Giovanni Da San Martino
92
95
0
07 Aug 2021
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
PSViT: Better Vision Transformer via Token Pooling and Attention Sharing
Boyu Chen
Peixia Li
Baopu Li
Chuming Li
Lei Bai
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
ViT
129
35
0
07 Aug 2021
Vision Transformer for femur fracture classification
Vision Transformer for femur fracture classification
L. Tanzi
A. Audisio
G. Cirrincione
A. Aprato
E. Vezzetti
MedIm
90
65
0
07 Aug 2021
Controllable Summarization with Constrained Markov Decision Process
Controllable Summarization with Constrained Markov Decision Process
Hou Pong Chan
Lu Wang
Irwin King
258
22
0
07 Aug 2021
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling
  for Self-Supervised Speech Pre-Training
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training
Yu-An Chung
Yu Zhang
Wei Han
Chung-Cheng Chiu
James Qin
Ruoming Pang
Yonghui Wu
SSLVLM
113
429
0
07 Aug 2021
VitaLITy: Promoting Serendipitous Discovery of Academic Literature with
  Transformers & Visual Analytics
VitaLITy: Promoting Serendipitous Discovery of Academic Literature with Transformers & Visual Analytics
Arpit Narechania
Alireza Karduni
Ryan Wesslen
Emily Wall
82
25
0
07 Aug 2021
What Matters in Learning from Offline Human Demonstrations for Robot
  Manipulation
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
301
519
0
06 Aug 2021
Cross-lingual Capsule Network for Hate Speech Detection in Social Media
Cross-lingual Capsule Network for Hate Speech Detection in Social Media
Aiqi Jiang
A. Zubiaga
57
14
0
06 Aug 2021
Detecting Requirements Smells With Deep Learning: Experiences,
  Challenges and Future Work
Detecting Requirements Smells With Deep Learning: Experiences, Challenges and Future Work
Mohammad Kasra Habib
Stefan Wagner
Daniel Graziotin
UQCV
62
10
0
06 Aug 2021
Transferring Knowledge Distillation for Multilingual Social Event Detection
Jiaqian Ren
Hao Peng
Lei Jiang
Hongzhi Zhang
Yongxin Tong
Lihong Wang
X. Bai
Bo Wang
Qiang Yang
103
12
0
06 Aug 2021
SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection
SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection
Aiqi Jiang
Xiaohan Yang
Yang Liu
A. Zubiaga
93
76
0
06 Aug 2021
Deriving Disinformation Insights from Geolocalized Twitter Callouts
Deriving Disinformation Insights from Geolocalized Twitter Callouts
David Tuxworth
Dimosthenis Antypas
Luis Espinosa-Anke
Jose Camacho-Collados
Alun D. Preece
David Rogers
38
0
0
06 Aug 2021
An Empirical Study on End-to-End Singing Voice Synthesis with
  Encoder-Decoder Architectures
An Empirical Study on End-to-End Singing Voice Synthesis with Encoder-Decoder Architectures
Dengfeng Ke
Yuxing Lu
Xudong Liu
Yanyan Xu
Jing Sun
Cheng-Hao Cai
55
0
0
06 Aug 2021
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
StrucTexT: Structured Text Understanding with Multi-Modal Transformers
Yulin Li
Yuxi Qian
Yuchen Yu
Xiameng Qin
Chengquan Zhang
Yan Liu
Kun Yao
Junyu Han
Jingtuo Liu
Errui Ding
113
117
0
06 Aug 2021
Previous
123...314315316...475476477
Next