ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,865 papers shown
Title
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
101
78
0
20 Aug 2021
Discriminative Region-based Multi-Label Zero-Shot Learning
Discriminative Region-based Multi-Label Zero-Shot Learning
Sanath Narayan
Akshita Gupta
Salman Khan
Fahad Shahbaz Khan
Ling Shao
M. Shah
VLM
128
48
0
20 Aug 2021
Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code
  Contributions
Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions
Hammond Pearce
Baleegh Ahmad
Benjamin Tan
Brendan Dolan-Gavitt
Ramesh Karri
SILM
144
431
0
20 Aug 2021
Extracting Radiological Findings With Normalized Anatomical Information
  Using a Span-Based BERT Relation Extraction Model
Extracting Radiological Findings With Normalized Anatomical Information Using a Span-Based BERT Relation Extraction Model
K. Lybarger
Aashka Damani
Martin Gunn
Özlem Uzuner
Meliha Yetisgen-Yildiz
MedIm
61
4
0
20 Aug 2021
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Pierre-Louis Guhur
Makarand Tapaswi
Shizhe Chen
Ivan Laptev
Cordelia Schmid
LM&Ro
59
144
0
20 Aug 2021
GEDIT: Geographic-Enhanced and Dependency-Guided Tagging for Joint POI
  and Accessibility Extraction at Baidu Maps
GEDIT: Geographic-Enhanced and Dependency-Guided Tagging for Joint POI and Accessibility Extraction at Baidu Maps
Yibo Sun
Jizhou Huang
Chunyuan Yuan
M. Fan
Haifeng Wang
Ming Liu
Bing Qin
58
13
0
20 Aug 2021
Fastformer: Additive Attention Can Be All You Need
Fastformer: Additive Attention Can Be All You Need
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
119
121
0
20 Aug 2021
Knowledge Perceived Multi-modal Pretraining in E-commerce
Knowledge Perceived Multi-modal Pretraining in E-commerce
Yushan Zhu
Huaixiao Tou
Wen Zhang
Ganqiang Ye
Hui Chen
Ningyu Zhang
Huajun Chen
102
33
0
20 Aug 2021
Type Anywhere You Want: An Introduction to Invisible Mobile Keyboard
Type Anywhere You Want: An Introduction to Invisible Mobile Keyboard
Sahng-Min Yoo
Ue-Hwan Kim
Yewon Hwang
Jong-Hwan Kim
OffRL
29
2
0
20 Aug 2021
Twitter User Representation Using Weakly Supervised Graph Embedding
Twitter User Representation Using Weakly Supervised Graph Embedding
Tunazzina Islam
Dan Goldwasser
59
8
0
20 Aug 2021
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with
  Structured Semantics for Medical Text Mining
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining
Taolin Zhang
Zerui Cai
Chengyu Wang
Minghui Qiu
Bite Yang
Xiaofeng He
AI4MH
78
54
0
20 Aug 2021
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling
Xiaopeng Lu
Zhenhua Fan
Yansen Wang
Jean Oh
Carolyn Rose
92
27
0
20 Aug 2021
Detection of Illicit Drug Trafficking Events on Instagram: A Deep
  Multimodal Multilabel Learning Approach
Detection of Illicit Drug Trafficking Events on Instagram: A Deep Multimodal Multilabel Learning Approach
Chuanbo Hu
Minglei Yin
Bin Liu
Xin Li
Yanfang Ye
51
15
0
19 Aug 2021
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text
  Models
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Jianmo Ni
Gustavo Hernández Ábrego
Noah Constant
Ji Ma
Keith B. Hall
Daniel Cer
Yinfei Yang
305
569
0
19 Aug 2021
PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers
PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers
Xumin Yu
Yongming Rao
Ziyi Wang
Zuyan Liu
Jiwen Lu
Jie Zhou
ViT
109
437
0
19 Aug 2021
Fine-grained Semantics-aware Representation Enhancement for
  Self-supervised Monocular Depth Estimation
Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation
Hyun-Joo Jung
Eunhyeok Park
S. Yoo
MDE
79
111
0
19 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for
  Autoregressive Image Synthesis
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
119
162
0
19 Aug 2021
Do Vision Transformers See Like Convolutional Neural Networks?
Do Vision Transformers See Like Convolutional Neural Networks?
M. Raghu
Thomas Unterthiner
Simon Kornblith
Chiyuan Zhang
Alexey Dosovitskiy
ViT
154
971
0
19 Aug 2021
Causal Attention for Unbiased Visual Recognition
Causal Attention for Unbiased Visual Recognition
Tan Wang
Chan Zhou
Qianru Sun
Hanwang Zhang
OODCML
112
114
0
19 Aug 2021
How Hateful are Movies? A Study and Prediction on Movie Subtitles
How Hateful are Movies? A Study and Prediction on Movie Subtitles
Niklas von Boguszewski
Sana Moin
Anirban Bhowmick
Seid Muhie Yimam
Christian Biemann
40
4
0
19 Aug 2021
DESYR: Definition and Syntactic Representation Based Claim Detection on
  the Web
DESYR: Definition and Syntactic Representation Based Claim Detection on the Web
Megha Sundriyal
Parantak Singh
Md. Shad Akhtar
Shubhashis Sengupta
Tanmoy Chakraborty
63
10
0
19 Aug 2021
Category-Level 6D Object Pose Estimation via Cascaded Relation and
  Recurrent Reconstruction Networks
Category-Level 6D Object Pose Estimation via Cascaded Relation and Recurrent Reconstruction Networks
Jiaze Wang
Kai-xiang Chen
Qi Dou
3DPC
147
103
0
19 Aug 2021
Improving Semi-Supervised Learning for Remaining Useful Lifetime
  Estimation Through Self-Supervision
Improving Semi-Supervised Learning for Remaining Useful Lifetime Estimation Through Self-Supervision
Tilman Krokotsch
M. Knaak
C. Gühmann
46
22
0
19 Aug 2021
Contrastive Language-Image Pre-training for the Italian Language
Contrastive Language-Image Pre-training for the Italian Language
Federico Bianchi
Giuseppe Attanasio
Raphael Pisoni
Silvia Terragni
Gabriele Sarti
S. Lakshmi
VLMCLIP
90
30
0
19 Aug 2021
UNIQORN: Unified Question Answering over RDF Knowledge Graphs and
  Natural Language Text
UNIQORN: Unified Question Answering over RDF Knowledge Graphs and Natural Language Text
Soumajit Pramanik
Jesujoba Oluwadara Alabi
Rishiraj Saha Roy
Gerhard Weikum
RALM
134
34
0
19 Aug 2021
A Multi-input Multi-output Transformer-based Hybrid Neural Network for
  Multi-class Privacy Disclosure Detection
A Multi-input Multi-output Transformer-based Hybrid Neural Network for Multi-class Privacy Disclosure Detection
Nuhil Mehdy
Hoda Mehrpouyan
43
5
0
19 Aug 2021
Neural Operator: Learning Maps Between Function Spaces
Neural Operator: Learning Maps Between Function Spaces
Nikola B. Kovachki
Zong-Yi Li
Burigede Liu
Kamyar Azizzadenesheli
K. Bhattacharya
Andrew M. Stuart
Anima Anandkumar
AI4CE
199
454
0
19 Aug 2021
QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query
  Attribute Value Extraction
QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query Attribute Value Extraction
Danqing Zhang
Zheng Li
Tianyu Cao
Chen Luo
Tony Wu
Hanqing Lu
Yiwei Song
Bing Yin
Tuo Zhao
Qiang Yang
82
20
0
19 Aug 2021
Augmenting Slot Values and Contexts for Spoken Language Understanding
  with Pretrained Models
Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained Models
Haitao Lin
Lu Xiang
Yu Zhou
Jiajun Zhang
Chengqing Zong
48
2
0
19 Aug 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive
  Machine Translation
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
44
11
0
19 Aug 2021
Exploiting Multi-Object Relationships for Detecting Adversarial Attacks
  in Complex Scenes
Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes
Mingjun Yin
Shasha Li
Zikui Cai
Chengyu Song
M. Salman Asif
Amit K. Roy-Chowdhury
S. Krishnamurthy
AAML
75
20
0
19 Aug 2021
Integrating Dialog History into End-to-End Spoken Language Understanding
  Systems
Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Jatin Ganhotra
Samuel Thomas
H. Kuo
Sachindra Joshi
G. Saon
Zoltán Tüske
Brian Kingsbury
77
10
0
18 Aug 2021
The Multi-Modal Video Reasoning and Analyzing Competition
The Multi-Modal Video Reasoning and Analyzing Competition
Haoran Peng
He Huang
Li Xu
Tianjiao Li
Jing Liu
...
Yuanzhong Liu
Tao He
Fuwei Zhang
Xianbin Liu
Tao Lin
63
2
0
18 Aug 2021
SHAQ: Single Headed Attention with Quasi-Recurrence
SHAQ: Single Headed Attention with Quasi-Recurrence
Nashwin Bharwani
Warren Kushner
Sangeet Dandona
Ben Schreiber
37
0
0
18 Aug 2021
An Analysis Of Entire Space Multi-Task Models For Post-Click Conversion
  Prediction
An Analysis Of Entire Space Multi-Task Models For Post-Click Conversion Prediction
Conor O'Brien
Kin Sum Liu
James Neufeld
Rafael Barreto
Jonathan J. Hunt
61
15
0
18 Aug 2021
AdapterHub Playground: Simple and Flexible Few-Shot Learning with
  Adapters
AdapterHub Playground: Simple and Flexible Few-Shot Learning with Adapters
Tilman Beck
Bela Bohlender
Christina Viehmann
Vincent Hane
Yanik Adamson
Jaber Khuri
Jonas Brossmann
Jonas Pfeiffer
Iryna Gurevych
79
16
0
18 Aug 2021
Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences
  With Attention
Towards Interpreting Zoonotic Potential of Betacoronavirus Sequences With Attention
Kahini Wadhawan
Payel Das
Barbara A. Han
Ilya R. Fischhoff
Adrian C. Castellanos
A. Varsani
Kush R. Varshney
29
4
0
18 Aug 2021
Joint Multiple Intent Detection and Slot Filling via Self-distillation
Joint Multiple Intent Detection and Slot Filling via Self-distillation
Lisong Chen
Peilin Zhou
Yuexian Zou
VLM
59
31
0
18 Aug 2021
DeepCVA: Automated Commit-level Vulnerability Assessment with Deep
  Multi-task Learning
DeepCVA: Automated Commit-level Vulnerability Assessment with Deep Multi-task Learning
T. H. Le
David Hin
Roland Croft
Muhammad Ali Babar
64
56
0
18 Aug 2021
SIFN: A Sentiment-aware Interactive Fusion Network for Review-based Item
  Recommendation
SIFN: A Sentiment-aware Interactive Fusion Network for Review-based Item Recommendation
Kai Zhang
Hao Qian
Qi Liu
Qing Cui
Jun Zhou
Jianhui Ma
Enhong Chen
63
13
0
18 Aug 2021
Identifying Illicit Drug Dealers on Instagram with Large-scale
  Multimodal Data Fusion
Identifying Illicit Drug Dealers on Instagram with Large-scale Multimodal Data Fusion
Chuanbo Hu
Minglei Yin
Bin Liu
Xin Li
Yanfang Ye
69
10
0
18 Aug 2021
Self-Supervised Visual Representations Learning by Contrastive Mask
  Prediction
Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
Yucheng Zhao
Guangting Wang
Chong Luo
Wenjun Zeng
Zhengjun Zha
ISegSSL
94
47
0
18 Aug 2021
Data Pricing in Machine Learning Pipelines
Data Pricing in Machine Learning Pipelines
Zicun Cong
Xuan Luo
J. Pei
Feida Zhu
Yong Zhang
65
49
0
18 Aug 2021
Modulating Language Models with Emotions
Modulating Language Models with Emotions
Ruibo Liu
Jason W. Wei
Chenyan Jia
Soroush Vosoughi
65
23
0
17 Aug 2021
RandomRooms: Unsupervised Pre-training from Synthetic Shapes and
  Randomized Layouts for 3D Object Detection
RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection
Yongming Rao
Benlin Liu
Yi Wei
Jiwen Lu
Cho-Jui Hsieh
Jie Zhou
3DPC
123
52
0
17 Aug 2021
Toward a `Standard Model' of Machine Learning
Toward a `Standard Model' of Machine Learning
Zhiting Hu
Eric Xing
106
12
0
17 Aug 2021
A Game Interface to Study Semantic Grounding in Text-Based Models
A Game Interface to Study Semantic Grounding in Text-Based Models
Timothee Mickus
Mathieu Constant
Denis Paperno
18
0
0
17 Aug 2021
Learning C to x86 Translation: An Experiment in Neural Compilation
Learning C to x86 Translation: An Experiment in Neural Compilation
Jordi Armengol-Estapé
Michael F. P. O'Boyle
64
13
0
17 Aug 2021
Fact-Tree Reasoning for N-ary Question Answering over Knowledge Graphs
Fact-Tree Reasoning for N-ary Question Answering over Knowledge Graphs
Yao Zhang
Peiyao Li
Hongru Liang
Adam Jatowt
Zhenglu Yang
ReLM
62
5
0
17 Aug 2021
MigrationsKB: A Knowledge Base of Public Attitudes towards Migrations
  and their Driving Factors
MigrationsKB: A Knowledge Base of Public Attitudes towards Migrations and their Driving Factors
Yiyi Chen
Harald Sack
Mehwish Alam
41
3
0
17 Aug 2021
Previous
123...313314315...476477478
Next