ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,799 papers shown
Title
Applying the Information Bottleneck Principle to Prosodic Representation
  Learning
Applying the Information Bottleneck Principle to Prosodic Representation Learning
Guangyan Zhang
Ying Qin
Daxin Tan
Tan Lee
79
4
0
05 Aug 2021
Decoupled Transformer for Scalable Inference in Open-domain Question
  Answering
Decoupled Transformer for Scalable Inference in Open-domain Question Answering
Haytham ElFadeel
Stanislav Peshterliev
115
1
0
05 Aug 2021
Video Contrastive Learning with Global Context
Video Contrastive Learning with Global Context
Haofei Kuang
Yi Zhu
Zhi-Li Zhang
Xinyu Li
Joseph Tighe
Sören Schwertfeger
C. Stachniss
Mu Li
SSLAI4TS
93
61
0
05 Aug 2021
Using a Collated Cybersecurity Dataset for Machine Learning and
  Artificial Intelligence
Using a Collated Cybersecurity Dataset for Machine Learning and Artificial Intelligence
Erik Hemberg
Una-May O’Reilly
52
10
0
05 Aug 2021
EENLP: Cross-lingual Eastern European NLP Index
EENLP: Cross-lingual Eastern European NLP Index
Alexey Tikhonov
Alex Malkhasov
A. Manoshin
George-Andrei Dima
Réka Cserháti
Md. Sadek Hossain Asif
Matt Sárdi
84
2
0
05 Aug 2021
Knowledge Distillation from BERT Transformer to Speech Transformer for
  Intent Classification
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
Yiding Jiang
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
103
26
0
05 Aug 2021
Imperceptible Adversarial Examples by Spatial Chroma-Shift
Imperceptible Adversarial Examples by Spatial Chroma-Shift
A. Aydin
Deniz Sen
Berat Tuna Karli
Oguz Hanoglu
A. Temi̇zel
AAML
62
16
0
05 Aug 2021
Token Shift Transformer for Video Classification
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
87
119
0
05 Aug 2021
Fast Convergence of DETR with Spatially Modulated Co-Attention
Fast Convergence of DETR with Spatially Modulated Co-Attention
Peng Gao
Minghang Zheng
Xiaogang Wang
Jifeng Dai
Hongsheng Li
ViT
102
308
0
05 Aug 2021
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D
  Visual Grounding
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Dailan He
Yusheng Zhao
Junyu Luo
Tianrui Hui
Shaofei Huang
Aixi Zhang
Si Liu
ViT
81
95
0
05 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum
  Learning for Image Captioning
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
152
68
0
05 Aug 2021
Understand me, if you refer to Aspect Knowledge: Knowledge-aware Gated
  Recurrent Memory Network
Understand me, if you refer to Aspect Knowledge: Knowledge-aware Gated Recurrent Memory Network
Bowen Xing
Ivor W. Tsang
46
16
0
05 Aug 2021
FMMformer: Efficient and Flexible Transformer via Decomposed Near-field
  and Far-field Attention
FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention
T. Nguyen
Vai Suliafu
Stanley J. Osher
Long Chen
Bao Wang
72
36
0
05 Aug 2021
Robust Transfer Learning with Pretrained Language Models through
  Adapters
Robust Transfer Learning with Pretrained Language Models through Adapters
Wenjuan Han
Bo Pang
Ying Nian Wu
74
56
0
05 Aug 2021
Automatic Detection of COVID-19 Vaccine Misinformation with Graph Link
  Prediction
Automatic Detection of COVID-19 Vaccine Misinformation with Graph Link Prediction
Maxwell Weinzierl
S. Harabagiu
104
29
0
04 Aug 2021
Boosting Few-shot Semantic Segmentation with Transformers
Boosting Few-shot Semantic Segmentation with Transformers
Guolei Sun
Yun-Hai Liu
Christos Sakaridis
Luc Van Gool
ViT
63
9
0
04 Aug 2021
Curriculum learning for language modeling
Curriculum learning for language modeling
Daniel Fernando Campos
59
33
0
04 Aug 2021
The Potential of Using Vision Videos for CrowdRE: Video Comments as a
  Source of Feedback
The Potential of Using Vision Videos for CrowdRE: Video Comments as a Source of Feedback
Oliver Karras
Eklekta Kristo
J. Klünder
20
8
0
04 Aug 2021
Question-controlled Text-aware Image Captioning
Question-controlled Text-aware Image Captioning
Anwen Hu
Shizhe Chen
Qin Jin
76
15
0
04 Aug 2021
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt
  Verbalizer for Text Classification
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification
Shengding Hu
Ning Ding
Huadong Wang
Zhiyuan Liu
Jingang Wang
Juan-Zi Li
Wei Wu
Maosong Sun
VLM
110
373
0
04 Aug 2021
Log-based Anomaly Detection Without Log Parsing
Log-based Anomaly Detection Without Log Parsing
Van-Hoang Le
Hongyu Zhang
87
189
0
04 Aug 2021
How to Query Language Models?
How to Query Language Models?
Leonard Adolphs
Shehzaad Dhuliawala
Thomas Hofmann
KELM
86
15
0
04 Aug 2021
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence
  Pretraining
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining
Machel Reid
Mikel Artetxe
VLM
105
28
0
04 Aug 2021
Quality Evaluation of the Low-Resource Synthetically Generated
  Code-Mixed Hinglish Text
Quality Evaluation of the Low-Resource Synthetically Generated Code-Mixed Hinglish Text
Vivek Srivastava
M. Singh
69
12
0
04 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple
  Constraints
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDLAI4CE
115
79
0
04 Aug 2021
Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain
  Management
Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management
Cécile Logé
Emily L. Ross
D. Dadey
Saahil Jain
A. Saporta
A. Ng
Pranav Rajpurkar
144
23
0
03 Aug 2021
Improving Counterfactual Generation for Fair Hate Speech Detection
Improving Counterfactual Generation for Fair Hate Speech Detection
Aida Mostafazadeh Davani
Ali Omrani
Brendan Kennedy
M. Atari
Xiang Ren
Morteza Dehghani
70
11
0
03 Aug 2021
Linking Common Vulnerabilities and Exposures to the MITRE ATT&CK
  Framework: A Self-Distillation Approach
Linking Common Vulnerabilities and Exposures to the MITRE ATT&CK Framework: A Self-Distillation Approach
Benjamin Ampel
Sagar Samtani
Steven Ullman
Hsinchun Chen
70
39
0
03 Aug 2021
Vision Transformer with Progressive Sampling
Vision Transformer with Progressive Sampling
Xiaoyu Yue
Shuyang Sun
Zhanghui Kuang
Meng Wei
Philip Torr
Wayne Zhang
Dahua Lin
ViT
94
85
0
03 Aug 2021
Exploiting BERT For Multimodal Target Sentiment Classification Through
  Input Space Translation
Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation
Zaid Khan
Y. Fu
81
140
0
03 Aug 2021
Grounding Representation Similarity with Statistical Testing
Grounding Representation Similarity with Statistical Testing
Frances Ding
Jean-Stanislas Denain
Jacob Steinhardt
87
30
0
03 Aug 2021
Large-Scale Differentially Private BERT
Large-Scale Differentially Private BERT
Rohan Anil
Badih Ghazi
Vineet Gupta
Ravi Kumar
Pasin Manurangsi
96
139
0
03 Aug 2021
ExBERT: An External Knowledge Enhanced BERT for Natural Language
  Inference
ExBERT: An External Knowledge Enhanced BERT for Natural Language Inference
Amit Gajbhiye
Noura Al Moubayed
S. Bradley
65
10
0
03 Aug 2021
sarcasm detection and quantification in arabic tweets
sarcasm detection and quantification in arabic tweets
Bashar Talafha
Muhy Eddin Za'ter
Samer Suleiman
M. Al-Ayyoub
M. Al-Kabi
46
10
0
03 Aug 2021
Cycle-Consistent Inverse GAN for Text-to-Image Synthesis
Cycle-Consistent Inverse GAN for Text-to-Image Synthesis
Hao Wang
Guosheng Lin
Guosheng Lin
Chunyan Miao
103
48
0
03 Aug 2021
Where do Models go Wrong? Parameter-Space Saliency Maps for
  Explainability
Where do Models go Wrong? Parameter-Space Saliency Maps for Explainability
Roman Levin
Manli Shu
Eitan Borgnia
Furong Huang
Micah Goldblum
Tom Goldstein
FAttAAML
60
11
0
03 Aug 2021
Dialogue Summarization with Supporting Utterance Flow Modeling and Fact
  Regularization
Dialogue Summarization with Supporting Utterance Flow Modeling and Fact Regularization
Wang Chen
Pijian Li
Hou Pong Chan
Irwin King
HILMAI4TS
61
10
0
03 Aug 2021
CanvasVAE: Learning to Generate Vector Graphic Documents
CanvasVAE: Learning to Generate Vector Graphic Documents
Kota Yamaguchi
GAN
65
65
0
03 Aug 2021
Representation learning for neural population activity with Neural Data
  Transformers
Representation learning for neural population activity with Neural Data Transformers
Joel Ye
C. Pandarinath
AI4TSAI4CE
241
57
0
02 Aug 2021
Changes in European Solidarity Before and During COVID-19: Evidence from
  a Large Crowd- and Expert-Annotated Twitter Dataset
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset
A. Ils
Dan Liu
Daniela Grunow
Steffen Eger
85
9
0
02 Aug 2021
A Survey of Human-in-the-loop for Machine Learning
A Survey of Human-in-the-loop for Machine Learning
Xingjiao Wu
Luwei Xiao
Yixuan Sun
Junhang Zhang
Tianlong Ma
Liangbo He
SyDa
137
533
0
02 Aug 2021
Communication-Efficient Federated Learning via Predictive Coding
Communication-Efficient Federated Learning via Predictive Coding
Kai Yue
Richeng Jin
Chau-Wai Wong
H. Dai
FedML
78
14
0
02 Aug 2021
Self-supervised Answer Retrieval on Clinical Notes
Self-supervised Answer Retrieval on Clinical Notes
Paul Grundmann
Sebastian Arnold
Alexander Loser
RALMMedIm
60
2
0
02 Aug 2021
Efficient Deep Feature Calibration for Cross-Modal Joint Embedding
  Learning
Efficient Deep Feature Calibration for Cross-Modal Joint Embedding Learning
Zhongwei Xie
Ling Liu
Lin Li
Luo Zhong
29
2
0
02 Aug 2021
Transfer Learning for Mining Feature Requests and Bug Reports from
  Tweets and App Store Reviews
Transfer Learning for Mining Feature Requests and Bug Reports from Tweets and App Store Reviews
Pablo Restrepo Henao
Jannik Fischbach
Dominik Spies
Julian Frattini
Andreas Vogelsang
45
26
0
02 Aug 2021
From LSAT: The Progress and Challenges of Complex Reasoning
From LSAT: The Progress and Challenges of Complex Reasoning
Siyuan Wang
Zhongkun Liu
Wanjun Zhong
Ming Zhou
Zhongyu Wei
Zhumin Chen
Nan Duan
ELM
95
46
0
02 Aug 2021
Semi-Supervising Learning, Transfer Learning, and Knowledge Distillation
  with SimCLR
Semi-Supervising Learning, Transfer Learning, and Knowledge Distillation with SimCLR
Khoi Duc Minh Nguyen
Y. Nguyen
Bao Le
57
5
0
02 Aug 2021
Congested Crowd Instance Localization with Dilated Convolutional Swin
  Transformer
Congested Crowd Instance Localization with Dilated Convolutional Swin Transformer
Junyuan Gao
Maoguo Gong
Xuelong Li
ViT
112
47
0
02 Aug 2021
Logic-Consistency Text Generation from Semantic Parses
Logic-Consistency Text Generation from Semantic Parses
Chang Shu
Yusen Zhang
Xiangyu Dong
Peng Shi
Tao Yu
Rui Zhang
110
34
0
02 Aug 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
164
282
0
02 Aug 2021
Previous
123...315316317...474475476
Next