ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 1,211 papers shown
Title
VISTANet: VIsual Spoken Textual Additive Net for Interpretable Multimodal Emotion Recognition
VISTANet: VIsual Spoken Textual Additive Net for Interpretable Multimodal Emotion Recognition
Puneet Kumar
Sarthak Malik
Balasubramanian Raman
Xiaobai Li
49
2
0
24 Aug 2022
Toward Interpretable Sleep Stage Classification Using Cross-Modal Transformers
Toward Interpretable Sleep Stage Classification Using Cross-Modal Transformers
Jathurshan Pradeepkumar
Mithunjha Anandakumar
Vinith Kugathasan
Dhinesh Suntharalingham
S. L. Kappel
A. D. Silva
Chamira U. S. Edussooriya
36
31
0
15 Aug 2022
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
Qiming Bao
A. Peng
Tim Hartill
N. Tan
Zhenyun Deng
Michael Witbrock
Jiamou Liu
ReLM
OOD
NAI
LRM
42
13
0
28 Jul 2022
Why is constrained neural language generation particularly challenging?
Why is constrained neural language generation particularly challenging?
Cristina Garbacea
Qiaozhu Mei
69
14
0
11 Jun 2022
Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction
Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction
Jun Chen
Ming Hu
Boyang Albert Li
Mohamed Elhoseiny
63
36
0
01 Jun 2022
Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
Li Mingzhe
Xiexiong Lin
Preslav Nakov
Jinxiong Chang
Qishen Zhang
...
Taifeng Wang
Zhongyi Liu
Wei Chu
Dongyan Zhao
Rui Yan
58
11
0
26 May 2022
New Intent Discovery with Pre-training and Contrastive Learning
New Intent Discovery with Pre-training and Contrastive Learning
Yuwei Zhang
Haode Zhang
Li-Ming Zhan
Albert Y.S. Lam
Albert Y. S. Lam
SSL
VLM
64
42
0
25 May 2022
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds
Yu Zhang
Yu Meng
Xuan Wang
Sheng Wang
Jiawei Han
78
14
0
04 May 2022
Match the Script, Adapt if Multilingual: Analyzing the Effect of
  Multilingual Pretraining on Cross-lingual Transferability
Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability
Yoshinari Fujinuma
Jordan L. Boyd-Graber
Katharina Kann
AAML
69
23
0
21 Mar 2022
Are You Robert or RoBERTa? Deceiving Online Authorship Attribution
  Models Using Neural Text Generators
Are You Robert or RoBERTa? Deceiving Online Authorship Attribution Models Using Neural Text Generators
Keenan I. Jones
Jason R. C. Nurse
Shujun Li
DeLMO
36
19
0
18 Mar 2022
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hyperdecoders: Instance-specific decoders for multi-task NLP
Hamish Ivison
Matthew E. Peters
AI4CE
51
20
0
15 Mar 2022
Differential equation and probability inspired graph neural networks for latent variable learning
Differential equation and probability inspired graph neural networks for latent variable learning
Zhuangwei Shi
29
3
0
28 Feb 2022
Vision-Language Pre-Training with Triple Contrastive Learning
Vision-Language Pre-Training with Triple Contrastive Learning
Jinyu Yang
Jiali Duan
Son N. Tran
Yi Xu
Sampath Chanda
Liqun Chen
Belinda Zeng
Trishul Chilimbi
Junzhou Huang
VLM
60
290
0
21 Feb 2022
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
VLM
CLIP
111
1,009
0
09 Oct 2021
Transformer-based deep imitation learning for dual-arm robot manipulation
Transformer-based deep imitation learning for dual-arm robot manipulation
Heecheol Kim
Yoshiyuki Ohmura
Yasuo Kuniyoshi
40
48
0
01 Aug 2021
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can
  Scale Up
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up
Yi Ding
Shiyu Chang
Zhangyang Wang
ViT
43
389
0
14 Feb 2021
Representation Matters: Offline Pretraining for Sequential Decision
  Making
Representation Matters: Offline Pretraining for Sequential Decision Making
Mengjiao Yang
Ofir Nachum
SSL
OffRL
35
119
0
11 Feb 2021
EfficientQA : a RoBERTa Based Phrase-Indexed Question-Answering System
EfficientQA : a RoBERTa Based Phrase-Indexed Question-Answering System
Sofian Chaybouti
Achraf Saghe
A. Shabou
RALM
52
8
0
06 Jan 2021
Pretrained Language Models for Dialogue Generation with Multiple Input
  Sources
Pretrained Language Models for Dialogue Generation with Multiple Input Sources
Yu Cao
Wei Bi
Meng Fang
Dacheng Tao
33
29
0
15 Oct 2020
How Have We Reacted To The COVID-19 Pandemic? Analyzing Changing Indian
  Emotions Through The Lens of Twitter
How Have We Reacted To The COVID-19 Pandemic? Analyzing Changing Indian Emotions Through The Lens of Twitter
Rajdeep Mukherjee
S. Poddar
Atharva Naik
Soham Dasgupta
29
5
0
20 Aug 2020
Towards a Decomposable Metric for Explainable Evaluation of Text
  Generation from AMR
Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR
Juri Opitz
Anette Frank
76
35
0
20 Aug 2020
S^3-Rec: Self-Supervised Learning for Sequential Recommendation with
  Mutual Information Maximization
S^3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization
Kun Zhou
Haibo Wang
Wayne Xin Zhao
Yutao Zhu
Sirui Wang
Fuzheng Zhang
Zhongyuan Wang
Ji-Rong Wen
43
797
0
18 Aug 2020
VizCommender: Computing Text-Based Similarity in Visualization
  Repositories for Content-Based Recommendations
VizCommender: Computing Text-Based Similarity in Visualization Repositories for Content-Based Recommendations
Michael Oppermann
R. Kincaid
T. Munzner
49
44
0
18 Aug 2020
MIDAS: Multi-agent Interaction-aware Decision-making with Adaptive
  Strategies for Urban Autonomous Navigation
MIDAS: Multi-agent Interaction-aware Decision-making with Adaptive Strategies for Urban Autonomous Navigation
Xiaoyi Chen
Pratik Chaudhari
46
4
0
17 Aug 2020
Computer-Generated Music for Tabletop Role-Playing Games
Computer-Generated Music for Tabletop Role-Playing Games
Lucas N. Ferreira
Levi H. S. Lelis
E. Whitehead
50
43
0
16 Aug 2020
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act
  Recognition and Sentiment Classification
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo Qin
Wanxiang Che
Yangming Li
Minheng Ni
Ting Liu
42
94
0
16 Aug 2020
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Ming Tao
Hao Tang
Fei Wu
Xiaoyuan Jing
Bingkun Bao
Changsheng Xu
54
211
0
13 Aug 2020
The Language Interpretability Tool: Extensible, Interactive
  Visualizations and Analysis for NLP Models
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models
Ian Tenney
James Wexler
Jasmijn Bastings
Tolga Bolukbasi
Andy Coenen
...
Ellen Jiang
Mahima Pushkarna
Carey Radebaugh
Emily Reif
Ann Yuan
VLM
63
192
0
12 Aug 2020
An Automated, End-to-End Framework for Modeling Attacks From
  Vulnerability Descriptions
An Automated, End-to-End Framework for Modeling Attacks From Vulnerability Descriptions
Hodaya Binyamini
Ron Bitton
Masaki Inokuchi
T. Yagyu
Yuval Elovici
A. Shabtai
43
11
0
10 Aug 2020
FireBERT: Hardening BERT-based classifiers against adversarial attack
FireBERT: Hardening BERT-based classifiers against adversarial attack
Gunnar Mein
Kevin Hartman
Andrew Morris
SILM
AAML
39
0
0
10 Aug 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
39
39
0
07 Aug 2020
Match$^2$: A Matching over Matching Model for Similar Question
  Identification
Match2^22: A Matching over Matching Model for Similar Question Identification
Zizhen Wang
Yixing Fan
Jiafeng Guo
Liu Yang
Ruqing Zhang
Yanyan Lan
Xueqi Cheng
Hui Jiang
Xiaozhao Wang
65
15
0
21 Jun 2020
Simple and Principled Uncertainty Estimation with Deterministic Deep
  Learning via Distance Awareness
Simple and Principled Uncertainty Estimation with Deterministic Deep Learning via Distance Awareness
Jeremiah Zhe Liu
Zi Lin
Shreyas Padhy
Dustin Tran
Tania Bedrax-Weiss
Balaji Lakshminarayanan
UQCV
BDL
60
443
0
17 Jun 2020
CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
  Answering, and Abstractive Summarization
CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
A. Esteva
Anuprit Kale
Romain Paulus
Kazuma Hashimoto
Wenpeng Yin
Dragomir R. Radev
R. Socher
55
64
0
17 Jun 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and
  Integer Programming
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
49
124
0
14 Jun 2020
Ensemble Distillation for Robust Model Fusion in Federated Learning
Ensemble Distillation for Robust Model Fusion in Federated Learning
Tao R. Lin
Lingjing Kong
Sebastian U. Stich
Martin Jaggi
FedML
28
1,026
0
12 Jun 2020
Ansor: Generating High-Performance Tensor Programs for Deep Learning
Ansor: Generating High-Performance Tensor Programs for Deep Learning
Lianmin Zheng
Chengfan Jia
Minmin Sun
Zhao Wu
Cody Hao Yu
...
Jun Yang
Danyang Zhuo
Koushik Sen
Joseph E. Gonzalez
Ion Stoica
72
386
0
11 Jun 2020
Revisiting Few-sample BERT Fine-tuning
Revisiting Few-sample BERT Fine-tuning
Tianyi Zhang
Felix Wu
Arzoo Katiyar
Kilian Q. Weinberger
Yoav Artzi
51
444
0
10 Jun 2020
AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP
  Investigation in the Recommender System
AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP Investigation in the Recommender System
Pengyu Zhao
Kecheng Xiao
Yuanxing Zhang
Kaigui Bian
Wei Yan
33
16
0
10 Jun 2020
TableQA: a Large-Scale Chinese Text-to-SQL Dataset for Table-Aware SQL
  Generation
TableQA: a Large-Scale Chinese Text-to-SQL Dataset for Table-Aware SQL Generation
Ningyuan Sun
Xuefeng Yang
Yunfeng Liu
LMTD
29
34
0
10 Jun 2020
DeepVar: An End-to-End Deep Learning Approach for Genomic Variant
  Recognition in Biomedical Literature
DeepVar: An End-to-End Deep Learning Approach for Genomic Variant Recognition in Biomedical Literature
Chaoran Cheng
Fei Tan
Zhi Wei
35
7
0
05 Jun 2020
IMUTube: Automatic Extraction of Virtual on-body Accelerometry from
  Video for Human Activity Recognition
IMUTube: Automatic Extraction of Virtual on-body Accelerometry from Video for Human Activity Recognition
Hyeokhyen Kwon
C. Tong
H. Haresamudram
Yan Gao
G. Abowd
Nicholas D. Lane
Thomas Ploetz
33
83
0
29 May 2020
An Effective Transition-based Model for Discontinuous NER
An Effective Transition-based Model for Discontinuous NER
Xiang Dai
Sarvnaz Karimi
Ben Hachey
Cécile Paris
BDL
MU
MedIm
35
79
0
28 Apr 2020
Fine-tuning Multi-hop Question Answering with Hierarchical Graph Network
Guanming Xiong
36
0
0
20 Apr 2020
Neuroevolution of Self-Interpretable Agents
Neuroevolution of Self-Interpretable Agents
Yujin Tang
Duong Nguyen
David R Ha
41
111
0
18 Mar 2020
Transformer Networks for Trajectory Forecasting
Transformer Networks for Trajectory Forecasting
Francesco Giuliari
Irtiza Hasan
Marco Cristani
Fabio Galasso
120
375
0
18 Mar 2020
PowerNorm: Rethinking Batch Normalization in Transformers
PowerNorm: Rethinking Batch Normalization in Transformers
Sheng Shen
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
BDL
29
16
0
17 Mar 2020
PO-EMO: Conceptualization, Annotation, and Modeling of Aesthetic
  Emotions in German and English Poetry
PO-EMO: Conceptualization, Annotation, and Modeling of Aesthetic Emotions in German and English Poetry
T. Haider
Steffen Eger
Evgeny Kim
Roman Klinger
Winfried Menninghaus
21
32
0
17 Mar 2020
Recent Advances and Challenges in Task-oriented Dialog System
Recent Advances and Challenges in Task-oriented Dialog System
Zheng Zhang
Ryuichi Takanobu
Qi Zhu
Minlie Huang
Xiaoyan Zhu
LLMAG
74
176
0
17 Mar 2020
Review-guided Helpful Answer Identification in E-commerce
Review-guided Helpful Answer Identification in E-commerce
Wenxuan Zhang
Wai Lam
Yang Deng
Jing Ma
38
20
0
13 Mar 2020
Previous
123...22232425
Next