ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,885 papers shown
Title
Harms of Gender Exclusivity and Challenges in Non-Binary Representation
  in Language Technologies
Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies
Sunipa Dev
Masoud Monajatipoor
Anaelia Ovalle
Arjun Subramonian
J. M. Phillips
Kai-Wei Chang
166
177
0
27 Aug 2021
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa
Taewoon Kim
Piek Vossen
103
102
0
26 Aug 2021
A New Sentence Ordering Method Using BERT Pretrained Model
A New Sentence Ordering Method Using BERT Pretrained Model
Melika Golestani
S. Z. Razavi
Heshaam Faili
61
2
0
26 Aug 2021
Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive
  Text Summarization
Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization
Chujie Zheng
Kunpeng Zhang
Harry J. Wang
Ling Fan
Zhe Wang
60
7
0
26 Aug 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for
  Vision-and-Language Navigation in Continuous Environments
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
84
51
0
26 Aug 2021
HAN: Higher-order Attention Network for Spoken Language Understanding
HAN: Higher-order Attention Network for Spoken Language Understanding
Dongsheng Chen
Zhiqi Huang
Yuexian Zou
54
1
0
26 Aug 2021
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for
  Stylized Image Captioning
Similar Scenes arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning
Guodun Li
Yuchen Zhai
Zehao Lin
Yin Zhang
117
21
0
26 Aug 2021
A Survey on Automated Fact-Checking
A Survey on Automated Fact-Checking
Zhijiang Guo
Michael Schlichtkrull
Andreas Vlachos
144
498
0
26 Aug 2021
Alleviating Exposure Bias via Contrastive Learning for Abstractive Text
  Summarization
Alleviating Exposure Bias via Contrastive Learning for Abstractive Text Summarization
Shichao Sun
Wenjie Li
70
26
0
26 Aug 2021
Just Say No: Analyzing the Stance of Neural Dialogue Generation in
  Offensive Contexts
Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts
Ashutosh Baheti
Maarten Sap
Alan Ritter
Mark O. Riedl
92
91
0
26 Aug 2021
SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent
  Detection and Slot Filling
SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent Detection and Slot Filling
Fengyu Cai
Wanhao Zhou
Fei Mi
Boi Faltings
70
19
0
26 Aug 2021
Data Augmentation for Low-Resource Named Entity Recognition Using
  Backtranslation
Data Augmentation for Low-Resource Named Entity Recognition Using Backtranslation
Usama Yaseen
Stefan Langer
MedIm
58
15
0
26 Aug 2021
Rethinking Why Intermediate-Task Fine-Tuning Works
Rethinking Why Intermediate-Task Fine-Tuning Works
Ting-Yun Chang
Chi-Jen Lu
LRM
98
30
0
26 Aug 2021
AR-BERT: Aspect-relation enhanced Aspect-level Sentiment Classification
  with Multi-modal Explanations
AR-BERT: Aspect-relation enhanced Aspect-level Sentiment Classification with Multi-modal Explanations
Sk Mainul Islam
Sourangshu Bhattacharya
62
12
0
26 Aug 2021
MCML: A Novel Memory-based Contrastive Meta-Learning Method for Few Shot
  Slot Tagging
MCML: A Novel Memory-based Contrastive Meta-Learning Method for Few Shot Slot Tagging
Hongru Wang
Zezhong Wang
Gabriel Pui Cheong Fung
Kam-Fai Wong
OffRLCLL
100
10
0
26 Aug 2021
Retrieval Augmented Code Generation and Summarization
Retrieval Augmented Code Generation and Summarization
Md. Rizwan Parvez
W. Ahmad
Saikat Chakraborty
Baishakhi Ray
Kai-Wei Chang
76
192
0
26 Aug 2021
LayoutReader: Pre-training of Text and Layout for Reading Order
  Detection
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
Zilong Wang
Yiheng Xu
Lei Cui
Jingbo Shang
Furu Wei
95
76
0
26 Aug 2021
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Xuefan Zha
Wentao Zhu
Tingxun Lv
Sen Yang
Ji Liu
AI4TSViT
92
27
0
26 Aug 2021
Multilingual Multi-Aspect Explainability Analyses on Machine Reading
  Comprehension Models
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models
Yiming Cui
Weinan Zhang
Wanxiang Che
Ting Liu
Zhigang Chen
Shijin Wang
LRM
47
9
0
26 Aug 2021
Vision-Language Navigation: A Survey and Taxonomy
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
81
24
0
26 Aug 2021
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Lucio Dery
Yann N. Dauphin
David Grangier
MoMe
79
29
0
25 Aug 2021
Lightweight Self-Attentive Sequential Recommendation
Lightweight Self-Attentive Sequential Recommendation
Yang Li
Tong Chen
Pengfei Zhang
Hongzhi Yin
HAIAI4TS
81
109
0
25 Aug 2021
What do pre-trained code models know about code?
What do pre-trained code models know about code?
Anjan Karmakar
Romain Robbes
ELM
91
91
0
25 Aug 2021
Ontology-Enhanced Slot Filling
Ontology-Enhanced Slot Filling
Yuhao Ding
Yik-Cheung Tam
33
0
0
25 Aug 2021
Exploring the Promises of Transformer-Based LMs for the Representation
  of Normative Claims in the Legal Domain
Exploring the Promises of Transformer-Based LMs for the Representation of Normative Claims in the Legal Domain
Reto Gubelmann
Peter Hongler
Siegfried Handschuh
AILaw
28
0
0
25 Aug 2021
Product-oriented Machine Translation with Cross-modal Cross-lingual
  Pre-training
Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Yuqing Song
Shizhe Chen
Qin Jin
Wei Luo
Jun Xie
Fei Huang
103
20
0
25 Aug 2021
A Framework for Learning Ante-hoc Explainable Models via Concepts
A Framework for Learning Ante-hoc Explainable Models via Concepts
Anirban Sarkar
Deepak Vijaykeerthy
Anindya Sarkar
V. Balasubramanian
LRMBDL
93
51
0
25 Aug 2021
Viola: A Topic Agnostic Generate-and-Rank Dialogue System
Viola: A Topic Agnostic Generate-and-Rank Dialogue System
Hyundong Justin Cho
Basel Shbita
K. Shenoy
Shuai Liu
Nikhil Patel
Hitesh Pindikanti
Jennifer Lee
Jonathan May
66
2
0
25 Aug 2021
Social Norm Bias: Residual Harms of Fairness-Aware Algorithms
Social Norm Bias: Residual Harms of Fairness-Aware Algorithms
Myra Cheng
Maria De-Arteaga
Lester W. Mackey
Adam Tauman Kalai
FaML
111
9
0
25 Aug 2021
Using BERT Encoding and Sentence-Level Language Model for Sentence
  Ordering
Using BERT Encoding and Sentence-Level Language Model for Sentence Ordering
Melika Golestani
S. Z. Razavi
Zeinab Borhanifard
Farnaz Tahmasebian
H. Faili
39
7
0
24 Aug 2021
Towards Offensive Language Identification for Tamil Code-Mixed YouTube
  Comments and Posts
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts
Charangan Vasantharajan
Uthayasanker Thayasivam
57
39
0
24 Aug 2021
The Word is Mightier than the Label: Learning without Pointillistic
  Labels using Data Programming
The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming
Chufan Gao
Mononito Goswami
30
0
0
24 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLMMLLM
183
801
0
24 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer
  Models via Low-Rank Approximation
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
108
12
0
24 Aug 2021
Relation Extraction from Tables using Artificially Generated Metadata
Relation Extraction from Tables using Artificially Generated Metadata
Gaurav Singh
Siffi Singh
Joshua Wong
Amir Saffari
30
2
0
24 Aug 2021
Graph Neural Networks: Methods, Applications, and Opportunities
Graph Neural Networks: Methods, Applications, and Opportunities
Lilapati Waikhom
Ripon Patgiri
GNN
100
42
0
24 Aug 2021
Are the Multilingual Models Better? Improving Czech Sentiment with
  Transformers
Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel Přibáň
J. Steinberger
70
11
0
24 Aug 2021
Weakly Supervised Cross-platform Teenager Detection with Adversarial
  BERT
Weakly Supervised Cross-platform Teenager Detection with Adversarial BERT
Peiling Yi
A. Zubiaga
44
1
0
24 Aug 2021
Prompt-Learning for Fine-Grained Entity Typing
Prompt-Learning for Fine-Grained Entity Typing
Ning Ding
Yulin Chen
Xu Han
Guangwei Xu
Pengjun Xie
Haitao Zheng
Zhiyuan Liu
Juan-Zi Li
Hong-Gee Kim
95
159
0
24 Aug 2021
Detection of Criminal Texts for the Polish State Border Guard
Detection of Criminal Texts for the Polish State Border Guard
Artur Nowakowski
K. Jassem
56
1
0
24 Aug 2021
Support-Set Based Cross-Supervision for Video Grounding
Support-Set Based Cross-Supervision for Video Grounding
Xinpeng Ding
N. Wang
Shiwei Zhang
De Cheng
Xiaomeng Li
Ziyuan Huang
Mingqian Tang
Xinbo Gao
88
42
0
24 Aug 2021
sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel
  Classification
sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification
Gabriel Bénédict
Vincent Koops
Daan Odijk
Maarten de Rijke
102
33
0
24 Aug 2021
Recurrent multiple shared layers in Depth for Neural Machine Translation
Recurrent multiple shared layers in Depth for Neural Machine Translation
Guoliang Li
Yiyang Li
MoE
48
1
0
23 Aug 2021
Using Neighborhood Context to Improve Information Extraction from Visual
  Documents Captured on Mobile Phones
Using Neighborhood Context to Improve Information Extraction from Visual Documents Captured on Mobile Phones
Kalpa Gunaratna
Vijay Srinivasan
Sandeep Nama
Hongxia Jin
61
5
0
23 Aug 2021
Legal Search in Case Law and Statute Law
Legal Search in Case Law and Statute Law
Julien Rossi
Evangelos Kanoulas
AILawELM
162
8
0
23 Aug 2021
High Performance GPU Code Generation for Matrix-Matrix Multiplication
  using MLIR: Some Early Results
High Performance GPU Code Generation for Matrix-Matrix Multiplication using MLIR: Some Early Results
Navdeep Katel
Vivek Khandelwal
Uday Bondhugula
41
7
0
23 Aug 2021
Event Extraction by Associating Event Types and Argument Roles
Qian Li
Shu Guo
Hongzhi Zhang
Jianxin Li
Shuaiyi Nie
Lihong Wang
Xiaohan Dong
Hao Peng
79
16
0
23 Aug 2021
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Jianwei Yang
Yonatan Bisk
Jianfeng Gao
123
140
0
23 Aug 2021
Modeling Dynamics of Facial Behavior for Mental Health Assessment
Modeling Dynamics of Facial Behavior for Mental Health Assessment
Minh Tran
Ellen R. Bradley
Michelle Matvey
J. Woolley
M. Soleymani
CVBM
45
3
0
23 Aug 2021
Fluent: An AI Augmented Writing Tool for People who Stutter
Fluent: An AI Augmented Writing Tool for People who Stutter
Bhavya Ghai
Klaus Mueller
73
16
0
23 Aug 2021
Previous
123...312313314...476477478
Next