ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,708 papers shown
Title
Learning Syntactic Dense Embedding with Correlation Graph for Automatic
  Readability Assessment
Learning Syntactic Dense Embedding with Correlation Graph for Automatic Readability Assessment
Xinying Qiu
Yuan Chen
Hanwu Chen
J. Nie
Yuming Shen
D. Lu
74
18
0
09 Jul 2021
A Survey on Low-Resource Neural Machine Translation
A Survey on Low-Resource Neural Machine Translation
Rui Wang
Xu Tan
Renqian Luo
Tao Qin
Tie-Yan Liu
3DV
98
61
0
09 Jul 2021
Measuring and Improving Model-Moderator Collaboration using Uncertainty
  Estimation
Measuring and Improving Model-Moderator Collaboration using Uncertainty Estimation
Ian D Kivlichan
Zi Lin
J. Liu
Lucy Vasserman
60
20
0
09 Jul 2021
A Systematic Survey of Text Worlds as Embodied Natural Language
  Environments
A Systematic Survey of Text Worlds as Embodied Natural Language Environments
Peter Alexander Jansen
LM&Ro
87
23
0
08 Jul 2021
Improved Language Identification Through Cross-Lingual Self-Supervised
  Learning
Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Andros Tjandra
Diptanu Gon Choudhury
Frank Zhang
Kritika Singh
Alexis Conneau
Alexei Baevski
Assaf Sela
Yatharth Saraf
Michael Auli
VLMSSL
109
36
0
08 Jul 2021
Learning Vision-Guided Quadrupedal Locomotion End-to-End with
  Cross-Modal Transformers
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers
Ruihan Yang
Minghao Zhang
Nicklas Hansen
Huazhe Xu
Xiaolong Wang
OffRL
101
108
0
08 Jul 2021
CANDLE: Decomposing Conditional and Conjunctive Queries for
  Task-Oriented Dialogue Systems
CANDLE: Decomposing Conditional and Conjunctive Queries for Task-Oriented Dialogue Systems
Aadesh Gupta
Kaustubh D. Dhole
Rahul Tarway
S. Prabhakar
A. Shrivastava
49
1
0
08 Jul 2021
A Review of Bangla Natural Language Processing Tasks and the Utility of
  Transformer Models
A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models
Firoj Alam
Md. Arid Hasan
Tanvirul Alam
A. Khan
Janntatul Tajrin
Naira Khan
Shammur A. Chowdhury
LM&MA
81
27
0
08 Jul 2021
Fuzzy-Rough Nearest Neighbour Approaches for Emotion Detection in Tweets
Fuzzy-Rough Nearest Neighbour Approaches for Emotion Detection in Tweets
Olha Kaminska
Chris Cornelis
Véronique Hoste
36
9
0
08 Jul 2021
COMBO: a new module for EUD parsing
COMBO: a new module for EUD parsing
Mateusz Klimaszewski
Alina Wróblewska
MoE
54
5
0
08 Jul 2021
Deep Structural Point Process for Learning Temporal Interaction Networks
Deep Structural Point Process for Learning Temporal Interaction Networks
Jiangxia Cao
Xixun Lin
Xin Cong
Shu Guo
Hengzhu Tang
Tingwen Liu
Bin Wang
BDL3DPC
65
12
0
08 Jul 2021
Unsupervised Proxy Selection for Session-based Recommender Systems
Unsupervised Proxy Selection for Session-based Recommender Systems
Junsu Cho
SeongKu Kang
Dongmin Hyun
Hwanjo Yu
100
22
0
08 Jul 2021
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time
  Series Imputation
CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation
Y. Tashiro
Jiaming Song
Yang Song
Stefano Ermon
BDLDiffM
101
559
0
07 Jul 2021
Differentiable Random Access Memory using Lattices
Differentiable Random Access Memory using Lattices
Adam P. Goucher
R. Troll
15
0
0
07 Jul 2021
Anticipating Safety Issues in E2E Conversational AI: Framework and
  Tooling
Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling
Emily Dinan
Gavin Abercrombie
A. S. Bergman
Shannon L. Spruit
Dirk Hovy
Y-Lan Boureau
Verena Rieser
97
109
0
07 Jul 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
Dieter Fox
109
95
0
07 Jul 2021
Long Short-Term Transformer for Online Action Detection
Long Short-Term Transformer for Online Action Detection
Mingze Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Xia
Zhuowen Tu
Stefano Soatto
ViT
154
137
0
07 Jul 2021
Pragmatic Image Compression for Human-in-the-Loop Decision-Making
Pragmatic Image Compression for Human-in-the-Loop Decision-Making
S. Reddy
Anca Dragan
Sergey Levine
OffRL
86
13
0
07 Jul 2021
Evaluating Large Language Models Trained on Code
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELMALM
307
5,702
0
07 Jul 2021
M-FAC: Efficient Matrix-Free Approximations of Second-Order Information
M-FAC: Efficient Matrix-Free Approximations of Second-Order Information
Elias Frantar
Eldar Kurtic
Dan Alistarh
92
59
0
07 Jul 2021
DORA: Toward Policy Optimization for Task-oriented Dialogue System with
  Efficient Context
DORA: Toward Policy Optimization for Task-oriented Dialogue System with Efficient Context
Hyunmin Jeon
G. G. Lee
OffRL
63
12
0
07 Jul 2021
A Survey on Data Augmentation for Text Classification
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
150
355
0
07 Jul 2021
MedGPT: Medical Concept Prediction from Clinical Narratives
MedGPT: Medical Concept Prediction from Clinical Narratives
Z. Kraljevic
Anthony Shek
D. Bean
R. Bendayan
J. Teo
Richard J. B. Dobson
LM&MAAI4TSMedIm
88
40
0
07 Jul 2021
Learning Vision Transformer with Squeeze and Excitation for Facial
  Expression Recognition
Learning Vision Transformer with Squeeze and Excitation for Facial Expression Recognition
Mouath Aouayeb
W. Hamidouche
Catherine Soladié
K. Kpalma
Renaud Séguier
ViT
93
59
0
07 Jul 2021
Android Security using NLP Techniques: A Review
Android Security using NLP Techniques: A Review
Sevil Sen
Burcu Can
AAML
49
4
0
07 Jul 2021
Efficient Transformer for Direct Speech Translation
Efficient Transformer for Direct Speech Translation
Belen Alastruey
Gerard I. Gállego
Marta R. Costa-jussá
56
7
0
07 Jul 2021
Structured Denoising Diffusion Models in Discrete State-Spaces
Structured Denoising Diffusion Models in Discrete State-Spaces
Jacob Austin
Daniel D. Johnson
Jonathan Ho
Daniel Tarlow
Rianne van den Berg
DiffM
299
952
0
07 Jul 2021
Neural Natural Language Processing for Unstructured Data in Electronic
  Health Records: a Review
Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review
Irene Li
Jessica Pan
Jeremy Goldwasser
Neha Verma
Wai Pan Wong
...
Matthew Zhang
David Chang
R. Taylor
H. Krumholz
Dragomir R. Radev
BDL
86
160
0
07 Jul 2021
Deep Extrapolation for Attribute-Enhanced Generation
Deep Extrapolation for Attribute-Enhanced Generation
Alvin Chan
Ali Madani
Ben Krause
Nikhil Naik
118
26
0
07 Jul 2021
DISCO : efficient unsupervised decoding for discrete natural language
  problems via convex relaxation
DISCO : efficient unsupervised decoding for discrete natural language problems via convex relaxation
Anish Acharya
Rudrajit Das
62
0
0
07 Jul 2021
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior
  for Joint Image-Text Modeling
PhotoChat: A Human-Human Dialogue Dataset with Photo Sharing Behavior for Joint Image-Text Modeling
Xiaoxue Zang
Lijuan Liu
Maria Wang
Yang Song
Hao Zhang
Jindong Chen
VLM
103
60
0
06 Jul 2021
Rethinking Positional Encoding
Rethinking Positional Encoding
Jianqiao Zheng
Sameera Ramasinghe
Simon Lucey
87
52
0
06 Jul 2021
Transfer Learning for Improving Results on Russian Sentiment Datasets
Transfer Learning for Improving Results on Russian Sentiment Datasets
A. Golubev
Natalia Loukachevitch
54
5
0
06 Jul 2021
On Robustness of Lane Detection Models to Physical-World Adversarial
  Attacks in Autonomous Driving
On Robustness of Lane Detection Models to Physical-World Adversarial Attacks in Autonomous Driving
Takami Sato
Qi Alfred Chen
AAMLELM
84
6
0
06 Jul 2021
Learning Disentangled Representation Implicitly via Transformer for
  Occluded Person Re-Identification
Learning Disentangled Representation Implicitly via Transformer for Occluded Person Re-Identification
Mengxi Jia
Xinhua Cheng
Shijian Lu
Jian Zhang
ViT
99
138
0
06 Jul 2021
Leveraging Clinical Context for User-Centered Explainability: A Diabetes
  Use Case
Leveraging Clinical Context for User-Centered Explainability: A Diabetes Use Case
Shruthi Chari
Prithwish Chakraborty
Mohamed F. Ghalwash
Oshani Seneviratne
Elif Eyigoz
Daniel Gruen
Fernando Jose Suarez Saiz
Ching-Hua Chen
Pablo Meyer Rojas
D. McGuinness
16
1
0
06 Jul 2021
Feature Fusion Vision Transformer for Fine-Grained Visual Categorization
Feature Fusion Vision Transformer for Fine-Grained Visual Categorization
Jun Wang
Xiaohan Yu
Yongsheng Gao
ViT
104
109
0
06 Jul 2021
Mind Your Outliers! Investigating the Negative Impact of Outliers on
  Active Learning for Visual Question Answering
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering
Siddharth Karamcheti
Ranjay Krishna
Li Fei-Fei
Christopher D. Manning
102
92
0
06 Jul 2021
What Helps Transformers Recognize Conversational Structure? Importance
  of Context, Punctuation, and Labels in Dialog Act Recognition
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition
Piotr Żelasko
R. Pappagari
Najim Dehak
60
14
0
05 Jul 2021
Weakly Supervised Named Entity Tagging with Learnable Logical Rules
Weakly Supervised Named Entity Tagging with Learnable Logical Rules
Jiacheng Li
Haibo Ding
Jingbo Shang
Julian McAuley
Zhe Feng
NAI
91
37
0
05 Jul 2021
Sarcasm Detection: A Comparative Study
Sarcasm Detection: A Comparative Study
Hamed Yaghoobian
H. Arabnia
Khaled Rasheed
59
23
0
05 Jul 2021
Experiments with adversarial attacks on text genres
Experiments with adversarial attacks on text genres
Mikhail Lepekhin
S. Sharoff
43
2
0
05 Jul 2021
Vision Xformers: Efficient Attention for Image Classification
Vision Xformers: Efficient Attention for Image Classification
Pranav Jeevan
Amit Sethi
ViT
70
13
0
05 Jul 2021
Long-Short Transformer: Efficient Transformers for Language and Vision
Long-Short Transformer: Efficient Transformers for Language and Vision
Chen Zhu
Ming-Yu Liu
Chaowei Xiao
Mohammad Shoeybi
Tom Goldstein
Anima Anandkumar
Bryan Catanzaro
ViTVLM
132
133
0
05 Jul 2021
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers
Aljavz Bovzivc
Pablo Rodríguez Palafox
Justus Thies
Angela Dai
Matthias Nießner
ViT
104
138
0
05 Jul 2021
Do Different Tracking Tasks Require Different Appearance Models?
Do Different Tracking Tasks Require Different Appearance Models?
Zhongdao Wang
Hengshuang Zhao
Yali Li
Shengjin Wang
Philip Torr
Luca Bertinetto
111
86
0
05 Jul 2021
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language
  Understanding and Generation
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Yu Sun
Shuohuan Wang
Shikun Feng
Siyu Ding
Chao Pang
...
Ouyang Xuan
Dianhai Yu
Hao Tian
Hua Wu
Haifeng Wang
128
475
0
05 Jul 2021
Test-Time Personalization with a Transformer for Human Pose Estimation
Test-Time Personalization with a Transformer for Human Pose Estimation
Yizhuo Li
Miao Hao
Zonglin Di
N. B. Gundavarapu
Xiaolong Wang
ViT
98
48
0
05 Jul 2021
A Survey on Deep Learning Event Extraction: Approaches and Applications
A Survey on Deep Learning Event Extraction: Approaches and Applications
Qian Li
Jianxin Li
Shuaiyi Nie
Shiyao Cui
Hongzhi Zhang
...
Hao Peng
Shu Guo
Lihong Wang
Amin Beheshti
Philip S. Yu
117
48
0
05 Jul 2021
Semi-supervised Learning for Dense Object Detection in Retail Scenes
Semi-supervised Learning for Dense Object Detection in Retail Scenes
Jaydeep Chauhan
Srikrishna Varadarajan
Muktabh Mayank Srivastava
69
2
0
05 Jul 2021
Previous
123...319320321...473474475
Next