ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 1,211 papers shown
Title
Making Large Language Models Better Knowledge Miners for Online Marketing with Progressive Prompting Augmentation
Making Large Language Models Better Knowledge Miners for Online Marketing with Progressive Prompting Augmentation
Chunjing Gan
Dan Yang
Binbin Hu
Ziqi Liu
Yue Shen
Qing Cui
Jinjie Gu
Jun Zhou
Guannan Zhang
49
5
0
08 Dec 2023
Jointly spatial-temporal representation learning for individual trajectories
Jointly spatial-temporal representation learning for individual trajectories
Fei Huang
Jianrong Lv
Yang Yue
AI4TS
111
1
0
07 Dec 2023
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Arun V. Reddy
William Paul
Corban Rivera
Ketul Shah
Celso M. de Melo
Rama Chellappa
58
4
0
05 Dec 2023
Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model
Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model
Yuxia Geng
Jiaoyan Chen
Yuhang Zeng
Zhuo Chen
Wen Zhang
Jeff Z. Pan
Yuxiang Wang
Xiaoliang Xu
66
2
0
04 Dec 2023
Optimizing Context-Enhanced Relational Joins
Optimizing Context-Enhanced Relational Joins
Viktor Sanca
Manos Chatzakis
Anastasia Ailamaki
41
2
0
03 Dec 2023
ResNLS: An Improved Model for Stock Price Forecasting
ResNLS: An Improved Model for Stock Price Forecasting
Yuanzhe Jia
Ali Anaissi
Basem Suleiman
AI4TS
AIFin
69
4
0
02 Dec 2023
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
Lei Guan
Dongsheng Li
Jiye Liang
Wenjian Wang
Wenjian Wang
Xicheng Lu
49
1
0
01 Dec 2023
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Rohan Myer Krishnan
Zitian Tang
Zhiqiu Yu
Chen Sun
78
1
0
30 Nov 2023
Meta Co-Training: Two Views are Better than One
Meta Co-Training: Two Views are Better than One
Jay C. Rothenberger
Dimitrios I. Diochnos
VLM
83
2
0
29 Nov 2023
Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs
Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs
Shengyin Sun
Yuxiang Ren
Chen Ma
Xuecang Zhang
154
21
0
24 Nov 2023
Image Super-Resolution with Text Prompt Diffusion
Image Super-Resolution with Text Prompt Diffusion
Zheng Chen
Yulun Zhang
Jinjin Gu
Xin Yuan
Linghe Kong
Guihai Chen
Xiaokang Yang
DiffM
62
20
0
24 Nov 2023
MultiDelete for Multimodal Machine Unlearning
MultiDelete for Multimodal Machine Unlearning
Jiali Cheng
Hadi Amiri
MU
71
7
0
18 Nov 2023
Hijacking Large Language Models via Adversarial In-Context Learning
Hijacking Large Language Models via Adversarial In-Context Learning
Yao Qiang
Xiangyu Zhou
Saleh Zare Zade
Prashant Khanduri
Dongxiao Zhu
69
34
0
16 Nov 2023
Divergences between Language Models and Human Brains
Divergences between Language Models and Human Brains
Yuchen Zhou
Emmy Liu
Graham Neubig
Michael J. Tarr
Leila Wehbe
51
2
0
15 Nov 2023
Generalizable Imitation Learning Through Pre-Trained Representations
Generalizable Imitation Learning Through Pre-Trained Representations
Wei-Di Chang
F. Hogan
David Meger
Gregory Dudek
Gregory Dudek
46
1
0
15 Nov 2023
Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval
Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval
Junyang Chen
Hanjiang Lai
VLM
58
15
0
13 Nov 2023
AI-accelerated Discovery of Altermagnetic Materials
AI-accelerated Discovery of Altermagnetic Materials
Ze-Feng Gao
Shuai Qu
Bocheng Zeng
Yang Liu
Ji-Rong Wen
Hao Sun
Peng-Jie Guo
Zhong-Yi Lu
35
27
0
08 Nov 2023
Uncovering Intermediate Variables in Transformers using Circuit Probing
Uncovering Intermediate Variables in Transformers using Circuit Probing
Michael A. Lepori
Thomas Serre
Ellie Pavlick
92
7
0
07 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
73
4
0
06 Nov 2023
Joint Learning of Local and Global Features for Aspect-based Sentiment Classification
Joint Learning of Local and Global Features for Aspect-based Sentiment Classification
Hao Niu
Yun Xiong
Xiaosu Wang
Philip S. Yu
112
0
0
02 Nov 2023
Advances in Embodied Navigation Using Large Language Models: A Survey
Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin
Han Gao
Xuxiang Feng
Rongtao Xu
Changwei Wang
Man Zhang
Li Guo
Shibiao Xu
LM&Ro
LLMAG
95
9
0
01 Nov 2023
A Tractable Inference Perspective of Offline RL
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Hoang Trung-Dung
Guy Van den Broeck
Yitao Liang
OffRL
70
1
0
31 Oct 2023
Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering
Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering
Zijie Song
Zhenzhen Hu
Richang Hong
SSL
61
0
0
27 Oct 2023
netFound: Foundation Model for Network Security
netFound: Foundation Model for Network Security
Satyandra Guthula
Navya Battula
Roman Beltiukov
Wenbo Guo
Arpit Gupta
Inder Monga
52
15
0
25 Oct 2023
FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering
FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering
Md Rafi Ur Rashid
Vishnu Asutosh Dasu
Kang Gu
Najrin Sultana
Shagufta Mehnaz
AAML
FedML
68
11
0
24 Oct 2023
Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models
Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models
Jaeyoung Choe
Keonwoong Noh
Nayeon Kim
Seyun Ahn
Woohwan Jung
76
4
0
20 Oct 2023
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi
Raquel Fernández
Arianna Bisazza
KELM
HILM
63
68
0
16 Oct 2023
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
67
15
0
15 Oct 2023
Semi-Supervised End-To-End Contrastive Learning For Time Series Classification
Semi-Supervised End-To-End Contrastive Learning For Time Series Classification
Hui Cai
Xiang Zhang
Xiaofeng Liu
AI4TS
44
0
0
13 Oct 2023
Unsupervised Log Anomaly Detection with Few Unique Tokens
Unsupervised Log Anomaly Detection with Few Unique Tokens
Antonin Sulc
Annika Eichler
T. Wilksen
50
0
0
13 Oct 2023
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text
Chanho Park
Chengsong Lu
Mingjie Chen
Thomas Hain
77
3
0
12 Oct 2023
AdaMesh: Personalized Facial Expressions and Head Poses for Adaptive Speech-Driven 3D Facial Animation
AdaMesh: Personalized Facial Expressions and Head Poses for Adaptive Speech-Driven 3D Facial Animation
Liyang Chen
Weihong Bao
Shunwei Lei
Boshi Tang
Zhiyong Wu
Shiyin Kang
Haozhi Huang
Helen M. Meng
49
1
0
11 Oct 2023
ParFam -- (Neural Guided) Symbolic Regression Based on Continuous Global Optimization
ParFam -- (Neural Guided) Symbolic Regression Based on Continuous Global Optimization
Philipp Scholl
Katharina Bieker
Hillary Hauger
Gitta Kutyniok
60
5
0
09 Oct 2023
On the Evolution of Knowledge Graphs: A Survey and Perspective
On the Evolution of Knowledge Graphs: A Survey and Perspective
Xuhui Jiang
Chengjin Xu
Yinghan Shen
Xun Sun
Lumingyuan Tang
Saizhuo Wang
Zhongwu Chen
Yuanzhuo Wang
Jian Guo
62
8
0
07 Oct 2023
Generating Less Certain Adversarial Examples Improves Robust Generalization
Generating Less Certain Adversarial Examples Improves Robust Generalization
Minxing Zhang
Michael Backes
Xiao Zhang
AAML
65
1
0
06 Oct 2023
URLOST: Unsupervised Representation Learning without Stationarity or Topology
URLOST: Unsupervised Representation Learning without Stationarity or Topology
Zeyu Yun
Juexiao Zhang
Bruno A. Olshausen
Yann LeCun
69
1
0
06 Oct 2023
CLEVRER-Humans: Describing Physical and Causal Events the Human Way
CLEVRER-Humans: Describing Physical and Causal Events the Human Way
Jiayuan Mao
Xuelin Yang
Xikun Zhang
Noah D. Goodman
Jiajun Wu
NAI
39
22
0
05 Oct 2023
DataDAM: Efficient Dataset Distillation with Attention Matching
DataDAM: Efficient Dataset Distillation with Attention Matching
A. Sajedi
Samir Khaki
Ehsan Amjadian
Lucy Z. Liu
Y. Lawryshyn
Konstantinos N. Plataniotis
DD
74
64
0
29 Sep 2023
Asynchronous Graph Generator
Asynchronous Graph Generator
Christopher P. Ley
Felipe Tobar
AI4TS
61
0
0
29 Sep 2023
Ragas: Automated Evaluation of Retrieval Augmented Generation
Ragas: Automated Evaluation of Retrieval Augmented Generation
ES Shahul
Jithin James
Luis Espinosa-Anke
Steven Schockaert
100
186
0
26 Sep 2023
KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation
KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation
Haotian Li
Lingzhi Wang
Yuliang Wei
Richard Y. D. Xu
Bailing Wang
Bailing Wang
73
2
0
26 Sep 2023
TouchUp-G: Improving Feature Representation through Graph-Centric Finetuning
TouchUp-G: Improving Feature Representation through Graph-Centric Finetuning
Jing Zhu
Xiang Song
V. Ioannidis
Danai Koutra
Christos Faloutsos
97
14
0
25 Sep 2023
Interpretability-Aware Vision Transformer
Interpretability-Aware Vision Transformer
Yao Qiang
Chengyin Li
Prashant Khanduri
D. Zhu
ViT
104
7
0
14 Sep 2023
Measuring Catastrophic Forgetting in Cross-Lingual Transfer Paradigms: Exploring Tuning Strategies
Measuring Catastrophic Forgetting in Cross-Lingual Transfer Paradigms: Exploring Tuning Strategies
Boshko Koloski
Blaž Škrlj
Marko Robnik-Šikonja
Senja Pollak
CLL
47
2
0
12 Sep 2023
Motif-aware Attribute Masking for Molecular Graph Pre-training
Motif-aware Attribute Masking for Molecular Graph Pre-training
Eric Inae
Gang Liu
Meng Jiang
AI4CE
56
14
0
08 Sep 2023
FLM-101B: An Open LLM and How to Train It with $100K Budget
FLM-101B: An Open LLM and How to Train It with 100KBudget100K Budget100KBudget
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
73
22
0
07 Sep 2023
Certifying LLM Safety against Adversarial Prompting
Certifying LLM Safety against Adversarial Prompting
Aounon Kumar
Chirag Agarwal
Suraj Srinivas
Aaron Jiaxun Li
Soheil Feizi
Himabindu Lakkaraju
AAML
42
182
0
06 Sep 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
104
31
0
27 Aug 2023
Exploring Large Language Models for Knowledge Graph Completion
Exploring Large Language Models for Knowledge Graph Completion
Liang Yao
Jiazhen Peng
Chengsheng Mao
Yuan Luo
52
38
0
26 Aug 2023
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Walter Hernandez Cruz
K. Tylinski
Alastair Moore
Niall Roche
Nikhil Vadgama
Horst Treiblmaier
J. Shangguan
Paolo Tasca
Jiahua Xu
47
2
0
23 Aug 2023
Previous
123...202122232425
Next