ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 23,491 papers shown
Title
Into the Unknown: Applying Inductive Spatial-Semantic Location Embeddings for Predicting Individuals' Mobility Beyond Visited Places
Into the Unknown: Applying Inductive Spatial-Semantic Location Embeddings for Predicting Individuals' Mobility Beyond Visited Places
Xinglei Wang
Tao Cheng
Stephen Law
Zichao Zeng
Ilya Ilyankou
Junyuan Liu
Lu Yin
Weiming Huang
Natchapon Jongwiriyanurak
HAI
29
0
0
17 Jun 2025
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
Yujun Wang
Jinhe Bi
Yunpu Ma
Soeren Pirk
MLLM
53
0
0
17 Jun 2025
When Does Meaning Backfire? Investigating the Role of AMRs in NLI
When Does Meaning Backfire? Investigating the Role of AMRs in NLI
Junghyun Min
Xiulin Yang
Shira Wein
LLMSV
44
0
0
17 Jun 2025
Adverse Event Extraction from Discharge Summaries: A New Dataset, Annotation Scheme, and Initial Findings
Adverse Event Extraction from Discharge Summaries: A New Dataset, Annotation Scheme, and Initial Findings
Imane Guellil
Salomé Andres
Atul Anand
Bruce Guthrie
Huayu Zhang
Abul Hasan
Honghan Wu
Beatrice Alex
17
0
0
17 Jun 2025
Don't throw the baby out with the bathwater: How and why deep learning for ARC
Don't throw the baby out with the bathwater: How and why deep learning for ARC
Jack Cole
Mohamed Osman
LRM
43
0
0
17 Jun 2025
NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving
NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving
Ren Xin
Hongji Liu
Xiaodong Mei
Wenru Liu
Maosheng Ye
Zhili Chen
Jun Ma
34
0
0
17 Jun 2025
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription
Anna Hamberger
Sebastian Murgul
Jochen Schmidt
Michael Heizmann
31
0
0
17 Jun 2025
Adjustment for Confounding using Pre-Trained Representations
Adjustment for Confounding using Pre-Trained Representations
Rickmer Schulte
David Rügamer
Thomas Nagler
CMLBDL
30
0
0
17 Jun 2025
ODD: Overlap-aware Estimation of Model Performance under Distribution Shift
ODD: Overlap-aware Estimation of Model Performance under Distribution Shift
Aayush Mishra
Anqi Liu
24
0
0
17 Jun 2025
Understand the Implication: Learning to Think for Pragmatic Understanding
Understand the Implication: Learning to Think for Pragmatic Understanding
S. Sravanthi
Kishan Maharaj
Sravani Gunnu
Abhijit Mishra
Pushpak Bhattacharyya
ReLMLRM
29
0
0
16 Jun 2025
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Chidera Biringa
Gökhan Kul
33
0
0
16 Jun 2025
Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding
Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding
Nikkie Hooman
Zhongjie Wu
Eric C. Larson
Mehak Gupta
32
0
0
16 Jun 2025
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection
Shang-Chi Tsai
Seiya Kawano
Angel García Contreras
Koichiro Yoshino
Yun-Nung Chen
LM&Ro
43
2
0
16 Jun 2025
DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding
DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding
Thomas Kreutz
M. Mühlhäuser
Alejandro Sánchez Guinea
43
0
0
16 Jun 2025
Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation
Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation
Debanjan Dutta
Faizanuddin Ansari
Swagatam Das
22
0
0
16 Jun 2025
Watermarking LLM-Generated Datasets in Downstream Tasks
Watermarking LLM-Generated Datasets in Downstream Tasks
Y. Liu
Tianshuo Cong
Michael Backes
Zheng Li
Yang Zhang
WaLM
47
0
0
16 Jun 2025
Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval
Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval
Kshitij Kavimandan
Angelos Nalmpantis
Emma Beauxis-Aussalet
Robert-Jan Sips
40
0
0
16 Jun 2025
BOW: Bottlenecked Next Word Exploration
BOW: Bottlenecked Next Word Exploration
Ming shen
Zhikun Xu
Xiao Ye
Jacob Dineen
Ben Zhou
OffRLLRM
35
0
0
16 Jun 2025
Document-Level Tabular Numerical Cross-Checking: A Coarse-to-Fine Approach
Document-Level Tabular Numerical Cross-Checking: A Coarse-to-Fine Approach
Chaoxu Pang
Yixuan Cao
Ganbin Zhou
Hongwei Bran Li
Ping Luo
LMTD
44
0
0
16 Jun 2025
Sketched Sum-Product Networks for Joins
Sketched Sum-Product Networks for Joins
Brian Tsan
Abylay Amanbayev
Asoke Datta
Florin Rusu
31
0
0
16 Jun 2025
Abstract, Align, Predict: Zero-Shot Stance Detection via Cognitive Inductive Reasoning
Abstract, Align, Predict: Zero-Shot Stance Detection via Cognitive Inductive Reasoning
Jun Ma
Fuqiang Niu
Dong Li
Jinzhou Cao
Genan Dai
Bowen Zhang
LRM
19
0
0
16 Jun 2025
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Discrete Diffusion in Large Language and Multimodal Models: A Survey
Runpeng Yu
Qi Li
Xinchao Wang
DiffMAI4CE
49
0
0
16 Jun 2025
FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design
FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design
Kai Lan
Jiayong Zhu
Jiangtong Li
Dawei Cheng
Guang-Sheng Chen
Changjun Jiang
LRM
30
0
0
16 Jun 2025
Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Yibo Yang
Sihao Liu
Chuan Rao
Bang An
Tiancheng Shen
Philip Torr
Ming-Hsuan Yang
Bernard Ghanem
31
0
0
16 Jun 2025
Flexible-length Text Infilling for Discrete Diffusion Models
Flexible-length Text Infilling for Discrete Diffusion Models
Andrew Zhang
Anushka Sivakumar
Chiawei Tang
Chris Thomas
DiffM
28
0
0
16 Jun 2025
Large Language Models Enhanced by Plug and Play Syntactic Knowledge for Aspect-based Sentiment Analysis
Large Language Models Enhanced by Plug and Play Syntactic Knowledge for Aspect-based Sentiment Analysis
Yuanhe Tian
Xu Li
Wei Wang
Guoqing Jin
Pengsen Cheng
Yan Song
KELM
30
0
0
15 Jun 2025
SoK: The Privacy Paradox of Large Language Models: Advancements, Privacy Risks, and Mitigation
SoK: The Privacy Paradox of Large Language Models: Advancements, Privacy Risks, and Mitigation
Yashothara Shanmugarasa
Ming Ding
M. Chamikara
Thierry Rakotoarivelo
PILMAILaw
84
0
0
15 Jun 2025
Enhancing Clinical Models with Pseudo Data for De-identification
Enhancing Clinical Models with Pseudo Data for De-identification
Paul Landes
Aaron J Chaise
Tarak Nandi
Ravi K. Madduri
18
0
0
15 Jun 2025
Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?
Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?
Daman Deep Singh
Ramanuj Bhattacharjee
Abhijnan Chakraborty
27
0
0
15 Jun 2025
Magnetoencephalography (MEG) Based Non-Invasive Chinese Speech Decoding
Magnetoencephalography (MEG) Based Non-Invasive Chinese Speech Decoding
Zhihong Jia
Hongbin Wang
Yuanzhong Shen
Feng Hu
Jiayu An
Kai Shu
Dongrui Wu
28
1
0
15 Jun 2025
From Experts to a Generalist: Toward General Whole-Body Control for Humanoid Robots
From Experts to a Generalist: Toward General Whole-Body Control for Humanoid Robots
Yuxuan Wang
Ming Yang
Weishuai Zeng
Yu Zhang
Xinrun Xu
Haobin Jiang
Ziluo Ding
Zongqing Lu
36
0
0
15 Jun 2025
Assessing the Role of Data Quality in Training Bilingual Language Models
Assessing the Role of Data Quality in Training Bilingual Language Models
Skyler Seto
Maartje ter Hoeve
Maureen de Seyssel
David Grangier
24
0
0
15 Jun 2025
Assessing the Performance Gap Between Lexical and Semantic Models for Information Retrieval With Formulaic Legal Language
Assessing the Performance Gap Between Lexical and Semantic Models for Information Retrieval With Formulaic Legal Language
Larissa Mori
Carlos Sousa de Oliveira
Yuehwern Yih
Mario Ventresca
AILawRALMELM
42
0
0
15 Jun 2025
CliniDial: A Naturally Occurring Multimodal Dialogue Dataset for Team Reflection in Action During Clinical Operation
CliniDial: A Naturally Occurring Multimodal Dialogue Dataset for Team Reflection in Action During Clinical Operation
Naihao Deng
Kapotaksha Das
Rada Mihalcea
Vitaliy Popov
M. Abouelenien
25
0
0
15 Jun 2025
Transforming Chatbot Text: A Sequence-to-Sequence Approach
Transforming Chatbot Text: A Sequence-to-Sequence Approach
Natesh Reddy
Mark Stamp
DeLMOSILM
42
0
0
15 Jun 2025
JEBS: A Fine-grained Biomedical Lexical Simplification Task
JEBS: A Fine-grained Biomedical Lexical Simplification Task
William Xia
Ishita Unde
Brian D. Ondov
Dina Demner-Fushman
30
0
0
15 Jun 2025
Unleashing Diffusion and State Space Models for Medical Image Segmentation
Unleashing Diffusion and State Space Models for Medical Image Segmentation
Rong Wu
Ziqi Chen
Liming Zhong
Heng Li
Hai Shu
MedIm
41
0
0
15 Jun 2025
Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts
Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts
Zain Muhammad Mujahid
Dilshod Azizov
Maha Tufail Agro
Preslav Nakov
15
0
0
14 Jun 2025
A Pluggable Multi-Task Learning Framework for Sentiment-Aware Financial Relation Extraction
A Pluggable Multi-Task Learning Framework for Sentiment-Aware Financial Relation Extraction
Jinming Luo
Hailin Wang
24
0
0
14 Jun 2025
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks
Zhou Chen
Zhiqiang Wei
Yuqi Bai
Xue Xiong
Jianmin Wu
3DV
19
0
0
14 Jun 2025
Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation
Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation
Xiangyan Chen
Yujian Gan
Matthew Purver
HILM
35
0
0
14 Jun 2025
Training-free LLM Merging for Multi-task Learning
Training-free LLM Merging for Multi-task Learning
Zichuan Fu
Xian Wu
Y. X. R. Wang
Wanyu Wang
Shanshan Ye
Hongzhi Yin
Yi-Ju Chang
Yefeng Zheng
Xiangyu Zhao
MoMe
22
0
0
14 Jun 2025
INTERPOS: Interaction Rhythm Guided Positional Morphing for Mobile App Recommender Systems
INTERPOS: Interaction Rhythm Guided Positional Morphing for Mobile App Recommender Systems
M. H. Maqbool
Moghis Fereidouni
Umar Farooq
A.B. Siddique
H. Foroosh
AI4TS
19
0
0
14 Jun 2025
Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics
Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics
Asifullah Khan
Muhammad Zaeem Khan
Saleha Jamshed
Sadia Ahmad
Aleesha Zainab
Kaynat Khatib
Faria Bibi
Abdul Rehman
OffRLLRM
35
0
0
14 Jun 2025
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation
Zhuocheng Zhang
Yang Feng
Min Zhang
34
0
0
14 Jun 2025
A Survey of Foundation Models for IoT: Taxonomy and Criteria-Based Analysis
A Survey of Foundation Models for IoT: Taxonomy and Criteria-Based Analysis
Hui Wei
Dong Yoon Lee
Shubham Rohal
Zhizhang Hu
Shiwei Fang
Shijia Pan
40
0
0
13 Jun 2025
Improving Causal Interventions in Amnesic Probing with Mean Projection or LEACE
Improving Causal Interventions in Amnesic Probing with Mean Projection or LEACE
Alicja Dobrzeniecka
Antske Fokkens
Pia Sommerauer
17
0
0
13 Jun 2025
Generative or Discriminative? Revisiting Text Classification in the Era of Transformers
Generative or Discriminative? Revisiting Text Classification in the Era of Transformers
Siva Rajesh Kasa
Karan Gupta
Sumegh Roychowdhury
Ashutosh Kumar
Yaswanth Biruduraju
Santhosh Kumar Kasa
Nikhil Pattisapu
Arindam Bhattacharya
Shailendra Agarwal
Vijay huddar
27
0
0
13 Jun 2025
Large Language Models for History, Philosophy, and Sociology of Science: Interpretive Uses, Methodological Challenges, and Critical Perspectives
Large Language Models for History, Philosophy, and Sociology of Science: Interpretive Uses, Methodological Challenges, and Critical Perspectives
Arno Simons
Michael Zichert
Adrian Wüthrich
34
0
0
13 Jun 2025
Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning
Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning
Chengye Li
Haiyun Liu
Yuanxi Li
24
0
0
13 Jun 2025
Previous
12345...468469470
Next