ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 17,997 papers shown
Title
Enhancing Keyphrase Extraction from Academic Articles Using Section Structure Information
Enhancing Keyphrase Extraction from Academic Articles Using Section Structure Information
Chengzhi Zhang
Xinyi Yan
Lei Zhao
Yingyi Zhang
14
0
0
20 May 2025
Interpretable Dual-Stream Learning for Local Wind Hazard Prediction in Vulnerable Communities
Interpretable Dual-Stream Learning for Local Wind Hazard Prediction in Vulnerable Communities
Mahmuda Akhter Nishu
Chenyu Huang
Milad Roohi
Xin Zhong
7
0
0
20 May 2025
Multi-Armed Bandits Meet Large Language Models
Multi-Armed Bandits Meet Large Language Models
Djallel Bouneffouf
Raphael Feraud
7
0
0
19 May 2025
Unlocking Non-Invasive Brain-to-Text
Unlocking Non-Invasive Brain-to-Text
Dulhan Jayalath
Gilad Landau
Oiwi Parker Jones
12
0
0
19 May 2025
An Empirical Study of Many-to-Many Summarization with Large Language Models
An Empirical Study of Many-to-Many Summarization with Large Language Models
Jiaan Wang
Fandong Meng
Zengkui Sun
Yunlong Liang
Yuxuan Cao
Jiarong Xu
Haoxiang Shi
Jie Zhou
22
0
0
19 May 2025
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
Jieying Xue
Phuong Minh Nguyen
Minh Le Nguyen
Xin Liu
7
0
0
19 May 2025
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Yunseok Jang
Yeda Song
Sungryull Sohn
Lajanugen Logeswaran
Tiange Luo
Dong-Ki Kim
Kyunghoon Bae
Honglak Lee
VGen
12
0
0
19 May 2025
Attention-based clustering
Attention-based clustering
Rodrigo Maulen-Soto
Claire Boyer
Pierre Marion
17
0
0
19 May 2025
Think Before You Attribute: Improving the Performance of LLMs Attribution Systems
Think Before You Attribute: Improving the Performance of LLMs Attribution Systems
João Eduardo Batista
Emil Vatai
Mohamed Wahib
33
0
0
19 May 2025
Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
Shane Bergsma
Nolan Dey
Gurpreet Gosal
Gavia Gray
Daria Soboleva
Joel Hestness
24
0
0
19 May 2025
Enhancing LLMs for Time Series Forecasting via Structure-Guided Cross-Modal Alignment
Enhancing LLMs for Time Series Forecasting via Structure-Guided Cross-Modal Alignment
Siming Sun
Kai Zhang
Xuejun Jiang
Wenchao Meng
Qinmin Yang
AI4TS
12
0
0
19 May 2025
ProDS: Preference-oriented Data Selection for Instruction Tuning
ProDS: Preference-oriented Data Selection for Instruction Tuning
Wenya Guo
Zhengkun Zhang
Xumeng Liu
Ying Zhang
Ziyu Lu
Haoze Zhu
Xubo Liu
Ruxue Yan
12
0
0
19 May 2025
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
Seanie Lee
Sangwoo Park
Dong Bok Lee
Dominik Wagner
Haebin Seong
Tobias Bocklet
Juho Lee
Sung Ju Hwang
FedML
12
0
0
19 May 2025
Sinusoidal Initialization, Time for a New Start
Sinusoidal Initialization, Time for a New Start
Alberto Fernández-Hernández
Jose I. Mestre
Manuel F. Dolz
Jose Duato
Enrique S. Quintana-Ortí
ODL
AI4CE
19
0
0
19 May 2025
MSVIT: Improving Spiking Vision Transformer Using Multi-scale Attention Fusion
MSVIT: Improving Spiking Vision Transformer Using Multi-scale Attention Fusion
Wei Hua
Chenlin Zhou
Jibin Wu
Yansong Chua
Yangyang Shu
7
0
0
19 May 2025
Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
Yuxin Lin
Yinglin Zheng
Ming Zeng
Wangzheng Shi
19
0
0
19 May 2025
GDPRShield: AI-Powered GDPR Support for Software Developers in Small and Medium-Sized Enterprises
GDPRShield: AI-Powered GDPR Support for Software Developers in Small and Medium-Sized Enterprises
Tharaka Wijesundara
Mathew Warren
Nalin Arachchilage
12
0
0
19 May 2025
SMOTExT: SMOTE meets Large Language Models
SMOTExT: SMOTE meets Large Language Models
Mateusz Bystroński
Mikołaj Hołysz
Grzegorz Piotrowski
Nitesh Chawla
Tomasz Kajdanowicz
17
0
0
19 May 2025
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Fei Xie
Jiahao Nie
Yujin Tang
W. Zhang
Hongshen Zhao
Mamba
13
0
0
19 May 2025
DD-Ranking: Rethinking the Evaluation of Dataset Distillation
DD-Ranking: Rethinking the Evaluation of Dataset Distillation
Zekai Li
Xinhao Zhong
Samir Khaki
Zhiyuan Liang
Yuhao Zhou
...
Konstantinos N Plataniotis
Zhangyang Wang
Bo Zhao
Yang You
Kai Wang
DD
23
0
0
19 May 2025
CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling
CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling
Aditeya Baral
Allen George Ajith
Roshan Nayak
Mrityunjay Abhijeet Bhanja
17
0
0
19 May 2025
SPKLIP: Aligning Spike Video Streams with Natural Language
SPKLIP: Aligning Spike Video Streams with Natural Language
Yongchang Gao
Meiling Jin
Zhaofei Yu
Tiejun Huang
Guozhang Chen
CLIP
VLM
22
0
0
19 May 2025
To Bias or Not to Bias: Detecting bias in News with bias-detector
To Bias or Not to Bias: Detecting bias in News with bias-detector
Himel Ghosh
Ahmed Mosharafa
Georg Groh
14
0
0
19 May 2025
LLM-Based Compact Reranking with Document Features for Scientific Retrieval
LLM-Based Compact Reranking with Document Features for Scientific Retrieval
Runchu Tian
Xueqiang Xu
Bowen Jin
SeongKu Kang
Jiawei Han
12
0
0
19 May 2025
Krikri: Advancing Open Large Language Models for Greek
Krikri: Advancing Open Large Language Models for Greek
Dimitris Roussis
Leon Voukoutis
Georgios Paraskevopoulos
Sokratis Sofianopoulos
Prokopis Prokopidis
Vassilis Papavasileiou
Athanasios Katsamanis
Stelios Piperidis
Vassilis Katsouros
ALM
30
0
0
19 May 2025
Suicide Risk Assessment Using Multimodal Speech Features: A Study on the SW1 Challenge Dataset
Suicide Risk Assessment Using Multimodal Speech Features: A Study on the SW1 Challenge Dataset
Ambre Marie
Ilias Maoudj
Guillaume Dardenne
Gwenolé Quellec
12
0
0
19 May 2025
Rank, Chunk and Expand: Lineage-Oriented Reasoning for Taxonomy Expansion
Rank, Chunk and Expand: Lineage-Oriented Reasoning for Taxonomy Expansion
Sahil Mishra
Kumar Arjun
Tanmoy Chakraborty
12
0
0
19 May 2025
Contrastive Prompting Enhances Sentence Embeddings in LLMs through Inference-Time Steering
Contrastive Prompting Enhances Sentence Embeddings in LLMs through Inference-Time Steering
Zifeng Cheng
Zhonghui Wang
Yuchen Fu
Zhiwei Jiang
Yafeng Yin
Cong Wang
Qing Gu
17
0
0
19 May 2025
Re-identification of De-identified Documents with Autoregressive Infilling
Re-identification of De-identified Documents with Autoregressive Infilling
Lucas Georges Gabriel Charpentier
Pierre Lison
25
0
0
19 May 2025
Advancing Software Quality: A Standards-Focused Review of LLM-Based Assurance Techniques
Advancing Software Quality: A Standards-Focused Review of LLM-Based Assurance Techniques
Avinash Patil
17
0
0
19 May 2025
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
Wenhao Zhu
Yuhang Xie
Guojie Song
Xin Zhang
19
0
0
19 May 2025
Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR
Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR
Xugang Lu
Peng Shen
Yu Tsao
Hisashi Kawai
OT
12
0
0
19 May 2025
AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database
AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database
Rong Bian
Yu Geng
Zijian Yang
Bing Cheng
17
0
0
19 May 2025
Enhancing Knowledge Graph Completion with GNN Distillation and Probabilistic Interaction Modeling
Enhancing Knowledge Graph Completion with GNN Distillation and Probabilistic Interaction Modeling
Lingzhi Wang
Pengcheng Huang
Haotian Li
Yuliang Wei
Guodong Xin
Rui Zhang
Donglin Zhang
Zhenzhou Ji
Wei Wang
12
0
0
18 May 2025
CompBench: Benchmarking Complex Instruction-guided Image Editing
CompBench: Benchmarking Complex Instruction-guided Image Editing
Bohan Jia
Wenxuan Huang
Yuntian Tang
Junbo Qiao
Jincheng Liao
...
Lin Chen
Fei Zhao
Zihan Wang
Yuan Xie
Shaohui Lin
CoGe
19
0
0
18 May 2025
A Survey of Attacks on Large Language Models
A Survey of Attacks on Large Language Models
Wenrui Xu
Keshab K. Parhi
AAML
ELM
29
0
0
18 May 2025
A Survey on Side Information-driven Session-based Recommendation: From a Data-centric Perspective
A Survey on Side Information-driven Session-based Recommendation: From a Data-centric Perspective
Xiaokun Zhang
Bo Xu
Chenliang Li
Bowei He
Hongfei Lin
Chen Ma
Fenglong Ma
22
0
0
18 May 2025
NeuroGen: Neural Network Parameter Generation via Large Language Models
NeuroGen: Neural Network Parameter Generation via Large Language Models
Jiaqi Wang
Yusen Zhang
Xi Li
12
0
0
18 May 2025
PartDexTOG: Generating Dexterous Task-Oriented Grasping via Language-driven Part Analysis
PartDexTOG: Generating Dexterous Task-Oriented Grasping via Language-driven Part Analysis
Weishang Wu
Yifei Shi
Zhizhong Chen
Zhipong Cai
7
0
0
18 May 2025
LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference
LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference
Guangyuan Ma
Yongliang Ma
Xuanrui Gou
Zhenpeng Su
Ming Zhou
Songlin Hu
RALM
27
0
0
18 May 2025
Relation Extraction or Pattern Matching? Unravelling the Generalisation Limits of Language Models for Biographical RE
Relation Extraction or Pattern Matching? Unravelling the Generalisation Limits of Language Models for Biographical RE
Varvara Arzt
Allan Hanbury
Michael Wiegand
Gábor Recski
Terra Blevins
15
0
0
18 May 2025
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Kai Tang
Jinhao You
Xiuqi Ge
Hanze Li
Yichen Guo
Xiande Huang
MLLM
12
0
0
18 May 2025
SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization
SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization
Haodong Yang
Lei Wang
Md Zakir Hossain
17
0
0
18 May 2025
Information Extraction from Visually Rich Documents using LLM-based Organization of Documents into Independent Textual Segments
Information Extraction from Visually Rich Documents using LLM-based Organization of Documents into Independent Textual Segments
Aniket Bhattacharyya
Anurag Tripathi
Ujjal Das
Archan Karmakar
Amit Pathak
Maneesh Gupta
9
0
0
18 May 2025
Truth Neurons
Truth Neurons
Haohang Li
Yupeng Cao
Yangyang Yu
Jordan W. Suchow
Zining Zhu
HILM
MILM
KELM
13
0
0
18 May 2025
MTIL: Encoding Full History with Mamba for Temporal Imitation Learning
MTIL: Encoding Full History with Mamba for Temporal Imitation Learning
Yulin Zhou
Yuankai Lin
Fanzhe Peng
Jiahui Chen
Zhuang Zhou
Kaiji Huang
Hua Yang
Zhouping Yin
Mamba
11
0
0
18 May 2025
Curriculum Abductive Learning
Curriculum Abductive Learning
Wen-Chao Hu
Qi-Jie Li
Lin Jia
Cunjing Ge
Yu-Feng Li
Yuan Jiang
Zhi-Hua Zhou
17
0
0
18 May 2025
MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark
MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark
Yiwei Ou
Xiaobin Ren
Ronggui Sun
Guansong Gao
Ziyi Jiang
Kaiqi Zhao
Manfredo Manfredini
2
0
0
18 May 2025
S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation
S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation
Junlang Huang
Hao Chen
Li Luo
Yong Cai
Lexin Zhang
Tianhao Ma
Yitian Zhang
Zhong Guan
12
0
0
17 May 2025
ELITE: Embedding-Less retrieval with Iterative Text Exploration
ELITE: Embedding-Less retrieval with Iterative Text Exploration
Zhenting Wang
Siyuan Gao
Rong Zhou
Hao Wang
Li Ning
RALM
3DV
14
0
0
17 May 2025
Previous
12345...358359360
Next