ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 16,719 papers shown
Title
Think Before You Attribute: Improving the Performance of LLMs Attribution Systems
Think Before You Attribute: Improving the Performance of LLMs Attribution Systems
João Eduardo Batista
Emil Vatai
M. Wahib
26
0
0
19 May 2025
Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals
Yuxin Lin
Yinglin Zheng
Ming Zeng
Wangzheng Shi
7
0
0
19 May 2025
Contrastive Prompting Enhances Sentence Embeddings in LLMs through Inference-Time Steering
Contrastive Prompting Enhances Sentence Embeddings in LLMs through Inference-Time Steering
Zifeng Cheng
Zhonghui Wang
Yuchen Fu
Zhiwei Jiang
Yafeng Yin
Cong Wang
Qing Gu
14
0
0
19 May 2025
CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling
CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling
Aditeya Baral
Allen George Ajith
Roshan Nayak
Mrityunjay Abhijeet Bhanja
7
0
0
19 May 2025
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Fei Xie
Jiahao Nie
Yujin Tang
W. Zhang
Hongshen Zhao
Mamba
8
0
0
19 May 2025
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
Wenhao Zhu
Yuhang Xie
Guojie Song
Xin Zhang
7
0
0
19 May 2025
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
Seanie Lee
Sangwoo Park
Dong Bok Lee
Dominik Wagner
Haebin Seong
Tobias Bocklet
Juho Lee
Sung Ju Hwang
FedML
7
0
0
19 May 2025
Sinusoidal Initialization, Time for a New Start
Sinusoidal Initialization, Time for a New Start
Alberto Fernández-Hernández
Jose I. Mestre
Manuel F. Dolz
Jose Duato
Enrique S. Quintana-Ortí
ODL
AI4CE
2
0
0
19 May 2025
SPKLIP: Aligning Spike Video Streams with Natural Language
SPKLIP: Aligning Spike Video Streams with Natural Language
Yongchang Gao
Meiling Jin
Zhaofei Yu
Tiejun Huang
Guozhang Chen
CLIP
VLM
2
0
0
19 May 2025
ProDS: Preference-oriented Data Selection for Instruction Tuning
ProDS: Preference-oriented Data Selection for Instruction Tuning
Wenya Guo
Zhengkun Zhang
Xumeng Liu
Ying Zhang
Ziyu Lu
Haoze Zhu
Xubo Liu
Ruxue Yan
7
0
0
19 May 2025
Re-identification of De-identified Documents with Autoregressive Infilling
Re-identification of De-identified Documents with Autoregressive Infilling
Lucas Georges Gabriel Charpentier
Pierre Lison
2
0
0
19 May 2025
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
Yunseok Jang
Yeda Song
Sungryull Sohn
Lajanugen Logeswaran
Tiange Luo
Dong-Ki Kim
Kyunghoon Bae
Honglak Lee
VGen
4
0
0
19 May 2025
GDPRShield: AI-Powered GDPR Support for Software Developers in Small and Medium-Sized Enterprises
GDPRShield: AI-Powered GDPR Support for Software Developers in Small and Medium-Sized Enterprises
Tharaka Wijesundara
Mathew Warren
Nalin Arachchilage
7
0
0
19 May 2025
An Empirical Study of Many-to-Many Summarization with Large Language Models
An Empirical Study of Many-to-Many Summarization with Large Language Models
Jiaan Wang
Fandong Meng
Zengkui Sun
Yunlong Liang
Yuxuan Cao
Jiarong Xu
Haoxiang Shi
Jie Zhou
9
0
0
19 May 2025
To Bias or Not to Bias: Detecting bias in News with bias-detector
To Bias or Not to Bias: Detecting bias in News with bias-detector
Himel Ghosh
Ahmed Mosharafa
Georg Groh
7
0
0
19 May 2025
MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark
MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark
Yiwei Ou
Xiaobin Ren
Ronggui Sun
Guansong Gao
Ziyi Jiang
Kaiqi Zhao
Manfredo Manfredini
2
0
0
18 May 2025
CompBench: Benchmarking Complex Instruction-guided Image Editing
CompBench: Benchmarking Complex Instruction-guided Image Editing
Bohan Jia
Wenxuan Huang
Yuntian Tang
Junbo Qiao
Jincheng Liao
...
Lin Chen
Fei Zhao
Zihan Wang
Yuan Xie
Shaohui Lin
CoGe
12
0
0
18 May 2025
Enhancing Knowledge Graph Completion with GNN Distillation and Probabilistic Interaction Modeling
Enhancing Knowledge Graph Completion with GNN Distillation and Probabilistic Interaction Modeling
Lingzhi Wang
Pengcheng Huang
Haotian Li
Yuliang Wei
Guodong Xin
Rui Zhang
Donglin Zhang
Zhenzhou Ji
Wei Wang
2
0
0
18 May 2025
PartDexTOG: Generating Dexterous Task-Oriented Grasping via Language-driven Part Analysis
PartDexTOG: Generating Dexterous Task-Oriented Grasping via Language-driven Part Analysis
Weishang Wu
Yifei Shi
Zhizhong Chen
Zhipong Cai
2
0
0
18 May 2025
A Survey on Side Information-driven Session-based Recommendation: From a Data-centric Perspective
A Survey on Side Information-driven Session-based Recommendation: From a Data-centric Perspective
Xiaokun Zhang
Bo Xu
Chenliang Li
Bowei He
Hongfei Lin
Chen Ma
Fenglong Ma
2
0
0
18 May 2025
Relation Extraction or Pattern Matching? Unravelling the Generalisation Limits of Language Models for Biographical RE
Relation Extraction or Pattern Matching? Unravelling the Generalisation Limits of Language Models for Biographical RE
Varvara Arzt
Allan Hanbury
Michael Wiegand
Gábor Recski
Terra Blevins
7
0
0
18 May 2025
NeuroGen: Neural Network Parameter Generation via Large Language Models
NeuroGen: Neural Network Parameter Generation via Large Language Models
Jiaqi Wang
Yusen Zhang
Xi Li
2
0
0
18 May 2025
A Survey of Attacks on Large Language Models
A Survey of Attacks on Large Language Models
Wenrui Xu
Keshab K. Parhi
AAML
ELM
12
0
0
18 May 2025
SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization
SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization
Haodong Yang
Lei Wang
Md Zakir Hossain
7
0
0
18 May 2025
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Kai Tang
Jinhao You
Xiuqi Ge
Hanze Li
Yichen Guo
Xiande Huang
MLLM
4
0
0
18 May 2025
Truth Neurons
Truth Neurons
Haohang Li
Yupeng Cao
Yangyang Yu
Jordan W. Suchow
Zining Zhu
HILM
MILM
KELM
3
0
0
18 May 2025
Curriculum Abductive Learning
Curriculum Abductive Learning
Wen-Chao Hu
Qi-Jie Li
Lin Jia
Cunjing Ge
Yu-Feng Li
Yuan Jiang
Zhi-Hua Zhou
6
0
0
18 May 2025
MTIL: Encoding Full History with Mamba for Temporal Imitation Learning
MTIL: Encoding Full History with Mamba for Temporal Imitation Learning
Yulin Zhou
Yuankai Lin
Fanzhe Peng
Jiahui Chen
Zhuang Zhou
Kaiji Huang
Hua Yang
Zhouping Yin
Mamba
6
0
0
18 May 2025
LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference
LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference
Guangyuan Ma
Yongliang Ma
Xuanrui Gou
Zhenpeng Su
Ming Zhou
Songlin Hu
RALM
12
0
0
18 May 2025
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
Fitsum Gaim
Hoyun Song
Huije Lee
Changgeon Ko
Eui Jun Hwang
Jong C. Park
4
0
0
17 May 2025
S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation
S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation
Junlang Huang
Hao Chen
Li Luo
Yong Cai
Lexin Zhang
Tianhao Ma
Yitian Zhang
Zhong Guan
2
0
0
17 May 2025
ELITE: Embedding-Less retrieval with Iterative Text Exploration
ELITE: Embedding-Less retrieval with Iterative Text Exploration
Zhenting Wang
Siyuan Gao
Rong Zhou
Hao Wang
Li Ning
RALM
3DV
2
0
0
17 May 2025
Towards Comprehensive Argument Analysis in Education: Dataset, Tasks, and Method
Towards Comprehensive Argument Analysis in Education: Dataset, Tasks, and Method
Yupei Ren
Xinyi Zhou
Ning Zhang
Shangqing Zhao
Man Lan
Xiaopeng Bai
2
0
0
17 May 2025
Personalized Author Obfuscation with Large Language Models
Personalized Author Obfuscation with Large Language Models
Mohammad Shokri
Sarah Ita Levitan
Rivka Levitan
2
0
0
17 May 2025
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing
Aybora Koksal
A. Aydin Alatan
LRM
2
0
0
17 May 2025
On Membership Inference Attacks in Knowledge Distillation
On Membership Inference Attacks in Knowledge Distillation
Ziyao Cui
Minxing Zhang
Jian Pei
2
0
0
17 May 2025
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Dena F. Mujtaba
Nihar R. Mahapatra
2
0
0
17 May 2025
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research
Renqi Chen
Haoyang Su
Shixiang Tang
Zhenfei Yin
Qi Wu
Hui Li
Ye Sun
Nanqing Dong
Wanli Ouyang
Philip Torr
AI4CE
7
0
0
17 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao Song
4
0
0
17 May 2025
PROBE: Proprioceptive Obstacle Detection and Estimation while Navigating in Clutter
PROBE: Proprioceptive Obstacle Detection and Estimation while Navigating in Clutter
Dhruv Metha Ramesh
Aravind Sivaramakrishnan
Shreesh Keskar
Kostas E. Bekris
Jingjin Yu
Abdeslam Boularias
2
0
0
17 May 2025
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
Chenlu Wang
Weimin Lyu
Ritwik Banerjee
2
0
0
17 May 2025
Generative and Contrastive Graph Representation Learning
Generative and Contrastive Graph Representation Learning
Jiali Chen
Avijit Mukherjee
SSL
2
0
0
17 May 2025
VITA: Versatile Time Representation Learning for Temporal Hyper-Relational Knowledge Graphs
VITA: Versatile Time Representation Learning for Temporal Hyper-Relational Knowledge Graphs
ChongIn Un
Yuhuan Lu
Tianyue Yang
Dingqi Yang
4
0
0
17 May 2025
EmoHopeSpeech: An Annotated Dataset of Emotions and Hope Speech in English
EmoHopeSpeech: An Annotated Dataset of Emotions and Hope Speech in English
Md Rafiul Biswas
Wajdi Zaghouani
2
0
0
17 May 2025
Enhanced Multimodal Hate Video Detection via Channel-wise and Modality-wise Fusion
Enhanced Multimodal Hate Video Detection via Channel-wise and Modality-wise Fusion
Yinghui Zhang
Tailin Chen
Yuchen Zhang
Zeyu Fu
2
0
0
17 May 2025
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving
Zepeng Ding
Dixuan Wang
Ziqin Luo
Guochao Jiang
Deqing Yang
Jiaqing Liang
2
0
0
17 May 2025
Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs
Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs
Abhishek Dey
Aabha Bothera
Samhita Sarikonda
Rishav Aryan
Sanjay Kumar Podishetty
Akshay Havalgi
Gaurav Singh
Saurabh Srivastava
12
0
0
16 May 2025
On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms
On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms
Jacob Trauger
Ambuj Tewari
17
0
0
16 May 2025
Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification
Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification
Wenhao Qian
Zhenzhen Hu
Zijie Song
Jia Li
14
0
0
16 May 2025
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Lee Harris
Philippe De Wilde
James Bentham
2
0
0
16 May 2025
1234...333334335
Next