ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 17,917 papers shown
Title
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings
Fitsum Gaim
Hoyun Song
Huije Lee
Changgeon Ko
Eui Jun Hwang
Jong C. Park
7
0
0
17 May 2025
VITA: Versatile Time Representation Learning for Temporal Hyper-Relational Knowledge Graphs
VITA: Versatile Time Representation Learning for Temporal Hyper-Relational Knowledge Graphs
ChongIn Un
Yuhuan Lu
Tianyue Yang
Dingqi Yang
4
0
0
17 May 2025
EmoHopeSpeech: An Annotated Dataset of Emotions and Hope Speech in English and Arabic
EmoHopeSpeech: An Annotated Dataset of Emotions and Hope Speech in English and Arabic
Md Rafiul Biswas
Wajdi Zaghouani
13
0
0
17 May 2025
ELITE: Embedding-Less retrieval with Iterative Text Exploration
ELITE: Embedding-Less retrieval with Iterative Text Exploration
Zhangyu Wang
Siyuan Gao
Rong Zhou
Hao Wang
Li Ning
RALM
3DV
14
0
0
17 May 2025
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
Chenlu Wang
Weimin Lyu
Ritwik Banerjee
17
0
0
17 May 2025
S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation
S-Crescendo: A Nested Transformer Weaving Framework for Scalable Nonlinear System in S-Domain Representation
Junlang Huang
Hao Chen
Li Luo
Yong Cai
Lexin Zhang
Tianhao Ma
Yitian Zhang
Zhong Guan
9
0
0
17 May 2025
Towards Comprehensive Argument Analysis in Education: Dataset, Tasks, and Method
Towards Comprehensive Argument Analysis in Education: Dataset, Tasks, and Method
Yupei Ren
Xinyi Zhou
Ning Zhang
Shangqing Zhao
Man Lan
Xiaopeng Bai
12
0
0
17 May 2025
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Dena F. Mujtaba
Nihar R. Mahapatra
9
0
0
17 May 2025
PROBE: Proprioceptive Obstacle Detection and Estimation while Navigating in Clutter
PROBE: Proprioceptive Obstacle Detection and Estimation while Navigating in Clutter
Dhruv Metha Ramesh
Aravind Sivaramakrishnan
Shreesh Keskar
Kostas E. Bekris
Jingjin Yu
Abdeslam Boularias
7
0
0
17 May 2025
Personalized Author Obfuscation with Large Language Models
Personalized Author Obfuscation with Large Language Models
Mohammad Shokri
Sarah Ita Levitan
Rivka Levitan
14
0
0
17 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao Song
27
12
0
17 May 2025
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research
Renqi Chen
Haoyang Su
Shixiang Tang
Zhenfei Yin
Qi Wu
Hui Li
Ye Sun
Nanqing Dong
Wanli Ouyang
Philip Torr
AI4CE
19
0
0
17 May 2025
On Membership Inference Attacks in Knowledge Distillation
On Membership Inference Attacks in Knowledge Distillation
Ziyao Cui
Minxing Zhang
Jian Pei
12
0
0
17 May 2025
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving
Zepeng Ding
Dixuan Wang
Ziqin Luo
Guochao Jiang
Deqing Yang
Jiaqing Liang
7
0
0
17 May 2025
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing
Aybora Koksal
A. Aydin Alatan
LRM
14
0
0
17 May 2025
Evaluating Design Decisions for Dual Encoder-based Entity Disambiguation
Evaluating Design Decisions for Dual Encoder-based Entity Disambiguation
Susanna Rücker
Alan Akbik
17
0
0
16 May 2025
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
26
0
0
16 May 2025
JaxSGMC: Modular stochastic gradient MCMC in JAX
JaxSGMC: Modular stochastic gradient MCMC in JAX
Stephan Thaler
Paul Fuchs
Ana Cukarska
Julija Zavadlav
BDL
48
2
0
16 May 2025
Distilled Circuits: A Mechanistic Study of Internal Restructuring in Knowledge Distillation
Distilled Circuits: A Mechanistic Study of Internal Restructuring in Knowledge Distillation
Reilly Haskins
Benjamin Adams
16
0
0
16 May 2025
SECRET: Semi-supervised Clinical Trial Document Similarity Search
SECRET: Semi-supervised Clinical Trial Document Similarity Search
Trisha Das
Afrah Shafquat
Beigi Mandis
Jacob Aptekar
Jimeng Sun
17
0
0
16 May 2025
Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach
Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach
Jiabei Cheng
Zhen-Qun Yang
Jiannong Cao
Yu Yang
Xinzhe Zheng
14
0
0
16 May 2025
Modeling cognitive processes of natural reading with transformer-based Language Models
Modeling cognitive processes of natural reading with transformer-based Language Models
Bruno Bianchi
Fermín Travi
Juan E. Kamienkowski
17
0
0
16 May 2025
StRuCom: A Novel Dataset of Structured Code Comments in Russian
StRuCom: A Novel Dataset of Structured Code Comments in Russian
Maria Dziuba
Valentin Malykh
16
0
0
16 May 2025
On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms
On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms
Jacob Trauger
Ambuj Tewari
22
0
0
16 May 2025
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures
Shun Inadumi
Nobuhiro Ueda
Koichiro Yoshino
ObjD
12
0
0
16 May 2025
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Comparing Lexical and Semantic Vector Search Methods When Classifying Medical Documents
Lee Harris
Philippe De Wilde
James Bentham
4
0
0
16 May 2025
Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs
Multimodal Event Detection: Current Approaches and Defining the New Playground through LLMs and VLMs
Abhishek Dey
Aabha Bothera
Samhita Sarikonda
Rishav Aryan
Sanjay Kumar Podishetty
Akshay Havalgi
Gaurav Singh
Saurabh Srivastava
12
0
0
16 May 2025
On the Interconnections of Calibration, Quantification, and Classifier Accuracy Prediction under Dataset Shift
On the Interconnections of Calibration, Quantification, and Classifier Accuracy Prediction under Dataset Shift
Alejandro Moreo
9
0
0
16 May 2025
Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Yuan Zhang
Xinfeng Zhang
Xiaoming Qi Xinyu Wu
Feng Chen
Guanyu Yang
Huazhu Fu
MedIm
LM&MA
AI4CE
31
0
0
16 May 2025
GeoMM: On Geodesic Perspective for Multi-modal Learning
GeoMM: On Geodesic Perspective for Multi-modal Learning
Shibin Mei
Hang Wang
Bingbing Ni
22
0
0
16 May 2025
Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification
Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification
Wenhao Qian
Zhenzhen Hu
Zijie Song
Jia Li
17
0
0
16 May 2025
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
Hrishit Madhavi
Jacob Cherian
Yuvraj Khamkar
Dhananjay Bhagat
VLM
24
0
0
16 May 2025
Sybil-based Virtual Data Poisoning Attacks in Federated Learning
Sybil-based Virtual Data Poisoning Attacks in Federated Learning
Changxun Zhu
Qilong Wu
Lingjuan Lyu
Shibei Xue
AAML
FedML
28
0
0
15 May 2025
The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
Benedikt Ebing
Goran Glavaš
32
0
0
15 May 2025
LDIR: Low-Dimensional Dense and Interpretable Text Embeddings with Relative Representations
LDIR: Low-Dimensional Dense and Interpretable Text Embeddings with Relative Representations
Yile Wang
Zhanyu Shen
Hui Huang
34
0
0
15 May 2025
Artificial Intelligence Bias on English Language Learners in Automatic Scoring
Artificial Intelligence Bias on English Language Learners in Automatic Scoring
Shuchen Guo
Yue Wang
Jichao Yu
Xuansheng Wu
Bilgehan Ayik
Field M. Watts
Ehsan Latif
Ninghao Liu
Lei Liu
Xiaoming Zhai
17
0
0
15 May 2025
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Tianyu Huai
Jie Zhou
Yuxuan Cai
Qin Chen
Wen Wu
Xingjiao Wu
Xipeng Qiu
Liang He
CLL
33
0
0
15 May 2025
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
Jintian Shao
Hongyi Huang
Jiayi Wu
Beiwen Zhang
ZhiYu Wu
You Shan
MingKai Zheng
29
0
0
15 May 2025
From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models
Yidan Wang
Yubing Ren
Yanan Cao
Binxing Fang
32
0
0
15 May 2025
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Annie Wong
Thomas Bäck
Aske Plaat
Niki van Stein
Anna V. Kononova
ReLM
ELM
LRM
50
0
0
15 May 2025
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
Amritanshu Tiwari
Cherish Puniani
Kaustubh Sharma
Ojasva Nema
DiffM
24
0
0
15 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yixuan Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
19
0
0
15 May 2025
Demystifying AI Agents: The Final Generation of Intelligence
Demystifying AI Agents: The Final Generation of Intelligence
Kevin J McNamara
Rhea Pritham Marpu
31
0
0
15 May 2025
ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model
ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model
Przemek Pospieszny
Wojciech Mormul
Karolina Szyndler
Sanjeev Kumar
12
0
0
15 May 2025
Interim Report on Human-Guided Adaptive Hyperparameter Optimization with Multi-Fidelity Sprints
Interim Report on Human-Guided Adaptive Hyperparameter Optimization with Multi-Fidelity Sprints
Michael Kamfonas
34
0
0
14 May 2025
Adversarial Suffix Filtering: a Defense Pipeline for LLMs
Adversarial Suffix Filtering: a Defense Pipeline for LLMs
David Khachaturov
Robert D. Mullins
AAML
31
0
0
14 May 2025
A Multi-Task Foundation Model for Wireless Channel Representation Using Contrastive and Masked Autoencoder Learning
A Multi-Task Foundation Model for Wireless Channel Representation Using Contrastive and Masked Autoencoder Learning
Berkay Guler
Giovanni Geraci
Hamid Jafarkhani
38
0
0
14 May 2025
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
Brandon Smith
Mohamed Reda Bouadjenek
Tahsin Alamgir Kheya
Phillip Dawson
S. Aryal
ALM
ELM
36
0
0
14 May 2025
Analog Foundation Models
Analog Foundation Models
Julian Büchel
Iason Chalas
Giovanni Acampa
An Chen
Omobayode Fagbohungbe
Sidney Tsai
Kaoutar El Maghraoui
Manuel Le Gallo
Abbas Rahimi
Abu Sebastian
MQ
35
0
0
14 May 2025
LiDDA: Data Driven Attribution at LinkedIn
LiDDA: Data Driven Attribution at LinkedIn
John Bencina
Erkut Aykutlug
Yue Chen
Zerui Zhang
Stephanie Sorenson
Shao Tang
Changshuai Wei
19
0
0
14 May 2025
Previous
123456...357358359
Next