Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 23,491 papers shown
Title
Multi-encoder nnU-Net outperforms transformer models with self-supervised pretraining
Seyedeh Sahar Taheri Otaghsara
Reza Rahmanzadeh
ViT
75
0
0
01 Jul 2025
A Pre-trained Sequential Recommendation Framework: Popularity Dynamics for Zero-shot Transfer
Junting Wang
Praneet Rathi
Hari Sundaram
HAI
VLM
47
5
0
01 Jul 2025
Are Bias Evaluation Methods Biased ?
Lina Berrayana
Sean Rooney
Luis Garces-Erice
Ioana Giurgiu
ELM
30
0
0
20 Jun 2025
Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations
Ananth Agarwal
Jasper Jian
Christopher D. Manning
Shikhar Murty
21
0
0
20 Jun 2025
Private Training & Data Generation by Clustering Embeddings
Felix Y. Zhou
Samson Zhou
Vahab Mirrokni
Alessandro Epasto
Vincent Cohen-Addad
24
0
0
20 Jun 2025
Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model
Side Liu
Jiang Ming
Guodong Zhou
Xinyi Liu
Jianming Fu
Guojun Peng
AAML
32
0
0
20 Jun 2025
A Simple Contrastive Framework Of Item Tokenization For Generative Recommendation
Penglong Zhai
Yifang Yuan
Fanyi Di
Jie Li
Y. Liu
Chen Li
Jie Huang
S. Wang
Yao Xu
X. Li
17
0
0
20 Jun 2025
Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse
Paulina DeVito
Akhil Vallala
Sean Mcmahon
Yaroslav Hinda
Benjamin Thaw
Hanqi Zhuang
Hari Kalva
19
0
0
19 Jun 2025
Subspace-Boosted Model Merging
Ronald Skorobogat
Karsten Roth
Mariana-Iuliana Georgescu
Zeynep Akata
MoMe
25
0
0
19 Jun 2025
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
Zhiyuan Liang
Dongwen Tang
Yuhao Zhou
Xuanlei Zhao
Mingjia Shi
...
Damian Borth
Michael M. Bronstein
Yang You
Zhangyang Wang
Kai Wang
OffRL
25
0
0
19 Jun 2025
Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues
Myke C. Cohen
Zhe Su
Hsien-Te Kao
Daniel Nguyen
Spencer Lynch
Maarten Sap
Svitlana Volkova
26
0
0
19 Jun 2025
Knee-Deep in C-RASP: A Transformer Depth Hierarchy
Andy Yang
Michaël Cadilhac
David Chiang
22
0
0
19 Jun 2025
CRIA: A Cross-View Interaction and Instance-Adapted Pre-training Framework for Generalizable EEG Representations
Puchun Liu
Cheng Chen
Yubin He
Tong Zhang
10
0
0
19 Jun 2025
Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support
Sophie Chiang
Guy Laban
Hatice Gunes
48
0
0
19 Jun 2025
Bridging Brain with Foundation Models through Self-Supervised Learning
Hamdi Altaheri
Fakhri Karray
Md. Milon Islam
S M Taslim Uddin Raju
Amir-Hossein Karimi
28
0
0
19 Jun 2025
Relic: Enhancing Reward Model Generalization for Low-Resource Indic Languages with Few-Shot Examples
Soumya Suvra Ghosal
Vaibhav Singh
Akash Ghosh
Soumyabrata Pal
Subhadip Baidya
Sriparna Saha
Dinesh Manocha
22
0
0
19 Jun 2025
A Vietnamese Dataset for Text Segmentation and Multiple Choices Reading Comprehension
Toan Nguyen Hai
Ha Nguyen Viet
Truong Quan Xuan
Duc Do Minh
29
0
0
19 Jun 2025
PR-DETR: Injecting Position and Relation Prior for Dense Video Captioning
Yizhe Li
Sanping Zhou
Zheng Qin
Le Wang
ViT
20
0
0
19 Jun 2025
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View
Fenghua Cheng
Jinxiang Wang
Sen Wang
Zi Huang
Xue Li
LRM
29
0
0
19 Jun 2025
LBMamba: Locally Bi-directional Mamba
Jingwei Zhang
Xi Han
Hong Qin
Mahdi S. Hosseini
Dimitris Samaras
Mamba
44
0
0
19 Jun 2025
Advancing Harmful Content Detection in Organizational Research: Integrating Large Language Models with Elo Rating System
Mustafa Akben
Aaron Satko
15
0
0
19 Jun 2025
Black-Box Privacy Attacks on Shared Representations in Multitask Learning
John Abascal
Nicolás Berrios
Alina Oprea
Jonathan R. Ullman
Adam D. Smith
Matthew Jagielski
MLAU
37
0
0
19 Jun 2025
Optimizing MoE Routers: Design, Implementation, and Evaluation in Transformer Models
Daniel Fidel Harvey
George Weale
Berk Yilmaz
MoE
16
0
0
19 Jun 2025
Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning
Duc Hieu Ho
Chenglin Fan
HILM
LRM
23
0
0
19 Jun 2025
A Brain-to-Population Graph Learning Framework for Diagnosing Brain Disorders
Qianqian Liao
Wuque Cai
Hongze Sun
Dongze Liu
Duo Chen
Dezhong Yao
Daqing Guo
22
0
0
19 Jun 2025
Optimizing Multilingual Text-To-Speech with Accents & Emotions
Pranav Pawar
Akshansh Dwivedi
Jenish Boricha
Himanshu Gohil
Aditya Dubey
12
0
0
19 Jun 2025
Towards Generalizable Generic Harmful Speech Datasets for Implicit Hate Speech Detection
Saad Almohaimeed
Saleh Almohaimeed
D. Turgut
Ladislau Bölöni
15
0
0
19 Jun 2025
MoR: Better Handling Diverse Queries with a Mixture of Sparse, Dense, and Human Retrievers
Jushaan Singh Kalra
Xinran Zhao
To Eun Kim
Fengyu Cai
Fernando Diaz
Tongshuang Wu
VLM
23
0
0
18 Jun 2025
SecFwT: Efficient Privacy-Preserving Fine-Tuning of Large Language Models Using Forward-Only Passes
Jinglong Luo
Zhuo Zhang
Yehong Zhang
Shiyu Liu
Ye Dong
Xun Zhou
Hui Wang
Yue Yu
Zenglin Xu
19
0
0
18 Jun 2025
Explainable speech emotion recognition through attentive pooling: insights from attention-based temporal localization
Tahitoa Leygue
Astrid Sabourin
Christian Bolzmacher
Sylvain Bouchigny
Margarita Anastassova
Quoc-Cuong Pham
12
0
0
18 Jun 2025
Enhancing Hyperbole and Metaphor Detection with Their Bidirectional Dynamic Interaction and Emotion Knowledge
Li Zheng
Sihang Wang
Hao Fei
Zuquan Peng
Fei Li
Jianming Fu
Chong Teng
Donghong Ji
17
0
0
18 Jun 2025
Intelligent Assistants for the Semiconductor Failure Analysis with LLM-Based Planning Agents
Aline Dobrovsky
Konstantin Schekotihin
Christian Burmer
LLMAG
24
0
0
18 Jun 2025
A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals
Andrea Cadeddu
Alessandro Chessa
Vincenzo De Leo
Gianni Fenu
Enrico Motta
Francesco Osborne
Diego Reforgiato Recupero
Angelo Salatino
Luca Secchi
26
0
0
18 Jun 2025
VLMInferSlow: Evaluating the Efficiency Robustness of Large Vision-Language Models as a Service
X. Wang
Tianliang Yao
S. Chen
Runqi Wang
Lei YE
Kuofeng Gao
Yi Huang
Yuan Yao
VLM
24
0
0
18 Jun 2025
Demystifying the Visual Quality Paradox in Multimodal Large Language Models
Shuo Xing
Lanqing guo
Hongyuan Hua
Seoyoung Lee
Peiran Li
Yufei Wang
Zhangyang Wang
Zhengzhong Tu
VLM
47
0
0
18 Jun 2025
Cohort Discovery: A Survey on LLM-Assisted Clinical Trial Recruitment
Shrestha Ghosh
Moritz Schneider
Carina Reinicke
Carsten Eickhoff
22
0
0
18 Jun 2025
Sequential Policy Gradient for Adaptive Hyperparameter Optimization
Zheng Li
Jerry Q. Cheng
Huanying Gu
OffRL
26
0
0
18 Jun 2025
Research on Graph-Retrieval Augmented Generation Based on Historical Text Knowledge Graphs
Yang Fan
Zhang Qi
Xing Wenqian
Liu Chang
Liu Liu
RALM
20
0
0
18 Jun 2025
SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models' Knowledge of Indian Culture
Arijit Maji
Raghvendra Kumar
Akash Ghosh
Anushka
Sriparna Saha
ELM
25
0
0
18 Jun 2025
MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning
Leonid Ivanov
Vasily Yuryev
Dmitry Yudin
19
0
0
18 Jun 2025
Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning
Stanley Ngugi
37
0
0
18 Jun 2025
From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem
Yanxu Mao
Tiehan Cui
Peipei Liu
Datao You
Hongsong Zhu
AAML
17
0
0
18 Jun 2025
ODD: Overlap-aware Estimation of Model Performance under Distribution Shift
Aayush Mishra
Anqi Liu
24
0
0
17 Jun 2025
Multi-Scale Finetuning for Encoder-based Time Series Foundation Models
Zhongzheng Qiao
Chenghao Liu
Y. Zhang
Ming Jin
Quang Pham
Qingsong Wen
P.N. Suganthan
Xudong Jiang
Savitha Ramasamy
AI4TS
AI4CE
24
0
0
17 Jun 2025
ELLIS Alicante at CQs-Gen 2025: Winning the critical thinking questions shared task: LLM-based question generation and selection
Lucile Favero
Daniel Frases
Juan Antonio Pérez-Ortiz
Tanja Kaser
Nuria Oliver
ELM
LRM
20
0
0
17 Jun 2025
Foundation Artificial Intelligence Models for Health Recognition Using Face Photographs (FAHR-Face)
Fridolin Haugg
Grace Lee
John He
Leonard Nürnberg
Dennis Bontempi
...
Christian Guthier
Benjamin H. Kann
Vadim N. Gladyshev
Hugo J. W. L. Aerts
Raymond H. Mak
22
0
0
17 Jun 2025
When Does Meaning Backfire? Investigating the Role of AMRs in NLI
Junghyun Min
Xiulin Yang
Shira Wein
LLMSV
44
0
0
17 Jun 2025
Into the Unknown: Applying Inductive Spatial-Semantic Location Embeddings for Predicting Individuals' Mobility Beyond Visited Places
Xinglei Wang
Tao Cheng
Stephen Law
Zichao Zeng
Ilya Ilyankou
Junyuan Liu
Lu Yin
Weiming Huang
Natchapon Jongwiriyanurak
HAI
29
0
0
17 Jun 2025
Adverse Event Extraction from Discharge Summaries: A New Dataset, Annotation Scheme, and Initial Findings
Imane Guellil
Salomé Andres
Atul Anand
Bruce Guthrie
Huayu Zhang
Abul Hasan
Honghan Wu
Beatrice Alex
17
0
0
17 Jun 2025
NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving
Ren Xin
Hongji Liu
Xiaodong Mei
Wenru Liu
Maosheng Ye
Zhili Chen
Jun Ma
34
0
0
17 Jun 2025
1
2
3
4
...
468
469
470
Next