Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 4,640 papers shown
Title
AdUE: Improving uncertainty estimation head for LoRA adapters in LLMs
Artem Zabolotnyi
Roman Makarov
Mile Mitrovic
P. Proskura
Oleg Travkin
Roman Alferov
Alexey Zaytsev
UQCV
12
0
0
21 May 2025
Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning
Yukun Zhao
Lingyong Yan
Zhenyang Li
S. Wang
Zhumin Chen
Z. Ren
Dawei Yin
CLL
KELM
VLM
LRM
17
0
0
21 May 2025
Enhancing Abstractive Summarization of Scientific Papers Using Structure Information
Tong Bao
Heng Zhang
Chengzhi Zhang
17
1
0
20 May 2025
GDPRShield: AI-Powered GDPR Support for Software Developers in Small and Medium-Sized Enterprises
Tharaka Wijesundara
Mathew Warren
Nalin Arachchilage
12
0
0
19 May 2025
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
Wenhao Zhu
Yuhang Xie
Guojie Song
Xin Zhang
19
0
0
19 May 2025
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
Seanie Lee
Sangwoo Park
Dong Bok Lee
Dominik Wagner
Haebin Seong
Tobias Bocklet
Juho Lee
Sung Ju Hwang
FedML
12
0
0
19 May 2025
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Kai Tang
Jinhao You
Xiuqi Ge
Hanze Li
Yichen Guo
Xiande Huang
MLLM
7
0
0
18 May 2025
Information Extraction from Visually Rich Documents using LLM-based Organization of Documents into Independent Textual Segments
Aniket Bhattacharyya
Anurag Tripathi
Ujjal Das
Archan Karmakar
Amit Pathak
Maneesh Gupta
9
0
0
18 May 2025
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Yuwei Zhang
Wenhao Yu
Shangbin Feng
Yifan Zhu
Letian Peng
Jayanth Srinivasa
Gaowen Liu
Jingbo Shang
KELM
7
0
0
18 May 2025
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Dena F. Mujtaba
Nihar R. Mahapatra
9
0
0
17 May 2025
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
Yifei He
Siqi Zeng
Yuzheng Hu
Rui Yang
Tong Zhang
Han Zhao
MoMe
ALM
34
0
0
16 May 2025
On the Interconnections of Calibration, Quantification, and Classifier Accuracy Prediction under Dataset Shift
Alejandro Moreo
9
0
0
16 May 2025
Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications
Dominic Zaun Eu Jones
30
0
0
14 May 2025
Automated Detection of Clinical Entities in Lung and Breast Cancer Reports Using NLP Techniques
J. Moreno-Casanova
J.M. Auñón
A. Mártinez-Pérez
M.E. Pérez-Martínez
M.E. Gas-López
14
0
0
14 May 2025
A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data
Jiin Park
Misuk Kim
24
0
0
14 May 2025
Analog Foundation Models
Julian Büchel
Iason Chalas
Giovanni Acampa
An Chen
Omobayode Fagbohungbe
Sidney Tsai
Kaoutar El Maghraoui
Manuel Le Gallo
Abbas Rahimi
Abu Sebastian
MQ
35
0
0
14 May 2025
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Chang Zong
Yueting Zhuang
Jian Shao
Weiming Lu
44
0
0
13 May 2025
LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models
Takumi Shibata
Yuichi Miyamura
39
0
0
13 May 2025
Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions
Lata Pangtey
Anukriti Bhatnagar
Shubhi Bansal
Shahid Shafi Dar
Nagendra Kumar
34
0
0
13 May 2025
Next Word Suggestion using Graph Neural Network
Abisha Thapa Magar
Anup Shakya
GNN
30
0
0
13 May 2025
Exploiting Text Semantics for Few and Zero Shot Node Classification on Text-attributed Graph
Yuxiang Wang
Xiao Yan
Shiyu Jin
Quanqing Xu
Chuang Hu
Yuanyuan Zhu
Bo Du
Jia Wu
Wentao Zhang
36
0
0
13 May 2025
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
Hajar Sakai
Sarah Lam
VLM
44
0
0
12 May 2025
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue
Jannatun Naim
Jie Cao
Fareen Tasneem
Jennifer Jacobs
Brent Milne
James H. Martin
T. Sumner
36
0
0
12 May 2025
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
Karahan Sarıtaş
Çağatay Yıldız
34
0
0
12 May 2025
TiSpell: A Semi-Masked Methodology for Tibetan Spelling Correction covering Multi-Level Error with Data Augmentation
Yutong Liu
Feng Xiao
Ziyue Zhang
Yongbin Yu
Cheng Huang
...
Thupten Tsering
Cheng Huang
Gadeng Luosang
Renzeng Duojie
Nyima Tashi
31
0
0
12 May 2025
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
Paul Primus
Florian Schmid
Gerhard Widmer
CLIP
AI4TS
VLM
36
0
0
12 May 2025
Comparative sentiment analysis of public perception: Monkeypox vs. COVID-19 behavioral insights
Mostafa Mohaimen Akand Faisal
Rabeya Amin Jhuma
35
0
0
12 May 2025
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation
Jiashuo Sun
Xianrui Zhong
Sizhe Zhou
Jiawei Han
RALM
31
0
0
12 May 2025
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
Shunyao Wang
Ming Cheng
Christina Dan Wang
AIFin
30
0
0
11 May 2025
Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
Fabrice Harel-Canada
Boran Erol
Connor Choi
J. Liu
Gary Jiarui Song
Nanyun Peng
Amit Sahai
WaLM
30
0
0
11 May 2025
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
Wenju Sun
Qingyong Li
Yangli-ao Geng
Boyang Li
MoMe
40
0
0
11 May 2025
A Vision-Language Foundation Model for Leaf Disease Identification
Khang Nguyen Quoc
Lan Le Thi Thu
Luyl-Da Quach
VLM
32
0
0
11 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLM
LRM
48
0
0
10 May 2025
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Lu Dong
Han Zhang
Hongjie Zhang
Yuanmin Huang
Z. Ling
Yu Qiao
Limin Wang
Yue Wang
AI4TS
48
0
0
10 May 2025
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
Yu Wang
Runxi Yu
Zhongyuan Wang
Jing He
26
0
0
10 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
34
0
0
09 May 2025
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction
Ziliang Wang
Xiaohong Zhang
Ze Shi Li
Meng Yan
18
0
0
09 May 2025
FLAM: Frame-Wise Language-Audio Modeling
Yusong Wu
Christos Tsirigotis
Ke Chen
Cheng-Zhi Anna Huang
Rameswar Panda
Oriol Nieto
Prem Seetharaman
Justin Salamon
55
0
0
08 May 2025
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
Qianbo Zang
Christophe Zgrzendek
Igor Tchappi
Afshin Khadangi
Johannes Sedlmeir
VLM
42
0
0
08 May 2025
QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public
Mingruo Yuan
Ben Kao
Tien-Hsuan Wu
AILaw
79
0
0
08 May 2025
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
Hanxun Huang
Sarah Monazam Erfani
Yige Li
Xingjun Ma
James Bailey
AAML
55
0
0
08 May 2025
UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections
Fatima Haouari
Carolina Scarton
Nicolò Faggiani
Nikolaos Nikolaidis
Bonka Kotseva
Ibrahim Abu Farha
Jens Linge
Kalina Bontcheva
46
0
0
08 May 2025
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
Hao Zhen
Jidong J. Yang
43
0
0
08 May 2025
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
Muhammad Nabeel Asim
A. Rehman
Andreas Dengel
33
0
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
51
0
0
08 May 2025
A Tale of Two Identities: An Ethical Audit of Human and AI-Crafted Personas
Pranav Narayanan Venkit
Jiayi Li
Yingfan Zhou
Sarah Rajtmajer
Shomir Wilson
34
0
0
07 May 2025
Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction
Paul Landes
Jimeng Sun
Adam Cross
36
0
0
06 May 2025
Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation
Hannes Waldetoft
Jakob Torgander
Måns Magnusson
34
0
0
05 May 2025
Logits-Constrained Framework with RoBERTa for Ancient Chinese NER
Wenjie Hua
Shenghan Xu
19
0
0
05 May 2025
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation
Gerard Pons
Besim Bilalli
Anna Queralt
43
1
0
05 May 2025
1
2
3
4
...
91
92
93
Next