Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,734 papers shown
Title
On the Potential of Large Language Models to Solve Semantics-Aware Process Mining Tasks
Adrian Rebmann
Fabian David Schmidt
Goran Glavaš
Han van der Aa
LRM
61
0
0
29 Apr 2025
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification
Foteini Papadopoulou
Osman Mutlu
Neris Özen
Bas H. M. van der Velden
I. Hendrickx
Ali Hürriyetoǧlu
ViT
64
0
0
29 Apr 2025
Image deidentification in the XNAT ecosystem: use cases and solutions
Alex Michie
Simon J Doran
54
0
0
29 Apr 2025
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare
Lovedeep Gondara
Jonathan Simkin
Graham Sayle
Shebnum Devji
Gregory Arbour
Raymond Ng
LM&MA
53
0
0
29 Apr 2025
Leveraging Generative AI Through Prompt Engineering and Rigorous Validation to Create Comprehensive Synthetic Datasets for AI Training in Healthcare
Polycarp Nalela
SyDa
53
0
0
29 Apr 2025
Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records
Jesus Lovon
Thouria Ben-Haddi
Jules Di Scala
José G. Moreno
L. Tamine
153
3
0
29 Apr 2025
Generative AI in Education: Student Skills and Lecturer Roles
Stefanie Krause
Ashish Dalvi
Syed Khubaib Zaidi
450
0
0
28 Apr 2025
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Kang Yang
Xinjun Mao
Shangwen Wang
Yanjie Wang
Tanghaoran Zhang
Bo Lin
Yihao Qin
Zhang Zhang
Yao Lu
Kamal Al-Sabahi
ALM
285
1
0
28 Apr 2025
Towards Long Context Hallucination Detection
Siyi Liu
Kishaloy Halder
Zheng Qi
Wei Xiao
Nikolaos Pappas
Phu Mon Htut
Neha Anna John
Yassine Benajiba
Dan Roth
HILM
115
2
0
28 Apr 2025
Exploiting Inter-Sample Correlation and Intra-Sample Redundancy for Partially Relevant Video Retrieval
Junlong Ren
Gangjian Zhang
Yitao Hu
Jian Shu
Haoran Wang
102
0
0
28 Apr 2025
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
Sehyeong Jo
Gangjae Jang
Haesol Park
143
0
0
28 Apr 2025
ClimaEmpact: Domain-Aligned Small Language Models and Datasets for Extreme Weather Analytics
Deeksha Varshney
Keane Ong
Rui Mao
Min Zhang
G. Mengaldo
70
1
0
27 Apr 2025
Attention to Detail: Fine-Scale Feature Preservation-Oriented Geometric Pre-training for AI-Driven Surrogate Modeling
Yu-hsuan Chen
Jing Bi
Cyril Ngo Ngoc
Victor Oancea
Jonathan Cagan
Levent Burak Kara
AI4CE
90
0
0
27 Apr 2025
HoloDx: Knowledge- and Data-Driven Multimodal Diagnosis of Alzheimer's Disease
Qiuhui Chen
Jintao Wang
Gang Wang
Yi Hong
80
0
0
27 Apr 2025
Detect, Explain, Escalate: Low-Carbon Dialogue Breakdown Management for LLM-Powered Agents
Abdellah Ghassel
Xianzhi Li
Xiaodan Zhu
141
0
0
26 Apr 2025
The Influence of Text Variation on User Engagement in Cross-Platform Content Sharing
Yibo Hu
Yiqiao Jin
Meng Ye
Ajay Divakaran
Srijan Kumar
65
0
0
26 Apr 2025
Pushing the boundary on Natural Language Inference
Pablo Miralles-González
Javier Huertas-Tato
Alejandro Martín
David Camacho
LRM
227
0
0
25 Apr 2025
Aligning Language Models for Icelandic Legal Text Summarization
Þórir Hrafn Harðarson
Hrafn Loftsson
Stefán Ólafsson
AILaw
AI4TS
ELM
130
0
0
25 Apr 2025
TLoRA: Tri-Matrix Low-Rank Adaptation of Large Language Models
Tanvir Islam
AI4CE
199
0
0
25 Apr 2025
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
Wencong You
Daniel Lowd
94
0
0
24 Apr 2025
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Haokai Zhang
Shengtao Zhang
Zijian Cai
Heng Wang
Ruixuan Zhu
Zinan Zeng
Minnan Luo
138
0
0
24 Apr 2025
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models
Junxuan Zhang
Jiadong Wang
Haoyang Li
Lidan Shou
Ke Chen
Gang Chen
Qin Xie
Guiming Xie
Xuejian Gong
47
0
0
24 Apr 2025
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation
Guojia An
Jie Zou
Jiwei Wei
Chaoning Zhang
Fuming Sun
Yang Yang
420
1
0
24 Apr 2025
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification
Alexander Shvets
60
0
0
23 Apr 2025
MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores
Fengwei Zhou
Jiafei Song
Wenjin Jason Li
Gengjian Xue
Zhikang Zhao
Yichao Lu
Bailin Na
63
1
0
23 Apr 2025
Out-of-the-Box Conditional Text Embeddings from Large Language Models
Kosuke Yamada
Peinan Zhang
59
1
0
23 Apr 2025
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
Jabez Magomere
Elena Kochkina
Samuel Mensah
Simerjot Kaur
Charese Smiley
108
1
0
22 Apr 2025
Sentiment Analysis in Software Engineering: Evaluating Generative Pre-trained Transformers
KM Khalid Saifullah
Faiaz Azmain
Habiba Hye
47
0
0
22 Apr 2025
The Language of Attachment: Modeling Attachment Dynamics in Psychotherapy
Frederik Bredgaard
Martin Lund Trinhammer
Elisa Bassignana
17
0
0
22 Apr 2025
Performance Evaluation of Emotion Classification in Japanese Using RoBERTa and DeBERTa
Yoichi Takenaka
62
0
0
22 Apr 2025
Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Haohe Liu
Thomas Deacon
Wenwu Wang
Matt Paradis
Mark D. Plumbley
63
0
0
22 Apr 2025
LLM-based Semantic Augmentation for Harmful Content Detection
Elyas Meguellati
Assaad Zeghina
S. Sadiq
Gianluca Demartini
141
0
0
22 Apr 2025
llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length
Issa Sugiura
Kouta Nakayama
Yusuke Oda
68
1
0
22 Apr 2025
Kill two birds with one stone: generalized and robust AI-generated text detection via dynamic perturbations
Yunlong Zhou
Juan Wen
Wanli Peng
Yiming Xue
Ziwei Zhang
Zhengxian Wu
AAML
68
0
0
22 Apr 2025
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Yatong Bai
Jonah Casebeer
Somayeh Sojoudi
Nicholas J. Bryan
DiffM
VLM
111
1
0
21 Apr 2025
LLMs as Data Annotators: How Close Are We to Human Performance
Muhammad Uzair Ul Haq
Davide Rigoni
A. Sperduti
71
1
0
21 Apr 2025
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren
Yihong Liu
Hinrich Schütze
69
0
0
21 Apr 2025
The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks
Joan C. Timoneda
DiffM
SyDa
63
0
0
21 Apr 2025
Leveraging Language Models for Automated Patient Record Linkage
Mohammad Beheshti
Lovedeep Gondara
Iris Zachary
59
0
0
21 Apr 2025
Visualizing Public Opinion on X: A Real-Time Sentiment Dashboard Using VADER and DistilBERT
Yanampally Abhiram Reddy
Siddhi Agarwal
Vikram Parashar
Arshiya Arora
33
0
0
21 Apr 2025
Feeding LLM Annotations to BERT Classifiers at Your Own Risk
Yucheng Lu
Kazimier Smith
57
0
0
21 Apr 2025
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform
Xingyu Lu
Tianke Zhang
Chang Meng
Xinyu Wang
Jinpeng Wang
...
Hai-Tao Zheng
Fan Yang
Yan Li
Di Zhang
Kun Gai
OffRL
87
0
0
21 Apr 2025
Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends
Jiaxin Guo
Xiaoyu Chen
Zhiqiang Rao
Jinlong Yang
Zongyao Li
Hengchao Shang
Daimeng Wei
Hao Yang
79
0
0
21 Apr 2025
ViQA-COVID: COVID-19 Machine Reading Comprehension Dataset for Vietnamese
H. Phung
Ngoc C. Lê
Van-Chien Nguyen
Hang Thi Nguyen
Thuy Phuong Thi Nguyen
224
2
0
21 Apr 2025
Biased by Design: Leveraging Inherent AI Biases to Enhance Critical Thinking of News Readers
L. Zavolokina
Kilian Sprenkamp
Zoya Katashinskaya
Daniel Gordon Jones
80
0
0
20 Apr 2025
Bias Analysis and Mitigation through Protected Attribute Detection and Regard Classification
Takuma Udagawa
Yang Zhao
H. Kanayama
Bishwaranjan Bhattacharjee
63
0
0
19 Apr 2025
Probing the Subtle Ideological Manipulation of Large Language Models
Demetris Paschalides
G. Pallis
M. Dikaiakos
56
0
0
19 Apr 2025
Long-context Non-factoid Question Answering in Indic Languages
Ritwik Mishra
R. Shah
Ponnurangam Kumaraguru
76
0
0
18 Apr 2025
Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation
CheolWon Na
YunSeok Choi
Jee-Hyong Lee
AAML
71
0
0
18 Apr 2025
Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence Matching
Heng Liu
Guanghui Li
Mingqi Gao
Xiantong Zhen
Feng Zheng
Yansen Wang
VOS
147
0
0
18 Apr 2025
Previous
1
2
3
...
6
7
8
...
213
214
215
Next