Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 4,640 papers shown
Title
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
25
0
0
05 May 2025
OODTE: A Differential Testing Engine for the ONNX Optimizer
Nikolaos Louloudakis
Ajitha Rajan
46
0
0
03 May 2025
Embedding based retrieval for long tail search queries in ecommerce
Akshay Kekuda
Yuyang Zhang
Arun Udayashankar
RALM
39
0
0
03 May 2025
Towards High-Fidelity Synthetic Multi-platform Social Media Datasets via Large Language Models
Henry Tari
Nojus Sereiva
Rishabh Kaushal
T. Bertaglia
Adriana Iamnitchi
35
0
0
02 May 2025
Multi-agents based User Values Mining for Recommendation
L. Chen
Wei Yuan
Tong Chen
Xiangyu Zhao
Nguyen Quoc Viet Hung
Hongzhi Yin
OffRL
55
0
0
02 May 2025
Dual-Forecaster: A Multimodal Time Series Model Integrating Descriptive and Predictive Texts
Wenfa Wu
Guanyu Zhang
Zheng Tan
Yi Wang
Hongsheng Qi
AI4TS
57
1
0
02 May 2025
Emotions in the Loop: A Survey of Affective Computing for Emotional Support
Karishma Hegde
Hemadri Jayalath
32
1
0
02 May 2025
One Search Fits All: Pareto-Optimal Eco-Friendly Model Selection
Filippo Betello
Antonio Purificato
Vittoria Vineis
Gabriele Tolomei
Fabrizio Silvestri
48
0
0
02 May 2025
Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods
Mahdi Dhaini
Ege Erdogan
Nils Feldhus
Gjergji Kasneci
54
0
0
02 May 2025
Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
Cong Xu
Wenbin Liang
Mo Yu
Anan Liu
Kaipeng Zhang
Lizhuang Ma
Yufei Guo
Jun Wang
Wentao Zhang
MQ
57
0
0
01 May 2025
Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors
Xinyu Ding
Lexuan Chen
Siyu Liao
Zhongfeng Wang
52
0
0
01 May 2025
Computational Identification of Regulatory Statements in EU Legislation
Gijs Jan Brandsma
Jens Blom-Hansen
Christiaan Meijer
Kody Moodley
AILaw
63
0
0
01 May 2025
Block Circulant Adapter for Large Language Models
Xinyu Ding
Meiqi Wang
Siyu Liao
Zhongfeng Wang
40
0
0
01 May 2025
Synergy-CLIP: Extending CLIP with Multi-modal Integration for Robust Representation Learning
Sangyeon Cho
Jangyeong Jeon
Mingi Kim
Junyeong Kim
CLIP
VLM
89
0
0
30 Apr 2025
Image deidentification in the XNAT ecosystem: use cases and solutions
Alex Michie
Simon J Doran
32
0
0
29 Apr 2025
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
Jieming Bian
Yuanzhe Peng
Lei Wang
Yin Huang
Jie Xu
FedML
65
0
0
29 Apr 2025
Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records
Jesus Lovon
Thouria Ben-Haddi
Jules Di Scala
José G. Moreno
L. Tamine
60
2
0
29 Apr 2025
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification
Foteini Papadopoulou
Osman Mutlu
Neris Özen
Bas H. M. van der Velden
I. Hendrickx
Ali Hürriyetoǧlu
ViT
39
0
0
29 Apr 2025
Exploiting Inter-Sample Correlation and Intra-Sample Redundancy for Partially Relevant Video Retrieval
Junlong Ren
Gangjian Zhang
Yitao Hu
Jian Shu
Haoran Wang
29
0
0
28 Apr 2025
GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability
Sehyeong Jo
Gangjae Jang
Haesol Park
32
0
0
28 Apr 2025
Generative AI in Education: Student Skills and Lecturer Roles
Stefanie Krause
Ashish Dalvi
Syed Khubaib Zaidi
231
0
0
28 Apr 2025
Towards Long Context Hallucination Detection
Siyi Liu
Kishaloy Halder
Zheng Qi
Wei Xiao
Nikolaos Pappas
Phu Mon Htut
Neha Anna John
Yassine Benajiba
Dan Roth
HILM
77
0
0
28 Apr 2025
Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks
Kang Yang
Xinjun Mao
Shangwen Wang
Yanjie Wang
Tanghaoran Zhang
Bo Lin
Yihao Qin
Zhang Zhang
Yao Lu
Kamal Al-Sabahi
ALM
170
1
0
28 Apr 2025
Attention to Detail: Fine-Scale Feature Preservation-Oriented Geometric Pre-training for AI-Driven Surrogate Modeling
Yu-Hsuan Chen
Jing Bi
Cyril Ngo Ngoc
Victor Oancea
Jonathan Cagan
L. Kara
AI4CE
33
0
0
27 Apr 2025
ClimaEmpact: Domain-Aligned Small Language Models and Datasets for Extreme Weather Analytics
Deeksha Varshney
Keane Ong
Rui Mao
Min Zhang
G. Mengaldo
47
0
0
27 Apr 2025
HoloDx: Knowledge- and Data-Driven Multimodal Diagnosis of Alzheimer's Disease
Qiuhui Chen
Jintao Wang
Gang Wang
Yi Hong
52
0
0
27 Apr 2025
The Influence of Text Variation on User Engagement in Cross-Platform Content Sharing
Yibo Hu
Yiqiao Jin
Meng Ye
Ajay Divakaran
Srijan Kumar
22
0
0
26 Apr 2025
Towards Robust Dialogue Breakdown Detection: Addressing Disruptors in Large Language Models with Self-Guided Reasoning
Abdellah Ghassel
Xianzhi Li
Xiaodan Zhu
51
0
0
26 Apr 2025
TLoRA: Tri-Matrix Low-Rank Adaptation of Large Language Models
Tanvir Islam
AI4CE
50
0
0
25 Apr 2025
Aligning Language Models for Icelandic Legal Text Summarization
Þórir Hrafn Harðarson
Hrafn Loftsson
Stefán Ólafsson
AILaw
AI4TS
ELM
82
0
0
25 Apr 2025
Pushing the boundary on Natural Language Inference
Pablo Miralles-González
Javier Huertas-Tato
Alejandro Martín
David Camacho
LRM
49
0
0
25 Apr 2025
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
Wencong You
Daniel Lowd
39
0
0
24 Apr 2025
Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification
Alexander Shvets
21
0
0
23 Apr 2025
Sentiment Analysis in Software Engineering: Evaluating Generative Pre-trained Transformers
KM Khalid Saifullah
Faiaz Azmain
Habiba Hye
2
0
0
22 Apr 2025
llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length
Issa Sugiura
Kouta Nakayama
Yusuke Oda
34
1
0
22 Apr 2025
LLM-based Semantic Augmentation for Harmful Content Detection
Elyas Meguellati
Assaad Zeghina
S. Sadiq
Gianluca Demartini
38
0
0
22 Apr 2025
ViQA-COVID: COVID-19 Machine Reading Comprehension Dataset for Vietnamese
H. Phung
Ngoc C. Lê
Van-Chien Nguyen
Hang Thi Nguyen
Thuy Phuong Thi Nguyen
77
1
0
21 Apr 2025
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren
Yihong Liu
Hinrich Schütze
36
0
0
21 Apr 2025
Bias Analysis and Mitigation through Protected Attribute Detection and Regard Classification
Takuma Udagawa
Yang Zhao
H. Kanayama
Bishwaranjan Bhattacharjee
33
0
0
19 Apr 2025
Probabilistic Stability Guarantees for Feature Attributions
Helen Jin
Anton Xue
Weiqiu You
Surbhi Goel
Eric Wong
29
0
0
18 Apr 2025
Transferrable Surrogates in Expressive Neural Architecture Search Spaces
Shiwen Qin
Gabriela Kadlecová
Martin Pilát
Shay B. Cohen
Roman Neruda
Elliot J. Crowley
Jovita Lukasik
Linus Ericsson
AI4CE
175
0
0
17 Apr 2025
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models
S. Bhagat
Ibne Farabi Shihab
Anuj Sharma
32
0
0
17 Apr 2025
WildFireCan-MMD: A Multimodal Dataset for Classification of User-Generated Content During Wildfires in Canada
Braeden Sherritt
Isar Nejadgholi
Marzieh Amini
VLM
44
0
0
17 Apr 2025
C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset
Fuqiang Niu
Yuqing Yang
Xianghua Fu
Genan Dai
Bowen Zhang
30
0
0
14 Apr 2025
Improving Multimodal Hateful Meme Detection Exploiting LMM-Generated Knowledge
Maria Tzelepi
Vasileios Mezaris
34
0
0
14 Apr 2025
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Ryan Cotterell
49
109
0
10 Apr 2025
Understanding Users' Security and Privacy Concerns and Attitudes Towards Conversational AI Platforms
Mutahar Ali
Arjun Arunasalam
Habiba Farrukh
SILM
59
0
0
09 Apr 2025
LLM-based Automated Grading with Human-in-the-Loop
Hang Li
Yucheng Chu
Kaiqi Yang
Yasemin Copur-Gencturk
Jiliang Tang
AI4Ed
ELM
64
0
0
07 Apr 2025
REFORMER: A ChatGPT-Driven Data Synthesis Framework Elevating Text-to-SQL Models
Shenyang Liu
Saleh Almohaimeed
Liqiang Wang
35
0
0
06 Apr 2025
Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles
Chen Wei Kuo
Kevin Chu
Nouar Aldahoul
Hazem Ibrahim
Talal Rahwan
Yasir Zaki
SyDa
63
0
0
04 Apr 2025
Previous
1
2
3
4
5
...
91
92
93
Next