ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,734 papers shown
Title
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
Paul Primus
Florian Schmid
Gerhard Widmer
CLIPAI4TSVLM
56
0
0
12 May 2025
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue
Jannatun Naim
Jie Cao
Fareen Tasneem
Jennifer Jacobs
Brent Milne
James H. Martin
T. Sumner
68
0
0
12 May 2025
Domain Regeneration: How well do LLMs match syntactic properties of text domains?
Domain Regeneration: How well do LLMs match syntactic properties of text domains?
Da Ju
Hagen Blix
Adina Williams
DeLMO
88
0
0
12 May 2025
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
Karahan Sarıtaş
Çağatay Yıldız
57
0
0
12 May 2025
Comparative sentiment analysis of public perception: Monkeypox vs. COVID-19 behavioral insights
Comparative sentiment analysis of public perception: Monkeypox vs. COVID-19 behavioral insights
Mostafa Mohaimen Akand Faisal
Rabeya Amin Jhuma
72
0
0
12 May 2025
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation
Jimeng Sun
Xianrui Zhong
Sizhe Zhou
Jiawei Han
RALM
75
0
0
12 May 2025
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
Hajar Sakai
Sarah Lam
VLM
113
0
0
12 May 2025
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
Wenju Sun
Qingyong Li
Yangli-ao Geng
Boyang Li
MoMe
115
2
0
11 May 2025
A Vision-Language Foundation Model for Leaf Disease Identification
A Vision-Language Foundation Model for Leaf Disease Identification
Khang Nguyen Quoc
Lan Le Thi Thu
Luyl-Da Quach
VLM
118
0
0
11 May 2025
Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence
Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence
Yu Qiao
Huy Q. Le
Avi Deb Raha
Phuong-Nam Tran
Apurba Adhikary
Mengchun Zhang
Loc X. Nguyen
Eui-nam Huh
Dusit Niyato
Choong Seon Hong
AI4CE
161
1
0
11 May 2025
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
Shunyao Wang
Ming Cheng
Christina Dan Wang
AIFin
79
0
0
11 May 2025
Evaluating Reasoning LLMs for Suicide Screening with the Columbia-Suicide Severity Rating Scale
Evaluating Reasoning LLMs for Suicide Screening with the Columbia-Suicide Severity Rating Scale
Avinash Patil
Siru Tao
Amardeep Gedhu
AI4MHLRMELM
59
0
0
11 May 2025
Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
Fabrice Harel-Canada
Boran Erol
Connor Choi
J. Liu
Gary Jiarui Song
Nanyun Peng
Amit Sahai
WaLM
80
0
0
11 May 2025
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
Yu Wang
Runxi Yu
Zhongyuan Wang
Jing He
45
0
0
10 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLMLRM
65
0
0
10 May 2025
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Lu Dong
Han Zhang
Hongjie Zhang
Yuanmin Huang
Z. Ling
Yu Qiao
Limin Wang
Yun Wang
AI4TS
209
0
0
10 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
198
0
0
09 May 2025
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction
Ziliang Wang
Xiaohong Zhang
Ze Shi Li
Meng Yan
46
0
0
09 May 2025
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding
Henry Zheng
Hao Shi
Qihang Peng
Yong Xien Chng
Rui Huang
Yepeng Weng
Zhongchao Shi
Gao Huang
106
2
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
124
0
0
08 May 2025
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
Hao Zhen
Jidong J. Yang
61
0
0
08 May 2025
UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections
UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections
Fatima Haouari
Carolina Scarton
Nicolò Faggiani
Nikolaos Nikolaidis
Bonka Kotseva
Ibrahim Abu Farha
Jens Linge
Kalina Bontcheva
95
0
0
08 May 2025
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
Hanxun Huang
Sarah Monazam Erfani
Yige Li
Xingjun Ma
James Bailey
AAML
155
1
0
08 May 2025
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
Muhammad Nabeel Asim
A. Rehman
Andreas Dengel
73
0
0
08 May 2025
FLAM: Frame-Wise Language-Audio Modeling
FLAM: Frame-Wise Language-Audio Modeling
Yusong Wu
Christos Tsirigotis
Ke Chen
Cheng-Zhi Anna Huang
Rameswar Panda
Oriol Nieto
Prem Seetharaman
Justin Salamon
85
1
0
08 May 2025
QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public
QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public
Mingruo Yuan
Ben Kao
Tien-Hsuan Wu
AILaw
109
0
0
08 May 2025
Visual Affordances: Enabling Robots to Understand Object Functionality
Visual Affordances: Enabling Robots to Understand Object Functionality
Tommaso Apicella
Alessio Xompero
Andrea Cavallaro
128
0
0
08 May 2025
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
Qianbo Zang
Christophe Zgrzendek
Igor Tchappi
Afshin Khadangi
Johannes Sedlmeir
VLM
80
0
0
08 May 2025
A Tale of Two Identities: An Ethical Audit of Human and AI-Crafted Personas
A Tale of Two Identities: An Ethical Audit of Human and AI-Crafted Personas
Pranav Narayanan Venkit
Jiayi Li
Yingfan Zhou
Sarah Rajtmajer
Shomir Wilson
65
1
0
07 May 2025
Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction
Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction
Paul Landes
Jimeng Sun
Adam Cross
63
0
0
06 May 2025
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
161
1
0
05 May 2025
Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation
Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation
Hannes Waldetoft
Jakob Torgander
Måns Magnusson
59
1
0
05 May 2025
Logits-Constrained Framework with RoBERTa for Ancient Chinese NER
Logits-Constrained Framework with RoBERTa for Ancient Chinese NER
Wenjie Hua
Shenghan Xu
66
0
0
05 May 2025
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation
Gerard Pons
Besim Bilalli
Anna Queralt
133
2
0
05 May 2025
Embedding based retrieval for long tail search queries in ecommerce
Embedding based retrieval for long tail search queries in ecommerce
Akshay Kekuda
Yuyang Zhang
Arun Udayashankar
RALM
196
0
0
03 May 2025
MISE: Meta-knowledge Inheritance for Social Media-Based Stressor Estimation
MISE: Meta-knowledge Inheritance for Social Media-Based Stressor Estimation
Xin Wang
Ling Feng
Huijun Zhang
Lei Cao
Kaisheng Zeng
Qi Li
Yang Ding
Yi Dai
David A. Clifton
114
0
0
03 May 2025
OODTE: A Differential Testing Engine for the ONNX Optimizer
OODTE: A Differential Testing Engine for the ONNX Optimizer
Nikolaos Louloudakis
Ajitha Rajan
86
0
0
03 May 2025
Dual-Forecaster: A Multimodal Time Series Model Integrating Descriptive and Predictive Texts
Dual-Forecaster: A Multimodal Time Series Model Integrating Descriptive and Predictive Texts
Wenfa Wu
Guanyu Zhang
Zheng Tan
Yi Wang
Hongsheng Qi
AI4TS
108
2
0
02 May 2025
Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods
Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods
Mahdi Dhaini
Ege Erdogan
Nils Feldhus
Gjergji Kasneci
102
0
0
02 May 2025
Emotions in the Loop: A Survey of Affective Computing for Emotional Support
Emotions in the Loop: A Survey of Affective Computing for Emotional Support
Karishma Hegde
Hemadri Jayalath
80
1
0
02 May 2025
Multi-agents based User Values Mining for Recommendation
Multi-agents based User Values Mining for Recommendation
Lawrence Yunliang Chen
Wei Yuan
Tong Chen
Xiangyu Zhao
Nguyen Quoc Viet Hung
Hongzhi Yin
OffRL
129
0
0
02 May 2025
Towards High-Fidelity Synthetic Multi-platform Social Media Datasets via Large Language Models
Towards High-Fidelity Synthetic Multi-platform Social Media Datasets via Large Language Models
Henry Tari
Nojus Sereiva
Rishabh Kaushal
T. Bertaglia
Adriana Iamnitchi
80
0
0
02 May 2025
One Search Fits All: Pareto-Optimal Eco-Friendly Model Selection
One Search Fits All: Pareto-Optimal Eco-Friendly Model Selection
Filippo Betello
Antonio Purificato
Vittoria Vineis
Gabriele Tolomei
Fabrizio Silvestri
80
0
0
02 May 2025
Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
Cong Xu
Wenbin Liang
Mo Yu
Anan Liu
Kai Zhang
Lizhuang Ma
Jiangming Wang
Jun Wang
Weinan Zhang
Wei Zhang
MQ
80
0
0
01 May 2025
Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors
Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors
Xinyu Ding
Lexuan Chen
Siyu Liao
Zhongfeng Wang
122
0
0
01 May 2025
Block Circulant Adapter for Large Language Models
Block Circulant Adapter for Large Language Models
Xinyu Ding
Meiqi Wang
Siyu Liao
Zhongfeng Wang
74
1
0
01 May 2025
Computational Identification of Regulatory Statements in EU Legislation
Computational Identification of Regulatory Statements in EU Legislation
Gijs Jan Brandsma
Jens Blom-Hansen
Christiaan Meijer
Kody Moodley
AILaw
117
0
0
01 May 2025
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language Encoders
Investigating the Effect of Parallel Data in the Cross-Lingual Transfer for Vision-Language Encoders
Andrei-Alexandru Manea
Jindřich Libovický
VLM
123
0
0
30 Apr 2025
Synergy-CLIP: Extending CLIP with Multi-modal Integration for Robust Representation Learning
Synergy-CLIP: Extending CLIP with Multi-modal Integration for Robust Representation Learning
Sangyeon Cho
Jangyeong Jeon
Mingi Kim
Junyeong Kim
CLIPVLM
241
0
0
30 Apr 2025
Leveraging Generative AI Through Prompt Engineering and Rigorous Validation to Create Comprehensive Synthetic Datasets for AI Training in Healthcare
Leveraging Generative AI Through Prompt Engineering and Rigorous Validation to Create Comprehensive Synthetic Datasets for AI Training in Healthcare
Polycarp Nalela
SyDa
53
0
0
29 Apr 2025
Previous
123...567...213214215
Next