ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 4,640 papers shown
Title
AdUE: Improving uncertainty estimation head for LoRA adapters in LLMs
AdUE: Improving uncertainty estimation head for LoRA adapters in LLMs
Artem Zabolotnyi
Roman Makarov
Mile Mitrovic
P. Proskura
Oleg Travkin
Roman Alferov
Alexey Zaytsev
UQCV
12
0
0
21 May 2025
Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning
Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning
Yukun Zhao
Lingyong Yan
Zhenyang Li
S. Wang
Zhumin Chen
Z. Ren
Dawei Yin
CLL
KELM
VLM
LRM
17
0
0
21 May 2025
Enhancing Abstractive Summarization of Scientific Papers Using Structure Information
Enhancing Abstractive Summarization of Scientific Papers Using Structure Information
Tong Bao
Heng Zhang
Chengzhi Zhang
17
1
0
20 May 2025
GDPRShield: AI-Powered GDPR Support for Software Developers in Small and Medium-Sized Enterprises
GDPRShield: AI-Powered GDPR Support for Software Developers in Small and Medium-Sized Enterprises
Tharaka Wijesundara
Mathew Warren
Nalin Arachchilage
12
0
0
19 May 2025
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
Wenhao Zhu
Yuhang Xie
Guojie Song
Xin Zhang
19
0
0
19 May 2025
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
FedSVD: Adaptive Orthogonalization for Private Federated Learning with LoRA
Seanie Lee
Sangwoo Park
Dong Bok Lee
Dominik Wagner
Haebin Seong
Tobias Bocklet
Juho Lee
Sung Ju Hwang
FedML
12
0
0
19 May 2025
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Mitigating Hallucinations via Inter-Layer Consistency Aggregation in Large Vision-Language Models
Kai Tang
Jinhao You
Xiuqi Ge
Hanze Li
Yichen Guo
Xiande Huang
MLLM
7
0
0
18 May 2025
Information Extraction from Visually Rich Documents using LLM-based Organization of Documents into Independent Textual Segments
Information Extraction from Visually Rich Documents using LLM-based Organization of Documents into Independent Textual Segments
Aniket Bhattacharyya
Anurag Tripathi
Ujjal Das
Archan Karmakar
Amit Pathak
Maneesh Gupta
9
0
0
18 May 2025
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Yuwei Zhang
Wenhao Yu
Shangbin Feng
Yifan Zhu
Letian Peng
Jayanth Srinivasa
Gaowen Liu
Jingbo Shang
KELM
7
0
0
18 May 2025
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Dena F. Mujtaba
Nihar R. Mahapatra
9
0
0
17 May 2025
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
Yifei He
Siqi Zeng
Yuzheng Hu
Rui Yang
Tong Zhang
Han Zhao
MoMe
ALM
34
0
0
16 May 2025
On the Interconnections of Calibration, Quantification, and Classifier Accuracy Prediction under Dataset Shift
On the Interconnections of Calibration, Quantification, and Classifier Accuracy Prediction under Dataset Shift
Alejandro Moreo
9
0
0
16 May 2025
Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications
Ornithologist: Towards Trustworthy "Reasoning" about Central Bank Communications
Dominic Zaun Eu Jones
30
0
0
14 May 2025
Automated Detection of Clinical Entities in Lung and Breast Cancer Reports Using NLP Techniques
Automated Detection of Clinical Entities in Lung and Breast Cancer Reports Using NLP Techniques
J. Moreno-Casanova
J.M. Auñón
A. Mártinez-Pérez
M.E. Pérez-Martínez
M.E. Gas-López
14
0
0
14 May 2025
A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data
A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data
Jiin Park
Misuk Kim
24
0
0
14 May 2025
Analog Foundation Models
Analog Foundation Models
Julian Büchel
Iason Chalas
Giovanni Acampa
An Chen
Omobayode Fagbohungbe
Sidney Tsai
Kaoutar El Maghraoui
Manuel Le Gallo
Abbas Rahimi
Abu Sebastian
MQ
35
0
0
14 May 2025
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Chang Zong
Yueting Zhuang
Jian Shao
Weiming Lu
44
0
0
13 May 2025
LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models
LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models
Takumi Shibata
Yuichi Miyamura
39
0
0
13 May 2025
Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions
Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions
Lata Pangtey
Anukriti Bhatnagar
Shubhi Bansal
Shahid Shafi Dar
Nagendra Kumar
34
0
0
13 May 2025
Next Word Suggestion using Graph Neural Network
Next Word Suggestion using Graph Neural Network
Abisha Thapa Magar
Anup Shakya
GNN
30
0
0
13 May 2025
Exploiting Text Semantics for Few and Zero Shot Node Classification on Text-attributed Graph
Exploiting Text Semantics for Few and Zero Shot Node Classification on Text-attributed Graph
Yuxiang Wang
Xiao Yan
Shiyu Jin
Quanqing Xu
Chuang Hu
Yuanyuan Zhu
Bo Du
Jia Wu
Wentao Zhang
36
0
0
13 May 2025
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
Hajar Sakai
Sarah Lam
VLM
44
0
0
12 May 2025
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue
Jannatun Naim
Jie Cao
Fareen Tasneem
Jennifer Jacobs
Brent Milne
James H. Martin
T. Sumner
36
0
0
12 May 2025
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
Karahan Sarıtaş
Çağatay Yıldız
34
0
0
12 May 2025
TiSpell: A Semi-Masked Methodology for Tibetan Spelling Correction covering Multi-Level Error with Data Augmentation
TiSpell: A Semi-Masked Methodology for Tibetan Spelling Correction covering Multi-Level Error with Data Augmentation
Yutong Liu
Feng Xiao
Ziyue Zhang
Yongbin Yu
Cheng Huang
...
Thupten Tsering
Cheng Huang
Gadeng Luosang
Renzeng Duojie
Nyima Tashi
31
0
0
12 May 2025
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
Paul Primus
Florian Schmid
Gerhard Widmer
CLIP
AI4TS
VLM
36
0
0
12 May 2025
Comparative sentiment analysis of public perception: Monkeypox vs. COVID-19 behavioral insights
Comparative sentiment analysis of public perception: Monkeypox vs. COVID-19 behavioral insights
Mostafa Mohaimen Akand Faisal
Rabeya Amin Jhuma
35
0
0
12 May 2025
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation
Jiashuo Sun
Xianrui Zhong
Sizhe Zhou
Jiawei Han
RALM
31
0
0
12 May 2025
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks
Shunyao Wang
Ming Cheng
Christina Dan Wang
AIFin
30
0
0
11 May 2025
Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
Fabrice Harel-Canada
Boran Erol
Connor Choi
J. Liu
Gary Jiarui Song
Nanyun Peng
Amit Sahai
WaLM
30
0
0
11 May 2025
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
Wenju Sun
Qingyong Li
Yangli-ao Geng
Boyang Li
MoMe
40
0
0
11 May 2025
A Vision-Language Foundation Model for Leaf Disease Identification
A Vision-Language Foundation Model for Leaf Disease Identification
Khang Nguyen Quoc
Lan Le Thi Thu
Luyl-Da Quach
VLM
32
0
0
11 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLM
LRM
48
0
0
10 May 2025
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining
Lu Dong
Han Zhang
Hongjie Zhang
Yuanmin Huang
Z. Ling
Yu Qiao
Limin Wang
Yue Wang
AI4TS
48
0
0
10 May 2025
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
The Sound of Populism: Distinct Linguistic Features Across Populist Variants
Yu Wang
Runxi Yu
Zhongyuan Wang
Jing He
26
0
0
10 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
34
0
0
09 May 2025
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction
QoSBERT: An Uncertainty-Aware Approach based on Pre-trained Language Models for Service Quality Prediction
Ziliang Wang
Xiaohong Zhang
Ze Shi Li
Meng Yan
18
0
0
09 May 2025
FLAM: Frame-Wise Language-Audio Modeling
FLAM: Frame-Wise Language-Audio Modeling
Yusong Wu
Christos Tsirigotis
Ke Chen
Cheng-Zhi Anna Huang
Rameswar Panda
Oriol Nieto
Prem Seetharaman
Justin Salamon
55
0
0
08 May 2025
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification
Qianbo Zang
Christophe Zgrzendek
Igor Tchappi
Afshin Khadangi
Johannes Sedlmeir
VLM
42
0
0
08 May 2025
QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public
QBR: A Question-Bank-Based Approach to Fine-Grained Legal Knowledge Retrieval for the General Public
Mingruo Yuan
Ben Kao
Tien-Hsuan Wu
AILaw
79
0
0
08 May 2025
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP
Hanxun Huang
Sarah Monazam Erfani
Yige Li
Xingjun Ma
James Bailey
AAML
55
0
0
08 May 2025
UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections
UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections
Fatima Haouari
Carolina Scarton
Nicolò Faggiani
Nikolaos Nikolaidis
Bonka Kotseva
Ibrahim Abu Farha
Jens Linge
Kalina Bontcheva
46
0
0
08 May 2025
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis
Hao Zhen
Jidong J. Yang
43
0
0
08 May 2025
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
Faiza Hassan
Summra Saleem
Kashif Javed
Muhammad Nabeel Asim
A. Rehman
Andreas Dengel
33
0
0
08 May 2025
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
GroverGPT-2: Simulating Grover's Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization
Min Chen
Jinglei Cheng
Pingzhi Li
Haoran Wang
Tianlong Chen
Junyu Liu
LRM
51
0
0
08 May 2025
A Tale of Two Identities: An Ethical Audit of Human and AI-Crafted Personas
A Tale of Two Identities: An Ethical Audit of Human and AI-Crafted Personas
Pranav Narayanan Venkit
Jiayi Li
Yingfan Zhou
Sarah Rajtmajer
Shomir Wilson
34
0
0
07 May 2025
Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction
Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction
Paul Landes
Jimeng Sun
Adam Cross
36
0
0
06 May 2025
Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation
Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation
Hannes Waldetoft
Jakob Torgander
Måns Magnusson
34
0
0
05 May 2025
Logits-Constrained Framework with RoBERTa for Ancient Chinese NER
Logits-Constrained Framework with RoBERTa for Ancient Chinese NER
Wenjie Hua
Shenghan Xu
19
0
0
05 May 2025
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation
Gerard Pons
Besim Bilalli
Anna Queralt
43
1
0
05 May 2025
1234...919293
Next