ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXiv (abs)PDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 10,734 papers shown
Title
HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring
HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring
Zhixiong Su
Yichen Wang
Herun Wan
Zhaohan Zhang
Minnan Luo
DeLMO
57
0
0
03 Jun 2025
QKV Projections Require a Fraction of Their Memory
QKV Projections Require a Fraction of Their Memory
Malik Khalf
Yara Shamshoum
Nitzan Hodos
Yuval Sieradzki
Assaf Schuster
MQVLM
68
0
0
03 Jun 2025
Natural Language Processing to Enhance Deliberation in Political Online Discussions: A Survey
Natural Language Processing to Enhance Deliberation in Political Online Discussions: A Survey
Maike Behrendt
Stefan Sylvius Wagner
Carina Weinmann
Marike Bormann
Mira Warne
Stefan Harmeling
55
0
0
03 Jun 2025
Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior
Exploiting LLMs for Automatic Hypothesis Assessment via a Logit-Based Calibrated Prior
Yue Gong
Raul Castro Fernandez
13
0
0
03 Jun 2025
MLorc: Momentum Low-rank Compression for Large Language Model Adaptation
MLorc: Momentum Low-rank Compression for Large Language Model Adaptation
Wei Shen
Zhang Yaxiang
Minhui Huang
Mengfan Xu
Jiawei Zhang
Cong Shen
AI4CE
58
0
0
02 Jun 2025
Dictionaries to the Rescue: Cross-Lingual Vocabulary Transfer for Low-Resource Languages Using Bilingual Dictionaries
Dictionaries to the Rescue: Cross-Lingual Vocabulary Transfer for Low-Resource Languages Using Bilingual Dictionaries
Haruki Sakajo
Yusuke Ide
Justin Vasselli
Yusuke Sakai
Yingtao Tian
Hidetaka Kamigaito
Taro Watanabe
51
0
0
02 Jun 2025
Quantifying Misattribution Unfairness in Authorship Attribution
Quantifying Misattribution Unfairness in Authorship Attribution
Pegah Alipoormolabashi
Ajay Patel
Niranjan Balasubramanian
21
0
0
02 Jun 2025
Statement-Tuning Enables Efficient Cross-lingual Generalization in Encoder-only Models
Statement-Tuning Enables Efficient Cross-lingual Generalization in Encoder-only Models
Ahmed Elshabrawy
Thanh-Nhi Nguyen
Yeeun Kang
Lihan Feng
Annant Jain
...
Jonibek Mansurov
Mohamed Fazli Mohamed Imam
Jesús-Germán Ortiz-Barajas
Rendi Chevi
Alham Fikri Aji
54
0
0
02 Jun 2025
CogniAlign: Word-Level Multimodal Speech Alignment with Gated Cross-Attention for Alzheimer's Detection
CogniAlign: Word-Level Multimodal Speech Alignment with Gated Cross-Attention for Alzheimer's Detection
David Ortiz-Perez
Manuel Benavent-Lledo
Javier Rodriguez-Juan
José García Rodríguez
David Tomás
70
0
0
02 Jun 2025
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
Shuzhou Yuan
Ercong Nie
Lukas Kouba
Ashish Yashwanth Kangen
Helmut Schmid
Hinrich Schütze
Michael Färber
62
0
0
02 Jun 2025
Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models
Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models
Yiwen Jiang
Deval Mehta
Wei Feng
Zongyuan Ge
59
0
0
02 Jun 2025
Something Just Like TRuST : Toxicity Recognition of Span and Target
Something Just Like TRuST : Toxicity Recognition of Span and Target
Berk Atil
Namrata Sureddy
R. Passonneau
27
0
0
02 Jun 2025
Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data
Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data
Zixiao Zhu
Kezhi Mao
50
0
0
02 Jun 2025
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
SAM2-LOVE: Segment Anything Model 2 in Language-aided Audio-Visual Scenes
Yuji Wang
Haoran Xu
Yong-Jin Liu
Jiaze Li
Yansong Tang
89
1
0
02 Jun 2025
The State of Large Language Models for African Languages: Progress and Challenges
The State of Large Language Models for African Languages: Progress and Challenges
Kedir Yassin Hussen
W. Sewunetie
Abinew Ali Ayele
Sukairaj Hafiz Imam
Shamsuddeen Hassan Muhammad
Seid Muhie Yimam
38
0
0
02 Jun 2025
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation
Xinyi Liu
Lipeng Ma
Yixuan Li
Weidong Yang
Qingyuan Zhou
Jiayi Song
Shuhao Li
Ben Fei
LRM
53
0
0
01 Jun 2025
LensCraft: Your Professional Virtual Cinematographer
LensCraft: Your Professional Virtual Cinematographer
Zahra Dehghanian
Morteza Abolghasemi
Hossein Azizinaghsh
Amir Vahedi
Hamid Beigy
Hamid R. Rabiee
VGen
42
0
0
01 Jun 2025
GIA-MIC: Multimodal Emotion Recognition with Gated Interactive Attention and Modality-Invariant Learning Constraints
GIA-MIC: Multimodal Emotion Recognition with Gated Interactive Attention and Modality-Invariant Learning Constraints
Jiajun He
Jinyi Mi
Tomoki Toda
25
0
0
01 Jun 2025
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
Zihang Liu
Tianyu Pang
Oleg Balabanov
Chaoqun Yang
Tianjin Huang
L. Yin
Yaoqing Yang
Shiwei Liu
LRM
53
1
0
01 Jun 2025
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
Yuyuan Liu
Yuanhong Chen
Chong Wang
Junlin Han
Junde Wu
Can Peng
Jingkun Chen
Yu Tian
Gustavo Carneiro
VLM
49
0
0
01 Jun 2025
FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models
FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models
Xuan Xu
Fufang Wen
Beilin Chu
Zhibing Fu
Qinhong Lin
Jiaqi Liu
Binjie Fei
Zhongliang Yang
Linna Zhou
Yu Li
15
0
0
31 May 2025
The Hidden Language of Harm: Examining the Role of Emojis in Harmful Online Communication and Content Moderation
The Hidden Language of Harm: Examining the Role of Emojis in Harmful Online Communication and Content Moderation
Yuhang Zhou
Yimin Xiao
Wei Ai
Ge Gao
25
0
0
31 May 2025
Exploring the Performance of Perforated Backpropagation through Further Experiments
Exploring the Performance of Perforated Backpropagation through Further Experiments
Rorry Brenner
Evan Davis
Rushi Chaudhari
Rowan Morse
Jingyao Chen
Xirui Liu
Zhaoyi You
Laurent Itti
22
0
0
31 May 2025
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization
Joschka Braun
Carsten Eickhoff
Seyed Ali Bahrainian
LLMSV
23
0
0
30 May 2025
Domain Pre-training Impact on Representations
Domain Pre-training Impact on Representations
César González-Gutiérrez
A. Quattoni
30
0
0
30 May 2025
PRISM: A Framework for Producing Interpretable Political Bias Embeddings with Political-Aware Cross-Encoder
PRISM: A Framework for Producing Interpretable Political Bias Embeddings with Political-Aware Cross-Encoder
Yiqun Sun
Qiang Huang
Anthony K. H. Tung
Jun Yu
35
0
0
30 May 2025
GradPower: Powering Gradients for Faster Language Model Pre-Training
GradPower: Powering Gradients for Faster Language Model Pre-Training
Mingze Wang
Jinbo Wang
Jiaqi Zhang
Wei Wang
Peng Pei
Xunliang Cai
Weinan E
Lei Wu
56
0
0
30 May 2025
Structuring Radiology Reports: Challenging LLMs with Lightweight Models
Structuring Radiology Reports: Challenging LLMs with Lightweight Models
Johannes Moll
Louisa Fay
Asfandyar Azhar
Sophie Ostmeier
Tim Lueth
S. Gatidis
Curtis P. Langlotz
Jean-Benoit Delbrouck
12
0
0
30 May 2025
LightSAM: Parameter-Agnostic Sharpness-Aware Minimization
LightSAM: Parameter-Agnostic Sharpness-Aware Minimization
Yifei Cheng
Li Shen
Hao Sun
Nan Yin
Xiaochun Cao
Enhong Chen
AAML
30
0
0
30 May 2025
SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training
SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training
Yehonathan Refael
Guy Smorodinsky
Tom Tirer
Ofir Lindenbaum
37
0
0
30 May 2025
LLMs for Argument Mining: Detection, Extraction, and Relationship Classification of pre-defined Arguments in Online Comments
LLMs for Argument Mining: Detection, Extraction, and Relationship Classification of pre-defined Arguments in Online Comments
Matteo Guida
Yulia Otmakhova
Eduard H. Hovy
Lea Frermann
20
1
0
29 May 2025
Evaluating AI capabilities in detecting conspiracy theories on YouTube
Evaluating AI capabilities in detecting conspiracy theories on YouTube
Leonardo La Rocca
Francesco Corso
Francesco Pierri
43
0
0
29 May 2025
Unsupervised Transcript-assisted Video Summarization and Highlight Detection
Unsupervised Transcript-assisted Video Summarization and Highlight Detection
Spyros Barbakos
Charalampos Antoniadis
Gerasimos Potamianos
Gianluca Setti
OffRLAI4TS
135
0
0
29 May 2025
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
Yixian Shen
Qi Bi
Jia-Hong Huang
Hongyi Zhu
Andy D. Pimentel
Anuj Pathania
24
0
0
29 May 2025
ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations
ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations
Yiming Lei
Zhizheng Yang
Zeming Liu
Haitao Leng
Shaoguo Liu
Tingting Gao
Qingjie Liu
Yunhong Wang
30
0
0
29 May 2025
Accelerating AllReduce with a Persistent Straggler
Accelerating AllReduce with a Persistent Straggler
Arjun Devraj
Eric Ding
Abhishek Vijaya Kumar
Robert Kleinberg
Rachee Singh
56
0
0
29 May 2025
AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora
AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora
Jiaxin Bai
Wei Fan
Qi Hu
Qing Zong
Chunyang Li
...
Leijie Wu
Yi Ji
Gong Zhang
Renhai Chen
Yangqiu Song
55
0
0
29 May 2025
Detecting Stealthy Backdoor Samples based on Intra-class Distance for Large Language Models
Detecting Stealthy Backdoor Samples based on Intra-class Distance for Large Language Models
Jinwen Chen
Hainan Zhang
Fei Sun
Qinnan Zhang
Sijia Wen
Ziwei Wang
Zhiming Zheng
AAML
22
0
0
29 May 2025
MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Chongjie Si
Zhiyi Shi
Yadao Wang
Xiaokang Yang
Susanto Rahardja
Wei Shen
62
0
0
29 May 2025
Case-Based Reasoning Enhances the Predictive Power of LLMs in Drug-Drug Interaction
Case-Based Reasoning Enhances the Predictive Power of LLMs in Drug-Drug Interaction
Guangyi Liu
Yongqi Zhang
Xunyuan Liu
Quanming Yao
29
0
0
29 May 2025
Generating Diverse Training Samples for Relation Extraction with Large Language Models
Generating Diverse Training Samples for Relation Extraction with Large Language Models
Zexuan Li
Hongliang Dai
Piji Li
SyDa
24
0
0
29 May 2025
Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration
Stairway to Success: Zero-Shot Floor-Aware Object-Goal Navigation via LLM-Driven Coarse-to-Fine Exploration
Zeying Gong
Rong Li
Tianshuai Hu
Ronghe Qiu
Lingdong Kong
Lingfeng Zhang
Yiyi Ding
Leying Zhang
Junwei Liang
43
0
0
29 May 2025
Human Empathy as Encoder: AI-Assisted Depression Assessment in Special Education
Human Empathy as Encoder: AI-Assisted Depression Assessment in Special Education
Boning Zhao
20
0
0
29 May 2025
Precise In-Parameter Concept Erasure in Large Language Models
Precise In-Parameter Concept Erasure in Large Language Models
Yoav Gur-Arieh
Clara Suslik
Yihuai Hong
Fazl Barez
Mor Geva
KELMMU
88
0
0
28 May 2025
Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning
Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning
Yongkang Liu
Xingle Xu
Ercong Nie
Zijing Wang
Shi Feng
Daling Wang
Qian Li
Hinrich Schutze
35
0
0
28 May 2025
Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging
Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging
Haobo Zhang
Jiayu Zhou
MoMe
54
0
0
28 May 2025
Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented Languages
Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented Languages
S. Higashiyama
Masao Utiyama
22
0
0
28 May 2025
Limited Generalizability in Argument Mining: State-Of-The-Art Models Learn Datasets, Not Arguments
Limited Generalizability in Argument Mining: State-Of-The-Art Models Learn Datasets, Not Arguments
Marc Feger
Katarina Boland
Stefan Dietze
30
0
0
28 May 2025
MObyGaze: a film dataset of multimodal objectification densely annotated by experts
MObyGaze: a film dataset of multimodal objectification densely annotated by experts
Julie Tores
Elisa Ancarani
L. Sassatelli
Hui-Yin Wu
Clement Bergman
...
F. Precioso
Thierry Devars
Magali Guaresi
Virginie Julliard
Sarah Lecossais
DiffMVGen
40
0
0
28 May 2025
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs
Ngeyen Yinkfu
12
0
0
28 May 2025
Previous
123456...213214215
Next