ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 4,654 papers shown
Title
Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA
Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA
Shuangyi Chen
Yuanxin Guo
Yue Ju
Harik Dalal
Ashish Khisti
53
2
0
03 Feb 2025
Training and Evaluating with Human Label Variation: An Empirical Study
Training and Evaluating with Human Label Variation: An Empirical Study
Kemal Kurniawan
Meladel Mistica
Timothy Baldwin
Jey Han Lau
70
0
0
03 Feb 2025
RandLoRA: Full-rank parameter-efficient fine-tuning of large models
RandLoRA: Full-rank parameter-efficient fine-tuning of large models
Paul Albert
Frederic Z. Zhang
Hemanth Saratchandran
Cristian Rodriguez-Opazo
Anton van den Hengel
Ehsan Abbasnejad
111
2
0
03 Feb 2025
Wizard of Shopping: Target-Oriented E-commerce Dialogue Generation with Decision Tree Branching
Wizard of Shopping: Target-Oriented E-commerce Dialogue Generation with Decision Tree Branching
Xuelong Li
Zhiyu Zoey Chen
J. Choi
Nikhita Vedula
B. Fetahu
Oleg Rokhlenko
S. Malmasi
83
2
0
03 Feb 2025
DEUCE: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active Learning
DEUCE: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active Learning
Jiaxin Guo
Cheng Chen
Shuzhen Li
Tianze Zhang
63
0
0
01 Feb 2025
Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers
Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers
Akiyoshi Tomihari
Issei Sato
ODL
61
1
0
31 Jan 2025
Towards Making Flowchart Images Machine Interpretable
Towards Making Flowchart Images Machine Interpretable
Shivalika Singh
Prajwal Gatti
Yogesh Kumar
Vikash Yadav
Anand Mishra
53
5
0
29 Jan 2025
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
Deren Lei
Yaxi Li
Siyao Li
Mengya Hu
Rui Xu
Ken Archer
Mingyu Wang
Emily Ching
Alex Deng
SyDa
HILM
LRM
78
1
0
28 Jan 2025
Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Kei Katsumata
Motonari Kambara
Daichi Yashima
Ryosuke Korekata
Komei Sugiura
70
0
0
28 Jan 2025
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Peiling Yi
A. Zubiaga
Yunfei Long
90
0
0
28 Jan 2025
Irony Detection, Reasoning and Understanding in Zero-shot Learning
Irony Detection, Reasoning and Understanding in Zero-shot Learning
Peiling Yi
Yuhan Xia
58
0
0
28 Jan 2025
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Rose E. Wang
Ana T. Ribeiro
Carly Robinson
Susanna Loeb
Dora Demszky
80
12
0
28 Jan 2025
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
Youpeng Zhao
Ming Lin
Huadong Tang
Qiang Wu
Jun Wang
86
0
0
28 Jan 2025
SCCD: A Session-based Dataset for Chinese Cyberbullying Detection
Qingpo Yang
Yakai Chen
Zihui Xu
Yu-ming Shang
Sanchuan Guo
Xi Zhang
44
1
0
28 Jan 2025
DepressionX: Knowledge Infused Residual Attention for Explainable Depression Severity Assessment
Yusif Ibrahimov
Tarique Anwar
Tommy Yuan
49
0
0
28 Jan 2025
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models
Yibin Wang
Haizhou Shi
Ligong Han
Dimitris N. Metaxas
Hao Wang
BDL
UQLM
116
8
0
28 Jan 2025
Concept-Guided Chain-of-Thought Prompting for Pairwise Comparison Scoring of Texts with Large Language Models
Concept-Guided Chain-of-Thought Prompting for Pairwise Comparison Scoring of Texts with Large Language Models
Patrick Y. Wu
Jonathan Nagler
Joshua A. Tucker
Solomon Messing
LRM
57
2
0
28 Jan 2025
Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Ziwei Liu
Qi Zhang
Lifu Gao
41
0
0
28 Jan 2025
Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy
Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy
Tunazzina Islam
Dan Goldwasser
82
3
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Min Zhang
LM&MA
AILaw
107
155
0
28 Jan 2025
Towards Explainable Multimodal Depression Recognition for Clinical Interviews
Wenjie Zheng
Qiming Xie
Zengzhi Wang
Jianfei Yu
Rui Xia
62
0
0
28 Jan 2025
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation
Zijie Zhong
Hanwen Liu
Xiaoya Cui
Xiaofan Zhang
Zengchang Qin
92
7
0
28 Jan 2025
Survey: Understand the challenges of MachineLearning Experts using Named EntityRecognition Tools
Florian Freund
Philippe Tamla
Matthias Hemmje
47
1
0
27 Jan 2025
Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference
Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference
Go Kamoda
Benjamin Heinzerling
Tatsuro Inaba
Keito Kudo
Keisuke Sakaguchi
Kentaro Inui
MILM
38
2
0
27 Jan 2025
Faster Configuration Performance Bug Testing with Neural Dual-level Prioritization
Faster Configuration Performance Bug Testing with Neural Dual-level Prioritization
Youpeng Ma
Tao Chen
Ke Li
96
0
0
26 Jan 2025
Decentralized Low-Rank Fine-Tuning of Large Language Models
Sajjad Ghiasvand
Mahnoosh Alizadeh
Ramtin Pedarsani
ALM
71
0
0
26 Jan 2025
A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification
A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification
Younes Yousef
Lukas Galke
A. Scherp
51
0
0
23 Jan 2025
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Ella Koresh
Ronit D. Gross
Yuval Meir
Yarden Tzach
Tal Halevi
Ido Kanter
ViT
49
0
0
22 Jan 2025
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition
Hamid Nasiri
Peter Garraghan
41
1
0
21 Jan 2025
YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives
YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives
Nong Ming
Sachin Sharma
Jiho Noh
AI4Ed
44
0
0
20 Jan 2025
LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations
LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations
Soumya Dutta
Sriram Ganapathy
39
2
0
20 Jan 2025
Revisiting Language Models in Neural News Recommender Systems
Revisiting Language Models in Neural News Recommender Systems
Yuyue Zhao
Jin Huang
David Vos
Maarten de Rijke
KELM
186
0
0
20 Jan 2025
Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates
Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates
Kaifeng Lyu
Haoyu Zhao
Xinran Gu
Dingli Yu
Anirudh Goyal
Sanjeev Arora
ALM
82
46
0
20 Jan 2025
Can AI-Generated Text be Reliably Detected?
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
S. Feizi
DeLMO
81
368
0
20 Jan 2025
AIMA at SemEval-2024 Task 3: Simple Yet Powerful Emotion Cause Pair Analysis
AIMA at SemEval-2024 Task 3: Simple Yet Powerful Emotion Cause Pair Analysis
Alireza Ghahramani Kure
Mahshid Dehghani
Mohammad Mahdi Abootorabi
Nona Ghazizadeh
Seyed Arshan Dalili
Ehsaneddin Asgari
57
1
0
19 Jan 2025
AIMA at SemEval-2024 Task 10: History-Based Emotion Recognition in Hindi-English Code-Mixed Conversations
AIMA at SemEval-2024 Task 10: History-Based Emotion Recognition in Hindi-English Code-Mixed Conversations
Mohammad Mahdi Abootorabi
Nona Ghazizadeh
Seyed Arshan Dalili
Alireza Ghahramani Kure
Mahshid Dehghani
Ehsaneddin Asgari
56
2
0
19 Jan 2025
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Xinyi Wang
Na Zhao
Zhiyuan Han
Dan Guo
Xun Yang
56
1
0
17 Jan 2025
A Simple Graph Contrastive Learning Framework for Short Text Classification
A Simple Graph Contrastive Learning Framework for Short Text Classification
Yuqi Liu
Fausto Giunchiglia
Lan Huang
Ximing Li
Xiaoyue Feng
Renchu Guan
39
0
0
17 Jan 2025
AudioBERT: Audio Knowledge Augmented Language Model
AudioBERT: Audio Knowledge Augmented Language Model
Hyunjong Ok
Suho Yoo
Jaeho Lee
AuLLM
RALM
VLM
53
0
0
17 Jan 2025
ReFactor GNNs: Revisiting Factorisation-based Models from a Message-Passing Perspective
ReFactor GNNs: Revisiting Factorisation-based Models from a Message-Passing Perspective
Yihong Chen
Pushkar Mishra
Luca Franceschi
Pasquale Minervini
Pontus Stenetorp
Sebastian Riedel
67
20
0
17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
19
0
17 Jan 2025
Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring
Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring
Buse Sibel Korkmaz
Rahul Nair
Elizabeth M. Daly
Evangelos Anagnostopoulos
Christos Varytimidis
Antonio del Rio Chanona
42
0
0
13 Jan 2025
Event Argument Extraction with Enriched Prompts
Event Argument Extraction with Enriched Prompts
Chen Liang
46
0
0
12 Jan 2025
Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences
Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences
Liu Yu
Ludie Guo
Ping Kuang
Fan Zhou
44
0
0
12 Jan 2025
A Hessian-informed hyperparameter optimization for differential learning rate
A Hessian-informed hyperparameter optimization for differential learning rate
Shiyun Xu
Zhiqi Bu
Yiliang Zhang
Ian Barnett
46
1
0
12 Jan 2025
Correcting Annotator Bias in Training Data: Population-Aligned Instance Replication (PAIR)
Correcting Annotator Bias in Training Data: Population-Aligned Instance Replication (PAIR)
Stephanie Eckman
Bolei Ma
Christoph Kern
Rob Chew
Yun Xue
Frauke Kreuter
41
0
0
12 Jan 2025
Aggregating Low Rank Adapters in Federated Fine-tuning
Aggregating Low Rank Adapters in Federated Fine-tuning
Evelyn Trautmann
Ian Hales
Martin F. Volk
AI4CE
FedML
44
0
0
10 Jan 2025
CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech
CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech
Madhurananda Pahar
Fuxiang Tao
B. Mirheidari
Nathan Pevy
Rebecca Bright
...
Lise Sproson
Dorota Braun
Caitlin Illingworth
D. Blackburn
H. Christensen
38
0
0
10 Jan 2025
AdaPRL: Adaptive Pairwise Regression Learning with Uncertainty Estimation for Universal Regression Tasks
AdaPRL: Adaptive Pairwise Regression Learning with Uncertainty Estimation for Universal Regression Tasks
Fuhang Liang
Rucong Xu
Deng Lin
OOD
38
0
0
10 Jan 2025
Multi-Task Model Merging via Adaptive Weight Disentanglement
Multi-Task Model Merging via Adaptive Weight Disentanglement
Feng Xiong
Runxi Cheng
Wang Chen
Zhanqiu Zhang
Yiwen Guo
Chun Yuan
Ruifeng Xu
MoMe
106
4
0
10 Jan 2025
Previous
123456...929394
Next