Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 4,654 papers shown
Title
Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA
Shuangyi Chen
Yuanxin Guo
Yue Ju
Harik Dalal
Ashish Khisti
53
2
0
03 Feb 2025
Training and Evaluating with Human Label Variation: An Empirical Study
Kemal Kurniawan
Meladel Mistica
Timothy Baldwin
Jey Han Lau
70
0
0
03 Feb 2025
RandLoRA: Full-rank parameter-efficient fine-tuning of large models
Paul Albert
Frederic Z. Zhang
Hemanth Saratchandran
Cristian Rodriguez-Opazo
Anton van den Hengel
Ehsan Abbasnejad
111
2
0
03 Feb 2025
Wizard of Shopping: Target-Oriented E-commerce Dialogue Generation with Decision Tree Branching
Xuelong Li
Zhiyu Zoey Chen
J. Choi
Nikhita Vedula
B. Fetahu
Oleg Rokhlenko
S. Malmasi
83
2
0
03 Feb 2025
DEUCE: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active Learning
Jiaxin Guo
Cheng Chen
Shuzhen Li
Tianze Zhang
63
0
0
01 Feb 2025
Understanding Why Adam Outperforms SGD: Gradient Heterogeneity in Transformers
Akiyoshi Tomihari
Issei Sato
ODL
61
1
0
31 Jan 2025
Towards Making Flowchart Images Machine Interpretable
Shivalika Singh
Prajwal Gatti
Yogesh Kumar
Vikash Yadav
Anand Mishra
53
5
0
29 Jan 2025
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
Deren Lei
Yaxi Li
Siyao Li
Mengya Hu
Rui Xu
Ken Archer
Mingyu Wang
Emily Ching
Alex Deng
SyDa
HILM
LRM
78
1
0
28 Jan 2025
Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Kei Katsumata
Motonari Kambara
Daichi Yashima
Ryosuke Korekata
Komei Sugiura
70
0
0
28 Jan 2025
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Peiling Yi
A. Zubiaga
Yunfei Long
90
0
0
28 Jan 2025
Irony Detection, Reasoning and Understanding in Zero-shot Learning
Peiling Yi
Yuhan Xia
58
0
0
28 Jan 2025
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Rose E. Wang
Ana T. Ribeiro
Carly Robinson
Susanna Loeb
Dora Demszky
80
12
0
28 Jan 2025
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
Youpeng Zhao
Ming Lin
Huadong Tang
Qiang Wu
Jun Wang
86
0
0
28 Jan 2025
SCCD: A Session-based Dataset for Chinese Cyberbullying Detection
Qingpo Yang
Yakai Chen
Zihui Xu
Yu-ming Shang
Sanchuan Guo
Xi Zhang
44
1
0
28 Jan 2025
DepressionX: Knowledge Infused Residual Attention for Explainable Depression Severity Assessment
Yusif Ibrahimov
Tarique Anwar
Tommy Yuan
49
0
0
28 Jan 2025
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models
Yibin Wang
Haizhou Shi
Ligong Han
Dimitris N. Metaxas
Hao Wang
BDL
UQLM
116
8
0
28 Jan 2025
Concept-Guided Chain-of-Thought Prompting for Pairwise Comparison Scoring of Texts with Large Language Models
Patrick Y. Wu
Jonathan Nagler
Joshua A. Tucker
Solomon Messing
LRM
57
2
0
28 Jan 2025
Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Ziwei Liu
Qi Zhang
Lifu Gao
41
0
0
28 Jan 2025
Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy
Tunazzina Islam
Dan Goldwasser
82
3
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Min Zhang
LM&MA
AILaw
107
155
0
28 Jan 2025
Towards Explainable Multimodal Depression Recognition for Clinical Interviews
Wenjie Zheng
Qiming Xie
Zengzhi Wang
Jianfei Yu
Rui Xia
62
0
0
28 Jan 2025
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation
Zijie Zhong
Hanwen Liu
Xiaoya Cui
Xiaofan Zhang
Zengchang Qin
92
7
0
28 Jan 2025
Survey: Understand the challenges of MachineLearning Experts using Named EntityRecognition Tools
Florian Freund
Philippe Tamla
Matthias Hemmje
47
1
0
27 Jan 2025
Weight-based Analysis of Detokenization in Language Models: Understanding the First Stage of Inference Without Inference
Go Kamoda
Benjamin Heinzerling
Tatsuro Inaba
Keito Kudo
Keisuke Sakaguchi
Kentaro Inui
MILM
38
2
0
27 Jan 2025
Faster Configuration Performance Bug Testing with Neural Dual-level Prioritization
Youpeng Ma
Tao Chen
Ke Li
96
0
0
26 Jan 2025
Decentralized Low-Rank Fine-Tuning of Large Language Models
Sajjad Ghiasvand
Mahnoosh Alizadeh
Ramtin Pedarsani
ALM
71
0
0
26 Jan 2025
A Transformer-based Autoregressive Decoder Architecture for Hierarchical Text Classification
Younes Yousef
Lukas Galke
A. Scherp
51
0
0
23 Jan 2025
Unified CNNs and transformers underlying learning mechanism reveals multi-head attention modus vivendi
Ella Koresh
Ronit D. Gross
Yuval Meir
Yarden Tzach
Tal Halevi
Ido Kanter
ViT
49
0
0
22 Jan 2025
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition
Hamid Nasiri
Peter Garraghan
41
1
0
21 Jan 2025
YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives
Nong Ming
Sachin Sharma
Jiho Noh
AI4Ed
44
0
0
20 Jan 2025
LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations
Soumya Dutta
Sriram Ganapathy
39
2
0
20 Jan 2025
Revisiting Language Models in Neural News Recommender Systems
Yuyue Zhao
Jin Huang
David Vos
Maarten de Rijke
KELM
186
0
0
20 Jan 2025
Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates
Kaifeng Lyu
Haoyu Zhao
Xinran Gu
Dingli Yu
Anirudh Goyal
Sanjeev Arora
ALM
82
46
0
20 Jan 2025
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
S. Feizi
DeLMO
81
368
0
20 Jan 2025
AIMA at SemEval-2024 Task 3: Simple Yet Powerful Emotion Cause Pair Analysis
Alireza Ghahramani Kure
Mahshid Dehghani
Mohammad Mahdi Abootorabi
Nona Ghazizadeh
Seyed Arshan Dalili
Ehsaneddin Asgari
57
1
0
19 Jan 2025
AIMA at SemEval-2024 Task 10: History-Based Emotion Recognition in Hindi-English Code-Mixed Conversations
Mohammad Mahdi Abootorabi
Nona Ghazizadeh
Seyed Arshan Dalili
Alireza Ghahramani Kure
Mahshid Dehghani
Ehsaneddin Asgari
56
2
0
19 Jan 2025
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Xinyi Wang
Na Zhao
Zhiyuan Han
Dan Guo
Xun Yang
56
1
0
17 Jan 2025
A Simple Graph Contrastive Learning Framework for Short Text Classification
Yuqi Liu
Fausto Giunchiglia
Lan Huang
Ximing Li
Xiaoyue Feng
Renchu Guan
39
0
0
17 Jan 2025
AudioBERT: Audio Knowledge Augmented Language Model
Hyunjong Ok
Suho Yoo
Jaeho Lee
AuLLM
RALM
VLM
53
0
0
17 Jan 2025
ReFactor GNNs: Revisiting Factorisation-based Models from a Message-Passing Perspective
Yihong Chen
Pushkar Mishra
Luca Franceschi
Pasquale Minervini
Pontus Stenetorp
Sebastian Riedel
67
20
0
17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
19
0
17 Jan 2025
Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring
Buse Sibel Korkmaz
Rahul Nair
Elizabeth M. Daly
Evangelos Anagnostopoulos
Christos Varytimidis
Antonio del Rio Chanona
42
0
0
13 Jan 2025
Event Argument Extraction with Enriched Prompts
Chen Liang
46
0
0
12 Jan 2025
Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences
Liu Yu
Ludie Guo
Ping Kuang
Fan Zhou
44
0
0
12 Jan 2025
A Hessian-informed hyperparameter optimization for differential learning rate
Shiyun Xu
Zhiqi Bu
Yiliang Zhang
Ian Barnett
46
1
0
12 Jan 2025
Correcting Annotator Bias in Training Data: Population-Aligned Instance Replication (PAIR)
Stephanie Eckman
Bolei Ma
Christoph Kern
Rob Chew
Yun Xue
Frauke Kreuter
41
0
0
12 Jan 2025
Aggregating Low Rank Adapters in Federated Fine-tuning
Evelyn Trautmann
Ian Hales
Martin F. Volk
AI4CE
FedML
44
0
0
10 Jan 2025
CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech
Madhurananda Pahar
Fuxiang Tao
B. Mirheidari
Nathan Pevy
Rebecca Bright
...
Lise Sproson
Dorota Braun
Caitlin Illingworth
D. Blackburn
H. Christensen
38
0
0
10 Jan 2025
AdaPRL: Adaptive Pairwise Regression Learning with Uncertainty Estimation for Universal Regression Tasks
Fuhang Liang
Rucong Xu
Deng Lin
OOD
38
0
0
10 Jan 2025
Multi-Task Model Merging via Adaptive Weight Disentanglement
Feng Xiong
Runxi Cheng
Wang Chen
Zhanqiu Zhang
Yiwen Guo
Chun Yuan
Ruifeng Xu
MoMe
106
4
0
10 Jan 2025
Previous
1
2
3
4
5
6
...
92
93
94
Next