Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 10,734 papers shown
Title
Precise In-Parameter Concept Erasure in Large Language Models
Yoav Gur-Arieh
Clara Suslik
Yihuai Hong
Fazl Barez
Mor Geva
KELM
MU
92
0
0
28 May 2025
LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners
Yu He
Zihan Yao
Chentao Song
Tianyu Qi
Jun Liu
Ming Li
Qing Huang
AI4Ed
46
0
0
27 May 2025
The Feasibility of Topic-Based Watermarking on Academic Peer Reviews
Alexander Nemecek
Yuzhou Jiang
Erman Ayday
WaLM
43
0
0
27 May 2025
AutoSGD: Automatic Learning Rate Selection for Stochastic Gradient Descent
Nikola Surjanovic
Alexandre Bouchard-Côté
Trevor Campbell
32
0
0
27 May 2025
Test-Time Learning for Large Language Models
Jinwu Hu
Zhitian Zhang
Guohao Chen
Xutao Wen
Chao Shuai
Wei Luo
Bin Xiao
Yuanqing Li
Mingkui Tan
55
0
0
27 May 2025
VeriTrail: Closed-Domain Hallucination Detection with Traceability
Dasha Metropolitansky
Jonathan Larson
HILM
59
0
0
27 May 2025
Def-DTS: Deductive Reasoning for Open-domain Dialogue Topic Segmentation
Seungmin Lee
Yongsang Yoo
Minhwa Jung
Min Song
LRM
30
0
0
27 May 2025
CoRI: Synthesizing Communication of Robot Intent for Physical Human-Robot Interaction
Junxiang Wang
Emek Barış Küçüktabak
Rana Soltani Zarrin
Zackory Erickson
25
0
0
26 May 2025
MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning
Yuanxin Zhuang
Dazhong Shen
Ying Sun
43
1
0
26 May 2025
ETS: Open Vocabulary Electroencephalography-To-Text Decoding and Sentiment Classification
Mohamed Masry
Mohamed Amen
Mohamed Elzyat
Mohamed Hamed
Norhan Magdy
Maram Khaled
17
0
0
26 May 2025
Conversation Kernels: A Flexible Mechanism to Learn Relevant Context for Online Conversation Understanding
Vibhor Agarwal
Arjoo Gupta
Suparna De
Nishanth R. Sastry
39
0
0
26 May 2025
How to Improve the Robustness of Closed-Source Models on NLI
Joe Stacey
Lisa Alazraki
Aran Ubhi
Beyza Ermis
Aaron Mueller
Marek Rei
42
0
0
26 May 2025
DCG-SQL: Enhancing In-Context Learning for Text-to-SQL with Deep Contextual Schema Link Graph
Jihyung Lee
Jin-Seop Lee
Jaehoon Lee
YunSeok Choi
Jee-Hyong Lee
29
0
0
26 May 2025
Transformers in Protein: A Survey
Xiaowen Ling
Zhiqiang Li
Yanbin Wang
Zhuhong You
ViT
MedIm
AI4CE
138
0
0
26 May 2025
Evaluating Large Language Models for Code Review
Umut Cihan
Arda İçöz
Vahid Haratian
Eray Tüzün
ALM
24
0
0
26 May 2025
DISRetrieval: Harnessing Discourse Structure for Long Document Retrieval
H. Chen
Yi Yang
Yinghui Li
Meishan Zhang
Min Zhang
RALM
13
0
0
26 May 2025
Discrete Markov Bridge
Hengli Li
Yuxuan Wang
Song-Chun Zhu
Ying Nian Wu
Zilong Zheng
DiffM
69
0
0
26 May 2025
Learning Extrapolative Sequence Transformations from Markov Chains
Sophia Hager
Aleem Khan
Andrew Wang
Nicholas Andrews
BDL
36
0
0
26 May 2025
MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models
Zhongzhan Huang
Guoming Ling
Shanshan Zhong
Hefeng Wu
Liang Lin
31
0
0
26 May 2025
Detection of Suicidal Risk on Social Media: A Hybrid Model
Zaihan Yang
Ryan Leonard
Hien Tran
Rory Driscoll
Chadbourne Davis
13
0
0
26 May 2025
Rethinking the Understanding Ability across LLMs through Mutual Information
Shaojie Wang
Sirui Ding
Na Zou
37
0
0
25 May 2025
POQD: Performance-Oriented Query Decomposer for Multi-vector retrieval
Yaoyang Liu
Junlin Li
Yinjun Wu
Zhen Chen
67
0
0
25 May 2025
CrosGrpsABS: Cross-Attention over Syntactic and Semantic Graphs for Aspect-Based Sentiment Analysis in a Low-Resource Language
Md. Mithun Hossain
Md. Shakil Hossain
Sudipto Chaki
Md. Rajib Hossain
M. S. Rahman
A. B. M. Shawkat Ali
41
0
0
25 May 2025
Towards Harmonized Uncertainty Estimation for Large Language Models
Rui Li
Jing Long
Muge Qi
Heming Xia
Lei Sha
Peiyi Wang
Zhifang Sui
UQCV
66
0
0
25 May 2025
Hypercube-RAG: Hypercube-Based Retrieval-Augmented Generation for In-domain Scientific Question-Answering
Jimeng Shi
Sizhe Zhou
Bowen Jin
Wei Hu
Shaowen Wang
Giri Narasimhan
Jiawei Han
51
0
0
25 May 2025
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
Zhendong Mi
Qitao Tan
Xiaodong Yu
Zining Zhu
Geng Yuan
Shaoyi Huang
206
0
0
24 May 2025
Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation
Alexander Shabalin
Viacheslav Meshchaninov
Dmitry Vetrov
44
0
0
24 May 2025
PromptWise: Online Learning for Cost-Aware Prompt Assignment in Generative Models
Xiaoyan Hu
Lauren Pick
Ho-fung Leung
Farzan Farnia
40
1
0
24 May 2025
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention
Can Yaras
Alec S. Xu
Pierre Abillama
Changwoo Lee
Laura Balzano
32
0
0
24 May 2025
DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors
Tazeek Bin Abdur Rakib
Ambuj Mehrish
Lay-Ki Soon
Wern Han Lim
Soujanya Poria
OffRL
50
0
0
23 May 2025
Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation
Zhihua Liu
Amrutha Saseendran
Lei Tong
Xilin He
Fariba Yousefi
...
Dino Oglic
Tom Diethe
Philip Teare
Huiyu Zhou
Chen Jin
VLM
358
0
0
23 May 2025
Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?
Junyan Zhang
Yiming Huang
Shuliang Liu
Yubo Gao
Xuming Hu
166
0
0
23 May 2025
Learning Shared Representations from Unpaired Data
Amitai Yacobi
Nir Ben-Ari
Ronen Talmon
Uri Shaham
SSL
80
0
0
23 May 2025
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations
Alan Arazi
Eilam Shapira
Roi Reichart
LMTD
205
0
0
23 May 2025
Locality-Sensitive Hashing for Efficient Hard Negative Sampling in Contrastive Learning
Fabian Deuser
Philipp Hausenblas
Hannah Schieber
Daniel Roth
Martin Werner
Norbert Oswald
205
0
0
23 May 2025
PLUMAGE: Probabilistic Low rank Unbiased Min Variance Gradient Estimator for Efficient Large Model Training
Matan Haroush
Daniel Soudry
189
0
0
23 May 2025
LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing
Dario Di Palma
Alessandro De Bellis
Giovanni Servedio
Vito Walter Anelli
Fedelucio Narducci
Tommaso Di Noia
MILM
64
0
0
22 May 2025
Bayesian Optimization for Enhanced Language Models: Optimizing Acquisition Functions
Zishuo Bao
Yibo Liu
Changyutao Qiu
214
0
0
22 May 2025
Nested Named Entity Recognition as Single-Pass Sequence Labeling
Alberto Muñoz-Ortiz
David Vilares
Caio COrro
Carlos Gómez-Rodríguez
53
0
0
22 May 2025
MPL: Multiple Programming Languages with Large Language Models for Information Extraction
Bo Li
Gexiang Fang
Wei Ye
Zhenghua Xu
Jinglei Zhang
Hao Cheng
Shikun Zhang
54
0
0
22 May 2025
Transfer of Structural Knowledge from Synthetic Languages
Mikhail Budnikov
Ivan Yamshchikov
66
0
0
21 May 2025
Small Language Models in the Real World: Insights from Industrial Text Classification
Lujun Li
Lama Sleem
Niccolo Gentile
Geoffrey Nichil
Radu State
LLMAG
218
0
0
21 May 2025
Visual Perturbation and Adaptive Hard Negative Contrastive Learning for Compositional Reasoning in Vision-Language Models
Xin Huang
Ruibin Li
Tong Jia
Wei Zheng
Ya Wang
VLM
CoGe
132
0
0
21 May 2025
PsyScam: A Benchmark for Psychological Techniques in Real-World Scams
Shang Ma
Tianyi Ma
Jiahao Liu
Wei Song
Zhenkai Liang
Xusheng Xiao
Yanfang Ye
174
0
0
21 May 2025
EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association
Weiqi Wang
Limeng Cui
Xin Liu
Sreyashi Nag
Wenju Xu
...
Y. Gao
Haiyang Zhang
Qi He
Shuiwang Ji
Yangqiu Song
132
3
0
21 May 2025
A Qualitative Investigation into LLM-Generated Multilingual Code Comments and Automatic Evaluation Metrics
Jonathan Katzy
Yongcheng Huang
Gopal-Raj Panchu
Maksym Ziemlewski
Paris Loizides
Sander Vermeulen
Arie van Deursen
Maliheh Izadi
ELM
57
1
0
21 May 2025
Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning
Yukun Zhao
Lingyong Yan
Zhenyang Li
Shuaiqiang Wang
Zhumin Chen
Zhaochun Ren
Dawei Yin
CLL
KELM
VLM
LRM
75
0
0
21 May 2025
Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
Hao Fang
Jiawei Kong
Tianqu Zhuang
Yixiang Qiu
Kuofeng Gao
Bin Chen
Shu-Tao Xia
Yaowei Wang
Min Zhang
125
0
0
21 May 2025
AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection
Jiatao Li
Mao Ye
Cheng Peng
Xunjian Yin
Xiaojun Wan
57
0
0
21 May 2025
AdUE: Improving uncertainty estimation head for LoRA adapters in LLMs
Artem Zabolotnyi
Roman Makarov
Mile Mitrovic
P. Proskura
Oleg Travkin
Roman Alferov
Alexey Zaytsev
UQCV
71
0
0
21 May 2025
Previous
1
2
3
4
5
...
213
214
215
Next