ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 4,630 papers shown
Title
Towards Symmetric Low-Rank Adapters
Towards Symmetric Low-Rank Adapters
Tales Panoutsos
Rodrygo L. T. Santos
Flavio Figueiredo
33
0
0
29 Mar 2025
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
Aden Haussmann
LMTD
62
0
0
29 Mar 2025
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation
Jiakai Tang
Sunhao Dai
Teng Shi
Jun Xu
X. Chen
Wen Chen
Wu Jian
Yuning Jiang
LRM
77
5
0
28 Mar 2025
Retrieving Time-Series Differences Using Natural Language Queries
Retrieving Time-Series Differences Using Natural Language Queries
Kota Dohi
Tomoya Nishida
Harsh Purohit
Takashi Endo
Y. Kawaguchi
AI4TS
48
0
0
27 Mar 2025
EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues
EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues
Yuhan Liu
Yunbo Long
LLMAG
62
0
0
27 Mar 2025
From Deep Learning to LLMs: A survey of AI in Quantitative Investment
From Deep Learning to LLMs: A survey of AI in Quantitative Investment
Bokai Cao
Saizhuo Wang
Xinyi Lin
Xiaojun Wu
Haohan Zhang
L. Ni
Jian Guo
AIFin
59
1
0
27 Mar 2025
Explainable ICD Coding via Entity Linking
Explainable ICD Coding via Entity Linking
Leonor Barreiros
I. Coutinho
Gonçalo M. Correia
Bruno Martins
63
0
0
26 Mar 2025
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
Rongyu Zhang
Menghang Dong
Yuan Zhang
Liang Heng
Xiaowei Chi
Gaole Dai
Li Du
Dan Wang
Yuan Du
MoE
92
0
0
26 Mar 2025
"Is There Anything Else?'': Examining Administrator Influence on Linguistic Features from the Cookie Theft Picture Description Cognitive Test
"Is There Anything Else?'': Examining Administrator Influence on Linguistic Features from the Cookie Theft Picture Description Cognitive Test
Changye Li
Zhecheng Sheng
T. Cohen
Serguei V. S. Pakhomov
AAML
56
0
0
25 Mar 2025
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement
Gaifan Zhang
Yi Zhou
Danushka Bollegala
219
0
0
21 Mar 2025
Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation
Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation
Pritam Kadasi
Sriman Reddy
Srivathsa Vamsi Chaturvedula
Rudranshu Sen
Agnish Saha
Soumavo Sikdar
Sayani Sarkar
Suhani Mittal
Rohit Jindal
Mayank Singh
53
0
0
19 Mar 2025
SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation
SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation
Thomas Pickard
Aline Villavicencio
Maggie Mi
Wei He
Dylan Phelps
Carolina Scarton
86
1
0
19 Mar 2025
ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models
ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models
Alexey Karev
Dong Xu
58
0
0
18 Mar 2025
High-entropy Advantage in Neural Networks' Generalizability
High-entropy Advantage in Neural Networks' Generalizability
Entao Yang
Jiahui Geng
Yue Shang
Ge Zhang
AI4CE
66
0
0
17 Mar 2025
Progressive Human Motion Generation Based on Text and Few Motion Frames
Progressive Human Motion Generation Based on Text and Few Motion Frames
Ling-an Zeng
Gaojie Wu
Ancong Wu
Jian-Fang Hu
Wei-Shi Zheng
64
1
0
17 Mar 2025
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
Vrushank Ahire
Kunal Shah
Mudasir Nazir Khan
Nikhil Pakhale
L. Sookha
M. A. Ganaie
Abhinav Dhall
83
0
0
16 Mar 2025
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
Hao Mark Chen
S. Hu
Wayne Luk
Timothy M. Hospedales
Hongxiang Fan
MoMe
77
0
0
16 Mar 2025
Learning to Inference Adaptively for Multimodal Large Language Models
Learning to Inference Adaptively for Multimodal Large Language Models
Zhuoyan Xu
Khoi Duc Nguyen
Preeti Mukherjee
Saurabh Bagchi
Somali Chaterji
Yingyu Liang
Yin Li
LRM
52
1
0
13 Mar 2025
MERGE -- A Bimodal Dataset for Static Music Emotion Recognition
MERGE -- A Bimodal Dataset for Static Music Emotion Recognition
Pedro Lima Louro
Hugo Redinho
Ricardo Santos
Ricardo Malheiro
R. Panda
Rui Pedro Paiva
MoMe
75
3
0
13 Mar 2025
Sentiment Analysis in SemEval: A Review of Sentiment Identification Approaches
Bousselham EL HADDAOUI
R. Chiheb
R. Faizi
A. E. Afia
49
0
0
13 Mar 2025
OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses
OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses
Angela Lopez-Cardona
Sebastian Idesis
Miguel Barreda-Ángeles
Sergi Abadal
Ioannis Arapakis
51
0
0
13 Mar 2025
Cognitive-Mental-LLM: Evaluating Reasoning in Large Language Models for Mental Health Prediction via Online Text
Cognitive-Mental-LLM: Evaluating Reasoning in Large Language Models for Mental Health Prediction via Online Text
Avinash Patil
Amardeep Gedhu
AI4MH
LRM
43
2
0
13 Mar 2025
Probabilistic Reasoning with LLMs for k-anonymity Estimation
Jonathan Zheng
Sauvik Das
Alan Ritter
Wei-ping Xu
62
0
0
12 Mar 2025
Introducing Verification Task of Set Consistency with Set-Consistency Energy Networks
Introducing Verification Task of Set Consistency with Set-Consistency Energy Networks
Mooho Song
Hyeryung Son
Jay-Yoon Lee
52
0
0
12 Mar 2025
Who Are You Behind the Screen? Implicit MBTI and Gender Detection Using Artificial Intelligence
Who Are You Behind the Screen? Implicit MBTI and Gender Detection Using Artificial Intelligence
Kourosh Shahnazari
Seyed Moein Ayyoubzadeh
46
0
0
12 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Enhong Chen
3DV
76
3
0
11 Mar 2025
Fair Text Classification via Transferable Representations
Thibaud Leteno
Michael Perrot
Charlotte Laclau
Antoine Gourru
Christophe Gravier
FaML
88
0
0
10 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
174
2
0
10 Mar 2025
Gender Encoding Patterns in Pretrained Language Model Representations
Mahdi Zakizadeh
Mohammad Taher Pilehvar
48
0
0
09 Mar 2025
Heterogeneous bimodal attention fusion for speech emotion recognition
Heterogeneous bimodal attention fusion for speech emotion recognition
Jiachen Luo
Huy Phan
Lin Wang
Joshua Reiss
44
0
0
09 Mar 2025
Bimodal Connection Attention Fusion for Speech Emotion Recognition
Bimodal Connection Attention Fusion for Speech Emotion Recognition
Jiachen Luo
Huy Phan
Lin Wang
Joshua D. Reiss
51
0
0
08 Mar 2025
Exploiting Edited Large Language Models as General Scientific Optimizers
Exploiting Edited Large Language Models as General Scientific Optimizers
Qitan Lv
T. Liu
Haoyu Wang
46
0
0
08 Mar 2025
CeTAD: Towards Certified Toxicity-Aware Distance in Vision Language Models
CeTAD: Towards Certified Toxicity-Aware Distance in Vision Language Models
Xiangyu Yin
Jiaxu Liu
Zhen Chen
Jinwei Hu
Yi Dong
Xiaowei Huang
Wenjie Ruan
AAML
50
0
0
08 Mar 2025
Evaluating Discourse Cohesion in Pre-trained Language Models
Jie He
Wanqiu Long
Deyi Xiong
ELM
71
2
0
08 Mar 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
187
2
0
07 Mar 2025
Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models
Jie He
Bo Peng
Yi-Lun Liao
Qun Liu
Deyi Xiong
68
8
0
06 Mar 2025
LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue
Sangyeop Kim
S. Park
Jaewon Jung
Jinseok Kim
Sungzoon Cho
47
0
0
06 Mar 2025
Sarcasm Detection as a Catalyst: Improving Stance Detection with Cross-Target Capabilities
Gibson Nkhata Shi Yin Hong
Susan Gauch
58
0
0
05 Mar 2025
MoSE: Hierarchical Self-Distillation Enhances Early Layer Embeddings
MoSE: Hierarchical Self-Distillation Enhances Early Layer Embeddings
Andrea Gurioli
Federico Pennino
João Monteiro
Maurizio Gabbrielli
51
0
0
04 Mar 2025
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking
Nam V. Nguyen
Dien X. Tran
Thanh T. Tran
Anh T. Hoang
Tai V. Duong
Di T. Le
Phuc-Lu Le
42
0
0
02 Mar 2025
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition
Yifei Duan
Raphael Shang
Deng Liang
Yongqiang Cai
87
0
0
28 Feb 2025
Lotus at SemEval-2025 Task 11: RoBERTa with Llama-3 Generated Explanations for Multi-Label Emotion Classification
Lotus at SemEval-2025 Task 11: RoBERTa with Llama-3 Generated Explanations for Multi-Label Emotion Classification
Niloofar Ranjbar
Hamed Baghbani
40
1
0
27 Feb 2025
FedMentalCare: Towards Privacy-Preserving Fine-Tuned LLMs to Analyze Mental Health Status Using Federated Learning Framework
S M Sarwar
AI4MH
51
0
0
27 Feb 2025
DreamNet: A Multimodal Framework for Semantic and Emotional Analysis of Sleep Narratives
Tapasvi Panchagnula
42
0
0
26 Feb 2025
CAMEx: Curvature-aware Merging of Experts
CAMEx: Curvature-aware Merging of Experts
Dung V. Nguyen
Minh H. Nguyen
Luc Q. Nguyen
R. Teo
T. Nguyen
Linh Duy Tran
MoMe
104
2
0
26 Feb 2025
Clip-TTS: Contrastive Text-content and Mel-spectrogram, A High-Quality Text-to-Speech Method based on Contextual Semantic Understanding
Clip-TTS: Contrastive Text-content and Mel-spectrogram, A High-Quality Text-to-Speech Method based on Contextual Semantic Understanding
Tianyun Liu
CLIP
VLM
68
0
0
26 Feb 2025
From Small to Large Language Models: Revisiting the Federalist Papers
From Small to Large Language Models: Revisiting the Federalist Papers
So Won Jeong
Veronika Rockova
42
0
0
25 Feb 2025
Data-Constrained Synthesis of Training Data for De-Identification
Data-Constrained Synthesis of Training Data for De-Identification
Thomas Vakili
Aron Henriksson
Hercules Dalianis
SyDa
49
0
0
24 Feb 2025
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin
Lukas Miklautz
Sepp Hochreiter
Johannes Brandstetter
VLM
78
8
0
24 Feb 2025
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models
Jonathan Bourne
77
0
0
24 Feb 2025
Previous
123456...919293
Next