ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09543
  4. Cited By
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with
  Gradient-Disentangled Embedding Sharing

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021
Pengcheng He
Jianfeng Gao
Weizhu Chen
ArXivPDFHTML

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

50 / 664 papers shown
Title
SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization
SRLoRA: Subspace Recomposition in Low-Rank Adaptation via Importance-Based Fusion and Reinitialization
Haodong Yang
Lei Wang
Md Zakir Hossain
12
0
0
18 May 2025
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
Yuwei Zhang
W. Yu
Shangbin Feng
Yifan Zhu
Letian Peng
Jayanth Srinivasa
Gaowen Liu
Jingbo Shang
KELM
7
0
0
18 May 2025
CAPTURE: Context-Aware Prompt Injection Testing and Robustness Enhancement
CAPTURE: Context-Aware Prompt Injection Testing and Robustness Enhancement
Gauri Kholkar
Ratinder Ahuja
SILM
2
0
0
18 May 2025
Memory-Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation
Memory-Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation
Fei Wu
Jia Hu
Geyong Min
Shiqiang Wang
22
0
0
16 May 2025
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures
Shun Inadumi
Nobuhiro Ueda
Koichiro Yoshino
ObjD
12
0
0
16 May 2025
The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
Benedikt Ebing
Goran Glavas
32
0
0
15 May 2025
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
Brandon Smith
Mohamed Reda Bouadjenek
Tahsin Alamgir Kheya
Phillip Dawson
S. Aryal
ALM
ELM
26
0
0
14 May 2025
Communication-Efficient Federated Fine-Tuning of Language Models via Dynamic Update Schedules
Communication-Efficient Federated Fine-Tuning of Language Models via Dynamic Update Schedules
Michail Theologitis
V. Samoladas
Antonios Deligiannakis
34
0
0
07 May 2025
TartuNLP at SemEval-2025 Task 5: Subject Tagging as Two-Stage Information Retrieval
TartuNLP at SemEval-2025 Task 5: Subject Tagging as Two-Stage Information Retrieval
Aleksei Dorkin
Kairit Sirts
56
1
0
30 Apr 2025
X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation
X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation
Guy Hadad
Haggai Roitman
Yotam Eshel
Bracha Shapira
Lior Rokach
BDL
VLM
LRM
47
0
0
29 Apr 2025
Bi-directional Model Cascading with Proxy Confidence
Bi-directional Model Cascading with Proxy Confidence
David Warren
Mark Dras
49
0
0
27 Apr 2025
Span-Level Hallucination Detection for LLM-Generated Answers
Span-Level Hallucination Detection for LLM-Generated Answers
Passant Elchafei
Mervet Abu-Elkheir
HILM
LRM
74
1
0
25 Apr 2025
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection
Haokai Zhang
Shengtao Zhang
Zijian Cai
Heng Wang
Ruixuan Zhu
Zinan Zeng
Minnan Luo
54
0
0
24 Apr 2025
QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining
Fengze Liu
Weidong Zhou
Binbin Liu
Zhimiao Yu
Yifan Zhang
...
Yifeng Yu
Bingni Zhang
Xiaohuan Zhou
Taifeng Wang
Yong Cao
66
1
0
23 Apr 2025
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking
Jabez Magomere
Elena Kochkina
Samuel Mensah
Simerjot Kaur
Charese Smiley
30
1
0
22 Apr 2025
CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs
CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs
Yingming Zheng
Xiaoliang Liu
Peng Wu
Li Pan
LRM
38
0
0
21 Apr 2025
Natural Fingerprints of Large Language Models
Natural Fingerprints of Large Language Models
Teppei Suzuki
Ryokan Ri
Sho Takase
33
0
0
21 Apr 2025
Template-Based Financial Report Generation in Agentic and Decomposed Information Retrieval
Template-Based Financial Report Generation in Agentic and Decomposed Information Retrieval
Yong-En Tian
Yu-Chien Tang
Kuang-Da Wang
An-Zi Yen
Wen-Chih Peng
AIFin
49
0
0
19 Apr 2025
Towards Characterizing Subjectivity of Individuals through Modeling Value Conflicts and Trade-offs
Towards Characterizing Subjectivity of Individuals through Modeling Value Conflicts and Trade-offs
Younghun Lee
Dan Goldwasser
LLMAG
202
0
0
17 Apr 2025
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models
S. Bhagat
Ibne Farabi Shihab
Anuj Sharma
32
0
0
17 Apr 2025
Robust and Fine-Grained Detection of AI Generated Texts
Robust and Fine-Grained Detection of AI Generated Texts
Ram Mohan Rao Kadiyala
Siddartha Pullakhandam
Kanwal Mehreen
Drishti Sharma
Siddhant Gupta
...
Arvind Reddy Bobbili
Suraj Telugara Chandrashekhar
Modabbir Adeeb
Srinadh Vura
Hamza Farooq
DeLMO
55
0
0
16 Apr 2025
TD-Suite: All Batteries Included Framework for Technical Debt Classification
TD-Suite: All Batteries Included Framework for Technical Debt Classification
Karthik Shivashankar
Antonio Martini
29
0
0
15 Apr 2025
Resampling Benchmark for Efficient Comprehensive Evaluation of Large Vision-Language Models
Resampling Benchmark for Efficient Comprehensive Evaluation of Large Vision-Language Models
Teppei Suzuki
Keisuke Ozawa
VLM
46
0
0
14 Apr 2025
Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with Myanmar
Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with Myanmar
Aung Kyaw Htet
Mark Dras
39
1
0
13 Apr 2025
ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance
ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance
Wissam Antoun
B. Sagot
Djamé Seddah
MQ
40
0
0
11 Apr 2025
Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation
Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation
Alireza Salemi
Chris Samarinas
Hamed Zamani
36
0
0
10 Apr 2025
Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation
Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation
Bo Zhang
Hui Ma
Dailin Li
Jian Ding
Jian Wang
Bo Xu
Hongfei Lin
KELM
44
0
0
10 Apr 2025
SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog
SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog
Jennifer D’Souza
Sameer Sadruddin
Holger Israel
Mathias Begoin
Diana Slawig
65
5
0
09 Apr 2025
Defending Deep Neural Networks against Backdoor Attacks via Module Switching
Defending Deep Neural Networks against Backdoor Attacks via Module Switching
Weijun Li
Ansh Arora
Xuanli He
Mark Dras
Qiongkai Xu
AAML
MoMe
53
0
0
08 Apr 2025
Multi-Sense Embeddings for Language Models and Knowledge Distillation
Multi-Sense Embeddings for Language Models and Knowledge Distillation
Qitong Wang
Mohammed J. Zaki
Georgios Kollias
Vasileios Kalantzis
KELM
31
0
0
08 Apr 2025
Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation
Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation
Alexandre Misrahi
Nadezhda Chirkova
Maxime Louis
Vassilina Nikoulina
RALM
85
0
0
03 Apr 2025
HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents
HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents
Shiyi Liu
Haiying Shen
Shuai Che
Mahdi Ghandi
Mingqin Li
LLMAG
53
0
0
01 Apr 2025
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations
Chongjie Si
Zhiyi Shi
Xuehui Wang
Yichen Xiao
Xiaokang Yang
Wei-Ming Shen
AI4CE
68
0
0
01 Apr 2025
IHC-LLMiner: Automated extraction of tumour immunohistochemical profiles from PubMed abstracts using large language models
IHC-LLMiner: Automated extraction of tumour immunohistochemical profiles from PubMed abstracts using large language models
Yunsoo Kim
Michal W. S. Ong
Daniel W. Rogalsky
Manuel Rodriguez-Justo
Honghan Wu
Adam P. Levine
40
0
0
01 Apr 2025
GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition
GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition
A. Yazdani
Ihor Stepanov
Douglas Teodoro
VLM
AI4CE
44
0
0
01 Apr 2025
Do LLMs Surpass Encoders for Biomedical NER?
Do LLMs Surpass Encoders for Biomedical NER?
Motasem S Obeidat
Md Sultan al Nahian
R. Kavuluru
46
0
0
01 Apr 2025
Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them
Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them
Marc Felix Brinner
Tarek Al Mustafa
Sina Zarrieß
39
0
0
27 Mar 2025
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models
Suhas G Hegde
S. K
Aruna Tiwari
59
0
0
25 Mar 2025
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages
Tadesse Destaw Belay
Dawit Ketema Gete
A. Ayele
Olga Kolesnikova
Grigori Sidorov
Seid Muhie Yimam
37
1
0
24 Mar 2025
CoKe: Customizable Fine-Grained Story Evaluation via Chain-of-Keyword Rationalization
CoKe: Customizable Fine-Grained Story Evaluation via Chain-of-Keyword Rationalization
Brihi Joshi
Sriram Venkatapathy
Mohit Bansal
Nanyun Peng
Haw-Shiuan Chang
LRM
51
0
0
21 Mar 2025
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models
Boshra Khalili
Andrew W.Smyth
ELM
59
0
0
20 Mar 2025
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Longteng Guo
Zhihua Wei
Jiaheng Liu
LM&Ro
78
1
0
18 Mar 2025
OSCAR: Online Soft Compression And Reranking
OSCAR: Online Soft Compression And Reranking
Maxime Louis
Thibault Formal
Hervé Déjean
S. Clinchant
33
0
0
17 Mar 2025
Modeling Subjectivity in Cognitive Appraisal with Language Models
Yuxiang Zhou
Hainiu Xu
Desmond C. Ong
Petr Slovak
Yulan He
41
0
0
14 Mar 2025
Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout
Shilong Wang
Jianchun Liu
Hongli Xu
Jiaming Yan
Xianjun Gao
61
1
0
13 Mar 2025
GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval
Justus-Jonas Erker
Nils Reimers
Iryna Gurevych
63
0
0
10 Mar 2025
Detection Avoidance Techniques for Large Language Models
Sinclair Schneider
Florian Steuber
João A. G. Schneider
Gabi Dreo Rodosek
DeLMO
83
0
0
10 Mar 2025
A Graph-based Verification Framework for Fact-Checking
Yani Huang
Richong Zhang
Zhijie Nie
J. Chen
Xuefeng Zhang
39
0
0
10 Mar 2025
Quantum-PEFT: Ultra parameter-efficient fine-tuning
Toshiaki Koike-Akino
F. Tonin
Yongtao Wu
Frank Zhengqing Wu
Leyla Naz Candogan
V. Cevher
MQ
54
3
0
07 Mar 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
152
1
0
07 Mar 2025
1234...121314
Next