ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09543
  4. Cited By
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with
  Gradient-Disentangled Embedding Sharing

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021
Pengcheng He
Jianfeng Gao
Weizhu Chen
ArXivPDFHTML

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

50 / 664 papers shown
Title
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based
  Encoder For Legal Violation Detection and Resolution
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution
Shikha Bordia
AILaw
47
0
0
30 Oct 2024
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models
Yiming Li
Xiaogeng Liu
SILM
42
5
0
30 Oct 2024
Toxicity of the Commons: Curating Open-Source Pre-Training Data
Toxicity of the Commons: Curating Open-Source Pre-Training Data
Catherine Arnett
Eliot Jones
Ivan P. Yamshchikov
Pierre-Carl Langlais
36
2
0
29 Oct 2024
Are BabyLMs Second Language Learners?
Are BabyLMs Second Language Learners?
Lukas Edman
Lisa Bylinina
Faeze Ghorbanpour
Alexander Fraser
22
0
0
28 Oct 2024
uOttawa at LegalLens-2024: Transformer-based Classification Experiments
uOttawa at LegalLens-2024: Transformer-based Classification Experiments
Nima Meghdadi
Diana Inkpen
AILaw
26
0
0
28 Oct 2024
KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and
  Knowledge Distillation
KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation
Rambod Azimi
Rishav Rishav
M. Teichmann
Samira Ebrahimi Kahou
ALM
31
0
0
28 Oct 2024
GeoLoRA: Geometric integration for parameter efficient fine-tuning
GeoLoRA: Geometric integration for parameter efficient fine-tuning
Steffen Schotthöfer
Emanuele Zangrando
Gianluca Ceruti
Francesco Tudisco
J. Kusch
AI4CE
31
1
0
24 Oct 2024
Improving Pinterest Search Relevance Using Large Language Models
Improving Pinterest Search Relevance Using Large Language Models
Han Wang
Mukuntha Narayanan Sundararaman
Onur Gungor
Yu Xu
Krishna Kamath
Rakesh Chalasani
Kurchi Subhra Hazra
Jinfeng Rao
LRM
30
1
0
22 Oct 2024
RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary
  Detection in Partially Machine Generated Texts
RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts
Ram Mohan Rao Kadiyala
DeLMO
26
2
0
22 Oct 2024
A Statistical Analysis of LLMs' Self-Evaluation Using Proverbs
A Statistical Analysis of LLMs' Self-Evaluation Using Proverbs
Ryosuke Sonoda
Ramya Srinivasan
61
1
0
22 Oct 2024
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training
  and Fine-tuning
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning
Arijit Das
26
1
0
21 Oct 2024
Redefining Proactivity for Information Seeking Dialogue
Redefining Proactivity for Information Seeking Dialogue
Jing Yang Lee
Seokhwan Kim
Kartik Mehta
Jiun-Yu Kao
Yu-Hsiang Lin
Arpit Gupta
30
0
0
20 Oct 2024
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for
  Bangla
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla
Deeparghya Dutta Barua
Md Sakib Ul Rahman Sourove
Md Farhan Ishmam
Fabiha Haider
Fariha Tanjim Shifat
Md Fahim
Md Farhad Alam
29
0
0
19 Oct 2024
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
German Gritsai
Anastasia Voznyuk
Andrey Grabovoy
Yury Chekhovich
DeLMO
80
1
0
18 Oct 2024
Breaking the Manual Annotation Bottleneck: Creating a Comprehensive
  Legal Case Criticality Dataset through Semi-Automated Labeling
Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling
Ronja Stern
Ken Kawamura
Matthias Sturmer
Ilias Chalkidis
Joel Niklaus
AILaw
ELM
43
1
0
17 Oct 2024
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
Nandan Thakur
Suleman Kazi
Ge Luo
Jimmy J. Lin
Amin Ahmad
VLM
RALM
28
7
0
17 Oct 2024
On the Risk of Evidence Pollution for Malicious Social Text Detection in
  the Era of LLMs
On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs
Herun Wan
Minnan Luo
Zhixiong Su
Guang Dai
Xiang Zhao
DeLMO
35
0
0
16 Oct 2024
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
Xiaonan Jing
Srinivas Billa
Danny Godbout
HILM
45
0
0
16 Oct 2024
AIC CTU system at AVeriTeC: Re-framing automated fact-checking as a
  simple RAG task
AIC CTU system at AVeriTeC: Re-framing automated fact-checking as a simple RAG task
Herbert Ullrich
Tomás Mlynár
Jan Drchal
37
2
0
15 Oct 2024
Transformer-based Language Models for Reasoning in the Description Logic
  ALCQ
Transformer-based Language Models for Reasoning in the Description Logic ALCQ
Angelos Poulis
Eleni Tsalapati
Manolis Koubarakis
ReLM
LRM
29
1
0
12 Oct 2024
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language
  Model for Commonsense Reasoning
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning
Jiachun Li
Pengfei Cao
Chenhao Wang
Zhuoran Jin
Yubo Chen
Kang Liu
Xiaojian Jiang
Jiexin Xu
Jun Zhao
LRM
KELM
39
0
0
12 Oct 2024
Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution
  Generalisation of Misinformation Detection Models
Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalisation of Misinformation Detection Models
Ivo Verhoeven
Pushkar Mishra
Ekaterina Shutova
30
0
0
12 Oct 2024
Solving the Challenge Set without Solving the Task: On Winograd Schemas
  as a Test of Pronominal Coreference Resolution
Solving the Challenge Set without Solving the Task: On Winograd Schemas as a Test of Pronominal Coreference Resolution
Ian Porada
Jackie C.K. Cheung
44
0
0
12 Oct 2024
Zero-shot Commonsense Reasoning over Machine Imagination
Zero-shot Commonsense Reasoning over Machine Imagination
Hyuntae Park
Yeachan Kim
Jun-Hyung Park
S. Lee
ReLM
VLM
LRM
29
1
0
12 Oct 2024
NoVo: Norm Voting off Hallucinations with Attention Heads in Large
  Language Models
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models
Zheng Yi Ho
Siyuan Liang
Sen Zhang
Yibing Zhan
Dacheng Tao
34
2
0
11 Oct 2024
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection
Camilla Casula
Sara Tonelli
31
0
0
10 Oct 2024
MoDEM: Mixture of Domain Expert Models
MoDEM: Mixture of Domain Expert Models
Toby Simonds
Kemal Kurniawan
Jey Han Lau
MoE
31
1
0
09 Oct 2024
Parameter Efficient Fine-tuning via Explained Variance Adaptation
Parameter Efficient Fine-tuning via Explained Variance Adaptation
Fabian Paischer
Lukas Hauzenberger
Thomas Schmied
Benedikt Alkin
Marc Peter Deisenroth
Sepp Hochreiter
37
4
0
09 Oct 2024
QERA: an Analytical Framework for Quantization Error Reconstruction
QERA: an Analytical Framework for Quantization Error Reconstruction
Cheng Zhang
Jeffrey T. H. Wong
Can Xiao
George A. Constantinides
Yiren Zhao
MQ
47
2
0
08 Oct 2024
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with
  Explanatory Argumentative Structures
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures
Ekaterina Sviridova
Anar Yeginbergen
A. Estarrona
Elena Cabrio
S. Villata
Rodrigo Agerri
44
2
0
07 Oct 2024
Beyond Correlation: Interpretable Evaluation of Machine Translation
  Metrics
Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Stefano Perrella
Lorenzo Proietti
Pere-Lluís Huguet Cabot
Edoardo Barba
Roberto Navigli
23
3
0
07 Oct 2024
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual,
  Cross-lingual and Multi-document News Summarization
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization
Yangfan Ye
Xiachong Feng
Xiaocheng Feng
Weitao Ma
Libo Qin
Dongliang Xu
Qing Yang
Hongtao Liu
Bing Qin
37
2
0
05 Oct 2024
KidLM: Advancing Language Models for Children -- Early Insights and
  Future Directions
KidLM: Advancing Language Models for Children -- Early Insights and Future Directions
Mir Tafseer Nayeem
Davood Rafiei
ALM
39
3
0
04 Oct 2024
How Hard is this Test Set? NLI Characterization by Exploiting Training
  Dynamics
How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics
Adrian Cosma
Stefan Ruseti
Mihai Dascalu
Cornelia Caragea
21
2
0
04 Oct 2024
NL-Eye: Abductive NLI for Images
NL-Eye: Abductive NLI for Images
Mor Ventura
Michael Toker
Nitay Calderon
Zorik Gekhman
Yonatan Bitton
Roi Reichart
28
1
0
03 Oct 2024
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
Seanie Lee
Haebin Seong
Dong Bok Lee
Minki Kang
Xiaoyin Chen
Dominik Wagner
Yoshua Bengio
Juho Lee
Sung Ju Hwang
67
2
0
02 Oct 2024
Thinking Outside of the Differential Privacy Box: A Case Study in Text
  Privatization with Language Model Prompting
Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting
Stephen Meisenbacher
Florian Matthes
29
2
0
01 Oct 2024
Multimodal Coherent Explanation Generation of Robot Failures
Multimodal Coherent Explanation Generation of Robot Failures
Pradip Pramanick
Silvia Rossi
29
2
0
01 Oct 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling
  Large Language Models
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Shuhao Chen
Weisen Jiang
Baijiong Lin
James T. Kwok
Yu Zhang
RALM
MQ
48
5
0
30 Sep 2024
A Survey on the Honesty of Large Language Models
A Survey on the Honesty of Large Language Models
Siheng Li
Cheng Yang
Taiqiang Wu
Chufan Shi
Yuji Zhang
...
Jie Zhou
Yujiu Yang
Ngai Wong
Xixin Wu
Wai Lam
HILM
35
5
0
27 Sep 2024
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in
  German Text Classification
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification
Andreas Waldis
Joel Birrer
Anne Lauscher
Iryna Gurevych
33
1
0
26 Sep 2024
A fast and sound tagging method for discontinuous named-entity
  recognition
A fast and sound tagging method for discontinuous named-entity recognition
Caio Corro
28
0
0
24 Sep 2024
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
Haolin Chen
Philip N. Garner
55
1
0
16 Sep 2024
Rediscovering the Latent Dimensions of Personality with Large Language
  Models as Trait Descriptors
Rediscovering the Latent Dimensions of Personality with Large Language Models as Trait Descriptors
Joseph Suh
Suhong Moon
Minwoo Kang
David M. Chan
34
1
0
16 Sep 2024
Algorithmic Behaviors Across Regions: A Geolocation Audit of YouTube Search for COVID-19 Misinformation Between the United States and South Africa
Algorithmic Behaviors Across Regions: A Geolocation Audit of YouTube Search for COVID-19 Misinformation Between the United States and South Africa
Hayoung Jung
Prerna Juneja
Tanushree Mitra
MLAU
68
0
0
16 Sep 2024
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking,
  fine-tuning and deploying Rerankers for RAG
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG
Gabriel de Souza P. Moreira
Ronay Ak
Benedikt Schifferer
Mengyao Xu
Radek Osmulski
Even Oldridge
29
4
0
12 Sep 2024
Modeling Information Narrative Detection and Evolution on Telegram
  during the Russia-Ukraine War
Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War
Patrick Gerard
Svitlana Volkova
Louis Penafiel
Kristina Lerman
Tim Weninger
62
0
0
12 Sep 2024
Table-to-Text Generation with Pretrained Diffusion Models
Table-to-Text Generation with Pretrained Diffusion Models
Aleksei S. Krylov
Oleg D. Somov
40
1
0
10 Sep 2024
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for
  Political Text
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text
Michael Burnham
Kayla Kahn
Ryan Yank Wang
Rachel X. Peng
37
5
0
03 Sep 2024
TinyAgent: Function Calling at the Edge
TinyAgent: Function Calling at the Edge
Lutfi Eren Erdogan
Nicholas Lee
Siddharth Jha
Sehoon Kim
Ryan Tabrizi
Suhong Moon
Coleman Hooper
Gopala Anumanchipalli
Kurt Keutzer
Amir Gholami
LLMAG
41
12
0
01 Sep 2024
Previous
123456...121314
Next