ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09543
  4. Cited By
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with
  Gradient-Disentangled Embedding Sharing

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021
Pengcheng He
Jianfeng Gao
Weizhu Chen
ArXivPDFHTML

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

14 / 664 papers shown
Title
IT5: Text-to-text Pretraining for Italian Language Understanding and
  Generation
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
Gabriele Sarti
Malvina Nissim
AILaw
18
42
0
07 Mar 2022
Short-answer scoring with ensembles of pretrained language models
Short-answer scoring with ensembles of pretrained language models
Christopher M. Ormerod
39
8
0
23 Feb 2022
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual
  Sentiment Analysis
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis
Shamsuddeen Hassan Muhammad
David Ifeoluwa Adelani
Sebastian Ruder
I. Ahmad
Idris Abdulmumin
...
Chris C. Emezue
Saheed Abdul
Anuoluwapo Aremu
Alipio Jeorge
P. Brazdil
45
96
0
20 Jan 2022
CRASS: A Novel Data Set and Benchmark to Test Counterfactual Reasoning
  of Large Language Models
CRASS: A Novel Data Set and Benchmark to Test Counterfactual Reasoning of Large Language Models
Jorg Frohberg
Frank Binder
SLR
6
27
0
22 Dec 2021
ADBCMM : Acronym Disambiguation by Building Counterfactuals and
  Multilingual Mixing
ADBCMM : Acronym Disambiguation by Building Counterfactuals and Multilingual Mixing
Yixuan Weng
Fei Xia
Bin Li
Xiusheng Huang
Shizhu He
14
4
0
08 Dec 2021
Human Parity on CommonsenseQA: Augmenting Self-Attention with External
  Attention
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Yichong Xu
Chenguang Zhu
Shuohang Wang
Siqi Sun
Hao Cheng
Xiaodong Liu
Jianfeng Gao
Pengcheng He
Michael Zeng
Xuedong Huang
LRM
254
55
0
06 Dec 2021
LSA: Modeling Aspect Sentiment Coherency via Local Sentiment Aggregation
LSA: Modeling Aspect Sentiment Coherency via Local Sentiment Aggregation
Heng Yang
Ke Li
25
8
0
16 Oct 2021
BBQ: A Hand-Built Bias Benchmark for Question Answering
BBQ: A Hand-Built Bias Benchmark for Question Answering
Alicia Parrish
Angelica Chen
Nikita Nangia
Vishakh Padmakumar
Jason Phang
Jana Thompson
Phu Mon Htut
Sam Bowman
223
374
0
15 Oct 2021
Attacking Open-domain Question Answering by Injecting Misinformation
Attacking Open-domain Question Answering by Injecting Misinformation
Liangming Pan
Wenhu Chen
Min-Yen Kan
Wei Wang
HILM
AAML
206
22
0
15 Oct 2021
Small-Bench NLP: Benchmark for small single GPU trained models in
  Natural Language Processing
Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing
K. Kanakarajan
Bhuvana Kundumani
Malaikannan Sankarasubbu
ALM
MoE
11
5
0
22 Sep 2021
AmbiFC: Fact-Checking Ambiguous Claims with Evidence
AmbiFC: Fact-Checking Ambiguous Claims with Evidence
Max Glockner
Ieva Staliunaite
James Thorne
Gisela Vallejo
Andreas Vlachos
Iryna Gurevych
32
22
0
01 Apr 2021
COCO-LM: Correcting and Contrasting Text Sequences for Language Model
  Pretraining
COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
125
203
0
16 Feb 2021
Megatron-LM: Training Multi-Billion Parameter Language Models Using
  Model Parallelism
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,826
0
17 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
Previous
123...121314