ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09543
  4. Cited By
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with
  Gradient-Disentangled Embedding Sharing

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021
Pengcheng He
Jianfeng Gao
Weizhu Chen
ArXivPDFHTML

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

50 / 665 papers shown
Title
Exploring the Robustness of Task-oriented Dialogue Systems for
  Colloquial German Varieties
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties
Ekaterina Artemova
Verena Blaschke
Barbara Plank
36
3
0
03 Feb 2024
Rethinking the Role of Proxy Rewards in Language Model Alignment
Rethinking the Role of Proxy Rewards in Language Model Alignment
Sungdong Kim
Minjoon Seo
SyDa
ALM
31
0
0
02 Feb 2024
CABINET: Content Relevance based Noise Reduction for Table Question
  Answering
CABINET: Content Relevance based Noise Reduction for Table Question Answering
Sohan Patnaik
Heril Changwal
Milan Aggarwal
Sumita Bhatia
Yaman Kumar
Balaji Krishnamurthy
LMTD
RALM
44
19
0
02 Feb 2024
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought
  Reasoning
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning
Tinghui Zhu
Kai Zhang
Jian Xie
Yu-Chuan Su
LRM
28
15
0
31 Jan 2024
Do We Need Language-Specific Fact-Checking Models? The Case of Chinese
Do We Need Language-Specific Fact-Checking Models? The Case of Chinese
Caiqi Zhang
Zhijiang Guo
Andreas Vlachos
13
9
0
27 Jan 2024
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced
  Understanding and Generation
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation
Gokcce Uludougan
Zeynep Yirmibecsouglu Balal
Furkan Akkurt
Melikcsah Turker
Onur Gungor
S. Uskudarli
39
12
0
25 Jan 2024
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Asaf Yehudai
Boaz Carmeli
Y. Mass
Ofir Arviv
Nathaniel Mills
Assaf Toledo
Eyal Shnarch
Leshem Choshen
45
22
0
25 Jan 2024
SEER: Facilitating Structured Reasoning and Explanation via
  Reinforcement Learning
SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning
Guoxin Chen
Kexin Tang
Chao Yang
Fuying Ye
Yu Qiao
Yiming Qian
LRM
18
3
0
24 Jan 2024
Cheap Learning: Maximising Performance of Language Models for Social
  Data Science Using Minimal Data
Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data
Leonardo Castro-Gonzalez
Yi-Ling Chung
Hannak Rose Kirk
John Francis
Angus R. Williams
Pica Johansson
Jonathan Bright
50
1
0
22 Jan 2024
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing
  Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Aly M. Kassem
Sherif Saad
AAML
25
1
0
21 Jan 2024
PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation
PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation
Nadav Benedek
Lior Wolf
32
5
0
20 Jan 2024
End-to-End Argument Mining over Varying Rhetorical Structures
End-to-End Argument Mining over Varying Rhetorical Structures
Elena Chistova
23
4
0
20 Jan 2024
Learning Shortcuts: On the Misleading Promise of NLU in Language Models
Learning Shortcuts: On the Misleading Promise of NLU in Language Models
Geetanjali Bihani
Julia Taylor Rayz
33
3
0
17 Jan 2024
Hallucination Detection and Hallucination Mitigation: An Investigation
Hallucination Detection and Hallucination Mitigation: An Investigation
Junliang Luo
Tianyu Li
Di Wu
Michael R. M. Jenkin
Steve Liu
Gregory Dudek
HILM
LLMAG
46
22
0
16 Jan 2024
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Dominik Macko
Robert Moro
Adaku Uchendu
Ivan Srba
Jason Samuel Lucas
Michiharu Yamashita
Nafis Irtiza Tripto
Dongwon Lee
Jakub Simko
Maria Bielikova
DeLMO
40
17
0
15 Jan 2024
CANDLE: Iterative Conceptualization and Instantiation Distillation from
  Large Language Models for Commonsense Reasoning
CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning
Weiqi Wang
Tianqing Fang
Chunyang Li
Haochen Shi
Wenxuan Ding
...
Jiaxin Bai
Xin Liu
Cheng Jiayang
Chunkit Chan
Yangqiu Song
LRM
28
28
0
14 Jan 2024
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Dennis Ulmer
Elman Mansimov
Kaixiang Lin
Justin Sun
Xibin Gao
Yi Zhang
LLMAG
35
27
0
10 Jan 2024
TIER: Text-Image Encoder-based Regression for AIGC Image Quality
  Assessment
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Jinming Che
Qinyuan Wang
Sen Liang
Wei Ren
Jinlong Lin
Xixin Cao
EGVM
24
1
0
08 Jan 2024
Enhancing Essay Scoring with Adversarial Weights Perturbation and
  Metric-specific AttentionPooling
Enhancing Essay Scoring with Adversarial Weights Perturbation and Metric-specific AttentionPooling
Jiaxin Huang
Xinyu Zhao
Change Che
Qunwei Lin
Bo Liu
AAML
11
21
0
06 Jan 2024
Semantic Similarity Matching for Patent Documents Using Ensemble
  BERT-related Model and Novel Text Processing Method
Semantic Similarity Matching for Patent Documents Using Ensemble BERT-related Model and Novel Text Processing Method
Liqiang Yu
Bo Liu
Qunwei Lin
Xinyu Zhao
Change Che
14
31
0
06 Jan 2024
An Autoregressive Text-to-Graph Framework for Joint Entity and Relation
  Extraction
An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction
Urchade Zaratiana
Nadi Tomeh
Pierre Holat
Thierry Charnois
44
15
0
02 Jan 2024
A Multi-Task, Multi-Modal Approach for Predicting Categorical and
  Dimensional Emotions
A Multi-Task, Multi-Modal Approach for Predicting Categorical and Dimensional Emotions
Alex-Răzvan Ispas
Théo Deschamps-Berger
Laurence Devillers
40
1
0
31 Dec 2023
The Art of Defending: A Systematic Evaluation and Analysis of LLM
  Defense Strategies on Safety and Over-Defensiveness
The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness
Neeraj Varshney
Pavel Dolin
Agastya Seth
Chitta Baral
AAML
ELM
25
47
0
30 Dec 2023
Building Efficient Universal Classifiers with Natural Language Inference
Building Efficient Universal Classifiers with Natural Language Inference
Moritz Laurer
W. Atteveldt
Andreu Casas
Kasper Welbers
38
8
0
29 Dec 2023
MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining
MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining
Jacob P. Portes
Alex Trott
Sam Havens
Daniel King
Abhinav Venigalla
Moin Nadeem
Nikhil Sardana
D. Khudia
Jonathan Frankle
28
17
0
29 Dec 2023
S2M: Converting Single-Turn to Multi-Turn Datasets for Conversational
  Question Answering
S2M: Converting Single-Turn to Multi-Turn Datasets for Conversational Question Answering
Baokui Li
Sen Zhang
Wangshu Zhang
Yicheng Chen
Changlin Yang
Sen Hu
Teng Xu
Siye Liu
Jiwei Li
44
1
0
27 Dec 2023
Multilingual Bias Detection and Mitigation for Indian Languages
Multilingual Bias Detection and Mitigation for Indian Languages
Ankita Maity
Anubhav Sharma
Rudra Dhar
Tushar Abhishek
Manish Gupta
Vasudeva Varma
39
2
0
23 Dec 2023
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets
Dirk Groeneveld
Anas Awadalla
Iz Beltagy
Akshita Bhagia
Ian H. Magnusson
Hao Peng
Oyvind Tafjord
Pete Walsh
Kyle Richardson
Jesse Dodge
122
1
0
15 Dec 2023
Generative Context-aware Fine-tuning of Self-supervised Speech Models
Generative Context-aware Fine-tuning of Self-supervised Speech Models
Suwon Shon
Kwangyoun Kim
Prashant Sridhar
Yi-Te Hsu
Shinji Watanabe
Karen Livescu
39
2
0
15 Dec 2023
Probing Pretrained Language Models with Hierarchy Properties
Probing Pretrained Language Models with Hierarchy Properties
Jesús Lovón-Melgarejo
José G. Moreno
Romaric Besançon
Olivier Ferret
L. Tamine
19
3
0
15 Dec 2023
BiPFT: Binary Pre-trained Foundation Transformer with Low-rank
  Estimation of Binarization Residual Polynomials
BiPFT: Binary Pre-trained Foundation Transformer with Low-rank Estimation of Binarization Residual Polynomials
Xingrun Xing
Li Du
Xinyuan Wang
Xianlin Zeng
Yequan Wang
Zheng Zhang
Jiajun Zhang
15
3
0
14 Dec 2023
Labels Need Prompts Too: Mask Matching for Natural Language
  Understanding Tasks
Labels Need Prompts Too: Mask Matching for Natural Language Understanding Tasks
Bo Li
Wei Ye
Quan-ding Wang
Wen Zhao
Shikun Zhang
VLM
37
1
0
14 Dec 2023
Conceptualizing Suicidal Behavior: Utilizing Explanations of Predicted
  Outcomes to Analyze Longitudinal Social Media Data
Conceptualizing Suicidal Behavior: Utilizing Explanations of Predicted Outcomes to Analyze Longitudinal Social Media Data
Van Minh Nguyen
Nasheen Nur
William Stern
Thomas Mercer
Chiradeep Sen
S. Bhattacharyya
Victor Tumbiolo
Seng Jhing Goh
27
3
0
13 Dec 2023
Explanatory Argument Extraction of Correct Answers in Resident Medical
  Exams
Explanatory Argument Extraction of Correct Answers in Resident Medical Exams
Iakes Goenaga
Aitziber Atutxa
Koldo Gojenola
Maite Oronoz
Rodrigo Agerri
ELM
70
8
0
01 Dec 2023
Evaluating the Rationale Understanding of Critical Reasoning in Logical
  Reading Comprehension
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension
Akira Kawabata
Saku Sugawara
ELM
24
5
0
30 Nov 2023
SPIN: Sparsifying and Integrating Internal Neurons in Large Language
  Models for Text Classification
SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text Classification
Difan Jiao
Yilun Liu
Zhenwei Tang
Daniel Matter
Jürgen Pfeffer
Ashton Anderson
19
1
0
27 Nov 2023
Human Learning by Model Feedback: The Dynamics of Iterative Prompting
  with Midjourney
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney
Shachar Don-Yehiya
Leshem Choshen
Omri Abend
21
5
0
20 Nov 2023
Sparse Low-rank Adaptation of Pre-trained Language Models
Sparse Low-rank Adaptation of Pre-trained Language Models
Ning Ding
Xingtai Lv
Qiaosen Wang
Yulin Chen
Bowen Zhou
Zhiyuan Liu
Maosong Sun
30
55
0
20 Nov 2023
Measuring and Improving Attentiveness to Partial Inputs with
  Counterfactuals
Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals
Yanai Elazar
Bhargavi Paranjape
Hao Peng
Sarah Wiegreffe
Khyathi Raghavi
Vivek Srikumar
Sameer Singh
Noah A. Smith
AAML
OOD
34
0
0
16 Nov 2023
Reducing Privacy Risks in Online Self-Disclosures with Language Models
Reducing Privacy Risks in Online Self-Disclosures with Language Models
Yao Dou
Isadora Krsek
Tarek Naous
Anubha Kabra
Sauvik Das
Alan Ritter
Wei Xu
38
22
0
16 Nov 2023
SQATIN: Supervised Instruction Tuning Meets Question Answering for
  Improved Dialogue NLU
SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU
E. Razumovskaia
Goran Glavaš
Anna Korhonen
Ivan Vulić
LRM
32
2
0
16 Nov 2023
Show Your Work with Confidence: Confidence Bands for Tuning Curves
Show Your Work with Confidence: Confidence Bands for Tuning Curves
Nicholas Lourie
Kyunghyun Cho
He He
23
2
0
16 Nov 2023
ARES: An Automated Evaluation Framework for Retrieval-Augmented
  Generation Systems
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
Jon Saad-Falcon
Omar Khattab
Christopher Potts
Matei A. Zaharia
RALM
30
106
0
16 Nov 2023
Identifying Self-Disclosures of Use, Misuse and Addiction in
  Community-based Social Media Posts
Identifying Self-Disclosures of Use, Misuse and Addiction in Community-based Social Media Posts
Chenghao Yang
Tuhin Chakrabarty
K. Hochstatter
M. Slavin
N. El-Bassel
Smaranda Muresan
33
2
0
15 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei Huang
Junyu Mao
Rui Wang
Hai Hu
30
3
0
15 Nov 2023
Transformers in the Service of Description Logic-based Contexts
Transformers in the Service of Description Logic-based Contexts
Angelos Poulis
Eleni Tsalapati
Manolis Koubarakis
LRM
ReLM
28
0
0
15 Nov 2023
Routing to the Expert: Efficient Reward-guided Ensemble of Large
  Language Models
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
Keming Lu
Hongyi Yuan
Runji Lin
Junyang Lin
Zheng Yuan
Chang Zhou
Jingren Zhou
MoE
LRM
48
52
0
15 Nov 2023
GLiNER: Generalist Model for Named Entity Recognition using
  Bidirectional Transformer
GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer
Urchade Zaratiana
Nadi Tomeh
Pierre Holat
Thierry Charnois
37
33
0
14 Nov 2023
KTRL+F: Knowledge-Augmented In-Document Search
KTRL+F: Knowledge-Augmented In-Document Search
Hanseok Oh
Haebin Shin
Miyoung Ko
Hyunji Lee
Minjoon Seo
36
3
0
14 Nov 2023
A Survey of Confidence Estimation and Calibration in Large Language
  Models
A Survey of Confidence Estimation and Calibration in Large Language Models
Jiahui Geng
Fengyu Cai
Yuxia Wang
Heinz Koeppl
Preslav Nakov
Iryna Gurevych
UQCV
41
56
0
14 Nov 2023
Previous
123...789...121314
Next