ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09543
  4. Cited By
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with
  Gradient-Disentangled Embedding Sharing

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021
Pengcheng He
Jianfeng Gao
Weizhu Chen
ArXivPDFHTML

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

50 / 665 papers shown
Title
RealKIE: Five Novel Datasets for Enterprise Key Information Extraction
RealKIE: Five Novel Datasets for Enterprise Key Information Extraction
Benjamin Townsend
Madison May
Christopher Wells
SyDa
42
0
0
29 Mar 2024
AIpom at SemEval-2024 Task 8: Detecting AI-produced Outputs in M4
AIpom at SemEval-2024 Task 8: Detecting AI-produced Outputs in M4
Alexander Shirnin
Nikita Andreev
Vladislav Mikhailov
Ekaterina Artemova
DeLMO
22
1
0
28 Mar 2024
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using
  Representative Data
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
Manuel Tonneau
Pedro Vitor Quinta de Castro
Karim Lasri
I. Farouq
Lakshminarayanan Subramanian
Victor Orozco-Olvera
Samuel Fraiberger
44
10
0
28 Mar 2024
REFeREE: A REference-FREE Model-Based Metric for Text Simplification
REFeREE: A REference-FREE Model-Based Metric for Text Simplification
Yichen Huang
Ekaterina Kochmar
58
1
0
26 Mar 2024
ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition
ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition
Haris Riaz
Razvan-Gabriel Dumitru
Mihai Surdeanu
MU
37
0
0
26 Mar 2024
MasonTigers at SemEval-2024 Task 8: Performance Analysis of
  Transformer-based Models on Machine-Generated Text Detection
MasonTigers at SemEval-2024 Task 8: Performance Analysis of Transformer-based Models on Machine-Generated Text Detection
Sadiya Sayara Chowdhury Puspo
Md. Nishat Raihan
Dhiman Goswami
Al Nahian Bin Emran
Amrita Ganguly
Özlem Uzuner
DeLMO
41
1
0
22 Mar 2024
Automatic Annotation of Grammaticality in Child-Caregiver Conversations
Automatic Annotation of Grammaticality in Child-Caregiver Conversations
Mitja Nikolaus
Abhishek Agrawal
Petros Kaklamanis
Alex Warstadt
Abdellah Fourtassi
41
2
0
21 Mar 2024
Adaptive Ensembles of Fine-Tuned Transformers for LLM-Generated Text
  Detection
Adaptive Ensembles of Fine-Tuned Transformers for LLM-Generated Text Detection
Zhixin Lai
Xuesheng Zhang
Suiyao Chen
DeLMO
41
32
0
20 Mar 2024
Don't be a Fool: Pooling Strategies in Offensive Language Detection from
  User-Intended Adversarial Attacks
Don't be a Fool: Pooling Strategies in Offensive Language Detection from User-Intended Adversarial Attacks
Seunguk Yu
Juhwan Choi
Youngbin Kim
AAML
21
0
0
20 Mar 2024
SEVEN: Pruning Transformer Model by Reserving Sentinels
SEVEN: Pruning Transformer Model by Reserving Sentinels
Jinying Xiao
Ping Li
Jie Nie
Zhe Tang
39
3
0
19 Mar 2024
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with
  Self-Reflexive Large Language Models
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
Yuzhao Heng
Chun-Ying Deng
Yitong Li
Yue Yu
Yinghao Li
Rongzhi Zhang
Chao Zhang
33
4
0
17 Mar 2024
Team Trifecta at Factify5WQA: Setting the Standard in Fact Verification
  with Fine-Tuning
Team Trifecta at Factify5WQA: Setting the Standard in Fact Verification with Fine-Tuning
Shang-Hsuan Chiang
Ming-Chih Lo
Lin-Wei Chao
Wen-Chih Peng
28
2
0
15 Mar 2024
Leveraging Prototypical Representations for Mitigating Social Bias
  without Demographic Information
Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information
Shadi Iskander
Kira Radinsky
Yonatan Belinkov
56
4
0
14 Mar 2024
Language-Grounded Dynamic Scene Graphs for Interactive Object Search
  with Mobile Manipulation
Language-Grounded Dynamic Scene Graphs for Interactive Object Search with Mobile Manipulation
Daniel Honerkamp
Martin Buchner
Fabien Despinoy
Tim Welschehold
Abhinav Valada
LM&Ro
40
28
0
13 Mar 2024
Tastle: Distract Large Language Models for Automatic Jailbreak Attack
Tastle: Distract Large Language Models for Automatic Jailbreak Attack
Zeguan Xiao
Yan Yang
Guanhua Chen
Yun-Nung Chen
AAML
40
18
0
13 Mar 2024
Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs
Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs
Tianqing Fang
Zeming Chen
Yangqiu Song
Antoine Bosselut
ReLM
LRM
37
12
0
12 Mar 2024
Calibrating Large Language Models Using Their Generations Only
Calibrating Large Language Models Using Their Generations Only
Dennis Ulmer
Martin Gubri
Hwaran Lee
Sangdoo Yun
Seong Joon Oh
UQLM
432
18
1
09 Mar 2024
MMoE: Robust Spoiler Detection with Multi-modal Information and
  Domain-aware Mixture-of-Experts
MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts
Zinan Zeng
Sen Ye
Zijian Cai
Heng Wang
Yuhan Liu
Qinghua Zheng
Minnan Luo
31
0
0
08 Mar 2024
Benchmarking Large Language Models for Molecule Prediction Tasks
Benchmarking Large Language Models for Molecule Prediction Tasks
Zhiqiang Zhong
Kuangyu Zhou
Davide Mottin
40
8
0
08 Mar 2024
Defending Against Unforeseen Failure Modes with Latent Adversarial
  Training
Defending Against Unforeseen Failure Modes with Latent Adversarial Training
Stephen Casper
Lennart Schulze
Oam Patel
Dylan Hadfield-Menell
AAML
57
28
0
08 Mar 2024
Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts:
  Challenges, Strategies, and Insights
Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights
Zijie Zeng
Shiqi Liu
Lele Sha
Zhuang Li
Kaixun Yang
Sannyuya Liu
Dragan Gavsević
Guanliang Chen
DeLMO
50
1
0
06 Mar 2024
A Decade of Privacy-Relevant Android App Reviews: Large Scale Trends
A Decade of Privacy-Relevant Android App Reviews: Large Scale Trends
Omer Akgul
Sai Teja Peddinti
Nina Taft
Michelle L. Mazurek
Hamza Harkous
Animesh Srivastava
Benoit Seguin
33
5
0
04 Mar 2024
Formulation Comparison for Timeline Construction using LLMs
Formulation Comparison for Timeline Construction using LLMs
Kimihiro Hasegawa
Nikhil Kandukuri
Susan Holm
Yukari Yamakawa
Teruko Mitamura
43
0
0
01 Mar 2024
An Interpretable Ensemble of Graph and Language Models for Improving
  Search Relevance in E-Commerce
An Interpretable Ensemble of Graph and Language Models for Improving Search Relevance in E-Commerce
Nurendra Choudhary
E-Wen Huang
Karthik Subbian
Chandan K. Reddy
14
3
0
01 Mar 2024
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large
  Language Models
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models
Kedi Chen
Qin Chen
Jie Zhou
Yishen He
Liang He
HILM
38
1
0
01 Mar 2024
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
37
2
0
01 Mar 2024
Survey in Characterization of Semantic Change
Survey in Characterization of Semantic Change
Jader Martins Camboim de Sá
Marcos Da Silveira
C. Pruski
34
8
0
29 Feb 2024
RORA: Robust Free-Text Rationale Evaluation
RORA: Robust Free-Text Rationale Evaluation
Zhengping Jiang
Yining Lu
Hanjie Chen
Daniel Khashabi
Benjamin Van Durme
Anqi Liu
53
1
0
28 Feb 2024
Variational Learning is Effective for Large Deep Networks
Variational Learning is Effective for Large Deep Networks
Yuesong Shen
Nico Daheim
Bai Cong
Peter Nickl
Gian Maria Marconi
...
Rio Yokota
Iryna Gurevych
Daniel Cremers
Mohammad Emtiyaz Khan
Thomas Möllenhoff
43
22
0
27 Feb 2024
Don't Forget Your Reward Values: Language Model Alignment via
  Value-based Calibration
Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration
Xin Mao
Fengming Li
Huimin Xu
Wei Zhang
A. Luu
ALM
45
6
0
25 Feb 2024
Abdelhak at SemEval-2024 Task 9 : Decoding Brainteasers, The Efficacy of
  Dedicated Models Versus ChatGPT
Abdelhak at SemEval-2024 Task 9 : Decoding Brainteasers, The Efficacy of Dedicated Models Versus ChatGPT
Abdelhak Kelious
Mounir Okirim
LRM
26
1
0
24 Feb 2024
Is ChatGPT the Future of Causal Text Mining? A Comprehensive Evaluation
  and Analysis
Is ChatGPT the Future of Causal Text Mining? A Comprehensive Evaluation and Analysis
Takehiro Takayanagi
Masahiro Suzuki
Ryotaro Kobayashi
Hiroki Sakaji
Kiyoshi Izumi
50
1
0
22 Feb 2024
DrBenchmark: A Large Language Understanding Evaluation Benchmark for
  French Biomedical Domain
DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain
Yanis Labrak
Adrien Bazoge
Oumaima El Khettari
Mickael Rouvier
Pacome Constant dit Beaufils
...
B. Daille
Solen Quiniou
Emmanuel Morin
P. Gourraud
Richard Dufour
LM&MA
34
6
0
20 Feb 2024
Harnessing Large Language Models as Post-hoc Correctors
Harnessing Large Language Models as Post-hoc Correctors
Zhiqiang Zhong
Kuangyu Zhou
Davide Mottin
39
4
0
20 Feb 2024
A Simple but Effective Approach to Improve Structured Language Model
  Output for Information Extraction
A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction
Yinghao Li
R. Ramprasad
Chao Zhang
101
12
0
20 Feb 2024
Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic
  Textual Similarity
Are ELECTRA's Sentence Embeddings Beyond Repair? The Case of Semantic Textual Similarity
Ivan Rep
David Dukić
Jan Šnajder
37
0
0
20 Feb 2024
Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual
  and Multilingual Approaches for Detecting AI-generated Text
Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated Text
Xiaoman Xu
Xiangrun Li
Taihang Wang
Jianxiang Tian
Ye Jiang
DeLMO
37
3
0
19 Feb 2024
Uncovering Latent Human Wellbeing in Language Model Embeddings
Uncovering Latent Human Wellbeing in Language Model Embeddings
Pedro Freire
ChengCheng Tan
Adam Gleave
Dan Hendrycks
Scott Emmons
36
1
0
19 Feb 2024
Prospector Heads: Generalized Feature Attribution for Large Models &
  Data
Prospector Heads: Generalized Feature Attribution for Large Models & Data
Gautam Machiraju
Alexander Derry
Arjun D Desai
Neel Guha
Amir-Hossein Karimi
James Zou
Russ Altman
Christopher Ré
Parag Mallick
AI4TS
MedIm
50
0
0
18 Feb 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated
  Text Detectors Under Attacks
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
48
17
0
18 Feb 2024
From Prejudice to Parity: A New Approach to Debiasing Large Language
  Model Word Embeddings
From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings
Aishik Rakshit
Smriti Singh
Shuvam Keshari
Arijit Ghosh Chowdhury
Vinija Jain
Aman Chadha
37
1
0
18 Feb 2024
Can We Verify Step by Step for Incorrect Answer Detection?
Can We Verify Step by Step for Incorrect Answer Detection?
Xin Xu
Shizhe Diao
Can Yang
Yang Wang
LRM
130
14
0
16 Feb 2024
Long-form evaluation of model editing
Long-form evaluation of model editing
Domenic Rosati
Robie Gonzales
Jinkun Chen
Xuemin Yu
Melis Erkan
Yahya Kayani
Satya Deepika Chavatapalli
Frank Rudzicz
Hassan Sajjad
KELM
22
11
0
14 Feb 2024
Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
Zhichen Dong
Zhanhui Zhou
Chao Yang
Jing Shao
Yu Qiao
ELM
52
58
0
14 Feb 2024
eCeLLM: Generalizing Large Language Models for E-commerce from
  Large-scale, High-quality Instruction Data
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data
B. Peng
Xinyi Ling
Ziru Chen
Huan Sun
Xia Ning
ELM
37
17
0
13 Feb 2024
Plausible Extractive Rationalization through Semi-Supervised Entailment
  Signal
Plausible Extractive Rationalization through Semi-Supervised Entailment Signal
Yeo Wei Jie
Ranjan Satapathy
Min Zhang
19
5
0
13 Feb 2024
Lying Blindly: Bypassing ChatGPT's Safeguards to Generate Hard-to-Detect
  Disinformation Claims at Scale
Lying Blindly: Bypassing ChatGPT's Safeguards to Generate Hard-to-Detect Disinformation Claims at Scale
Freddy Heppell
M. Bakir
Kalina Bontcheva
DeLMO
33
1
0
13 Feb 2024
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods
  in Low-resource Regimes
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes
Juhwan Choi
Kyohoon Jin
Junho Lee
Sangmin Song
Youngbin Kim
30
1
0
08 Feb 2024
ApiQ: Finetuning of 2-Bit Quantized Large Language Model
ApiQ: Finetuning of 2-Bit Quantized Large Language Model
Baohao Liao
Christian Herold
Shahram Khadivi
Christof Monz
CLL
MQ
47
12
0
07 Feb 2024
English Prompts are Better for NLI-based Zero-Shot Emotion
  Classification than Target-Language Prompts
English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts
Patrick Bareiss
Roman Klinger
Jeremy Barnes
27
7
0
05 Feb 2024
Previous
123...678...121314
Next