ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09543
  4. Cited By
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with
  Gradient-Disentangled Embedding Sharing

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021
Pengcheng He
Jianfeng Gao
Weizhu Chen
ArXivPDFHTML

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

50 / 664 papers shown
Title
From Prediction to Application: Language Model-based Code Knowledge
  Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with
  Pedagogical Prompting for Comprehensive Programming Education
From Prediction to Application: Language Model-based Code Knowledge Tracing with Domain Adaptive Pre-Training and Automatic Feedback System with Pedagogical Prompting for Comprehensive Programming Education
Unggi Lee
Jiyeong Bae
Yeonji Jung
Minji Kang
Gyuri Byun
...
Sookbun Lee
Jaekwon Park
Taekyung Ahn
Gunho Lee
Hyeoncheol Kim
AI4Ed
KELM
39
1
0
31 Aug 2024
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models
Jonathan Bourne
54
4
0
30 Aug 2024
Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large
  Language Models
Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models
Aradhye Agarwal
Suhas K Ramesh
Ayan Sengupta
Tanmoy Chakraborty
25
1
0
26 Aug 2024
Probing the Robustness of Vision-Language Pretrained Models: A
  Multimodal Adversarial Attack Approach
Probing the Robustness of Vision-Language Pretrained Models: A Multimodal Adversarial Attack Approach
Jiwei Guan
Tianyu Ding
Longbing Cao
Lei Pan
Chen Wang
Xi Zheng
AAML
33
1
0
24 Aug 2024
MedDec: A Dataset for Extracting Medical Decisions from Discharge
  Summaries
MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries
Mohamed Elgaar
Jiali Cheng
Nidhi Vakil
Hadi Amiri
Leo Anthony Celi
36
2
0
23 Aug 2024
A Little Confidence Goes a Long Way
A Little Confidence Goes a Long Way
J. Scoville
Shang Gao
Devanshu Agrawal
Javed Qadrud-Din
29
0
0
20 Aug 2024
Improving VTE Identification through Language Models from Radiology
  Reports: A Comparative Study of Mamba, Phi-3 Mini, and BERT
Improving VTE Identification through Language Models from Radiology Reports: A Comparative Study of Mamba, Phi-3 Mini, and BERT
Jamie Deng
Yusen Wu
Yelena Yesha
Phuong Nguyen
21
0
0
16 Aug 2024
SEAL: Systematic Error Analysis for Value ALignment
SEAL: Systematic Error Analysis for Value ALignment
Manon Revel
Matteo Cargnelutti
Tyna Eloundou
Greg Leppert
40
3
0
16 Aug 2024
LoRA$^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large
  Language Models
LoRA2^22 : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models
Jia-Chen Zhang
Yu-Jie Xiong
He-Xi Qiu
Dong-Hai Zhu
Chun-Ming Xia
MoE
26
0
0
13 Aug 2024
LLM-Based Robust Product Classification in Commerce and Compliance
LLM-Based Robust Product Classification in Commerce and Compliance
Sina Gholamian
Gianfranco Romani
Bartosz Rudnikowicz
Laura Skylaki
39
1
0
11 Aug 2024
Reference-free Hallucination Detection for Large Vision-Language Models
Reference-free Hallucination Detection for Large Vision-Language Models
Qing Li
Chenyang Lyu
Jiahui Geng
Derui Zhu
Maxim Panov
Fakhri Karray
29
6
0
11 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
34
0
0
09 Aug 2024
Overview of the NLPCC 2024 Shared Task on Chinese Metaphor Generation
Overview of the NLPCC 2024 Shared Task on Chinese Metaphor Generation
Xingwei Qu
Ge Zhang
Siwei Wu
Yizhi Li
Chenghua Lin
35
2
0
08 Aug 2024
Explicating the Implicit: Argument Detection Beyond Sentence Boundaries
Explicating the Implicit: Argument Detection Beyond Sentence Boundaries
Paul Roit
Aviv Slobodkin
Eran Hirsch
Arie Cattan
Ayal Klein
Valentina Pyatkin
Ido Dagan
55
1
0
08 Aug 2024
Training LLMs to Recognize Hedges in Spontaneous Narratives
Training LLMs to Recognize Hedges in Spontaneous Narratives
Amie Paige
Adil Soubki
John Murzaku
Owen Rambow
Susan E. Brennan
32
0
0
06 Aug 2024
To Aggregate or Not to Aggregate. That is the Question: A Case Study on
  Annotation Subjectivity in Span Prediction
To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction
Kemal Kurniawan
Meladel Mistica
Timothy Baldwin
Jey Han Lau
30
1
0
05 Aug 2024
Maverick: Efficient and Accurate Coreference Resolution Defying Recent
  Trends
Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends
Giuliano Martinelli
Martin Larsson
Johannes Wiesel
31
7
0
31 Jul 2024
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
Adam Gould
Pere-Lluis Huguet-Cabot
S. Dadhania
Francesca Toni
79
9
0
31 Jul 2024
Evaluating Large Language Models for automatic analysis of teacher
  simulations
Evaluating Large Language Models for automatic analysis of teacher simulations
David de-Fitero-Dominguez
Mariano Albaladejo-González
Antonio Garcia-Cabot
Eva García-López
Antonio Moreno-Cediel
Erin Barno
Justin Reich
ELM
26
0
0
29 Jul 2024
KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting
  Relations in Dialogical Argument Mining
KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting Relations in Dialogical Argument Mining
Zihao Zheng
Zhaowei Wang
Qing Zong
Yangqiu Song
LRM
48
1
0
29 Jul 2024
Fine-Tuning Large Language Models for Stock Return Prediction Using
  Newsflow
Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow
Tian Guo
E. Hauptmann
AIFin
41
3
0
25 Jul 2024
Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated
  Scientific Papers
Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers
Nikita Andreev
Alexander Shirnin
Vladislav Mikhailov
Ekaterina Artemova
35
1
0
24 Jul 2024
Exploring Domain Robust Lightweight Reward Models based on Router
  Mechanism
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
Hyuk Namgoong
Jeesu Jung
Sangkeun Jung
Yoonhyung Roh
46
1
0
24 Jul 2024
Multilingual Fine-Grained News Headline Hallucination Detection
Multilingual Fine-Grained News Headline Hallucination Detection
Jiaming Shen
Tianqi Liu
Jialu Liu
Zhen Qin
Jay Pavagadhi
Simon Baumgartner
Michael Bendersky
56
0
0
22 Jul 2024
MetaSumPerceiver: Multimodal Multi-Document Evidence Summarization for
  Fact-Checking
MetaSumPerceiver: Multimodal Multi-Document Evidence Summarization for Fact-Checking
Ting-Chih Chen
Chia-Wei Tang
Chris Thomas
52
3
0
18 Jul 2024
Navigating the Noisy Crowd: Finding Key Information for Claim
  Verification
Navigating the Noisy Crowd: Finding Key Information for Claim Verification
Haisong Gong
Huanhuan Ma
Qiang Liu
Shu Wu
Liang Wang
48
1
0
17 Jul 2024
Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in
  Grammatical Error Detection
Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection
Gaetan Lopez Latouche
M. Carbonneau
Ben Swanson
32
0
0
16 Jul 2024
Enhancing Parameter Efficiency and Generalization in Large-Scale Models:
  A Regularized and Masked Low-Rank Adaptation Approach
Enhancing Parameter Efficiency and Generalization in Large-Scale Models: A Regularized and Masked Low-Rank Adaptation Approach
Yuzhu Mao
Siqi Ping
Zihao Zhao
Yang Liu
Wenbo Ding
37
1
0
16 Jul 2024
BinaryAlign: Word Alignment as Binary Sequence Labeling
BinaryAlign: Word Alignment as Binary Sequence Labeling
Gaetan Lopez Latouche
M. Carbonneau
Ben Swanson
31
1
0
16 Jul 2024
What distinguishes conspiracy from critical narratives? A computational
  analysis of oppositional discourse
What distinguishes conspiracy from critical narratives? A computational analysis of oppositional discourse
Damir Korenčić
Berta Chulvi
Xavier Bonet Casals
Alejandro Toselli
M. Taulé
Paolo Rosso
41
4
0
15 Jul 2024
DANIEL: A fast Document Attention Network for Information Extraction and
  Labelling of handwritten documents
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents
Thomas Constum
Pierrick Tranouez
Thierry Paquet
32
5
0
12 Jul 2024
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in
  Large Language Models Using Only Attention Maps
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps
Yung-Sung Chuang
Linlu Qiu
Cheng-Yu Hsieh
Ranjay Krishna
Yoon Kim
James R. Glass
HILM
18
35
0
09 Jul 2024
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in
  Text Classification
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification
Hongfei Huang
Tingting Liang
Xixi Sun
Zikang Jin
Yuyu Yin
NoLa
39
1
0
09 Jul 2024
MolTRES: Improving Chemical Language Representation Learning for
  Molecular Property Prediction
MolTRES: Improving Chemical Language Representation Learning for Molecular Property Prediction
Jun-Hyung Park
Yeachan Kim
Mingyu Lee
Hyuntae Park
SangKeun Lee
40
0
0
09 Jul 2024
Open-world Multi-label Text Classification with Extremely Weak
  Supervision
Open-world Multi-label Text Classification with Extremely Weak Supervision
Xintong Li
Jinya Jiang
Ria Dharmani
Jayanth Srinivasa
Gaowen Liu
Jingbo Shang
VLM
34
2
0
08 Jul 2024
See Further for Parameter Efficient Fine-tuning by Standing on the
  Shoulders of Decomposition
See Further for Parameter Efficient Fine-tuning by Standing on the Shoulders of Decomposition
Chongjie Si
Xiaokang Yang
Wei Shen
45
5
0
07 Jul 2024
Open foundation models for Azerbaijani language
Open foundation models for Azerbaijani language
Jafar Isbarov
Kavsar Huseynova
Elvin Mammadov
Mammad Hajili
Duygu Ataman
AI4CE
45
1
0
02 Jul 2024
Are Data Augmentation Methods in Named Entity Recognition Applicable for
  Uncertainty Estimation?
Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
29
1
0
02 Jul 2024
Automated Text Scoring in the Age of Generative AI for the GPU-poor
Automated Text Scoring in the Age of Generative AI for the GPU-poor
C. Ormerod
Alexander Kwako
46
2
0
02 Jul 2024
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
Wataru Hashimoto
Hidetaka Kamigaito
Taro Watanabe
60
0
0
02 Jul 2024
Increasing Model Capacity for Free: A Simple Strategy for Parameter
  Efficient Fine-tuning
Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
Haobo Song
Hao Zhao
Soumajit Majumder
Tao Lin
30
3
0
01 Jul 2024
DP-MLM: Differentially Private Text Rewriting Using Masked Language
  Models
DP-MLM: Differentially Private Text Rewriting Using Masked Language Models
Stephen Meisenbacher
Maulik Chevli
Juraj Vladika
Florian Matthes
44
7
0
30 Jun 2024
Calibrating LLMs with Preference Optimization on Thought Trees for
  Generating Rationale in Science Question Scoring
Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring
Jiazheng Li
Hainiu Xu
ZHAOYUE SUN
Yuxiang Zhou
David West
Cesare Aloisi
Yulan He
LRM
24
4
0
28 Jun 2024
DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark
  for Incoherence Detection, Reasoning, and Rewriting
DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting
Xuanming Zhang
Anthony Diaz
Zixun Chen
Qingyang Wu
Kun Qian
Erik Voss
Zhou Yu
29
0
0
28 Jun 2024
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs
Ekaterina Taktasheva
Maxim Bazhukov
Kirill Koncha
Alena Fenogenova
Ekaterina Artemova
Vladislav Mikhailov
42
9
0
27 Jun 2024
Weak Reward Model Transforms Generative Models into Robust Causal Event
  Extraction Systems
Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems
Italo Luis da Silva
Hanqi Yan
Lin Gui
Yulan He
CML
39
0
0
26 Jun 2024
RaTEScore: A Metric for Radiology Report Generation
RaTEScore: A Metric for Radiology Report Generation
W. Zhao
Chaoyi Wu
X. Zhang
Ya Zhang
Yanfeng Wang
Weidi Xie
24
8
0
24 Jun 2024
Exploring Factual Entailment with NLI: A News Media Study
Exploring Factual Entailment with NLI: A News Media Study
Guy Mor-Lan
Effi Levi
69
0
0
24 Jun 2024
DemoRank: Selecting Effective Demonstrations for Large Language Models
  in Ranking Task
DemoRank: Selecting Effective Demonstrations for Large Language Models in Ranking Task
Wenhan Liu
Yutao Zhu
Zhicheng Dou
ALM
RALM
47
9
0
24 Jun 2024
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned
  MT Evaluation Metrics
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics
Daniil Larionov
Mikhail Seleznyov
Vasiliy Viskov
Alexander Panchenko
Steffen Eger
37
3
0
20 Jun 2024
Previous
12345...121314
Next