DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

18 November 2021

Papers citing "DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing"

50 / 664 papers shown

Title
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution Shikha Bordia AILaw 47 0 0 30 Oct 2024
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models Yiming Li Xiaogeng Liu SILM 42 5 0 30 Oct 2024
Toxicity of the Commons: Curating Open-Source Pre-Training Data Catherine Arnett Eliot Jones Ivan P. Yamshchikov Pierre-Carl Langlais 36 2 0 29 Oct 2024
Are BabyLMs Second Language Learners? Lukas Edman Lisa Bylinina Faeze Ghorbanpour Alexander Fraser 22 0 0 28 Oct 2024
uOttawa at LegalLens-2024: Transformer-based Classification Experiments Nima Meghdadi Diana Inkpen AILaw 26 0 0 28 Oct 2024
KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation Rambod Azimi Rishav Rishav M. Teichmann Samira Ebrahimi Kahou ALM 31 0 0 28 Oct 2024
GeoLoRA: Geometric integration for parameter efficient fine-tuning Steffen Schotthöfer Emanuele Zangrando Gianluca Ceruti Francesco Tudisco J. Kusch AI4CE 31 1 0 24 Oct 2024
Improving Pinterest Search Relevance Using Large Language Models Han Wang Mukuntha Narayanan Sundararaman Onur Gungor Yu Xu Krishna Kamath Rakesh Chalasani Kurchi Subhra Hazra Jinfeng Rao LRM 30 1 0 22 Oct 2024
RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts Ram Mohan Rao Kadiyala DeLMO 26 2 0 22 Oct 2024
A Statistical Analysis of LLMs' Self-Evaluation Using Proverbs Ryosuke Sonoda Ramya Srinivasan 61 1 0 22 Oct 2024
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning Arijit Das 26 1 0 21 Oct 2024
Redefining Proactivity for Information Seeking Dialogue Jing Yang Lee Seokhwan Kim Kartik Mehta Jiun-Yu Kao Yu-Hsiang Lin Arpit Gupta 30 0 0 20 Oct 2024
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla Deeparghya Dutta Barua Md Sakib Ul Rahman Sourove Md Farhan Ishmam Fabiha Haider Fariha Tanjim Shifat Md Fahim Md Farhad Alam 29 0 0 19 Oct 2024
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts German Gritsai Anastasia Voznyuk Andrey Grabovoy Yury Chekhovich DeLMO 80 1 0 18 Oct 2024
Breaking the Manual Annotation Bottleneck: Creating a Comprehensive Legal Case Criticality Dataset through Semi-Automated Labeling Ronja Stern Ken Kawamura Matthias Sturmer Ilias Chalkidis Joel Niklaus AILaw ELM 43 1 0 17 Oct 2024
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems Nandan Thakur Suleman Kazi Ge Luo Jimmy J. Lin Amin Ahmad VLM RALM 28 7 0 17 Oct 2024
On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs Herun Wan Minnan Luo Zhixiong Su Guang Dai Xiang Zhao DeLMO 35 0 0 16 Oct 2024
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation Xiaonan Jing Srinivas Billa Danny Godbout HILM 45 0 0 16 Oct 2024
AIC CTU system at AVeriTeC: Re-framing automated fact-checking as a simple RAG task Herbert Ullrich Tomás Mlynár Jan Drchal 37 2 0 15 Oct 2024
Transformer-based Language Models for Reasoning in the Description Logic ALCQ Angelos Poulis Eleni Tsalapati Manolis Koubarakis ReLM LRM 29 1 0 12 Oct 2024
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning Jiachun Li Pengfei Cao Chenhao Wang Zhuoran Jin Yubo Chen Kang Liu Xiaojian Jiang Jiexin Xu Jun Zhao LRM KELM 39 0 0 12 Oct 2024
Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalisation of Misinformation Detection Models Ivo Verhoeven Pushkar Mishra Ekaterina Shutova 30 0 0 12 Oct 2024
Solving the Challenge Set without Solving the Task: On Winograd Schemas as a Test of Pronominal Coreference Resolution Ian Porada Jackie C.K. Cheung 44 0 0 12 Oct 2024
Zero-shot Commonsense Reasoning over Machine Imagination Hyuntae Park Yeachan Kim Jun-Hyung Park S. Lee ReLM VLM LRM 29 1 0 12 Oct 2024
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models Zheng Yi Ho Siyuan Liang Sen Zhang Yibing Zhan Dacheng Tao 34 2 0 11 Oct 2024
A Target-Aware Analysis of Data Augmentation for Hate Speech Detection Camilla Casula Sara Tonelli 31 0 0 10 Oct 2024
MoDEM: Mixture of Domain Expert Models Toby Simonds Kemal Kurniawan Jey Han Lau MoE 31 1 0 09 Oct 2024
Parameter Efficient Fine-tuning via Explained Variance Adaptation Fabian Paischer Lukas Hauzenberger Thomas Schmied Benedikt Alkin Marc Peter Deisenroth Sepp Hochreiter 37 4 0 09 Oct 2024
QERA: an Analytical Framework for Quantization Error Reconstruction Cheng Zhang Jeffrey T. H. Wong Can Xiao George A. Constantinides Yiren Zhao MQ 47 2 0 08 Oct 2024
CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures Ekaterina Sviridova Anar Yeginbergen A. Estarrona Elena Cabrio S. Villata Rodrigo Agerri 44 2 0 07 Oct 2024
Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics Stefano Perrella Lorenzo Proietti Pere-Lluís Huguet Cabot Edoardo Barba Roberto Navigli 23 3 0 07 Oct 2024
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization Yangfan Ye Xiachong Feng Xiaocheng Feng Weitao Ma Libo Qin Dongliang Xu Qing Yang Hongtao Liu Bing Qin 37 2 0 05 Oct 2024
KidLM: Advancing Language Models for Children -- Early Insights and Future Directions Mir Tafseer Nayeem Davood Rafiei ALM 39 3 0 04 Oct 2024
How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics Adrian Cosma Stefan Ruseti Mihai Dascalu Cornelia Caragea 21 2 0 04 Oct 2024
NL-Eye: Abductive NLI for Images Mor Ventura Michael Toker Nitay Calderon Zorik Gekhman Yonatan Bitton Roi Reichart 28 1 0 03 Oct 2024
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models Seanie Lee Haebin Seong Dong Bok Lee Minki Kang Xiaoyin Chen Dominik Wagner Yoshua Bengio Juho Lee Sung Ju Hwang 67 2 0 02 Oct 2024
Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting Stephen Meisenbacher Florian Matthes 29 2 0 01 Oct 2024
Multimodal Coherent Explanation Generation of Robot Failures Pradip Pramanick Silvia Rossi 29 2 0 01 Oct 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models Shuhao Chen Weisen Jiang Baijiong Lin James T. Kwok Yu Zhang RALM MQ 48 5 0 30 Sep 2024
A Survey on the Honesty of Large Language Models Siheng Li Cheng Yang Taiqiang Wu Chufan Shi Yuji Zhang ... Jie Zhou Yujiu Yang Ngai Wong Xixin Wu Wai Lam HILM 35 5 0 27 Sep 2024
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification Andreas Waldis Joel Birrer Anne Lauscher Iryna Gurevych 33 1 0 26 Sep 2024
A fast and sound tagging method for discontinuous named-entity recognition Caio Corro 28 0 0 24 Sep 2024
A Bayesian Interpretation of Adaptive Low-Rank Adaptation Haolin Chen Philip N. Garner 55 1 0 16 Sep 2024
Rediscovering the Latent Dimensions of Personality with Large Language Models as Trait Descriptors Joseph Suh Suhong Moon Minwoo Kang David M. Chan 34 1 0 16 Sep 2024
Algorithmic Behaviors Across Regions: A Geolocation Audit of YouTube Search for COVID-19 Misinformation Between the United States and South Africa Hayoung Jung Prerna Juneja Tanushree Mitra MLAU 68 0 0 16 Sep 2024
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG Gabriel de Souza P. Moreira Ronay Ak Benedikt Schifferer Mengyao Xu Radek Osmulski Even Oldridge 29 4 0 12 Sep 2024
Modeling Information Narrative Detection and Evolution on Telegram during the Russia-Ukraine War Patrick Gerard Svitlana Volkova Louis Penafiel Kristina Lerman Tim Weninger 62 0 0 12 Sep 2024
Table-to-Text Generation with Pretrained Diffusion Models Aleksei S. Krylov Oleg D. Somov 40 1 0 10 Sep 2024
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text Michael Burnham Kayla Kahn Ryan Yank Wang Rachel X. Peng 37 5 0 03 Sep 2024
TinyAgent: Function Calling at the Edge Lutfi Eren Erdogan Nicholas Lee Siddharth Jha Sehoon Kim Ryan Tabrizi Suhong Moon Coleman Hooper Gopala Anumanchipalli Kurt Keutzer Amir Gholami LLMAG 41 12 0 01 Sep 2024