ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.03208
  4. Cited By
NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification
  Tasks

NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks

5 June 2023
Jean-Michel Attendu
Jean-Philippe Corbeil
ArXivPDFHTML

Papers citing "NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks"

14 / 14 papers shown
Title
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Binh-Nguyen Nguyen
Yang He
43
1
0
05 Jan 2025
Pre-training data selection for biomedical domain adaptation using
  journal impact metrics
Pre-training data selection for biomedical domain adaptation using journal impact metrics
Mathieu Lai-king
P. Paroubek
34
0
0
04 Sep 2024
LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable
  Objectives
LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Luísa Shimabucoro
Sebastian Ruder
Julia Kreutzer
Marzieh Fadaee
Sara Hooker
SyDa
36
4
0
01 Jul 2024
Sexism Detection on a Data Diet
Sexism Detection on a Data Diet
Rabiraj Bandyopadhyay
Dennis Assenmacher
J. Alonso-Moral
Claudia Wagner
51
0
0
07 Jun 2024
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small
  Reference Models
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Zachary Ankner
Cody Blakeney
Kartik K. Sreenivasan
Max Marion
Matthew L. Leavitt
Mansheej Paul
43
24
0
30 May 2024
A Survey of Multimodal Large Language Model from A Data-centric
  Perspective
A Survey of Multimodal Large Language Model from A Data-centric Perspective
Tianyi Bai
Hao Liang
Binwang Wan
Yanran Xu
Xi Li
...
Ping Huang
Jiulong Shan
Conghui He
Binhang Yuan
Wentao Zhang
58
37
0
26 May 2024
SMART: Submodular Data Mixture Strategy for Instruction Tuning
SMART: Submodular Data Mixture Strategy for Instruction Tuning
Kowndinya Renduchintala
S. Bhatia
Ganesh Ramakrishnan
46
3
0
13 Mar 2024
Take the Bull by the Horns: Hard Sample-Reweighted Continual Training
  Improves LLM Generalization
Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization
Xuxi Chen
Zhendong Wang
Daouda Sow
Junjie Yang
Tianlong Chen
Yingbin Liang
Mingyuan Zhou
Zhangyang Wang
34
6
0
22 Feb 2024
Efficient Architecture Search via Bi-level Data Pruning
Efficient Architecture Search via Bi-level Data Pruning
Chongjun Tu
Peng Ye
Weihao Lin
Hancheng Ye
Chong Yu
Tao Chen
Baopu Li
Wanli Ouyang
40
2
0
21 Dec 2023
DEFT: Data Efficient Fine-Tuning for Pre-Trained Language Models via
  Unsupervised Core-Set Selection
DEFT: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection
Devleena Das
Vivek Khetan
29
1
0
25 Oct 2023
Training Dynamics for Curriculum Learning: A Study on Monolingual and
  Cross-lingual NLU
Training Dynamics for Curriculum Learning: A Study on Monolingual and Cross-lingual NLU
Fenia Christopoulou
Gerasimos Lampouras
Ignacio Iacobacci
48
3
0
22 Oct 2022
Dataset Pruning: Reducing Training Data by Examining Generalization
  Influence
Dataset Pruning: Reducing Training Data by Examining Generalization Influence
Shuo Yang
Zeke Xie
Hanyu Peng
Minjing Xu
Mingming Sun
P. Li
DD
155
107
0
19 May 2022
Big Bird: Transformers for Longer Sequences
Big Bird: Transformers for Longer Sequences
Manzil Zaheer
Guru Guruganesh
Kumar Avinava Dubey
Joshua Ainslie
Chris Alberti
...
Philip Pham
Anirudh Ravula
Qifan Wang
Li Yang
Amr Ahmed
VLM
288
2,017
0
28 Jul 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1