Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.11196
Cited By
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
31 January 2019
Jason W. Wei
Kai Zou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks"
50 / 268 papers shown
Title
BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification
Foteini Papadopoulou
Osman Mutlu
Neris Özen
Bas H. M. van der Velden
I. Hendrickx
Ali Hürriyetoǧlu
ViT
39
0
0
29 Apr 2025
Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting
Z. R. K. Rostam
Gábor Kertész
24
0
0
26 Apr 2025
CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval
Hang Yu
Jiahao Wen
Zhedong Zheng
54
0
0
26 Apr 2025
Ustnlp16 at SemEval-2025 Task 9: Improving Model Performance through Imbalance Handling and Focal Loss
Zhuoang Cai
Zehan Li
Yi Liu
Liyuan Guo
Yangqiu Song
26
0
0
24 Apr 2025
ReSi: A Comprehensive Benchmark for Representational Similarity Measures
Max Klabunde
Tassilo Wald
Tobias Schumacher
Klaus H. Maier-Hein
Markus Strohmaier
Adriana Iamnitchi
AI4TS
VLM
76
5
0
13 Mar 2025
Diversity-Oriented Data Augmentation with Large Language Models
Zaitian Wang
Jinghan Zhang
Xinhao Zhang
Kunpeng Liu
Pengfei Wang
Yuanchun Zhou
80
1
0
17 Feb 2025
A Large-Scale Benchmark for Vietnamese Sentence Paraphrases
Sang Quang Nguyen
Kiet Van Nguyen
62
0
0
11 Feb 2025
TCProF: Time-Complexity Prediction SSL Framework
Joonghyuk Hahn
Hyeseon Ahn
Jungin Kim
Soohan Lim
Yo-Sub Han
49
0
0
10 Feb 2025
Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Ziwei Liu
Qi Zhang
Lifu Gao
36
0
0
28 Jan 2025
Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models
Minki Kang
Sung Ju Hwang
Gibbeum Lee
Jaewoong Cho
KELM
43
0
0
01 Nov 2024
Vulnerability of LLMs to Vertically Aligned Text Manipulations
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Zhen Xiong
Nanyun Peng
Kai-Wei Chang
61
1
0
26 Oct 2024
Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges
Farid Ariai
Gianluca Demartini
ELM
AILaw
VLM
43
4
0
25 Oct 2024
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
Sreyan Ghosh
Sonal Kumar
Zhifeng Kong
Rafael Valle
Bryan Catanzaro
Dinesh Manocha
DiffM
49
2
0
02 Oct 2024
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
Seanie Lee
Haebin Seong
Dong Bok Lee
Minki Kang
Xiaoyin Chen
Dominik Wagner
Yoshua Bengio
Juho Lee
Sung Ju Hwang
67
2
0
02 Oct 2024
Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification
Guanyi Mou
Yichuan Li
Kyumin Lee
36
3
0
26 Sep 2024
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Congrui Huang
38
6
0
23 Sep 2024
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking
Jihyun Lee
Solee Im
Wonjun Lee
Gary Geunbae Lee
36
0
0
10 Sep 2024
Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage
Md. Rafi Ur Rashid
Jing Liu
T. Koike-Akino
Shagufta Mehnaz
Ye Wang
MU
SILM
43
3
0
30 Aug 2024
P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang
Chenchen Yuan
Yao Rong
Felix Steinbauer
Gjergji Kasneci
38
1
0
17 Jun 2024
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
109
22
0
15 May 2024
Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks
Haifa Alrdahi
R. Batista-Navarro
47
0
0
10 May 2024
Data Augmentation Policy Search for Long-Term Forecasting
Liran Nochumsohn
Omri Azencot
AI4TS
TPM
46
3
0
01 May 2024
Event-enhanced Retrieval in Real-time Search
Yanan Zhang
Xiaoling Bai
Tianhua Zhou
37
1
0
09 Apr 2024
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
Andrei Semenov
Vladimir Ivanov
Aleksandr Beznosikov
Alexander Gasnikov
42
6
0
04 Apr 2024
EDDA: A Encoder-Decoder Data Augmentation Framework for Zero-Shot Stance Detection
Daijun Ding
Li Dong
Zhichao Huang
Guangning Xu
Xu Huang
Bo Liu
Liwen Jing
Bowen Zhang
41
3
0
23 Mar 2024
Large Language Models on Fine-grained Emotion Detection Dataset with Data Augmentation and Transfer Learning
Kaipeng Wang
Zhi Jing
Yongye Su
Yikun Han
37
3
0
10 Mar 2024
FormulaReasoning: A Dataset for Formula-Based Numerical Reasoning
Xiao Li
Bolin Zhu
Kaiwen Shi
Sichen Liu
Yin Zhu
Yiwei Liu
Gong Cheng
AIMat
34
0
0
20 Feb 2024
PASCL: Supervised Contrastive Learning with Perturbative Augmentation for Particle Decay Reconstruction
Junjian Lu
Siwei Liu
Dmitrii Kobylianski
Etienne Dreyer
Eilam Gross
Houcheng Su
32
3
0
18 Feb 2024
AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes
Juhwan Choi
Kyohoon Jin
Junho Lee
Sangmin Song
Youngbin Kim
27
1
0
08 Feb 2024
ConFit: Improving Resume-Job Matching using Data Augmentation and Contrastive Learning
Xiao Yu
Jinzhong Zhang
Zhou Yu
43
1
0
29 Jan 2024
Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
Ming-Ru Wu
Yufei Wang
George F. Foster
Lizhen Qu
Gholamreza Haffari
43
6
0
27 Jan 2024
Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pretraining
Bumsoo Kim
Yeonsik Jo
Jinhyung Kim
S. Kim
VLM
27
7
0
19 Dec 2023
Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders
Bumsoo Kim
Jinhyung Kim
Yeonsik Jo
S. Kim
VLM
26
3
0
19 Dec 2023
Leveraging Domain Adaptation and Data Augmentation to Improve Quránic IR in English and Arabic
Vera Pavlova
23
2
0
05 Dec 2023
CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
Yichao Cai
Yuhang Liu
Zhen Zhang
Javen Qinfeng Shi
CLIP
VLM
34
5
0
28 Nov 2023
NNG-Mix: Improving Semi-supervised Anomaly Detection with Pseudo-anomaly Generation
Hao Dong
Gaëtan Frusque
Yue Zhao
Eleni Chatzi
Olga Fink
AAML
35
5
0
20 Nov 2023
Ask Language Model to Clean Your Noisy Translation Data
Quinten Bolding
Baohao Liao
Brandon James Denis
Jun Luo
Christof Monz
32
5
0
20 Oct 2023
Using Weak Supervision and Data Augmentation in Question Answering
Chumki Basu
Binyuan Hui
Allen McIntosh
Wei Wang
J. Wullert
OOD
52
0
0
28 Sep 2023
AMPLIFY:Attention-based Mixup for Performance Improvement and Label Smoothing in Transformer
Leixin Yang
Yu Xiang
28
0
0
22 Sep 2023
Are Large Language Models Really Robust to Word-Level Perturbations?
Haoyu Wang
Guozheng Ma
Cong Yu
Ning Gui
Linrui Zhang
...
Sen Zhang
Li Shen
Xueqian Wang
Peilin Zhao
Dacheng Tao
KELM
28
22
0
20 Sep 2023
ExpCLIP: Bridging Text and Facial Expressions via Semantic Alignment
Yicheng Zhong
Huawei Wei
Pei-Yin Yang
Zhisheng Wang
CLIP
37
6
0
28 Aug 2023
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation
Charles OÑeill
Y. Ting 丁
I. Ciucă
Jack Miller
Thang Bui
SyDa
37
1
0
15 Aug 2023
AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models
Siheng Li
Cheng Yang
Yichun Yin
Xinyu Zhu
Ze-Long Cheng
Lifeng Shang
Xin Jiang
Qun Liu
Yujiu Yang
SyDa
35
3
0
12 Aug 2023
I-WAS: a Data Augmentation Method with GPT-2 for Simile Detection
Yongzhu Chang
Rongsheng Zhang
Jiashu Pu
38
1
0
08 Aug 2023
Tag Prediction of Competitive Programming Problems using Deep Learning Techniques
Taha Lokat
Divya Prajapati
Shubhada Labde
16
1
0
03 Aug 2023
Feature-aware conditional GAN for category text generation
Xinze Li
K. Mao
Fanfan Lin
Zijian Feng
GAN
29
15
0
02 Aug 2023
Towards Generalising Neural Topical Representations
Xiaohao Yang
He Zhao
Dinh Q. Phung
Lan Du
BDL
OOD
MedIm
29
1
0
24 Jul 2023
Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation
Letian Peng
Yuwei Zhang
Jingbo Shang
LRM
24
7
0
14 Jul 2023
Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications
Saed Rezayi
Zheng Liu
Zihao Wu
Chandra Dhakal
Bao Ge
...
Gengchen Mai
Ninghao Liu
Chen Zhen
Tianming Liu
Sheng Li
28
32
0
20 Jun 2023
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Lifan Yuan
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Fangyuan Zou
Xingyi Cheng
Heng Ji
Zhiyuan Liu
Maosong Sun
39
73
0
07 Jun 2023
1
2
3
4
5
6
Next