Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.02952
Cited By
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation
5 December 2020
Ruibo Liu
Guangxuan Xu
Chenyan Jia
Weicheng Ma
Lili Wang
Soroush Vosoughi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation"
42 / 42 papers shown
Title
Semantic Probabilistic Control of Language Models
Kareem Ahmed
Catarina G Belém
Padhraic Smyth
Sameer Singh
119
1
0
04 May 2025
The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks
Joan C. Timoneda
DiffM
SyDa
68
0
0
21 Apr 2025
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Congrui Huang
160
7
0
23 Sep 2024
Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
Shang Zhou
Feng Yao
Chengyu Dong
Zihan Wang
Jingbo Shang
77
2
0
06 Jun 2024
Animal Behavior Analysis Methods Using Deep Learning: A Survey
Edoardo Fazzari
Donato Romano
Fabrizio Falchi
Cesare Stefanini
79
6
0
22 May 2024
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
175
28
0
15 May 2024
A Framework for Real-time Safeguarding the Text Generation of Large Language Model
Ximing Dong
Dayi Lin
Shaowei Wang
Ahmed E. Hassan
133
1
0
29 Apr 2024
DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness
Yuqi Wang
Zeqiang Wang
Wei Wang
Qi Chen
Kaizhu Huang
Anh Nguyen
Suparna De
59
2
0
14 Apr 2024
Evaluation Metrics for Text Data Augmentation in NLP
Marcellus Amadeus
William Alberto Cruz Castañeda
70
1
0
09 Feb 2024
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Aly M. Kassem
Sherif Saad
AAML
69
1
0
21 Jan 2024
Using GPT-4 to Augment Unbalanced Data for Automatic Scoring
Luyang Fang
Gyeong-Geon Lee
Xiaoming Zhai
70
19
0
25 Oct 2023
"A Tale of Two Movements": Identifying and Comparing Perspectives in #BlackLivesMatter and #BlueLivesMatter Movements-related Tweets using Weakly Supervised Graph-based Structured Prediction
Shamik Roy
Dan Goldwasser
93
4
0
11 Oct 2023
Reward Dropout Improves Control: Bi-objective Perspective on Reinforced LM
Changhun Lee
Chiehyeon Lim
80
0
0
06 Oct 2023
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation
Charles OÑeill
Y. Ting 丁
I. Ciucă
Jack Miller
Thang Bui
SyDa
121
1
0
15 Aug 2023
Training Models to Generate, Recognize, and Reframe Unhelpful Thoughts
Mounica Maddela
Megan Ung
Jing Xu
Andrea Madotto
H. Foran
Y-Lan Boureau
LRM
107
23
0
06 Jul 2023
Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions
John Joon Young Chung
Ece Kamar
Saleema Amershi
ALM
106
121
0
07 Jun 2023
Adaptive and Personalized Exercise Generation for Online Language Learning
Peng Cui
Mrinmaya Sachan
AI4Ed
96
23
0
04 Jun 2023
BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases
Xin Liu
Muhammad Khalifa
Lu Wang
115
20
0
19 May 2023
Data Augmentation for Neural NLP
Domagoj Pluscec
Jan Snajder
97
6
0
22 Feb 2023
DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation
Hanqing Zhang
Dawei Song
89
38
0
18 Oct 2022
Improving Short Text Classification With Augmented Data Using GPT-3
Salvador Balkus
Donghui Yan
61
37
0
23 May 2022
PromptDA: Label-guided Data Augmentation for Prompt-based Few-shot Learners
Canyu Chen
Kai Shu
VLM
104
8
0
18 May 2022
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
Le Zhang
Zichao Yang
Diyi Yang
112
25
0
12 May 2022
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Minyi Zhao
Lu Zhang
Yi Xu
Jiandong Ding
Jihong Guan
Shuigeng Zhou
VLM
86
11
0
24 Apr 2022
Non-Parallel Text Style Transfer with Self-Parallel Supervision
Ruibo Liu
Chongyang Gao
Chenyan Jia
Guangxuan Xu
Soroush Vosoughi
VLM
84
16
0
18 Apr 2022
Knowledge Infused Decoding
Ruibo Liu
Guoqing Zheng
Shashank Gupta
Radhika Gaonkar
Chongyang Gao
Soroush Vosoughi
Milad Shokouhi
Ahmed Hassan Awadallah
KELM
88
14
0
06 Apr 2022
Recent Advances in Neural Text Generation: A Task-Agnostic Survey
Chen Tang
Frank Guerin
Chenghua Lin
AI4CE
OOD
125
19
0
06 Mar 2022
A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
Hanqing Zhang
Haolin Song
Shaoyu Li
Ming Zhou
Dawei Song
143
230
0
14 Jan 2022
Revisiting Contextual Toxicity Detection in Conversations
Atijit Anuchitanukul
Julia Ive
Lucia Specia
57
15
0
24 Nov 2021
RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing
Hemlata Tak
Madhu R. Kamble
J. Patino
Massimiliano Todisco
Nicholas W. D. Evans
122
114
0
08 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
197
1,101
0
01 Nov 2021
Data Augmentation Approaches in Natural Language Processing: A Survey
Bohan Li
Yutai Hou
Wanxiang Che
219
284
0
05 Oct 2021
Good-Enough Example Extrapolation
Jason W. Wei
60
6
0
12 Sep 2021
GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation
Derek Chen
Zhou Yu
77
31
0
07 Sep 2021
AEDA: An Easier Data Augmentation Technique for Text Classification
Akbar Karimi
L. Rossi
Andrea Prati
95
158
0
30 Aug 2021
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
145
355
0
07 Jul 2021
Efficient (Soft) Q-Learning for Text Generation with Limited Good Data
Han Guo
Bowen Tan
Zhengzhong Liu
Eric P. Xing
Zhiting Hu
OffRL
92
35
0
14 Jun 2021
A Survey of Data Augmentation Approaches for NLP
Steven Y. Feng
Varun Gangal
Jason W. Wei
Sarath Chandar
Soroush Vosoughi
Teruko Mitamura
Eduard H. Hovy
AIMat
123
832
0
07 May 2021
Mitigating Political Bias in Language Models Through Reinforced Calibration
Ruibo Liu
Chenyan Jia
Jason W. Wei
Guangxuan Xu
Lili Wang
Soroush Vosoughi
75
99
0
30 Apr 2021
FUDGE: Controlled Text Generation With Future Discriminators
Kevin Kaichuang Yang
Dan Klein
109
338
0
12 Apr 2021
Substructure Substitution: Structured Data Augmentation for NLP
Freda Shi
Karen Livescu
Kevin Gimpel
250
45
0
02 Jan 2021
Enhanced Offensive Language Detection Through Data Augmentation
Ruibo Liu
Guangxuan Xu
Soroush Vosoughi
53
10
0
05 Dec 2020
1