Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.09039
Cited By
Sequence-Level Mixed Sample Data Augmentation
18 November 2020
Demi Guo
Yoon Kim
Alexander M. Rush
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequence-Level Mixed Sample Data Augmentation"
50 / 67 papers shown
Title
Learning to Substitute Components for Compositional Generalization
ZeLin Li
Gangwei Jiang
Chenwang Wu
Ying Wei
Defu Lian
Enhong Chen
62
0
0
28 Feb 2025
TARDiS : Text Augmentation for Refining Diversity and Separability
Kyungmin Kim
Sanghun Im
Gibaeg Kim
Heung-Seon Oh
VLM
34
0
0
06 Jan 2025
Compositional Generalization Across Distributional Shifts with Sparse Tree Operations
Paul Soulos
Henry Conklin
Mattia Opper
P. Smolensky
Jianfeng Gao
Roland Fernandez
73
4
0
18 Dec 2024
Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models
Weiqiu You
Youngja Park
71
0
0
27 Nov 2024
Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Zhengkai Lin
Z. Fu
Kai Liu
Liang Xie
Binbin Lin
Wenxiao Wang
D. Cai
Yue Wu
Jieping Ye
LRM
25
3
0
24 Oct 2024
The Effects of Hallucinations in Synthetic Training Data for Relation Extraction
Steven Rogulsky
Nicholas Popovic
Michael Färber
HILM
32
1
0
10 Oct 2024
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Yuxin Xiao
Shujian Zhang
Wenxuan Zhou
Marzyeh Ghassemi
Sanqiang Zhao
109
0
0
07 Oct 2024
InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection
T. Bertaglia
Lily Heisig
Rishabh Kaushal
Adriana Iamnitchi
25
4
0
22 Mar 2024
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Changyu Chen
Xiting Wang
Ting-En Lin
Ang Lv
Yuchuan Wu
Xin Gao
Ji-Rong Wen
Rui Yan
Yongbin Li
ReLM
LRM
28
9
0
04 Mar 2024
Enhancing Protein Predictive Models via Proteins Data Augmentation: A Benchmark and New Directions
Rui Sun
Lirong Wu
Haitao Lin
Yufei Huang
Stan Z. Li
33
1
0
01 Mar 2024
Retrieval-Augmented Data Augmentation for Low-Resource Domain Tasks
Minju Seo
Jinheon Baek
James Thorne
Sung Ju Hwang
RALM
37
9
0
21 Feb 2024
Non-Fluent Synthetic Target-Language Data Improve Neural Machine Translation
Víctor M. Sánchez-Cartagena
Miquel Espla-Gomis
J. A. Pérez-Ortiz
F. Sánchez-Martínez
35
4
0
29 Jan 2024
IndiText Boost: Text Augmentation for Low Resource India Languages
Onkar Litake
Niraj Yagnik
S. Labhsetwar
VLM
26
3
0
23 Jan 2024
RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training
Jaehyung Kim
Yuning Mao
Rui Hou
Hanchao Yu
Davis Liang
Pascale Fung
Qifan Wang
Fuli Feng
Lifu Huang
Madian Khabsa
AAML
23
2
0
07 Dec 2023
SegMix: A Simple Structure-Aware Data Augmentation Method
Yuxin Pei
Pushkar Bhuse
Zhengzhong Liu
Eric P. Xing
18
1
0
16 Nov 2023
TreeSwap: Data Augmentation for Machine Translation via Dependency Subtree Swapping
Attila Nagy
Dorina Lakatos
Botond Barta
Judit Ács
24
1
0
04 Nov 2023
MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Arkil Patel
S. Bhattamishra
Siva Reddy
Dzmitry Bahdanau
34
5
0
18 Oct 2023
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation
Charles OÑeill
Y. Ting 丁
I. Ciucă
Jack Miller
Thang Bui
SyDa
37
1
0
15 Aug 2023
Layer-wise Representation Fusion for Compositional Generalization
Yafang Zheng
Lei Lin
Shantao Liu
Binling Wang
Zhaohong Lai
Wenhao Rao
Biao Fu
Yidong Chen
Xiaodon Shi
AI4CE
43
2
0
20 Jul 2023
Data Augmentation for Machine Translation via Dependency Subtree Swapping
Attila Nagy
Dorina Lakatos
Botond Barta
Patrick Nanys
Judit Ács
31
1
0
13 Jul 2023
Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions
John Joon Young Chung
Ece Kamar
Saleema Amershi
ALM
34
109
0
07 Jun 2023
Learning to Substitute Spans towards Improving Compositional Generalization
Zhaoyi Li
Ying Wei
Defu Lian
10
9
0
05 Jun 2023
GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks
Xuming Hu
Aiwei Liu
Zeqi Tan
Xin Zhang
Chenwei Zhang
Irwin King
Philip S. Yu
44
16
0
26 May 2023
Weakly Supervised Vision-and-Language Pre-training with Relative Representations
Chi Chen
Peng Li
Maosong Sun
Yang Liu
22
1
0
24 May 2023
Generating Data for Symbolic Language with Large Language Models
Jiacheng Ye
Chengzu Li
Lingpeng Kong
Tao Yu
33
10
0
23 May 2023
Understanding Compositional Data Augmentation in Typologically Diverse Morphological Inflection
Farhan Samir
Miikka Silfverberg
22
3
0
23 May 2023
Learning to Compose Representations of Different Encoder Layers towards Improving Compositional Generalization
Lei Lin
Shuangtao Li
Yafang Zheng
Biao Fu
Shantao Liu
Yidong Chen
Xiaodon Shi
CoGe
25
2
0
20 May 2023
Adversarial Word Dilution as Text Data Augmentation in Low-Resource Regime
J. Chen
Richong Zhang
Zheyan Luo
Chunming Hu
Yongyi Mao
14
4
0
16 May 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
30
41
0
07 Apr 2023
Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection
Rabiul Awal
Roy Ka-Wei Lee
Eshaan Tanwar
Tanmay Garg
Tanmoy Chakraborty
24
26
0
04 Mar 2023
On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on Codex
Terry Yue Zhuo
Zhuang Li
Yujin Huang
Fatemeh Shiri
Weiqing Wang
Gholamreza Haffari
Yuan-Fang Li
AAML
28
54
0
30 Jan 2023
A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and Explainability
Chengtai Cao
Fan Zhou
Yurou Dai
Jianping Wang
Kunpeng Zhang
AAML
24
28
0
21 Dec 2022
SOLD: Sinhala Offensive Language Dataset
Tharindu Ranasinghe
Isuri Anuradha
Damith Premasiri
Kanishka Silva
Hansi Hettiarachchi
Lasitha Uyangodage
Marcos Zampieri
33
8
0
01 Dec 2022
A Short Survey of Systematic Generalization
Yuanpeng Li
AI4CE
38
1
0
22 Nov 2022
Categorizing Semantic Representations for Neural Machine Translation
Yongjing Yin
Yafu Li
Fandong Meng
Jie Zhou
Yue Zhang
24
6
0
13 Oct 2022
DATScore: Evaluating Translation with Data Augmented Translations
Moussa Kamal Eddine
Guokan Shang
Michalis Vazirgiannis
44
5
0
12 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
114
93
0
06 Oct 2022
MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation
Kshitij Gupta
VLM
LRM
6
2
0
01 Oct 2022
DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification
Hui Chen
Wei Han
Diyi Yang
Soujanya Poria
15
12
0
12 Sep 2022
GeoECG: Data Augmentation via Wasserstein Geodesic Perturbation for Robust Electrocardiogram Prediction
Jiacheng Zhu
Jielin Qiu
Zhuolin Yang
Douglas Weber
M. Rosenberg
Emerson Liu
Bo-wen Li
Ding Zhao
OOD
28
13
0
02 Aug 2022
Building Korean Sign Language Augmentation (KoSLA) Corpus with Data Augmentation Technique
Changnam An
Eunkyung Han
Dongmyeong Noh
O. Kwon
Sumi Lee
H. Han
SLR
11
1
0
12 Jul 2022
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
Le Zhang
Zichao Yang
Diyi Yang
36
24
0
12 May 2022
AdMix: A Mixed Sample Data Augmentation Method for Neural Machine Translation
Chang-Hu Jin
Shigui Qiu
Nini Xiao
Hao Jia
6
7
0
10 May 2022
BLISS: Robust Sequence-to-Sequence Learning via Self-Supervised Input Representation
Zheng-Wei Zhang
Liang Ding
Dazhao Cheng
Xuebo Liu
Min Zhang
Dacheng Tao
24
11
0
16 Apr 2022
CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation
Nishant Kambhatla
Logan Born
Anoop Sarkar
21
16
0
01 Apr 2022
Structurally Diverse Sampling for Sample-Efficient Training and Comprehensive Evaluation
Shivanshu Gupta
Sameer Singh
Matt Gardner
49
7
0
16 Mar 2022
Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation
Yong Cheng
Ankur Bapna
Orhan Firat
Yuan Cao
Pidong Wang
Wolfgang Macherey
23
13
0
15 Mar 2022
Revisiting the Compositional Generalization Abilities of Neural Sequence Models
Arkil Patel
S. Bhattamishra
Phil Blunsom
Navin Goyal
BDL
CoGe
26
32
0
14 Mar 2022
Grammar-Based Grounded Lexicon Learning
Jiayuan Mao
Haoyue Shi
Jiajun Wu
R. Levy
J. Tenenbaum
NAI
16
14
0
17 Feb 2022
Data Augmentation for Mental Health Classification on Social Media
Gunjan Ansari
Muskan Garg
Chandni Saxena
36
19
0
19 Dec 2021
1
2
Next