Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.02573
Cited By
Data Noising as Smoothing in Neural Network Language Models
7 March 2017
Ziang Xie
Sida I. Wang
Jiwei Li
Daniel Levy
Allen Nie
Dan Jurafsky
A. Ng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Data Noising as Smoothing in Neural Network Language Models"
49 / 49 papers shown
Title
Reducing and Exploiting Data Augmentation Noise through Meta Reweighting Contrastive Learning for Text Classification
Guanyi Mou
Yichuan Li
Kyumin Lee
36
3
0
26 Sep 2024
Predictive Dynamic Fusion
Bing Cao
Yinan Xia
Yi Ding
Changqing Zhang
Qinghua Hu
39
9
0
07 Jun 2024
A Comprehensive Survey on Data Augmentation
Zaitian Wang
Pengfei Wang
Kunpeng Liu
Pengyang Wang
Yanjie Fu
Chang-Tien Lu
Charu Aggarwal
Jian Pei
Yuanchun Zhou
ViT
109
22
0
15 May 2024
Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages
Baban Gain
Dibyanayan Bandyopadhyay
Subhabrata Mukherjee
Chandranath Adak
Asif Ekbal
33
2
0
30 Aug 2023
DropDim: A Regularization Method for Transformer Networks
Hao Zhang
Dan Qu
Kejia Shao
Xu Yang
28
12
0
20 Apr 2023
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELM
CLL
35
2
0
24 Oct 2022
Semantically Consistent Data Augmentation for Neural Machine Translation via Conditional Masked Language Model
Qiao Cheng
Jin Huang
Yitao Duan
31
7
0
22 Sep 2022
Selective Text Augmentation with Word Roles for Low-Resource Text Classification
Biyang Guo
Songqiao Han
Hailiang Huang
19
9
0
04 Sep 2022
A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval
Alex Falcon
G. Serra
Oswald Lanz
VGen
42
25
0
03 Aug 2022
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
Le Zhang
Zichao Yang
Diyi Yang
36
24
0
12 May 2022
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Minyi Zhao
Lu Zhang
Yi Xu
Jiandong Ding
Jihong Guan
Shuigeng Zhou
VLM
49
10
0
24 Apr 2022
Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Xiangpeng Wei
Heng Yu
Yue Hu
Rongxiang Weng
Weihua Luo
Jun Xie
Rong Jin
CLL
17
24
0
14 Apr 2022
CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation
Nishant Kambhatla
Logan Born
Anoop Sarkar
21
16
0
01 Apr 2022
Syntax-based data augmentation for Hungarian-English machine translation
Attila Nagy
Patrick Nanys
Balázs Frey Konrád
Bence Bial
Judit Ács
14
2
0
18 Jan 2022
Semantic-based Data Augmentation for Math Word Problems
Ai Li
Jiaqing Liang
Yanghua Xiao
AAML
24
7
0
07 Jan 2022
Developing neural machine translation models for Hungarian-English
A. Nagy
32
1
0
07 Nov 2021
GNN-LM: Language Modeling based on Global Contexts via GNN
Yuxian Meng
Shi Zong
Xiaoya Li
Xiaofei Sun
Tianwei Zhang
Fei Wu
Jiwei Li
LRM
24
37
0
17 Oct 2021
Metadata Shaping: Natural Language Annotations for the Tail
Simran Arora
Sen Wu
Enci Liu
Christopher Ré
32
0
0
16 Oct 2021
Data Augmentation Approaches in Natural Language Processing: A Survey
Bohan Li
Yutai Hou
Wanxiang Che
130
271
0
05 Oct 2021
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts
Shuhe Wang
Yuxian Meng
Xiaoya Li
Xiaofei Sun
Rongbin Ouyang
Jiwei Li
MLLM
VLM
30
21
0
27 Sep 2021
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach
Víctor M. Sánchez-Cartagena
M. Esplà-Gomis
Juan Antonio Pérez-Ortiz
Felipe Sánchez-Martínez
45
27
0
08 Sep 2021
AEDA: An Easier Data Augmentation Technique for Text Classification
Akbar Karimi
L. Rossi
Andrea Prati
32
151
0
30 Aug 2021
Layer-wise Model Pruning based on Mutual Information
Chun Fan
Jiwei Li
Xiang Ao
Fei Wu
Yuxian Meng
Xiaofei Sun
48
19
0
28 Aug 2021
Influence-guided Data Augmentation for Neural Tensor Completion
Sejoon Oh
Sungchul Kim
Ryan A. Rossi
Srijan Kumar
28
10
0
23 Aug 2021
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
36
334
0
07 Jul 2021
An Empirical Survey of Data Augmentation for Limited Data Learning in NLP
Jiaao Chen
Derek Tam
Colin Raffel
Joey Tianyi Zhou
Diyi Yang
28
172
0
14 Jun 2021
Reweighting Augmented Samples by Minimizing the Maximal Expected Loss
Mingyang Yi
Lu Hou
Lifeng Shang
Xin Jiang
Qun Liu
Zhi-Ming Ma
12
19
0
16 Mar 2021
BERT-based Acronym Disambiguation with Multiple Training Strategies
Chunguang Pan
Bingyan Song
Shengguang Wang
Zhipeng Luo
21
18
0
25 Feb 2021
Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation
Lingyun Feng
Minghui Qiu
Yaliang Li
Haitao Zheng
Ying Shen
46
10
0
20 Jan 2021
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual Contexts
Yuxian Meng
Shuhe Wang
Qinghong Han
Xiaofei Sun
Fei Wu
Rui Yan
Jiwei Li
27
28
0
30 Dec 2020
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation
Ruibo Liu
Guangxuan Xu
Chenyan Jia
Weicheng Ma
Lili Wang
Soroush Vosoughi
23
107
0
05 Dec 2020
PHICON: Improving Generalization of Clinical Text De-identification Models via Data Augmentation
Xiang Yue
Shuang Zhou
17
10
0
11 Oct 2020
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
Dayiheng Liu
Yeyun Gong
Jie Fu
Yu Yan
Jiusheng Chen
Jiancheng Lv
Nan Duan
M. Zhou
15
37
0
04 Oct 2020
Not Enough Data? Deep Learning to the Rescue!
Ateret Anaby-Tavor
Boaz Carmeli
Esther Goldbraich
Amir Kantor
George Kour
Segev Shlomov
N. Tepper
Naama Zwerdling
16
365
0
08 Nov 2019
BPE-Dropout: Simple and Effective Subword Regularization
Ivan Provilkov
Dmitrii Emelianenko
Elena Voita
38
276
0
29 Oct 2019
Learning Data Manipulation for Augmentation and Weighting
Zhiting Hu
Bowen Tan
Ruslan Salakhutdinov
Tom Michael Mitchell
Eric P. Xing
29
116
0
28 Oct 2019
GraphMix: Improved Training of GNNs for Semi-Supervised Learning
Vikas Verma
Meng Qu
Kenji Kawaguchi
Alex Lamb
Yoshua Bengio
Arno Solin
Jian Tang
33
62
0
25 Sep 2019
Synthetic Data for Deep Learning
Sergey I. Nikolenko
46
348
0
25 Sep 2019
AutoML: A Survey of the State-of-the-Art
Xin He
Kaiyong Zhao
Xiangxiang Chu
20
1,420
0
02 Aug 2019
Soft Contextual Data Augmentation for Neural Machine Translation
Jinhua Zhu
Fei Gao
Lijun Wu
Yingce Xia
Tao Qin
Wen-gang Zhou
Xueqi Cheng
Tie-Yan Liu
19
125
0
25 May 2019
Integrating Semantic Knowledge to Tackle Zero-shot Text Classification
Jingqing Zhang
Piyawat Lertvittayakumjorn
Yike Guo
VLM
33
118
0
29 Mar 2019
Text Data Augmentation Made Simple By Leveraging NLP Cloud APIs
Claude Coulombe
19
117
0
05 Dec 2018
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training
Yao Wan
Wenqiang Yan
Jianwei Gao
Zhou Zhao
Jian Wu
Philip S. Yu
21
10
0
12 Nov 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Dongsoo Lee
Byeongwook Kim
MQ
36
16
0
29 May 2018
Denoising Distant Supervision for Relation Extraction via Instance-Level Adversarial Training
Xu Han
Zhiyuan Liu
Maosong Sun
27
16
0
28 May 2018
Token-level and sequence-level loss smoothing for RNN language models
Maha Elbayad
Laurent Besacier
Jakob Verbeek
22
19
0
14 May 2018
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Taku Kudo
27
1,147
0
29 Apr 2018
Improving Variational Encoder-Decoders in Dialogue Generation
Xiaoyu Shen
Hui Su
Shuzi Niu
Vera Demberg
DRL
32
99
0
06 Feb 2018
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1