Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.14448
Cited By
R-Drop: Regularized Dropout for Neural Networks
28 June 2021
Xiaobo Liang
Lijun Wu
Juntao Li
Yue Wang
Qi Meng
Tao Qin
Wei Chen
M. Zhang
Tie-Yan Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"R-Drop: Regularized Dropout for Neural Networks"
47 / 47 papers shown
Title
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
0
0
17 Apr 2025
Neuroplasticity in Artificial Intelligence -- An Overview and Inspirations on Drop In & Out Learning
Yupei Li
M. Milling
Björn Schuller
AI4CE
107
0
0
27 Mar 2025
Deterministic Reversible Data Augmentation for Neural Machine Translation
Jiashu Yao
Heyan Huang
Zeming Liu
Yuhang Guo
49
0
0
21 Feb 2025
CR-CTC: Consistency regularization on CTC for improved speech recognition
Zengwei Yao
Wei Kang
Xiaoyu Yang
Fangjun Kuang
Liyong Guo
Han Zhu
Zengrui Jin
Zhaoqing Li
Long Lin
Daniel Povey
53
0
0
17 Feb 2025
Enhancing Retrosynthesis with Conformer: A Template-Free Method
Jiaxi Zhuang
Qian Zhang
Ying Qian
125
0
0
21 Jan 2025
Semi-Supervised Self-Learning Enhanced Music Emotion Recognition
Yifu Sun
Xulong Zhang
Monan Zhou
Wei Li
35
0
0
29 Oct 2024
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
Yao Ni
Shan Zhang
Piotr Koniusz
140
2
0
25 Sep 2024
Exploring the traditional NMT model and Large Language Model for chat translation
Jinlong Yang
Hengchao Shang
Daimeng Wei
Jiaxin Guo
Zongyao Li
...
Yuhao Xie
Yuanchang Luo
Jiawei Zheng
Bin Wei
Hao Yang
18
0
0
24 Sep 2024
Segmenting Medical Images with Limited Data
Zhaoshan Liua
Qiujie Lv
Chau Hung Lee
L. Shen
54
11
0
12 Jul 2024
Generalization Measures for Zero-Shot Cross-Lingual Transfer
Saksham Bassi
Duygu Ataman
Kyunghyun Cho
29
0
0
24 Apr 2024
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation
Jingxuan Wei
Linzhuang Sun
Yichong Leng
Xu Tan
Bihui Yu
Ruifeng Guo
45
3
0
23 Apr 2024
Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors
Victoria Graf
Qin Liu
Muhao Chen
AAML
34
8
0
02 Apr 2024
FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation
Kaiyuan Gao
Qizhi Pei
Jinhua Zhu
Kun He
Lijun Wu
Lijun Wu
34
6
0
29 Mar 2024
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
26
0
0
06 Jan 2024
AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
Yang Yu
Qi Liu
Kai Zhang
Yuren Zhang
Chao Song
Min Hou
Yuqing Yuan
Zhihao Ye
Zaixin Zhang
Sanshi Lei Yu
30
2
0
15 Oct 2023
Partition-and-Debias: Agnostic Biases Mitigation via A Mixture of Biases-Specific Experts
Jiaxuan Li
D. Vo
Hideki Nakayama
26
3
0
19 Aug 2023
Research on Named Entity Recognition in Improved transformer with R-Drop structure
Weidong Ji
Yousheng Zhang
Guohui Zhou
Xu Wang
26
0
0
14 Jun 2023
From Shortcuts to Triggers: Backdoor Defense with Denoised PoE
Qin Liu
Fei Wang
Chaowei Xiao
Muhao Chen
AAML
31
21
0
24 May 2023
Enhancing Clinical Predictive Modeling through Model Complexity-Driven Class Proportion Tuning for Class Imbalanced Data: An Empirical Study on Opioid Overdose Prediction
Yinan Liu
Xinyu Dong
Weimin Lyu
R. Rosenthal
Rachel Wong
Tengfei Ma
Fusheng Wang
27
0
0
09 May 2023
PAMI: partition input and aggregate outputs for model interpretation
Wei Shi
Wentao Zhang
Weishi Zheng
Ruixuan Wang
FAtt
22
3
0
07 Feb 2023
ZhichunRoad at Amazon KDD Cup 2022: MultiTask Pre-Training for E-Commerce Product Search
Xuange Cui
Wei Xiong
Songlin Wang
27
1
0
31 Jan 2023
Towards Personalized Review Summarization by Modeling Historical Reviews from Customer and Product Separately
Xin Cheng
Shen Gao
Yuchi Zhang
Yongliang Wang
Xiuying Chen
Mingzhe Li
Dongyan Zhao
Rui Yan
18
10
0
27 Jan 2023
Discovering Customer-Service Dialog System with Semi-Supervised Learning and Coarse-to-Fine Intent Detection
Zhitong Yang
Xing Ma
Anqi Liu
Zheyu Zhang
18
1
0
23 Dec 2022
On-the-fly Denoising for Data Augmentation in Natural Language Understanding
Tianqing Fang
Wenxuan Zhou
Fangyu Liu
Hongming Zhang
Yangqiu Song
Muhao Chen
36
1
0
20 Dec 2022
Semi-Supervised Lifelong Language Learning
Ying Zhao
Yinhe Zheng
Yu Bowen
Zhiliang Tian
Dongkyu Lee
Jian Sun
Haiyang Yu
Yongbin Li
N. Zhang
CLL
KELM
32
3
0
23 Nov 2022
Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks
Chun Bao
Jie Cao
Yaqian Ning
Yang Cheng
Q. Hao
26
1
0
23 Nov 2022
ConNER: Consistency Training for Cross-lingual Named Entity Recognition
Ran Zhou
Xin Li
Lidong Bing
Erik Cambria
Luo Si
Chun Miao
34
18
0
17 Nov 2022
FF2: A Feature Fusion Two-Stream Framework for Punctuation Restoration
Yangjun Wu
Kebin Fang
Yao Zhao
Hao Zhang
Lifeng Shi
Mengqi Zhang
34
0
0
09 Nov 2022
ROSE: Robust Selective Fine-tuning for Pre-trained Language Models
Lan Jiang
Hao Zhou
Yankai Lin
Peng Li
Jie Zhou
R. Jiang
AAML
27
8
0
18 Oct 2022
AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning
Tao Yang
Jinghao Deng
Xiaojun Quan
Qifan Wang
Shaoliang Nie
28
3
0
12 Oct 2022
Relaxed Attention for Transformer Models
Timo Lohrenz
Björn Möller
Zhengyang Li
Tim Fingscheidt
KELM
24
11
0
20 Sep 2022
UBARv2: Towards Mitigating Exposure Bias in Task-Oriented Dialogs
Yunyi Yang
Hong Ding
Qing Liu
Xiaojun Quan
34
1
0
15 Sep 2022
SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation
Longxuan Ma
Ziyu Zhuang
Weinan Zhang
Mingda Li
Ting Liu
21
4
0
17 Aug 2022
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers
R. Liu
Young Jin Kim
Alexandre Muzio
Hany Awadalla
MoE
42
22
0
28 May 2022
On the Use of BERT for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation
Yongjie Wang
Chuan Wang
Ruobing Li
Hui-Ching Lin
14
74
0
08 May 2022
A Survey on Dropout Methods and Experimental Verification in Recommendation
Y. Li
Weizhi Ma
C. L. Philip Chen
M. Zhang
Yiqun Liu
Shaoping Ma
Yue Yang
33
9
0
05 Apr 2022
CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation
Nishant Kambhatla
Logan Born
Anoop Sarkar
6
16
0
01 Apr 2022
RecursiveMix: Mixed Learning with History
Lingfeng Yang
Xiang Li
Borui Zhao
Renjie Song
Jian Yang
VLM
27
18
0
14 Mar 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
48
850
0
07 Feb 2022
Sharpness-Aware Minimization with Dynamic Reweighting
Wenxuan Zhou
Fangyu Liu
Huan Zhang
Muhao Chen
AAML
19
8
0
16 Dec 2021
Context-guided Triple Matching for Multiple Choice Question Answering
Xun Yao
Junlong Ma
Xinrong Hu
Junping Liu
Jie Yang
Wanqing Li
14
2
0
27 Sep 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
26
11
0
19 Aug 2021
SEED: Self-supervised Distillation For Visual Representation
Zhiyuan Fang
Jianfeng Wang
Lijuan Wang
Lei Zhang
Yezhou Yang
Zicheng Liu
SSL
236
190
0
12 Jan 2021
AutoDropout: Learning Dropout Patterns to Regularize Deep Networks
Hieu H. Pham
Quoc V. Le
70
56
0
05 Jan 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,956
0
20 Apr 2018
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
285
9,136
0
06 Jun 2015
Improving neural networks by preventing co-adaptation of feature detectors
Geoffrey E. Hinton
Nitish Srivastava
A. Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
VLM
266
7,634
0
03 Jul 2012
1