Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.03013
Cited By
SemiReward: A General Reward Model for Semi-supervised Learning
4 October 2023
Siyuan Li
Weiyang Jin
Zedong Wang
Fang Wu
Zicheng Liu
Cheng Tan
Stan Z. Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SemiReward: A General Reward Model for Semi-supervised Learning"
13 / 13 papers shown
Title
SUMix: Mixup with Semantic and Uncertain Information
Huafeng Qin
Xin Jin
Hongyu Zhu
Hongchao Liao
M. El-Yacoubi
Xinbo Gao
UQCV
31
5
0
10 Jul 2024
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning
Cheng Tan
Jingxuan Wei
Linzhuang Sun
Zhangyang Gao
Siyuan Li
Bihui Yu
Ruifeng Guo
Stan Z. Li
ReLM
LRM
3DV
66
6
0
31 May 2024
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data
Zifan Song
Yudong Wang
Wenwei Zhang
Kuikun Liu
Chengqi Lyu
...
Qipeng Guo
Hang Yan
Dahua Lin
Kai-xiang Chen
Cairong Zhao
SyDa
41
2
0
29 May 2024
LiqD: A Dynamic Liquid Level Detection Model under Tricky Small Containers
Yukun Ma
Zikun Mao
16
0
0
13 Mar 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
30
6
0
14 Feb 2024
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning
Yidong Wang
Hao Chen
Qiang Heng
Wenxin Hou
Yue Fan
...
Marios Savvides
T. Shinozaki
Bhiksha Raj
Bernt Schiele
Xing Xie
185
258
0
15 May 2022
Harnessing Hard Mixed Samples with Decoupled Regularizer
Zicheng Liu
Siyuan Li
Ge Wang
Cheng Tan
Lirong Wu
Stan Z. Li
56
18
0
21 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,953
0
04 Mar 2022
FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling
Bowen Zhang
Yidong Wang
Wenxin Hou
Hao Wu
Jindong Wang
Manabu Okumura
T. Shinozaki
AAML
234
862
0
15 Oct 2021
Co-learning: Learning from Noisy Labels with Self-supervision
Cheng Tan
Jun-Xiong Xia
Lirong Wu
Stan Z. Li
NoLa
73
116
0
05 Aug 2021
Localization Distillation for Dense Object Detection
Zhaohui Zheng
Rongguang Ye
Ping Wang
Dongwei Ren
W. Zuo
Qibin Hou
Ming-Ming Cheng
ObjD
98
115
0
24 Feb 2021
Meta Pseudo Labels
Hieu H. Pham
Zihang Dai
Qizhe Xie
Minh-Thang Luong
Quoc V. Le
VLM
253
656
0
23 Mar 2020
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results
Antti Tarvainen
Harri Valpola
OOD
MoMe
261
1,275
0
06 Mar 2017
1