Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.00942
Cited By
Not All Samples Are Created Equal: Deep Learning with Importance Sampling
2 March 2018
Angelos Katharopoulos
François Fleuret
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Not All Samples Are Created Equal: Deep Learning with Importance Sampling"
50 / 106 papers shown
Title
Self-Evolving Curriculum for LLM Reasoning
Xiaoyin Chen
Jiarui Lu
Minsu Kim
Dinghuai Zhang
Jian Tang
Alexandre Piché
Nicolas Angelard-Gontier
Yoshua Bengio
Ehsan Kamalloo
ReLM
LRM
25
0
0
20 May 2025
Importance Sampling for Nonlinear Models
Prakash Palanivelu Rajmohan
Fred Roosta
12
0
0
18 May 2025
Randomized Pairwise Learning with Adaptive Sampling: A PAC-Bayes Analysis
Sijia Zhou
Yunwen Lei
Ata Kabán
34
0
0
03 Apr 2025
Geometric Median Matching for Robust k-Subset Selection from Noisy Data
Anish Acharya
Sujay Sanghavi
Alexandros G. Dimakis
Inderjit S Dhillon
AAML
62
0
0
01 Apr 2025
Geometric Median (GM) Matching for Robust Data Pruning
Anish Acharya
Inderjit S Dhillon
Sujay Sanghavi
AAML
59
0
0
20 Jan 2025
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
S. Joshi
Besmira Nushi
Vidhisha Balachandran
Varun Chandrasekaran
Vibhav Vineet
Neel Joshi
Baharan Mirzasoleiman
MLLM
VLM
49
0
0
07 Jan 2025
Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification
Hsun-Yu Kuo
Yin-Hsiang Liao
Yu-Chieh Chao
Wei-Yun Ma
Pu-Jen Cheng
SyDa
56
3
0
28 Oct 2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
57
11
0
09 Oct 2024
Accelerating Deep Learning with Fixed Time Budget
Muhammad Asif Khan
R. Hamila
Hamid Menouar
28
0
0
03 Oct 2024
Fair Class-Incremental Learning using Sample Weighting
Jaeyoung Park
Minsu Kim
Steven Euijong Whang
38
0
0
02 Oct 2024
Multiple Importance Sampling for Stochastic Gradient Estimation
Corentin Salaün
Xingchang Huang
Iliyan Georgiev
Niloy J. Mitra
Gurprit Singh
32
1
0
22 Jul 2024
Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks
Quang H. Nguyen
Nguyen Ngoc-Hieu
The-Anh Ta
Thanh Nguyen-Tang
Kok-Seng Wong
Hoang Thanh-Tung
Khoa D. Doan
AAML
33
2
0
15 Jul 2024
Diversified Batch Selection for Training Acceleration
Feng Hong
Yueming Lyu
Jiangchao Yao
Ya Zhang
Ivor W. Tsang
Yanfeng Wang
42
4
0
07 Jun 2024
SAVA: Scalable Learning-Agnostic Data Valuation
Samuel Kessler
Tam Le
Vu Nguyen
TDI
66
0
0
03 Jun 2024
Adaptive and Parallel Split Federated Learning in Vehicular Edge Computing
Xianke Qiang
Zheng Chang
Yun Hu
Lei Liu
Timo Hämäläinen
30
2
0
29 May 2024
Rho-1: Not All Tokens Are What You Need
Zheng-Wen Lin
Zhibin Gou
Yeyun Gong
Xiao Liu
Yelong Shen
...
Chen Lin
Yujiu Yang
Jian Jiao
Nan Duan
Weizhu Chen
CLL
50
57
0
11 Apr 2024
On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions
Maximilian Böther
Abraham Sebastian
Pranjal Awasthi
Ana Klimovic
Srikumar Ramalingam
42
0
0
26 Feb 2024
Robust Training of Temporal GNNs using Nearest Neighbours based Hard Negatives
Shubham Gupta
Srikanta J. Bedathur
OOD
30
1
0
14 Feb 2024
Towards Improved Proxy-based Deep Metric Learning via Data-Augmented Domain Adaptation
Li Ren
Chen Chen
Liqiang Wang
Kien Hua
46
7
0
01 Jan 2024
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding
Talfan Evans
Shreya Pathak
Hamza Merzic
Jonathan Schwarz
Ryutaro Tanno
Olivier J. Hénaff
23
16
0
08 Dec 2023
Choosing Wisely and Learning Deeply: Selective Cross-Modality Distillation via CLIP for Domain Generalization
Jixuan Leng
Yijiang Li
Haohan Wang
VLM
37
0
0
26 Nov 2023
Efficient Trigger Word Insertion
Yueqi Zeng
Ziqiang Li
Pengfei Xia
Lei Liu
Bin Li
AAML
21
5
0
23 Nov 2023
Online Continual Knowledge Learning for Language Models
Yuhao Wu
Tongjun Shi
Karthick Sharma
Chun Seah
Shuhao Zhang
CLL
KELM
33
4
0
16 Nov 2023
EcoLearn: Optimizing the Carbon Footprint of Federated Learning
Talha Mehboob
Noman Bashir
Jesus Omana Iglesias
Michael Zink
David Irwin
33
0
0
27 Oct 2023
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models
Jean Kaddour
Oscar Key
Piotr Nawrot
Pasquale Minervini
Matt J. Kusner
32
41
0
12 Jul 2023
Personalized Privacy Amplification via Importance Sampling
Dominik Fay
Sebastian Mair
Jens Sjölund
68
0
0
05 Jul 2023
AdaSelection: Accelerating Deep Learning Training through Data Subsampling
Minghe Zhang
Chaosheng Dong
Jinmiao Fu
Tianchen Zhou
Jia Liang
...
Bo Liu
Michinari Momma
Bryan Wang
Yan Gao
Yi Sun
37
3
0
19 Jun 2023
Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Ramnath Kumar
Kushal Majmundar
Dheeraj M. Nagaraj
A. Suggala
ODL
34
6
0
15 Jun 2023
Sample-Level Weighting for Multi-Task Learning with Auxiliary Tasks
Emilie Grégoire
M. H. Chaudhary
Sam Verboven
26
1
0
07 Jun 2023
NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks
Jean-Michel Attendu
Jean-Philippe Corbeil
41
15
0
05 Jun 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
30
41
0
07 Apr 2023
GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for PINNs
Yuling Jiao
Dingwei Li
Xiliang Lu
J. Yang
Cheng Yuan
36
9
0
28 Mar 2023
Data Selection for Language Models via Importance Resampling
Sang Michael Xie
Shibani Santurkar
Tengyu Ma
Percy Liang
46
173
0
06 Feb 2023
A Survey on Efficient Training of Transformers
Bohan Zhuang
Jing Liu
Zizheng Pan
Haoyu He
Yuetian Weng
Chunhua Shen
31
47
0
02 Feb 2023
Leveraging Importance Weights in Subset Selection
Gui Citovsky
Giulia DeSalvo
Sanjiv Kumar
Srikumar Ramalingam
Afshin Rostamizadeh
Yunjuan Wang
40
3
0
28 Jan 2023
Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks
Jimmy Z. Di
Jack Douglas
Jayadev Acharya
Gautam Kamath
Ayush Sekhari
MU
32
44
0
21 Dec 2022
Man-recon: manifold learning for reconstruction with deep autoencoder for smart seismic interpretation
Ahmad Mustafa
Ghassan AlRegib
29
13
0
15 Dec 2022
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Zhenglun Kong
Haoyu Ma
Geng Yuan
Mengshu Sun
Yanyue Xie
...
Tianlong Chen
Xiaolong Ma
Xiaohui Xie
Zhangyang Wang
Yanzhi Wang
ViT
34
22
0
19 Nov 2022
Informative Sample-Aware Proxy for Deep Metric Learning
Aoyu Li
Ikuro Sato
Kohta Ishikawa
Rei Kawakami
Rio Yokota
24
1
0
18 Nov 2022
Simulation-Based Parallel Training
Lucas Meyer
Alejandro Ribés
Bruno Raffin
AI4CE
33
2
0
08 Nov 2022
Enhancing Efficiency in Multidevice Federated Learning through Data Selection
Fan Mo
Mohammad Malekzadeh
S. Chatterjee
F. Kawsar
Akhil Mathur
FedML
40
2
0
08 Nov 2022
Client Selection in Federated Learning: Principles, Challenges, and Opportunities
Lei Fu
Huan Zhang
Ge Gao
Mi Zhang
Xin Liu
FedML
39
118
0
03 Nov 2022
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
Kun Zhou
Yeyun Gong
Xiao Liu
Wayne Xin Zhao
Yelong Shen
...
Jing Lu
Rangan Majumder
Ji-Rong Wen
Nan Duan
Weizhu Chen
44
33
0
21 Oct 2022
DPIS: An Enhanced Mechanism for Differentially Private SGD with Importance Sampling
Jianxin Wei
Ergute Bao
X. Xiao
Yifan Yang
46
20
0
18 Oct 2022
Data-Efficient Augmentation for Training Neural Networks
Tian Yu Liu
Baharan Mirzasoleiman
32
7
0
15 Oct 2022
ISFL: Federated Learning for Non-i.i.d. Data with Local Importance Sampling
Zheqi Zhu
Yuchen Shi
Pingyi Fan
Chenghui Peng
Khaled B. Letaief
FedML
25
8
0
05 Oct 2022
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with Latest Weight Averaging
Jean Kaddour
MoMe
3DH
24
40
0
29 Sep 2022
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Shoaib Ahmed Siddiqui
Nitarshan Rajkumar
Tegan Maharaj
David M. Krueger
Sara Hooker
55
27
0
20 Sep 2022
Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models
Ethan Pickering
T. Sapsis
24
6
0
27 Aug 2022
Fast Heterogeneous Federated Learning with Hybrid Client Selection
Guangyuan Shen
D. Gao
Duanxiao Song
Libin Yang
Xukai Zhou
Shirui Pan
W. Lou
Fang Zhou
FedML
45
12
0
10 Aug 2022
1
2
3
Next