ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.00942
  4. Cited By
Not All Samples Are Created Equal: Deep Learning with Importance
  Sampling

Not All Samples Are Created Equal: Deep Learning with Importance Sampling

2 March 2018
Angelos Katharopoulos
F. Fleuret
ArXivPDFHTML

Papers citing "Not All Samples Are Created Equal: Deep Learning with Importance Sampling"

50 / 94 papers shown
Title
Importance Sampling for Nonlinear Models
Importance Sampling for Nonlinear Models
Prakash Palanivelu Rajmohan
Fred Roosta
7
0
0
18 May 2025
Randomized Pairwise Learning with Adaptive Sampling: A PAC-Bayes Analysis
Randomized Pairwise Learning with Adaptive Sampling: A PAC-Bayes Analysis
Sijia Zhou
Yunwen Lei
Ata Kabán
34
0
0
03 Apr 2025
Geometric Median Matching for Robust k-Subset Selection from Noisy Data
Geometric Median Matching for Robust k-Subset Selection from Noisy Data
Anish Acharya
Sujay Sanghavi
Alexandros G. Dimakis
Inderjit S Dhillon
AAML
62
0
0
01 Apr 2025
Geometric Median (GM) Matching for Robust Data Pruning
Geometric Median (GM) Matching for Robust Data Pruning
Anish Acharya
Inderjit S Dhillon
Sujay Sanghavi
AAML
59
0
0
20 Jan 2025
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
S. Joshi
Besmira Nushi
Vidhisha Balachandran
Varun Chandrasekaran
Vibhav Vineet
Neel Joshi
Baharan Mirzasoleiman
MLLM
VLM
49
0
0
07 Jan 2025
Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification
Not All LLM-Generated Data Are Equal: Rethinking Data Weighting in Text Classification
Hsun-Yu Kuo
Yin-Hsiang Liao
Yu-Chieh Chao
Wei-Yun Ma
Pu-Jen Cheng
SyDa
53
3
0
28 Oct 2024
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge
  with Curriculum Preference Learning
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
55
10
0
09 Oct 2024
Accelerating Deep Learning with Fixed Time Budget
Accelerating Deep Learning with Fixed Time Budget
Muhammad Asif Khan
R. Hamila
Hamid Menouar
28
0
0
03 Oct 2024
Multiple Importance Sampling for Stochastic Gradient Estimation
Multiple Importance Sampling for Stochastic Gradient Estimation
Corentin Salaün
Xingchang Huang
Iliyan Georgiev
Niloy J. Mitra
Gurprit Singh
32
1
0
22 Jul 2024
Wicked Oddities: Selectively Poisoning for Effective Clean-Label
  Backdoor Attacks
Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks
Quang H. Nguyen
Nguyen Ngoc-Hieu
The-Anh Ta
Thanh Nguyen-Tang
Kok-Seng Wong
Hoang Thanh-Tung
Khoa D. Doan
AAML
33
2
0
15 Jul 2024
Diversified Batch Selection for Training Acceleration
Diversified Batch Selection for Training Acceleration
Feng Hong
Yueming Lyu
Jiangchao Yao
Ya Zhang
Ivor W. Tsang
Yanfeng Wang
42
4
0
07 Jun 2024
SAVA: Scalable Learning-Agnostic Data Valuation
SAVA: Scalable Learning-Agnostic Data Valuation
Samuel Kessler
Tam Le
Vu Nguyen
TDI
64
0
0
03 Jun 2024
Rho-1: Not All Tokens Are What You Need
Rho-1: Not All Tokens Are What You Need
Zheng-Wen Lin
Zhibin Gou
Yeyun Gong
Xiao Liu
Yelong Shen
...
Chen Lin
Yujiu Yang
Jian Jiao
Nan Duan
Weizhu Chen
CLL
50
57
0
11 Apr 2024
On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions
On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions
Maximilian Böther
Abraham Sebastian
Pranjal Awasthi
Ana Klimovic
Srikumar Ramalingam
42
0
0
26 Feb 2024
Robust Training of Temporal GNNs using Nearest Neighbours based Hard
  Negatives
Robust Training of Temporal GNNs using Nearest Neighbours based Hard Negatives
Shubham Gupta
Srikanta J. Bedathur
OOD
30
1
0
14 Feb 2024
Towards Improved Proxy-based Deep Metric Learning via Data-Augmented
  Domain Adaptation
Towards Improved Proxy-based Deep Metric Learning via Data-Augmented Domain Adaptation
Li Ren
Chen Chen
Liqiang Wang
Kien Hua
44
7
0
01 Jan 2024
Bad Students Make Great Teachers: Active Learning Accelerates
  Large-Scale Visual Understanding
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding
Talfan Evans
Shreya Pathak
Hamza Merzic
Jonathan Schwarz
Ryutaro Tanno
Olivier J. Hénaff
20
16
0
08 Dec 2023
Choosing Wisely and Learning Deeply: Selective Cross-Modality
  Distillation via CLIP for Domain Generalization
Choosing Wisely and Learning Deeply: Selective Cross-Modality Distillation via CLIP for Domain Generalization
Jixuan Leng
Yijiang Li
Haohan Wang
VLM
37
0
0
26 Nov 2023
Efficient Trigger Word Insertion
Efficient Trigger Word Insertion
Yueqi Zeng
Ziqiang Li
Pengfei Xia
Lei Liu
Bin Li
AAML
21
5
0
23 Nov 2023
Online Continual Knowledge Learning for Language Models
Online Continual Knowledge Learning for Language Models
Yuhao Wu
Tongjun Shi
Karthick Sharma
Chun Seah
Shuhao Zhang
CLL
KELM
30
4
0
16 Nov 2023
EcoLearn: Optimizing the Carbon Footprint of Federated Learning
EcoLearn: Optimizing the Carbon Footprint of Federated Learning
Talha Mehboob
Noman Bashir
Jesus Omana Iglesias
Michael Zink
David Irwin
33
0
0
27 Oct 2023
No Train No Gain: Revisiting Efficient Training Algorithms For
  Transformer-based Language Models
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models
Jean Kaddour
Oscar Key
Piotr Nawrot
Pasquale Minervini
Matt J. Kusner
22
41
0
12 Jul 2023
Personalized Privacy Amplification via Importance Sampling
Personalized Privacy Amplification via Importance Sampling
Dominik Fay
Sebastian Mair
Jens Sjölund
63
0
0
05 Jul 2023
AdaSelection: Accelerating Deep Learning Training through Data
  Subsampling
AdaSelection: Accelerating Deep Learning Training through Data Subsampling
Minghe Zhang
Chaosheng Dong
Jinmiao Fu
Tianchen Zhou
Jia Liang
...
Bo Liu
Michinari Momma
Bryan Wang
Yan Gao
Yi Sun
35
3
0
19 Jun 2023
Stochastic Re-weighted Gradient Descent via Distributionally Robust
  Optimization
Stochastic Re-weighted Gradient Descent via Distributionally Robust Optimization
Ramnath Kumar
Kushal Majmundar
Dheeraj M. Nagaraj
A. Suggala
ODL
32
6
0
15 Jun 2023
Sample-Level Weighting for Multi-Task Learning with Auxiliary Tasks
Sample-Level Weighting for Multi-Task Learning with Auxiliary Tasks
Emilie Grégoire
M. H. Chaudhary
Sam Verboven
24
1
0
07 Jun 2023
NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification
  Tasks
NLU on Data Diets: Dynamic Data Subset Selection for NLP Classification Tasks
Jean-Michel Attendu
Jean-Philippe Corbeil
41
15
0
05 Jun 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
30
41
0
07 Apr 2023
GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for
  PINNs
GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for PINNs
Yuling Jiao
Dingwei Li
Xiliang Lu
J. Yang
Cheng Yuan
34
9
0
28 Mar 2023
A Survey on Efficient Training of Transformers
A Survey on Efficient Training of Transformers
Bohan Zhuang
Jing Liu
Zizheng Pan
Haoyu He
Yuetian Weng
Chunhua Shen
31
47
0
02 Feb 2023
Leveraging Importance Weights in Subset Selection
Leveraging Importance Weights in Subset Selection
Gui Citovsky
Giulia DeSalvo
Sanjiv Kumar
Srikumar Ramalingam
Afshin Rostamizadeh
Yunjuan Wang
40
3
0
28 Jan 2023
Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks
Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks
Jimmy Z. Di
Jack Douglas
Jayadev Acharya
Gautam Kamath
Ayush Sekhari
MU
32
44
0
21 Dec 2022
Man-recon: manifold learning for reconstruction with deep autoencoder
  for smart seismic interpretation
Man-recon: manifold learning for reconstruction with deep autoencoder for smart seismic interpretation
Ahmad Mustafa
Ghassan AlRegib
23
13
0
15 Dec 2022
Peeling the Onion: Hierarchical Reduction of Data Redundancy for
  Efficient Vision Transformer Training
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
Zhenglun Kong
Haoyu Ma
Geng Yuan
Mengshu Sun
Yanyue Xie
...
Tianlong Chen
Xiaolong Ma
Xiaohui Xie
Zhangyang Wang
Yanzhi Wang
ViT
34
22
0
19 Nov 2022
Informative Sample-Aware Proxy for Deep Metric Learning
Informative Sample-Aware Proxy for Deep Metric Learning
Aoyu Li
Ikuro Sato
Kohta Ishikawa
Rei Kawakami
Rio Yokota
24
1
0
18 Nov 2022
Simulation-Based Parallel Training
Simulation-Based Parallel Training
Lucas Meyer
Alejandro Ribés
Bruno Raffin
AI4CE
31
2
0
08 Nov 2022
Enhancing Efficiency in Multidevice Federated Learning through Data Selection
Enhancing Efficiency in Multidevice Federated Learning through Data Selection
Fan Mo
Mohammad Malekzadeh
S. Chatterjee
F. Kawsar
Akhil Mathur
FedML
38
2
0
08 Nov 2022
Client Selection in Federated Learning: Principles, Challenges, and
  Opportunities
Client Selection in Federated Learning: Principles, Challenges, and Opportunities
Lei Fu
Huan Zhang
Ge Gao
Mi Zhang
Xin Liu
FedML
37
118
0
03 Nov 2022
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
Kun Zhou
Yeyun Gong
Xiao Liu
Wayne Xin Zhao
Yelong Shen
...
Jing Lu
Rangan Majumder
Ji-Rong Wen
Nan Duan
Weizhu Chen
42
33
0
21 Oct 2022
DPIS: An Enhanced Mechanism for Differentially Private SGD with
  Importance Sampling
DPIS: An Enhanced Mechanism for Differentially Private SGD with Importance Sampling
Jianxin Wei
Ergute Bao
X. Xiao
Yifan Yang
46
20
0
18 Oct 2022
Data-Efficient Augmentation for Training Neural Networks
Data-Efficient Augmentation for Training Neural Networks
Tian Yu Liu
Baharan Mirzasoleiman
29
7
0
15 Oct 2022
ISFL: Federated Learning for Non-i.i.d. Data with Local Importance
  Sampling
ISFL: Federated Learning for Non-i.i.d. Data with Local Importance Sampling
Zheqi Zhu
Yuchen Shi
Pingyi Fan
Chenghui Peng
Khaled B. Letaief
FedML
25
8
0
05 Oct 2022
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with
  Latest Weight Averaging
Stop Wasting My Time! Saving Days of ImageNet and BERT Training with Latest Weight Averaging
Jean Kaddour
MoMe
3DH
24
39
0
29 Sep 2022
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training
  Dynamics
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Shoaib Ahmed Siddiqui
Nitarshan Rajkumar
Tegan Maharaj
David M. Krueger
Sara Hooker
47
27
0
20 Sep 2022
Information FOMO: The unhealthy fear of missing out on information. A
  method for removing misleading data for healthier models
Information FOMO: The unhealthy fear of missing out on information. A method for removing misleading data for healthier models
Ethan Pickering
T. Sapsis
24
6
0
27 Aug 2022
Fast Heterogeneous Federated Learning with Hybrid Client Selection
Fast Heterogeneous Federated Learning with Hybrid Client Selection
Guangyuan Shen
D. Gao
Duanxiao Song
Libin Yang
Xukai Zhou
Shirui Pan
W. Lou
Fang Zhou
FedML
45
12
0
10 Aug 2022
Efficient NLP Model Finetuning via Multistage Data Filtering
Efficient NLP Model Finetuning via Multistage Data Filtering
Ouyang Xu
S. Ansari
F. Lin
Yangfeng Ji
35
2
0
28 Jul 2022
Rank-based Decomposable Losses in Machine Learning: A Survey
Rank-based Decomposable Losses in Machine Learning: A Survey
Shu Hu
Xin Wang
Siwei Lyu
38
32
0
18 Jul 2022
Prioritized Training on Points that are Learnable, Worth Learning, and
  Not Yet Learnt
Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt
Sören Mindermann
J. Brauner
Muhammed Razzak
Mrinank Sharma
Andreas Kirsch
...
Benedikt Höltgen
Aidan Gomez
Adrien Morisot
Sebastian Farquhar
Y. Gal
62
149
0
14 Jun 2022
The Environmental Discontinuity Hypothesis for Down-Sampled Lexicase
  Selection
The Environmental Discontinuity Hypothesis for Down-Sampled Lexicase Selection
Ryan Boldi
Thomas Helmuth
Lee Spector
35
5
0
31 May 2022
12
Next