Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.02418
Cited By
Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data
5 December 2023
Yu Yang
Aaditya K. Singh
Mostafa Elhoushi
Anas Mahmoud
Kushal Tirumala
Fabian Gloeckle
Baptiste Rozière
Carole-Jean Wu
Ari S. Morcos
Newsha Ardalani
AAML
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data"
1 / 1 papers shown
Title
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
S. Joshi
Besmira Nushi
Vidhisha Balachandran
Varun Chandrasekaran
Vibhav Vineet
Neel Joshi
Baharan Mirzasoleiman
MLLM
VLM
49
0
0
07 Jan 2025
1