ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.06439
  4. Cited By
What can Data-Centric AI Learn from Data and ML Engineering?

What can Data-Centric AI Learn from Data and ML Engineering?

13 December 2021
N. Polyzotis
Matei A. Zaharia
    AI4CE
ArXivPDFHTML

Papers citing "What can Data-Centric AI Learn from Data and ML Engineering?"

24 / 24 papers shown
Title
Minimizing Risk Through Minimizing Model-Data Interaction: A Protocol For Relying on Proxy Tasks When Designing Child Sexual Abuse Imagery Detection Models
Minimizing Risk Through Minimizing Model-Data Interaction: A Protocol For Relying on Proxy Tasks When Designing Child Sexual Abuse Imagery Detection Models
Thamiris Coelho
Leo S. F. Ribeiro
João Macedo
J. A. dos Santos
Sandra Avila
29
0
0
10 May 2025
Data Acquisition for Improving Model Fairness using Reinforcement
  Learning
Data Acquisition for Improving Model Fairness using Reinforcement Learning
Jahid Hasan
Romila Pradhan
57
0
0
04 Dec 2024
Survey and Taxonomy: The Role of Data-Centric AI in Transformer-Based
  Time Series Forecasting
Survey and Taxonomy: The Role of Data-Centric AI in Transformer-Based Time Series Forecasting
Jingjing Xu
Caesar Wu
Yuan-Fang Li
Grégoire Danoy
Pascal Bouvry
AI4TS
45
1
0
29 Jul 2024
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation
  in low-data regimes
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes
Nabeel Seedat
Nicolas Huynh
B. V. Breugel
M. Schaar
35
25
0
19 Dec 2023
Better, Not Just More: Data-Centric Machine Learning for Earth Observation
Better, Not Just More: Data-Centric Machine Learning for Earth Observation
R. Roscher
M. Rußwurm
Caroline Gevaert
Michael C. Kampffmeyer
J. A. dos Santos
...
Ronny Hansch
Stine Hansen
Keiller Nogueira
Jonathan Prexl
D. Tuia
39
10
0
08 Dec 2023
Trust, Accountability, and Autonomy in Knowledge Graph-based AI for
  Self-determination
Trust, Accountability, and Autonomy in Knowledge Graph-based AI for Self-determination
Luis-Daniel Ibánez
J. Domingue
Sabrina Kirrane
Oshani Seneviratne
Aisling Third
Maria-Esther Vidal
28
2
0
30 Oct 2023
TRIAGE: Characterizing and auditing training data for improved
  regression
TRIAGE: Characterizing and auditing training data for improved regression
Nabeel Seedat
Jonathan Crabbé
Zhaozhi Qian
M. Schaar
26
5
0
29 Oct 2023
Can You Rely on Your Model Evaluation? Improving Model Evaluation with
  Synthetic Test Data
Can You Rely on Your Model Evaluation? Improving Model Evaluation with Synthetic Test Data
B. V. Breugel
Nabeel Seedat
F. Imrie
M. Schaar
SyDa
31
20
0
25 Oct 2023
Dataset Factory: A Toolchain For Generative Computer Vision Datasets
Dataset Factory: A Toolchain For Generative Computer Vision Datasets
Daniel Kharitonov
Ryan Turner
16
1
0
20 Sep 2023
Towards Data-centric Graph Machine Learning: Review and Outlook
Towards Data-centric Graph Machine Learning: Review and Outlook
Xin Zheng
Yixin Liu
Zhifeng Bao
Meng Fang
Xia Hu
Alan Wee-Chung Liew
Shirui Pan
GNN
AI4CE
39
19
0
20 Sep 2023
Synthetic Alone: Exploring the Dark Side of Synthetic Data for
  Grammatical Error Correction
Synthetic Alone: Exploring the Dark Side of Synthetic Data for Grammatical Error Correction
Chanjun Park
Seonmin Koo
Seolhwa Lee
Jaehyung Seo
Sugyeong Eo
Hyeonseok Moon
Heu-Jeoung Lim
42
0
0
26 Jun 2023
GPT Self-Supervision for a Better Data Annotator
GPT Self-Supervision for a Better Data Annotator
Xiaohuan Pei
Yanxi Li
Chang Xu
30
7
0
07 Jun 2023
Transition Role of Entangled Data in Quantum Machine Learning
Transition Role of Entangled Data in Quantum Machine Learning
Xinbiao Wang
Yuxuan Du
Zhuozhuo Tu
Yong Luo
Xiao Yuan
Dacheng Tao
53
8
0
06 Jun 2023
Dynamic Datasets and Market Environments for Financial Reinforcement
  Learning
Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Xiao-Yang Liu
Ziyi Xia
Hongyang Yang
Jiechao Gao
Daochen Zha
Ming Zhu
Chris Wang
Zhaoran Wang
Jian Guo
OffRL
32
27
0
25 Apr 2023
Data-centric Artificial Intelligence: A Survey
Data-centric Artificial Intelligence: A Survey
Daochen Zha
Zaid Pervaiz Bhat
Kwei-Herng Lai
Fan Yang
Zhimeng Jiang
Shaochen Zhong
Xia Hu
27
193
0
17 Mar 2023
Learning to Select Pivotal Samples for Meta Re-weighting
Learning to Select Pivotal Samples for Meta Re-weighting
Yinjun Wu
Adam Stein
Jacob R. Gardner
Mayur Naik
29
0
0
09 Feb 2023
Data-centric AI: Perspectives and Challenges
Data-centric AI: Perspectives and Challenges
Daochen Zha
Zaid Pervaiz Bhat
Kwei-Herng Lai
Fan Yang
Xia Hu
25
68
0
12 Jan 2023
DMOps: Data Management Operation and Recipes
DMOps: Data Management Operation and Recipes
E. Choi
Chanjun Park
29
7
0
02 Jan 2023
The Principles of Data-Centric AI (DCAI)
The Principles of Data-Centric AI (DCAI)
M. H. Jarrahi
Ali Memariani
Shion Guha
24
55
0
26 Nov 2022
DC-Check: A Data-Centric AI checklist to guide the development of
  reliable machine learning systems
DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
Nabeel Seedat
F. Imrie
M. Schaar
32
12
0
09 Nov 2022
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular
  data
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data
Nabeel Seedat
Jonathan Crabbé
Ioana Bica
M. Schaar
19
24
0
24 Oct 2022
DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine
  Learning with Treemaps
DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps
Donald Bertucci
M. Hamid
Yashwanthi Anand
Anita Ruangrotsakun
Delyar Tabatabai
Melissa Perez
Minsuk Kahng
43
29
0
14 May 2022
Modern Views of Machine Learning for Precision Psychiatry
Modern Views of Machine Learning for Precision Psychiatry
Z. Chen
Prathamesh Kulkarni
Kulkarni
I. Galatzer-Levy
Benedetta Bigio
C. Nasca
Yu Zhang
57
91
0
04 Apr 2022
Data Smells: Categories, Causes and Consequences, and Detection of
  Suspicious Data in AI-based Systems
Data Smells: Categories, Causes and Consequences, and Detection of Suspicious Data in AI-based Systems
Harald Foidl
Michael Felderer
Rudolf Ramler
15
31
0
19 Mar 2022
1