ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.10742
  4. Cited By
Machine Learning Testing: Survey, Landscapes and Horizons

Machine Learning Testing: Survey, Landscapes and Horizons

19 June 2019
Jie M. Zhang
Mark Harman
Lei Ma
Yang Liu
    VLM
    AILaw
ArXivPDFHTML

Papers citing "Machine Learning Testing: Survey, Landscapes and Horizons"

50 / 224 papers shown
Title
Not Just Training, Also Testing: High School Youths' Perspective-Taking
  through Peer Testing Machine Learning-Powered Applications
Not Just Training, Also Testing: High School Youths' Perspective-Taking through Peer Testing Machine Learning-Powered Applications
Luis Morales-Navarro
Meghan Shah
Yasmin B. Kafai
26
5
0
21 Nov 2023
Designing monitoring strategies for deployed machine learning
  algorithms: navigating performativity through a causal lens
Designing monitoring strategies for deployed machine learning algorithms: navigating performativity through a causal lens
Jean Feng
Adarsh Subbaswamy
Alexej Gossmann
Harvineet Singh
B. Sahiner
Mi-Ok Kim
Gene Pennello
N. Petrick
Romain Pirracchio
Fan Xia
19
1
0
20 Nov 2023
Search-Based Fairness Testing: An Overview
Search-Based Fairness Testing: An Overview
Hussaini Mamman
S. Basri
A. Balogun
A. A. Imam
Ganesh M. Kumar
L. F. Capretz
37
1
0
10 Nov 2023
Test & Evaluation Best Practices for Machine Learning-Enabled Systems
Test & Evaluation Best Practices for Machine Learning-Enabled Systems
Jaganmohan Chandrasekaran
Tyler Cody
Nicola McCarthy
Erin Lanus
Laura J. Freeman
32
5
0
10 Oct 2023
RAI4IoE: Responsible AI for Enabling the Internet of Energy
RAI4IoE: Responsible AI for Enabling the Internet of Energy
Minhui Xue
Surya Nepal
Ling Liu
Subbu Sethuvenkatraman
Xingliang Yuan
Carsten Rudolph
Ruoxi Sun
Greg Eisenhauer
39
4
0
20 Sep 2023
Machine Learning Data Suitability and Performance Testing Using Fault
  Injection Testing Framework
Machine Learning Data Suitability and Performance Testing Using Fault Injection Testing Framework
Manal Rahal
Bestoun S. Ahmed
Jorgen Samuelsson
AAML
21
1
0
20 Sep 2023
Hazards in Deep Learning Testing: Prevalence, Impact and Recommendations
Hazards in Deep Learning Testing: Prevalence, Impact and Recommendations
Salah Ghamizi
Maxime Cordy
Yuejun Guo
Mike Papadakis
And Yves Le Traon
16
1
0
11 Sep 2023
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using
  EmotionBench
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Jen-tse Huang
Man Ho Adrian Lam
E. Li
Shujie Ren
Wenxuan Wang
Wenxiang Jiao
Zhaopeng Tu
Michael R. Lyu
48
40
0
07 Aug 2023
Bias Behind the Wheel: Fairness Analysis of Autonomous Driving Systems
Bias Behind the Wheel: Fairness Analysis of Autonomous Driving Systems
Xinyue Li
Zhenpeng Chen
Jie M. Zhang
Federica Sarro
Wenjie Qu
Xuanzhe Liu
14
4
0
05 Aug 2023
Bug Characterization in Machine Learning-based Systems
Bug Characterization in Machine Learning-based Systems
Mohammad Mehdi Morovati
Amin Nikanjam
Florian Tambon
Foutse Khomh
Zhen Ming
Z. Jiang
28
20
0
26 Jul 2023
Fairness Improvement with Multiple Protected Attributes: How Far Are We?
Fairness Improvement with Multiple Protected Attributes: How Far Are We?
Zhenpeng Chen
Jie M. Zhang
Federica Sarro
Mark Harman
FaML
38
4
0
25 Jul 2023
Model Reporting for Certifiable AI: A Proposal from Merging EU
  Regulation into AI Development
Model Reporting for Certifiable AI: A Proposal from Merging EU Regulation into AI Development
Danilo Brajovic
Niclas Renner
Vincent Philipp Goebels
Philipp Wagner
Benjamin Frész
M. Biller
Mara Klaeb
Janika Kutz
Jens Neuhuettler
Marco F. Huber
19
9
0
21 Jul 2023
Badgers: generating data quality deficits with Python
Badgers: generating data quality deficits with Python
Julien Siebert
Daniel Seifert
P. Kelbert
Michael Kläs
Adam Trendowicz
8
1
0
10 Jul 2023
AutoML in Heavily Constrained Applications
AutoML in Heavily Constrained Applications
Felix Neutatz
Marius Lindauer
Ziawasch Abedjan
25
4
0
29 Jun 2023
Evaluating Superhuman Models with Consistency Checks
Evaluating Superhuman Models with Consistency Checks
Lukas Fluri
Daniel Paleka
Florian Tramèr
ELM
50
42
0
16 Jun 2023
Fault Localization for Buggy Deep Learning Framework Conversions in
  Image Recognition
Fault Localization for Buggy Deep Learning Framework Conversions in Image Recognition
Nikolaos Louloudakis
Perry Gibson
José Cano
A. Rajan
17
6
0
10 Jun 2023
Best Practices for Machine Learning Systems: An Industrial Framework for
  Analysis and Optimization
Best Practices for Machine Learning Systems: An Industrial Framework for Analysis and Optimization
G. Chouliaras
Kornel Kielczewski
Amit Beka
D. Konopnicki
Lucas Bernardi
6
0
0
09 Jun 2023
DeltaNN: Assessing the Impact of Computational Environment Parameters on
  the Performance of Image Recognition Models
DeltaNN: Assessing the Impact of Computational Environment Parameters on the Performance of Image Recognition Models
Nikolaos Louloudakis
Perry Gibson
José Cano
A. Rajan
17
8
0
05 Jun 2023
FITNESS: A Causal De-correlation Approach for Mitigating Bias in Machine
  Learning Software
FITNESS: A Causal De-correlation Approach for Mitigating Bias in Machine Learning Software
Ying Xiao
Shangwen Wang
Sicen Liu
Dingyuan Xue
Xian Zhan
Yepang Liu
CML
21
0
0
23 May 2023
Validating Multimedia Content Moderation Software via Semantic Fusion
Validating Multimedia Content Moderation Software via Semantic Fusion
Wenxuan Wang
Jingyuan Huang
Chang Chen
Jiazhen Gu
Jianping Zhang
Weibin Wu
Pinjia He
Michael Lyu
75
9
0
23 May 2023
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Matteo Biagiola
Paolo Tonella
41
19
0
22 May 2023
BiasAsker: Measuring the Bias in Conversational AI System
BiasAsker: Measuring the Bias in Conversational AI System
Yuxuan Wan
Wenxuan Wang
Pinjia He
Jiazhen Gu
Haonan Bai
Michael Lyu
29
67
0
21 May 2023
Evaluating the Robustness of Interpretability Methods through
  Explanation Invariance and Equivariance
Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance
Jonathan Crabbé
M. Schaar
AAML
24
6
0
13 Apr 2023
D-Score: A White-Box Diagnosis Score for CNNs Based on Mutation
  Operators
D-Score: A White-Box Diagnosis Score for CNNs Based on Mutation Operators
Xin Zhang
Yuqi Song
Xiang Wang
Fei Zuo
MedIm
DiffM
15
2
0
03 Apr 2023
A Meta-Summary of Challenges in Building Products with ML Components --
  Collecting Experiences from 4758+ Practitioners
A Meta-Summary of Challenges in Building Products with ML Components -- Collecting Experiences from 4758+ Practitioners
Nadia Nahar
Haoran Zhang
Grace A. Lewis
Shurui Zhou
Christian Kastner
23
36
0
31 Mar 2023
POLAR-Express: Efficient and Precise Formal Reachability Analysis of
  Neural-Network Controlled Systems
POLAR-Express: Efficient and Precise Formal Reachability Analysis of Neural-Network Controlled Systems
Yixuan Wang
Weichao Zhou
Jiameng Fan
Zhilu Wang
Jiajun Li
Xin Chen
Chao Huang
Wenchao Li
Qi Zhu
35
15
0
31 Mar 2023
An investigation of licensing of datasets for machine learning based on
  the GQM model
An investigation of licensing of datasets for machine learning based on the GQM model
Junyu Chen
Norihiro Yoshida
Hiroaki Takada
33
2
0
24 Mar 2023
Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring
  Platform
Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform
P. Heck
Gerard Schouten
13
0
0
23 Mar 2023
Challenges and Practices of Deep Learning Model Reengineering: A Case
  Study on Computer Vision
Challenges and Practices of Deep Learning Model Reengineering: A Case Study on Computer Vision
Wenxin Jiang
Vishnu Banna
Naveen Vivek
Abhinav Goel
Nicholas Synovic
George K. Thiruvathukal
James C. Davis
VLM
40
18
0
13 Mar 2023
Greener yet Powerful: Taming Large Code Generation Models with
  Quantization
Greener yet Powerful: Taming Large Code Generation Models with Quantization
Xiaokai Wei
Sujan Kumar Gonugondla
W. Ahmad
Shiqi Wang
Baishakhi Ray
...
Ben Athiwaratkun
Mingyue Shang
M. K. Ramanathan
Parminder Bhatia
Bing Xiang
MQ
28
6
0
09 Mar 2023
Iterative Assessment and Improvement of DNN Operational Accuracy
Iterative Assessment and Improvement of DNN Operational Accuracy
Antonio Guerriero
R. Pietrantuono
S. Russo
AI4CE
20
4
0
02 Mar 2023
Towards Fair Machine Learning Software: Understanding and Addressing
  Model Bias Through Counterfactual Thinking
Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking
Zichong Wang
Yangze Zhou
M. Qiu
I. Haque
Laura Brown
Yi He
Jianwu Wang
David Lo
Wenbin Zhang
FaML
28
23
0
16 Feb 2023
Reliability Assurance for Deep Neural Network Architectures Against
  Numerical Defects
Reliability Assurance for Deep Neural Network Architectures Against Numerical Defects
Linyi Li
Yuhao Zhang
Luyao Ren
Yingfei Xiong
Tao Xie
30
7
0
13 Feb 2023
MTTM: Metamorphic Testing for Textual Content Moderation Software
MTTM: Metamorphic Testing for Textual Content Moderation Software
Wenxuan Wang
Jen-tse Huang
Weibin Wu
Jianping Zhang
Yizhan Huang
Shuqing Li
Pinjia He
Michael Lyu
58
29
0
11 Feb 2023
To Be Forgotten or To Be Fair: Unveiling Fairness Implications of
  Machine Unlearning Methods
To Be Forgotten or To Be Fair: Unveiling Fairness Implications of Machine Unlearning Methods
Dawen Zhang
Shidong Pan
Thong Hoang
Zhenchang Xing
Mark Staples
Xiwei Xu
Lina Yao
Qinghua Lu
Liming Zhu
MU
30
16
0
07 Feb 2023
Identifying the Hazard Boundary of ML-enabled Autonomous Systems Using
  Cooperative Co-Evolutionary Search
Identifying the Hazard Boundary of ML-enabled Autonomous Systems Using Cooperative Co-Evolutionary Search
S. Sharifi
Donghwan Shin
Lionel C. Briand
Nathan Aschbacher
10
3
0
31 Jan 2023
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Florian Tambon
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
G. Antoniol
28
7
0
13 Jan 2023
Understanding the Complexity and Its Impact on Testing in ML-Enabled
  Systems
Understanding the Complexity and Its Impact on Testing in ML-Enabled Systems
Junming Cao
Bihuan Chen
Longjie Hu
Jie Ying Gao
Kaifeng Huang
Xin Peng
26
3
0
10 Jan 2023
Stealthy Backdoor Attack for Code Models
Stealthy Backdoor Attack for Code Models
Zhou Yang
Bowen Xu
Jie M. Zhang
Hong Jin Kang
Jieke Shi
Junda He
David Lo
AAML
19
65
0
06 Jan 2023
When and Why Test Generators for Deep Learning Produce Invalid Inputs:
  an Empirical Study
When and Why Test Generators for Deep Learning Produce Invalid Inputs: an Empirical Study
Vincenzo Riccio
Paolo Tonella
AAML
24
29
0
21 Dec 2022
QVIP: An ILP-based Formal Verification Approach for Quantized Neural
  Networks
QVIP: An ILP-based Formal Verification Approach for Quantized Neural Networks
Yedi Zhang
Zhe Zhao
Fu Song
Hao Fei
Tao Chen
Jun Sun
36
17
0
10 Dec 2022
QEBVerif: Quantization Error Bound Verification of Neural Networks
QEBVerif: Quantization Error Bound Verification of Neural Networks
Yedi Zhang
Fu Song
Jun Sun
MQ
26
11
0
06 Dec 2022
An Empirical Study of Library Usage and Dependency in Deep Learning
  Frameworks
An Empirical Study of Library Usage and Dependency in Deep Learning Frameworks
Mohamed Raed El aoun
L. Tidjon
Ben Rombaut
Foutse Khomh
Ahmed E. Hassan
24
5
0
28 Nov 2022
Quality Assurance in MLOps Setting: An Industrial Perspective
Quality Assurance in MLOps Setting: An Industrial Perspective
Ayan Chatterjee
Bestoun S. Ahmed
Erik Hallin
Anton Engman
20
2
0
23 Nov 2022
Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled
  Systems
Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems
Fitash Ul Haq
Donghwan Shin
Lionel C. Briand
OffRL
44
39
0
27 Oct 2022
A.I. Robustness: a Human-Centered Perspective on Technological
  Challenges and Opportunities
A.I. Robustness: a Human-Centered Perspective on Technological Challenges and Opportunities
Andrea Tocchetti
Lorenzo Corti
Agathe Balayn
Mireia Yurrita
Philip Lippmann
Marco Brambilla
Jie-jin Yang
27
10
0
17 Oct 2022
Space, Time, and Interaction: A Taxonomy of Corner Cases in Trajectory
  Datasets for Automated Driving
Space, Time, and Interaction: A Taxonomy of Corner Cases in Trajectory Datasets for Automated Driving
Kevin Rösch
Florian Heidecker
Julian Truetsch
K. Kowol
Clemens Schicktanz
Maarten Bieshaar
Bernhard Sick
Christoph Stiller
15
8
0
17 Oct 2022
TestAug: A Framework for Augmenting Capability-based NLP Tests
TestAug: A Framework for Augmenting Capability-based NLP Tests
Guanqun Yang
Mirazul Haque
Qiaochu Song
Wei Yang
Xueqing Liu
ELM
34
0
0
14 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
21
7
0
07 Oct 2022
Empowering the trustworthiness of ML-based critical systems through
  engineering activities
Empowering the trustworthiness of ML-based critical systems through engineering activities
J. Mattioli
Agnès Delaborde
Souhaiel Khalfaoui
Freddy Lecue
H. Sohier
F. Jurie
22
2
0
30 Sep 2022
Previous
12345
Next