Trusting my predictions: on the value of Instance-Level analysis

August, 2023

Ana C. Lorena, Pedro Y. A. Paiva, and Ricardo B. C. Prudêncio. 2023. “Trusting my predictions: on the value of Instance-Level analysis”. ACM Comput. Surv. Just Accepted (August 2023). https://doi.org/10.1145/3615354

Abstract

Machine Learning solutions have spread along many domains, including critical applications. The development of such models usually relies on a dataset containing labeled data. This dataset is then split into training and test sets and the accuracy of the models in replicating the test labels is assessed. This process is often iterated in a cross-validation procedure for obtaining average performance estimates. But is the average of the predictive performance on test sets enough for assessing the trustfulness of a Machine Learning model? This paper discusses the importance of knowing which individual observations of a dataset are more challenging than others and how this characteristic can be measured and used in order to improve classification performance and trustfulness. A set of strategies for measuring the hardness level of the instances of a dataset is surveyed and a Python package containing their implementation is provided.

Authors

Ana Carolina Lorena, Instituto Tecnológico de Aeronáutica, Brazil

Pedro Yuri Paiva, Instituto Tecnológico de Aeronáutica, Brazil

Ricardo Bastos C Prudêncio, Centro de Informática, Universidade Federal de Pernambuco, Brazil

Comentários desativados

Endereço

Av. Jornalista Anibal Fernandes, s/n, Cidade Universitária (Campus Recife)

CEP: 50740-560

Recife/PE

Acompanhe o CIn-UFPE em todas as redes sociais

CONTATO

+ 55 81 2126-8430

contato@cin.ufpe.br

Trusting my predictions: on the value of Instance-Level analysis

Sobre este site

Encontre-nos

Endereço

CONTATO