Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

IMPROVING THE RELIABILITY OF MACHINE LEARNING MODELS BY FILLING IN MISSING NAN VALUES IN MEDICAL DATASETS USING A GENETIC ALGORITHM

Shokhrukh SariyevSamarkand State University named after Sharof Rashidov, assistant, Samarkand, Uzbekistan
ABI

Аннотация

This article proposes a genetic algorithm-based approach to optimize the filling of missing NaN values in a dataset. The focus is on selecting NaN values in the dataset directly corresponding to the results of the classification task. In the proposed method, each individual is represented as a chromosome in the form of a vector of all missing values. The search space is bounded by the given intervals for numerical attributes, and by the set of appropriate categories for categorical attributes. The accuracy indicator of the Random Forest ensemble model was used as the fitness function in the genetic algorithm.

Перевод пока недоступен

Темы

Идентификаторы

Цитирования и источники

Цитирований: 0Использованных источников: 0