
Jurnal Riset Informatika, Год журнала: 2024, Номер 6(4), С. 211 - 222
Опубликована: Сен. 15, 2024
Stroke occurs due to disrupted blood flow the brain, either from a clot (ischemic) or ruptured vessel (hemorrhagic), leading brain tissue damage and neurological dysfunction. It remains cause of death disability worldwide, making early prediction crucial for timely intervention. This study evaluates impact data balancing techniques on stroke performance across different machine learning models. Random Forest (RF) consistently achieves highest accuracy (98%) but struggles with precision recall variations depending method. Decision Tree (DT) K-Nearest Neighbors (KNN) benefit most SMOTE SMOTETomek, improving their F1-scores (11.21% 9.18%), indicating better balance between recall. Under Sampling enhances all models reduces precision, lower overall predictive reliability. SMOTETomek emerge as effective techniques, particularly DT KNN, while RF accurate requires further optimization improve balance.
Язык: Английский