ARTICLE
TITLE

Modified balanced random forest for improving imbalanced data prediction

SUMMARY

This paper proposes a Modified Balanced Random Forest (MBRF) algorithm as a classification technique to address imbalanced data. The MBRF process changes the process in a Balanced Random Forest by applying an under-sampling strategy based on clustering techniques for each data bootstrap decision tree in the Random Forest algorithm. To find the optimal performance of our proposed method compared with four clustering techniques, like: K-MEANS, Spectral Clustering, Agglomerative Clustering, and Ward Hierarchical Clustering. The experimental result show the Ward Hierarchical Clustering Technique achieved optimal performance, also the proposed MBRF method yielded better performance compared to the Balanced Random Forest (BRF) and Random Forest (RF) algorithms, with a sensitivity value or true positive rate (TPR) of 93.42%, a specificity or true negative rate (TNR) of 93.60%, and the best AUC accuracy value of 93.51%. Moreover, MBRF also reduced process running time.

 Articles related

Aditya Gumilar, Sri Suryani Prasetiyowati, Yuliant Sibaroni    

This study proposes several methods to analyze the performance of the hybrid machine learning method using Voting and Stacking on rainfall classification. The two hybrid methods will combine five classification methods, namely Logistic Regression, Suppor... see more


Ajwa Helisa,Triando Hamonangan Saragih,Irwan Budiman,Fatma Indriani,Dwi Kartini    

Lung cancer is the most common cause of cancer death globally. Thoracic surgery is a common treatment for patients with lung cancer. However, there are many risks and postoperative complications leading to death. In this study, we will predict life expec... see more


Sayidati Karima,Achmad Benny Mutiara    

Kemajuan teknologi informasi memberikan dampak yang besar, seperti penyebaran berita online. Namun, kabar yang tersebar belum tentu benar adanya. Dalam beberapa penelitian, pendeteksian berita hoax telah dilakukan. Namun, terdapat perbedaan hasil dari be... see more

Revista: Faktor Exacta