Modified possibilistic fuzzy c-means algorithm for clustering incomplete data sets

Rustam, Koredianto Usman, Mudyawati Kamaruddin, Dina Chamidah, Nopendri, Khaerudin Saleh, Yulinda Eliskar, Ismail Marzuki

Modified possibilistic fuzzy c-means algorithm for clustering incomplete data sets

Číslo: 2/2021
Periodikum: Acta Polytechnica
DOI: 10.14311/AP.2021.61.0364

Klíčová slova: Incomplete data, fuzzy clustering, possibilistic clustering, missing values imputation.

Pro získání musíte mít účet v Citace PRO.

Přečíst po přihlášení

Anotace: A possibilistic fuzzy c-means (PFCM) algorithm is a reliable algorithm proposed to deal with the weaknesses associated with handling noise sensitivity and coincidence clusters in fuzzy c-means (FCM) and possibilistic c-means (PCM). However, the PFCM algorithm is only applicable to complete data sets. Therefore, this research modified the PFCM for clustering incomplete data sets to OCSPFCM and NPSPFCM with the performance evaluated based on three aspects, 1) accuracy percentage, 2) the number of iterations, and 3) centroid errors. The results showed that the NPSPFCM outperforms the OCSPFCM with missing values ranging from 5% − 30% for all experimental data sets. Furthermore, both algorithms provide average accuracies between 97.75%−78.98% and 98.86%−92.49%, respectively.