Local cover image
Local cover image
Local cover image
Local cover image

Nearest neighbor-based imputation in treating data sets with missing values and their effects in the clustering accuracy / Raphael John Rule Onggo.

By: Material type: TextTextLanguage: English Publication details: 2008Description: 73 leavesSubject(s): Dissertation note: Thesis (BS Applied Mathematics) -- University of the Philippines Mindanao, 2008 Abstract: The K-Nearest Neighbor (KNN) imputation method, along with the more commonly used imputation methods mean and median imputations, were used in treating incomplete data sets. In order to obtain a clear comparison, three complete data sets were used with two types of missingness: missing completely at random (MCAR) and missing at random (MAR). missing values were generated from these complete data sets at rates 1%, 5%, 10%, and 20%. The treated incomplete data sets were then clustered using then k-mean clustering algorithm. The incomplete data sets were also clustered using the modified k-means clustering algorithms to the imputed data sets obtained from the three imputation methods were compared to each other and to that of the results obtained after applying the modified k-means clustering algorithm with adaptive imputation to the incomplete data sets. Results revealed that the k-nearest neighbor, mean, and medium imputation methods and the modified k-means clustering algorithm attained high cluster recovery even at 20% missing values. Furthermore, clustering results obtained from the k-nearest neighbor imputed data sets showed to have the most accurate clustering results as compared to the clustering results obtained from the mean imputed data sets and the median imputed data sets, and also the clustering results obtained after applying the modified k-means clustering algorithm with adaptive imputation to the incomplete data sets in MAR and MCAR types of missing values.
List(s) this item appears in: BS Applied Mathematics
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Cover image Item type Current library Collection Call number Status Date due Barcode
University Library Theses Room-Use Only LG993.5 2008 A64 O55 (Browse shelf(Opens below)) Not For Loan 3UPML00012279
University Library Archives and Records Preservation Copy LG993.5 2008 A64 O55 (Browse shelf(Opens below)) Not For Loan 3UPML00033234

Thesis (BS Applied Mathematics) -- University of the Philippines Mindanao, 2008

The K-Nearest Neighbor (KNN) imputation method, along with the more commonly used imputation methods mean and median imputations, were used in treating incomplete data sets. In order to obtain a clear comparison, three complete data sets were used with two types of missingness: missing completely at random (MCAR) and missing at random (MAR). missing values were generated from these complete data sets at rates 1%, 5%, 10%, and 20%. The treated incomplete data sets were then clustered using then k-mean clustering algorithm. The incomplete data sets were also clustered using the modified k-means clustering algorithms to the imputed data sets obtained from the three imputation methods were compared to each other and to that of the results obtained after applying the modified k-means clustering algorithm with adaptive imputation to the incomplete data sets. Results revealed that the k-nearest neighbor, mean, and medium imputation methods and the modified k-means clustering algorithm attained high cluster recovery even at 20% missing values. Furthermore, clustering results obtained from the k-nearest neighbor imputed data sets showed to have the most accurate clustering results as compared to the clustering results obtained from the mean imputed data sets and the median imputed data sets, and also the clustering results obtained after applying the modified k-means clustering algorithm with adaptive imputation to the incomplete data sets in MAR and MCAR types of missing values.

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer

Local cover image Local cover image
 
University of the Philippines Mindanao
The University Library, UP Mindanao, Mintal, Tugbok District, Davao City, Philippines
Email: library.upmindanao@up.edu.ph
Contact: (082)295-7025
Copyright @ 2022 | All Rights Reserved