An implementation of the K-means clustering algorithm using Silhouette plot as evaluation technique / (Record no. 453)

MARC details
000 -LEADER
fixed length control field 02435nam a2200241 4500
001 - CONTROL NUMBER
control field UPMIN-00000010150
003 - CONTROL NUMBER IDENTIFIER
control field UPMIN
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20221012110413.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 221012b |||||||| |||| 00| 0 eng d
040 ## - CATALOGING SOURCE
Original cataloging agency DLC
Transcribing agency UPMin
Modifying agency upmin
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title eng
090 ## - LOCALLY ASSIGNED LC-TYPE CALL NUMBER (OCLC); LOCAL CALL NUMBER (RLIN)
Classification number (OCLC) (R) ; Classification number, CALL (RLIN) (NR) LG993.5 2005
Local cutter number (OCLC) ; Book number/undivided call number, CALL (RLIN) C6 A27
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Abroguena, Karen A.
245 00 - TITLE STATEMENT
Title An implementation of the K-means clustering algorithm using Silhouette plot as evaluation technique /
Statement of responsibility, etc. Karen A. Abroguena
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Date of publication, distribution, etc. 2005
300 ## - PHYSICAL DESCRIPTION
Extent 68 leaves
502 ## - DISSERTATION NOTE
Dissertation note Thesis (BS Computer Science) -- University of the Philippines Mindanao, 2005
520 3# - SUMMARY, ETC.
Summary, etc. This project developed a software that clusters data according to similarities. The k-Means Clustering Algorithm used in this study was the classical k-Means Algorithm but with enhanced feature which is important to the users, the evaluation method. The random technique was used in initializing the centroids. Then, the distance between the centroid and a data point is compared using the Euclidean Square Formula. The smaller the distance between the two, the more similar they are with each other. The Transferring pass-Global best improving technique was used in relocating data points. An evaluation technique known as the Silhouette Plot was added in order for the user to visualize how good or bad the clustering was. The software accepts numerical, input data with no missing values in a spreadsheet form only. Categorical of mixed (numerical and categorical) data cannot be accepted by the software. The larger the sample data is, the longer it would take for the software to cluster the data. With this software, users who analyze their data using the k-Means Algorithm need not open another application in order to evaluate the clusters generated. After clustering the iris (flower) data using the newly implemented software, differences were noted in the results obtained as compared to the results generated by the established software. There were also some data points that were misclassified. But the misclassified data points were resolved in the evaluation technique wherein the user was given a report on which cluster the data point belongs. The implemented algorithm may not be as different as the ones used in established software, but the evaluation technique compensated the said weakness
658 ## - INDEX TERM--CURRICULUM OBJECTIVE
Main curriculum objective Undergraduate Thesis
Curriculum code CMSC200,
Source of term or code BSCS
905 ## - LOCAL DATA ELEMENT E, LDE (RLIN)
a Fi
905 ## - LOCAL DATA ELEMENT E, LDE (RLIN)
a UP
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme Library of Congress Classification
Koha item type Thesis
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Status Collection Home library Current library Shelving location Date acquired Source of acquisition Accession Number Total Checkouts Full call number Barcode Date last seen Koha item type
    Library of Congress Classification   Not For Loan Preservation Copy University Library University Library Archives and Records 2005-07-06 donation UAR-T-gd599   LG993.5 2005 C6 A27 3UPML00022040 2022-09-21 Thesis
    Library of Congress Classification   Not For Loan Room-Use Only College of Science and Mathematics University Library General Reference 2005-05-24 donation CSM-T-gd1228   LG993.5 2005 C6 A27 3UPML00011342 2022-09-21 Thesis
 
University of the Philippines Mindanao
The University Library, UP Mindanao, Mintal, Tugbok District, Davao City, Philippines
Email: library.upmindanao@up.edu.ph
Contact: (082)295-7025
Copyright @ 2022 | All Rights Reserved