Dissimilarity coefficients in hierarchical mixed-type data clustering /

Gomez, Rhyz C.

Dissimilarity coefficients in hierarchical mixed-type data clustering / Rhyz C. Gomez. - 2008 - 78 leaves.

Thesis, Undergraduate (BS Applied Mathematics)- U.P. Mindanao

Yang's dissimilarity coefficient for mixed-type data was modified using two different aggregating equations of De Carvalho. L? Eixample normalized dissimilarity coefficient for continuous attributes was used instead of Yang's dissimilarity coefficient. This modified Yang's dissimilarity coefficients were then employed on constructing hierarchical trees using single linkage, complete linkage and UPGMA on auto, heart and credit data. Single linkage clustering algorithm was found to give higher misclassifications on auto data. This is due to the fact that single linkage has a tendency to cause chaining phenomenon. The efficiency of the two modified dissimilarity coefficients was then tested based on their accuracy, entropy and purity. The first dissimilarity coefficient was found to give better improvement on the accuracy, entropy and purity of Yang's dissimilarity coefficients.


Aggregating equations.
Clustering.
Mixed-type datas.
Dissimilarity coefficients.
Yang's dissimilarity coefficients.
UPGMA (Unweighted
Auto data.
Cluster solutions.
Distance measures.
Dendograms.
Hierarchical clustering.


Undergraduate Thesis --AMAT200
 
University of the Philippines Mindanao
The University Library, UP Mindanao, Mintal, Tugbok District, Davao City, Philippines
Email: library.upmindanao@up.edu.ph
Contact: (082)295-7025
Copyright @ 2022 | All Rights Reserved