TY - BOOK AU - Ubas, Apple Grace Otero. TI - Fuzzy Jaccard similarity approach in handling missing values for randomly amplified polymorphic DNA (RAPD) analysis PY - 2008/// KW - Fuzzy sets KW - Hierarchical clustering KW - Clustering KW - Jaccard similarity coefficients KW - k-nearest neighbors KW - Missing values KW - Modified Jaccard similarity coefficients KW - Zero replacements KW - RAPD (Randomly Amplified Polymorphic DNA) KW - UPGMA (Unweighted Pair Group Mean Average) KW - Clustering algorithms KW - Undergraduate Thesis KW - AMAT200, KW - BSAM N1 - Thesis (BS Applied Mathematics) -- University of the Philippines Mindanao, 2008 N2 - In RAPD analyses, ambiguous hands are discarded as missing values. The missing values in RAPD data add the complexity to the clustering of organisms. One way of dealing with the missing values in RAPD analyses is to tolerate the ambiguity of the bands that are considered missing. In this study, this was done by the introduction of the concept of fuzziness. It was proposed that an analyst may opt to score bands with values with in the interval [0,1]. The fuzzy interpretation of RAPD experiments requires the use of appropriate similarity measures. In this light, three fuzzy Jaccard similarity coefficients were presented and applied to three RAPD data sets with scores. The performance of the three fuzzy Jaccard similarity measures were evaluated and compared to those of zero replacement, KNN, and the modified Jaccard similarity approach in terms of their ability in recovering the similarity matrices and dendrograms of the data sets used in the study. The Spearman rank correlation index was used to measure the performance of the methods at the similarity matrix level, while the Symmetric Difference was used at the clustering level. Results of the study showed that the fuzzy Jaccard similarity measures had generally performed almost as good as the KNN method at almost all levels of missing value incidence at both the similarity matrix and dendogram levels. Moreover, the fuzzy similarity measures outperformed the zero replacement and the modified Jaccard similarity approaches in handling RAPD data missing values ER -