000 02822nam a22003733a 4500
001 UPMIN-00003211640
003 UPMIN
005 20230316103908.0
008 230213b |||||||| |||| 00| 0 eng d
040 _aDLC
_cUPMin
_dupmin
041 _aeng
090 0 _aLG993.5 2008
_bA64 U23
100 _aUbas, Apple Grace Otero.
_92397
245 _aFuzzy Jaccard similarity approach in handling missing values for randomly amplified polymorphic DNA (RAPD) analysis /
_cApple Grace Otero Ubas
260 _c2008
300 _a82 leaves.
502 _aThesis (BS Applied Mathematics) -- University of the Philippines Mindanao, 2008
520 3 _aIn RAPD analyses, ambiguous hands are discarded as missing values. The missing values in RAPD data add the complexity to the clustering of organisms. One way of dealing with the missing values in RAPD analyses is to tolerate the ambiguity of the bands that are considered missing. In this study, this was done by the introduction of the concept of fuzziness. It was proposed that an analyst may opt to score bands with values with in the interval [0,1]. The fuzzy interpretation of RAPD experiments requires the use of appropriate similarity measures. In this light, three fuzzy Jaccard similarity coefficients were presented and applied to three RAPD data sets with scores. The performance of the three fuzzy Jaccard similarity measures were evaluated and compared to those of zero replacement, KNN, and the modified Jaccard similarity approach in terms of their ability in recovering the similarity matrices and dendrograms of the data sets used in the study. The Spearman rank correlation index was used to measure the performance of the methods at the similarity matrix level, while the Symmetric Difference was used at the clustering level. Results of the study showed that the fuzzy Jaccard similarity measures had generally performed almost as good as the KNN method at almost all levels of missing value incidence at both the similarity matrix and dendogram levels. Moreover, the fuzzy similarity measures outperformed the zero replacement and the modified Jaccard similarity approaches in handling RAPD data missing values.
650 1 7 _aFuzzy sets.
_92398
650 1 7 _aHierarchical clustering.
_91294
650 1 7 _aClustering.
_9366
650 1 7 _aJaccard similarity coefficients.
_92399
650 1 7 _ak-nearest neighbors.
_92400
650 1 7 _aMissing values.
_9990
650 1 7 _aModified Jaccard similarity coefficients.
_92401
650 1 7 _aZero replacements.
_92402
650 1 7 _aRAPD (Randomly Amplified Polymorphic DNA)
_92403
650 1 7 _aUPGMA (Unweighted Pair Group Mean Average)
_92404
650 1 7 _aClustering algorithms.
_92405
658 _aUndergraduate Thesis
_cAMAT200,
_2BSAM
905 _aFi
905 _aUP
942 _2lcc
_cTHESIS
999 _c2236
_d2236