Next: Algorithm Performance Comparisons
Up: The CLICK Algorithm
Previous: Refinements
When the ``correct'' solution for the clustering problem is known, we
can use the same methods as for the HCS algorithm
(sec 11.3.5).
In most cases, unfortunately, the ``correct'' solution for the
clustering problems is unknown. In this case we evaluate the quality
of the solution by computing two figures of merit to measure the
homogeneity and separation of the produced clusters. For
fingerprint data, homogeneity is evaluated by the average and minimum
correlation coefficient between the fingerprint of an element and the
fingerprint of its corresponding cluster. Precisely, if cl(u) is the
cluster of u, F(X) and F(u) are the fingerprints of a cluster X
and an element u respectively, and S(x,y) is the correlation coefficient
(or any other similarity measure) of fingerprints x and y, then
Separation is evaluated by the weighted average and the maximum correlation
coefficient between cluster fingerprints. That is, if the clusters are
then
Hence, a solution improves if HAve increases and HMin increases, and
if SAve decreases and SMax decreases.
Next: Algorithm Performance Comparisons
Up: The CLICK Algorithm
Previous: Refinements
Peer Itsik
2001-01-31