How Dendrogram works

A dendrogram is a diagram that shows the attribute distances between each pair of sequentially merged classes. To avoid crossing lines, the diagram is graphically arranged so that members of each pair of classes to be merged are neighbors in the diagram.

The Dendrogram tool uses a hierarchical clustering algorithm. The program first computes the distances between each pair of classes in the input signature file. Then it iteratively merges the closest pair of classes and successively merges the next closest pair of classes and the succeeding closest until the classes are all merged. After each merging, the distances between all pairs of classes are updated. The distances at which the signatures of classes are merged are used to construct a dendrogram.

When the Use variance in distance calculations option is unchecked (NO_STD), the distance dmn between a pair of classes m and n is measured as a distance between their means:

Distance between means formula

When the variance option is checked (STD), the Dendrogram tool measures distances between pairs of classes based on their means and variances using the following formula:

Equation for measuring distances between pairs of classes

The new statistics (means and variances) describing the merged class are based on the original mean and variance of the samples that comprise the merged class. Therefore, the merged class is produced using the pooled mean and variance. The two signatures that are used to create the merged class are replaced by a single signature of the combined class. The new mean signature is computed based on the locations in the multidimensional attribute space of all member cells of the merged class. The new signature retains the lower number of the two input classes for the merged class ID.

The value levels, or the distances at which each pair of classes is merged, can be interpolated using the scale bars of the dendrogram graph. Due to the limitation of the size of a character (the graphic's coarse resolution), the levels of merging are rounded for display. However, the precise values of the levels of merging are presented as DISTANCE in the table associated with the dendrogram.

Variances, not covariances, are used for distance computation after a pair of classes is merged. The algorithm used by Dendrogram does not use Mahalanobis distance to determine the distance between classes. Therefore, the distances between classes and the merged classes may not match the results from those grid tools that are based on Mahalanobis distance, such as Edit Signatures, Maximum Likelihood Classification, and Class Probability.

Example

In the following example, classes 3 and 5 are the nearest neighbors in attribute space; therefore, they are merged at level 3.443. This value indicates the relative degree of similarity, which can also be viewed as the distance in multidimensional space. The two classes are merged and treated as a single class. The statistics for the merged class and the distances from the merged class to the other classes are computed. The next two closest classes are then identified. The two candidates are classes 4 and 6. The distance between them is 3.609, and they are merged. The process iterates. All classes are sequentially merged into bigger classes until all classes are merged into a single class.

The output dendrogram file would be as follows:

Distances between pairs of combined classes (in the sequence of merging):

Remaining   Merged   Between-Class
Class      Class      Distance
----------------------------------
  3         5        3.442680
  4         6        3.608904
  7         9        3.899360
  2         7        3.795288
  3         4        4.883098
  2         8        6.073256
  1         3        6.257798
  1         2        9.350019
----------------------------------

Dendrogram of /discb/topdir/myspace/isoclust12.gsg

C       DISTANCE
L
A
S   0      1.0     2.1     3.1     4.1     5.2     6.2     7.2     8.3     9.3
S   |-------|-------|-------|-------|-------|-------|-------|-------|------
   
   5 -------------------------|
                              |----------|
   3 -------------------------|          |
                                         |----------|
   6 ---------------------------|        |          |
                                |--------|          |-------------------|
   4 ---------------------------|                   |                   |
                                                    |                   |
   1 -----------------------------------------------|                   |
                                                                        |-
   9 -----------------------------|                                     |
                                  |                                     |
   7 ---------------------------------------------|                     |
                                   |              |                     |
   2 ------------------------------|              |---------------------|
                                                  |
   8 ---------------------------------------------|
   
    |-------|-------|-------|-------|-------|-------|-------|-------|------
    0      1.0     2.1     3.1     4.1     5.2     6.2     7.2     8.3     9.3

Related Topics


6/29/2011