Hierarchical Cluster Analysis — HCA • structToolbox

Hierarchical Cluster Analysis is a numerical technique that uses agglomerative clustering to identify clusters or groupings of samples.

Usage

HCA(
  dist_method = "euclidean",
  cluster_method = "complete",
  minkowski_power = 2,
  factor_name,
  ...
)

Arguments

dist_method

(character) Distance measure. Allowed values are limited to the following:

"euclidean": The euclidean distance (2 norm).
"maximum": The maximum distance.
"manhattan": The absolute distance (1 norm).
"canberra": A weighted version of the mahattan distance.
"minkowski": A generalisation of manhattan and euclidean distance to nth norm.

The default is "euclidean".

cluster_method

(character) Agglomeration method. Allowed values are limited to the following:

"ward.D": Ward clustering.
"ward.D2": Ward clustering using sqaured distances.
"single": Single linkage.
"complete": Complete linkage.
"average": Average linkage (UPGMA).
"mcquitty": McQuitty linkage (WPGMA).
"median": Median linkage (WPGMC).
"centroid": Centroid linkage (UPGMC).

The default is "complete".

minkowski_power

(numeric) The default is 2.

factor_name

(character) The name of a sample-meta column to use.

...

Additional slots and values passed to struct_class.

Value

A HCA object with the following output slots:

`dist_matrix`	(dist) An object containing pairwise distance information between samples.
`hclust`	(hclust) An object of class hclust which describes the tree produced by the clustering process.
`factor_df`	(data.frame)

Details

This object makes use of functionality from the following packages:

stats

Inheritance

A HCA object inherits the following struct classes:

[HCA] >> [model] >> [struct_class]

References

R Core Team (2024). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.

Examples

M = HCA(
      dist_method = "euclidean",
      cluster_method = "complete",
      minkowski_power = numeric(0),
      factor_name = "V1")

D = iris_DatasetExperiment()
M = HCA(factor_name='Species')
M = model_apply(M,D)