Hierarchical Cluster Analysis is a numerical technique that uses agglomerative clustering to identify clusters or groupings of samples.
Usage
HCA(
dist_method = "euclidean",
cluster_method = "complete",
minkowski_power = 2,
factor_name,
...
)
Arguments
- dist_method
(character) Distance measure. Allowed values are limited to the following:
"euclidean"
: The euclidean distance (2 norm)."maximum"
: The maximum distance."manhattan"
: The absolute distance (1 norm)."canberra"
: A weighted version of the mahattan distance."minkowski"
: A generalisation of manhattan and euclidean distance to nth norm.
The default is
"euclidean"
.- cluster_method
(character) Agglomeration method. Allowed values are limited to the following:
"ward.D"
: Ward clustering."ward.D2"
: Ward clustering using sqaured distances."single"
: Single linkage."complete"
: Complete linkage."average"
: Average linkage (UPGMA)."mcquitty"
: McQuitty linkage (WPGMA)."median"
: Median linkage (WPGMC)."centroid"
: Centroid linkage (UPGMC).
The default is
"complete"
.- minkowski_power
(numeric) The default is
2
.- factor_name
(character) The name of a sample-meta column to use.
- ...
Additional slots and values passed to
struct_class
.
Value
A HCA
object with the following output
slots:
dist_matrix | (dist) An object containing pairwise distance information between samples. |
hclust | (hclust) An object of class hclust which describes the tree produced by the clustering process. |
factor_df | (data.frame) |
References
R Core Team (2024). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.
Examples
M = HCA(
dist_method = "euclidean",
cluster_method = "complete",
minkowski_power = numeric(0),
factor_name = "V1")
D = iris_DatasetExperiment()
M = HCA(factor_name='Species')
M = model_apply(M,D)