Hierarchical Clustering in R: The Essentials

Featured

Hierarchical Clustering in R: The Essentials

Cluster Analysis in R

5 Lessons

2 hours 0 mins

Free

94 122 78 110 121 70 85 80 71

831

The Hierarchical clustering [or hierarchical cluster analysis (HCA)] method is an alternative approach to partitional clustering for grouping objects based on their similarity.

In contrast to partitional clustering, the hierarchical clustering does not require to pre-specify the number of clusters to be produced.

Hierarchical clustering can be subdivided into two types:

Agglomerative clustering in which, each observation is initially considered as a cluster of its own (leaf). Then, the most similar clusters are successively merged until there is just one single big cluster (root).
Divise clustering, an inverse of agglomerative clustering, begins with the root, in witch all objects are included in one cluster. Then the most heterogeneous clusters are successively divided until all observation are in their own cluster.

The result of hierarchical clustering is a tree-based representation of the objects, which is also known as dendrogram (see the figure below).

The dendrogram is a multilevel hierarchy where clusters at one level are joined together to form the clusters at the next levels. This makes it possible to decide the level at which to cut the tree for generating suitable groups of a data objects.

In this course, you will learn:

The hierarchical clustering algorithms
Examples of computing and visualizing hierarchical clustering in R
How to cut dendrograms into groups.
How to compare two dendrograms.
Solutions for handling dendrograms of large data sets.

Related Book

Practical Guide to Cluster Analysis in R

Lessons

Agglomerative Hierarchical Clustering
30 mins
Alboukadel Kassambara

In this article, we start by describing the agglomerative clustering algorithms. Next, we provide R lab sections with many examples for computing and visualizing hierarchical clustering. We continue by explaining how to interpret dendrogram. Finally, we provide R codes for cutting dendrograms into groups.
Divisive Hierarchical Clustering
5 mins
Alboukadel Kassambara

This article introduces the divisive clustering algorithms and provides practical examples showing how to compute divise clustering using R.
Comparing Cluster Dendrograms in R
20 mins
Alboukadel Kassambara

This article describes how to compare cluster dendrograms in R using the dendextend R package
Examples of Dendrograms Visualization
30 mins
Alboukadel Kassambara

This article provides examples of beautiful dendrograms visualization using R software. Additionally, we show how to save and to zoom a large dendrogram.
Heatmap in R: Static and Interactive Visualization
35 mins
Alboukadel Kassambara

A heatmap is another way to visualize hierarchical clustering. It's also called a false colored image, where data values are transformed to color scale. Here, we'll demonstrate how to draw and arrange a heatmap in R.

LZarba

28 Mar 2019

Hi, I am new to this site and can’t find how to start the course. Where should I click to start the lesson? Thanks

Kassambara

28 Mar 2019

Hi, you just need to click on a specific lesson title to read the corresponding contents

- LZarba
  
  28 Mar 2019
  
  Thanks!