Clustering

Machine Learning

Jesus A. Gonzalez

February 7, 2020

Clustering

Clustering

Challenges

Challenges

Similarity Measures

Similarity Measures

Similarity Measures

Similarity Measures

Similarity Measures

Similarity Measures

Similarity Measures

Similarity Measures

Similarity Measures

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

Clustering Methods

The k-Means Algorithm

The k-Means Algorithm

The k-Means Algorithm - the algorithm

The k-Means Algorithm

The k-Medoids Algorithm

Hierarchical k-Means

COBWEB (a hierarchical clustering method)

COBWEB (a hierarchical clustering method)

\(CU=\frac{\sum\limits_{k=1}^{n}P(C_k)[\sum\limits_i\sum\limits_j P(A_i=V_{ij}|C_k)^2-\sum\limits_{i}\sum\limits_{j} P(A_i = v_{ij})^2]}{n}\)

COBWEB (a hierarchical clustering method)

COBWEB (a hierarchical clustering method)

COBWEB (a hierarchical clustering method)

COBWEB (a hierarchical clustering method)

\(f(a) = \frac{1}{\sqrt{2\pi\sigma^2}}e^{-\frac{(a-\mu)^2}{2\sigma^2}}\)

\(\sum\limits_{i} P(A_i = V_{ij})^2 \sim \int f(a_i)^2 da_i = \frac{1}{2\sqrt{\pi\sigma_i}}\)

COBWEB (a hierarchical clustering method)

\(CU = \frac{1}{k} \sum\limits_{k=1}{n}P(C_k) \frac{1}{2\sqrt{\pi}}\sum\limits_{i}(\frac{1}{\sigma_{ik}} - \frac{1}{\sigma_i})\)

Clustering based on Probabilities

Clustering based on Probabilities

Clustering based on Probabilities

\(\mu=\frac{x_1+x_2+\dots+x_n}{n}\)

\(\sigma^2=\frac{(x_1-\mu)^2+(x_2-\mu)^2+\dots+(x_n-\mu)^2}{n-1}\)

Clustering based on Probabilities

\(P(A|x) = \frac{P(x|A)P(A)}{P(x)} = \frac{f(x;\mu_A,\sigma_A)P_A}{P(x)}\)

where \(f(x;\mu_A,\sigma_A)\) is a normal distribution

\(f(x;\mu_A,\sigma_A) = \frac{1}{\sqrt{2\pi\sigma^2}}e^{-\frac{(x-\mu)^2}{2\sigma^2}}\)

Expectation Maximization Algorithm

Expectation Maximization Algorithm

Expectation Maximization Algorithm

\(\mu_A=\frac{w_1x_1+w_2x_2+\dots+w_nx_n}{w_1+w_2+\dots+w_n}\)

\(\sigma^2_A=\frac{w_1(x_1-\mu)^2+w_2(x_2-\mu)^2+\dots+w_n(x_n-\mu)^2}{w_1+w_2+\dots+w_n}\)

\(w_i\) is the probability that object \(i\) belongs to cluster \(A\) and we sum over all objects (not only those of A)

Expectation Maximization Algorithm

\(\prod(P_AP(x_i|A)+P_BP(x_i|B))\)

Expectation Maximization Algorithm

How many Clusters?

How many Clusters?

How many Clusters?