Growing self-organizing map

The GSOM was developed to address the issue of identifying a suitable map size in the SOM.

By using the value called Spread Factor (SF), the data analyst has the ability to control the growth of the GSOM.

The GSOM process is as follows: The GSOM can be used for many preprocessing tasks in Data mining, for Nonlinear dimensionality reduction, for approximation of principal curves and manifolds, for clustering and classification.

It gives often the better representation of the data geometry than the SOM (see the classical benchmark for principal curves on the left).