Automatic taxonomy construction

Automatic taxonomy construction (ATC) is the use of software programs to generate taxonomical classifications from a body of texts called a corpus.

[1][2][3][4][5][6] Among other things, a taxonomy can be used to organize and index knowledge (stored as documents, articles, videos, etc.

Manually developing and maintaining a taxonomy is a labor-intensive task requiring significant time and resources, including familiarity of or expertise in the taxonomy's domain (scope, subject, or field), which drives the costs and limits the scope of such projects.

ATC uses artificial intelligence techniques to quickly automatically generate a taxonomy for a domain in order to avoid these problems and remove limitations.

This kind of taxonomy is called an is-a model because the specific objects are considered instances of a concept.