Additionally, 6 million word forms are in the subcorpus with manually resolved homonymy.
The whole corpus has a searchable tagging concerning lexical semantics (LS),[2] including morphosemantic POS subclasses (proper noun, reflexive pronoun etc.
), LS characteristics proper (thematic class, causativity, evaluation), derivation (diminutive, adverb formed from adjective etc.).
It is possible to define a user's subcorpus to search lemmata/POS-grammeme/semantic tags combinations only within this subset.
You can help Wikipedia by expanding it.This article about Slavic languages is a stub.