Russian National Corpus

Additionally, 6 million word forms are in the subcorpus with manually resolved homonymy.

The whole corpus has a searchable tagging concerning lexical semantics (LS),[2] including morphosemantic POS subclasses (proper noun, reflexive pronoun etc.

), LS characteristics proper (thematic class, causativity, evaluation), derivation (diminutive, adverb formed from adjective etc.).

It is possible to define a user's subcorpus to search lemmata/POS-grammeme/semantic tags combinations only within this subset.

You can help Wikipedia by expanding it.This article about Slavic languages is a stub.