[1] In computational linguistics, lemmatization is the algorithmic process of determining the lemma of a word based on its intended meaning.
In fact, when used within information retrieval systems, stemming improves query recall accuracy, or true positive rate, when compared to lemmatization.
This is because the purpose of stemming is not to produce the appropriate lemma – that is a more challenging task that requires knowledge of context.
This works well for straightforward inflected forms, but a rule-based system will be needed for other cases, such as in languages with long compound words.
Morphological processing of biomedical text can be more effective by a specialized lemmatization program for biomedicine, and may improve the accuracy of practical information extraction tasks.