Stemming and Lemmatization

  • Stemming and Lemmatization are Text Normalization techniques in Natural Language Processing that are used to prepare text, words, and documents for further text processing.

  • Text normalization sometimes called as Word Normalization

  • Stemming in the process of keeping only the root of the words. It will remove affixes such as -s, -de, -ize

  • PorterStemmer is very simple to use and you can get a higer speed when using this. It is very useful for Information Retrieval environments such as IR Environments for quick recall and getting of search queries.