Regexp Stemmer
from nltk.stem import RegexpStemmer
re_stemmer = RegexpStemmer("ing$|s$|e$|able$", min=7)
words = [
"wheels",
"breaking",
"thrones",
"breakable"
]
words
['wheels', 'breaking', 'thrones', 'breakable']
result = [re_stemmer.stem(word) for word in words]
result
['wheels', 'break', 'throne', 'break']
As the minium length of the string is 7 in the RegexStemmer, ‘wheels’ is not stemmed properly