Skip to content

## Add PorterStemmer support to TextProcessor#46

Open
agentksimha wants to merge 6 commits intohumanai-foundation:mainfrom
agentksimha:refactor/added-optional-stemming
Open

## Add PorterStemmer support to TextProcessor#46
agentksimha wants to merge 6 commits intohumanai-foundation:mainfrom
agentksimha:refactor/added-optional-stemming

Conversation

@agentksimha
Copy link
Copy Markdown

Adds stem_text() using NLTK's PorterStemmer and a stemming: bool = False flag to preprocess() as a faster alternative to lemmatization. The two are mutually exclusive — combining them lets the stemmer undo the lemmatizer's precise output. PorterStemmer is eagerly initialized; it's rule-based with negligible cost.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants