removeStopWords
Remove stop words from documents
Syntax
Description
Words like "a", "and", "to", and "the" (known as stop words) can add noise to data. Use this function to remove stop words before analysis.
The function supports English, Japanese, German, and Korean text. To learn how to use
removeStopWords for other languages, see Language Considerations.
removes the stop words from the newDocuments = removeStopWords(documents)tokenizedDocument array
documents. The function, by default, uses the stop word list given by
the stopWords function according to the language details of
documents and is case insensitive.
To remove a custom list of words, use the removeWords function.
removes stop words with case matching the stop word list given by the
newDocuments = removeStopWords(documents,'IgnoreCase',false)stopWords function.
Tip
Use removeStopWords before using the
normalizeWords function as removeStopWords uses
information that is removed by this function.
Examples
Input Arguments
Output Arguments
More About
Algorithms
Version History
Introduced in R2018b