Help Center
Length of documents in document array
N = doclength(documents)
N = doclength(documents) returns the number of tokens in each document in documents.
N
documents
example
collapse all
Find the number of words in an array of tokenized documents. Erase the punctuation characters so they do not get counted as words.
str = [ ... "An example of a short sentence." "A second short sentence."]; documents = tokenizedDocument(str)
documents = 2x1 tokenizedDocument: 7 tokens: An example of a short sentence . 5 tokens: A second short sentence .
documents = erasePunctuation(documents)
documents = 2x1 tokenizedDocument: 6 tokens: An example of a short sentence 4 tokens: A second short sentence
N = 2×1 6 4
tokenizedDocument
Input documents, specified as a tokenizedDocument array.
Document lengths, returned as a vector of nonnegative integers. The size of N is the same as the size of documents.
Introduced in R2017b
context | doc2cell | joinWords | string | tokenizedDocument
context
doc2cell
joinWords
string
You clicked a link that corresponds to this MATLAB command:
Run the command by entering it in the MATLAB Command Window. Web browsers do not support MATLAB commands.
Select a Web Site
Choose a web site to get translated content where available and see local events and offers. Based on your location, we recommend that you select: .
You can also select a web site from the following list
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.
Americas
Europe
Asia Pacific
Contact your local office