Natural language processing
Statistical distance
Multivariate statistics
Linear algebra
Document-term matrix
Similarity measure
Microsoft Word
Word count
HP2
Euclidean vector