To find patterns and relationships in text we can use the same technics as for data sets based on numbers. Source for it will be “document term matrix” we created using technics for text preparation.
- k-mean clustering
- find frequent words – in R: findFreqTerms
- plot frequent terms and their relationships – in R: plot with correlation threshold
- plot word cloud of the frequent words – command “wordcloud”