Introduction
The emergence of text analytics
()
1. Introduction to Text Mining
Purpose
()
Document
()
Corpus
()
R text processing libraries
()
Setting up the environment
()
2. Corpus in R
PCorpus and VCorpus
()
Reading files with CorpusReader
()
Exploring the corpus
()
Persisting the corpus
()
3. Text Cleansing and Extraction
Setup for processing
()
Cleansing text
()
Stop word removal
()
Stemming
()
Managing metadata
()
4. TF-IDF
Introduction to tf-idf
()
Generating term frequency matrix
()
Improving term frequency matrix
()
Plotting term frequency
()
Generating tf-idf
()
5. N-Grams
N-grams concepts
()
Using RWeka NGramTokenizer
()
Creating an n-gram text frequency matrix
()
Extracting n-gram pairs
()
6. Best Practices
Storing text
()
Processing text data
()
Scalability
()
Ex_Files_Text_R_EssT.zip
(1.0 MB)