Text analytics(1): Text Preprocessing
Pipeline Model of Text Interpretation
The steps of text preprocessing
1.Language identification
2.Tokenization
3.Morphological analysis (simplest form: stemming)
4.Sentence splitting
5.Part of speech (POS) tagging
6.Parsing