I have long, complex texts which I want to classify to categories such as psychology, history etc.
What processes would you recommend to use? Eg. tokenization, n-grams etc.
Thank you