Hi to everybody,
after a classical process documents where i create a word-vector (TF-IDF) , is possible to select for each document only the terms (attributes) whose sum of the values of tf-idf rapresents the upperf half of the total sum of tf-idf of the same document , or other percentual? Maybe i have to set a weight operator but i don't know which and how. I need it to reduce the number of attributes.
Thank you all!