🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

IDF Calculation for Test Set

smjsmj1User: "smjsmj1"
New Altair Community Member
Updated by Jocelyn
Can anyone explain the calculation of IDF value for Test sets?
Is it based on the IDF of Training sets?
I see that test set take only the word list used by the training set and IDF is Calculated solely based on the test set. So, if Test set contain only 1 document, then there is a chance that IDF becomes 0, correct?

Find more posts tagged with

Sort by:
1 - 2 of 21
    frasUser: "fras"
    New Altair Community Member
    If you are using TF-IDF you must store model _and_ wordlist after training.
    To test or score unseen data you have to preprocess with exactly the same
    "Process Documents"-Operator that you used for training including the wordlist.
    smjsmj1User: "smjsmj1"
    New Altair Community Member
    OP
    Thank you for the reply