🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

Error when applying a trained model to a new unlabeled data set

User: "Stann"
New Altair Community Member
Updated by Jocelyn
I want to apply a Naive Bayes model to a new (unlabeled) data set. The model has already been trained and tested via cross-validation. However when I try to apply the model to a brand new data set I get an error message.

Here is an overview of my process and the error I get:


The "Retrieve aggregate" is the new (unlabeled) data set, which I want to predict using my trained model.

"Process Documents from Data" contains a "Tokenize" operator.

The subprocesses within the Cross Validation operator are:


I am new to RapidMiner and I have no clue as to why I get this error :(
I would greatly appreciate your help as I need to carry on with my research :)
Sort by:
1 - 1 of 11
    User: "lionelderkrikor"
    New Altair Community Member
    Accepted Answer
    @Stann,

    Yes it is possible :

    As said apply the same preprocessing steps in your test set "branch"

    and connect the word output (wor) of Process Documents from Data  operator of your training "branch" to the word input (wor) of your Process Documents from Data of your test set branch.

    Regards,

    Lionel