Error when applying a trained model to a new unlabeled data set

New Altair Community Member

May 9, 2021

Updated Nov 5, 2024 by Jocelyn

I want to apply a Naive Bayes model to a new (unlabeled) data set. The model has already been trained and tested via cross-validation. However when I try to apply the model to a brand new data set I get an error message.

Here is an overview of my process and the error I get:

The "Retrieve aggregate" is the new (unlabeled) data set, which I want to predict using my trained model.

"Process Documents from Data" contains a "Tokenize" operator.

The subprocesses within the Cross Validation operator are:

I am new to RapidMiner and I have no clue as to why I get this error

I would greatly appreciate your help as I need to carry on with my research

Find more posts tagged with

Sort by:

1 - 1 of 11

lionelderkrikor

New Altair Community Member

Accepted Answer

May 10, 2021

@Stann,

Yes it is possible :

As said apply the same preprocessing steps in your test set "branch"

and connect the word output (wor) of Process Documents from Data operator of your training "branch" to the word input (wor) of your Process Documents from Data of your test set branch.

Regards,

Lionel

View in context

🎉Community Raffle - Win $25

Error when applying a trained model to a new unlabeled data set

Find more posts tagged with

Quick Links