A program to recognize and reward our most engaged community members
Is there a way to specify in the dataset that certain variables are positively correlated so such rules will never be examined?
I am however interested in the cases where we can classify, so the individual rules that provide strong evidence for a given outcome. Cross validation in rapid miner, as I have used it, performs poorly because it is based on trying to classify everything. I would however like to see how well the best individual rules perform on unseen data instead of how well the entire rule set performs on unseen data, is this possible?