"Help - Clustering?"
I'm very new to this datamining lark so apologies in advance.
I have a example set containing only "yes" data & I have been asked to score records in a new example set based on their similarity to records in the "yes" set. ??? - I don't really know what I'm doing, but I have a feeling clustering might be involved somehow. So far though all I have done is create clusters using the "yes" set and then labelled the new records with a prediction on which cluster they would fall into.
Not quite what I'm after; the desired result is to give each record a label from 1 to 10 indicating how close that record is a match it is to the "yes" set.
Any pointers would be appreciated.
Thanks,
JEdward
I have a example set containing only "yes" data & I have been asked to score records in a new example set based on their similarity to records in the "yes" set. ??? - I don't really know what I'm doing, but I have a feeling clustering might be involved somehow. So far though all I have done is create clusters using the "yes" set and then labelled the new records with a prediction on which cluster they would fall into.
Not quite what I'm after; the desired result is to give each record a label from 1 to 10 indicating how close that record is a match it is to the "yes" set.
Any pointers would be appreciated.
Thanks,
JEdward