Result of distance operator can not be reproduced
Hello,
I'm using a distance operator to match some element, but unfortunately I get strange results and I'm not able to reproduce them. In the special case I use Jaccard similiarity for the distance, but even other methods give only strange results.
As far as I understand this method it takes all attributes from two exampels and matches them: Same attribute in both examples -> hit; Attribute only in one example -> miss. In the end the distance is calculated by number of hits divided by number of used attributes. 6 matching attributes out of 13 attributes = 0,46. I've debuged my process and checked the cases manually - I won't get any result from rapid miner which is corresponding to this calculation. There are cases with no hit out of x attributes and they get a result in the range 0,35 or 0,4, and in other cases there is one hit out of x and the given distance (or similarity) is less than in the cases before.
I'm totally confused about this. Is anybody out there who knows about this strange behaviour and how to solve it?
Regards
Mario
I'm using a distance operator to match some element, but unfortunately I get strange results and I'm not able to reproduce them. In the special case I use Jaccard similiarity for the distance, but even other methods give only strange results.
As far as I understand this method it takes all attributes from two exampels and matches them: Same attribute in both examples -> hit; Attribute only in one example -> miss. In the end the distance is calculated by number of hits divided by number of used attributes. 6 matching attributes out of 13 attributes = 0,46. I've debuged my process and checked the cases manually - I won't get any result from rapid miner which is corresponding to this calculation. There are cases with no hit out of x attributes and they get a result in the range 0,35 or 0,4, and in other cases there is one hit out of x and the given distance (or similarity) is less than in the cases before.
I'm totally confused about this. Is anybody out there who knows about this strange behaviour and how to solve it?
Regards
Mario