"Preprocessing Data for Decision Tree (Weights)"
Hi,
I have a special problem because of the characteristics of my data. The attributes are:
- ID (I declared as ID)
- contact (nominal and declared as regular)
- product (nominal and declared as regular)
- execution (nominal and declared as label)
- quantity (numerical and declared as weight)
The data covers all possible combinations of contact, product and execution, if the combination doesn't exist, the quantity is zero, if the quantity is 300, then this case appeared 300 times (in reality but not in the datasheet). So it isn´t leading to the desired results, when i build a decision tree or some rules. I tried to declare the quantity-attribute as weight, but seemingly it isn´t the right way. Can someone tell me, how to weight the data correctly?
Thanks a lot!