How to manually give attribute weight
Hi to all,
I'm using a categorical dataset with 20 attributes and analyzed it successfuly with ID3 deision tree. What I'm trying do to for next step is to give attribute some weights so these higher weighted attributes to appear in first levels (nearest too root) of the tree. Can you help me about this? My biggest problem is I could not find an operator that will allow me to give attribute weights manually
I'm using a categorical dataset with 20 attributes and analyzed it successfuly with ID3 deision tree. What I'm trying do to for next step is to give attribute some weights so these higher weighted attributes to appear in first levels (nearest too root) of the tree. Can you help me about this? My biggest problem is I could not find an operator that will allow me to give attribute weights manually
Find more posts tagged with
Sort by:
1 - 4 of
41

Have you tried the "Weight by User Specification" operator? Here is an example (the weight-operator is inside the decision tree operator):
I have a similar request. in my case, I am looking to create a sentiment analysis model for user comments but do not have enough training data to use that, so I am using a list of words with sentiment weights from -5 to +5 (it's the AFINN database). I have a document term matrix so that the terms occurring in each user comment are all listed - in my case it is term frequency.
I wish to multiply these frequencies by the weight given to each term, eg if the document reads 'This was dire, a complete failure' I have the word 'dire' appearing once and also 'failure' appearing once. 'Dire' scores -3 and 'failure' scores -2, so the score for that comment would be 1*-3 plus 1*-2 = -5.
I can't see how to do that - the Weight by User Specification operator requires each word to be entered separately and there are about 2500 words in my AFINN database so that's no good...
many thanks
David Hampton
I wish to multiply these frequencies by the weight given to each term, eg if the document reads 'This was dire, a complete failure' I have the word 'dire' appearing once and also 'failure' appearing once. 'Dire' scores -3 and 'failure' scores -2, so the score for that comment would be 1*-3 plus 1*-2 = -5.
I can't see how to do that - the Weight by User Specification operator requires each word to be entered separately and there are about 2500 words in my AFINN database so that's no good...
many thanks
David Hampton