Find more posts tagged with
Sort by:
1 - 8 of
81
i worte a fast implementation for that on the combine documents operator sourcecode, which seems to be working, any comments?
Hi,
we have thought about this and think it is a good idea in general. However, assuming that you have something like "label_weight_0.7" in the annotations looks a bit weird. We should at least have a weight meta data or something similar that does not require this parsing operation. How are you constructing this string in your case?
Best,
Simon
we have thought about this and think it is a good idea in general. However, assuming that you have something like "label_weight_0.7" in the annotations looks a bit weird. We should at least have a weight meta data or something similar that does not require this parsing operation. How are you constructing this string in your case?
Best,
Simon
Hi Simon,
doing the weighting within the label was the easiest way for me to integrate it in my program.
Of which string are you talking about?
if you are talking about the string for the label than it goes like that:
first a bit context:
i want to cluster webservices, and for that i have documents related to the service. as not every document has the same importance, i have to weight them.
now how i build the label name:
the prefix is allways the service id, then i have "_weight_" and then i have a weight value like 0.5
e.g.: SMSService01_weight_0.5
all the best,
simon
doing the weighting within the label was the easiest way for me to integrate it in my program.
Of which string are you talking about?
if you are talking about the string for the label than it goes like that:
first a bit context:
i want to cluster webservices, and for that i have documents related to the service. as not every document has the same importance, i have to weight them.
now how i build the label name:
the prefix is allways the service id, then i have "_weight_" and then i have a weight value like 0.5
e.g.: SMSService01_weight_0.5
all the best,
simon
Hi Simon,
thanks for clarifying this. Aytually I was thinking about which operator you are using to construct these strings. Is it an RM operator or your own implementation?
Do you agree that this concatenation of strings is not the most elegant solution if we want to incorportate it into the release?
Best,
Simon
thanks for clarifying this. Aytually I was thinking about which operator you are using to construct these strings. Is it an RM operator or your own implementation?
Do you agree that this concatenation of strings is not the most elegant solution if we want to incorportate it into the release?
Best,
Simon
Hi Simon,
The string is not constructed by a rapidminer operator, but by my own code, where im setting the labelnames of create document operators.
But i agree with you that for a release there should be a more elegant/general way. Maybe a metadata which can be set for every document as you mentioned in your previous post.
This was just a quick n' dirty coding which fit into my own implementation. Nevertheless also i would appreciate, if this comes into a release, that one can handle this by metadata for instance.
all the best,
Simon
The string is not constructed by a rapidminer operator, but by my own code, where im setting the labelnames of create document operators.
But i agree with you that for a release there should be a more elegant/general way. Maybe a metadata which can be set for every document as you mentioned in your previous post.
This was just a quick n' dirty coding which fit into my own implementation. Nevertheless also i would appreciate, if this comes into a release, that one can handle this by metadata for instance.
all the best,
Simon