Hi together,
what did once work with RapidMiner 4.3 unfortunately doesn't work with version 4.4 any more. I'm talking about the following scenario:
(Note: when i speak of RapidMiner 4.4 i actually mean the most recent CVS version as i am unable to install the officially released 4.4 version as it wants to update the already installed 4.3 version which i need.)
- time series data read with ExampleSource (1 example, 4000 attributes, you might know it: sales_series.aml/dat)
- immediately after this: Series2WindowExamples with series_representation = encode_series_by_attributes, horizon = 1, window_size = 100, step_size = 5
In RapidMiner 4.3 a windowed ExampleSet was generated consisting of 100 attributes (window_size) and 1 label, which is obviously needed for training. RapidMiner 4.4 on the other hand does not produce the label attribute with these settings. Taking a look into UnivariateSeries2WindowExamples.createLabel() reveals:
if (representation == SERIES_AS_EXAMPLES) {
Attribute seriesAttribute = exampleSet.getAttributes().iterator().next();
int valueType = seriesAttribute.getValueType();
return AttributeFactory.createAttribute("label", valueType);
} else {
return null;
}
As i have representation == SERIES_AS_ATTRIBUTES it is indeed correct that no label will be created. Now the code lets me believe that there is a good reason for this behavior which i might miss. I do remember that the help recommends encoding the series as examples in favor of efficiency with respect to the memory usage. Maybe with version 4.4 RapidMiner forces me to use another encoding by simply omitting the label attribute now? ;-) I mean, it is absolutely no problem to transpose the data prior to using Series2WindowExamples, i just wanted to know what's behind all this. Maybe i am really missing some important point here.
Thanks in advance,
Oliver