Process Documents Output
Hyram
New Altair Community Member
Hi. The output of this operator when pre-processing text, is an example set with a number of special and regular attributes. When I try to apply other operators to this output at a later stage, RapidMiner seems to ignore all the regular attributes whilst keeping special attribute (text) i.e. the data table has only one attribute (text) and the regular attributes are ignored. In the first printout, one can see that regular attributes are ignored. On the second printout, looking at the results output, I can see the regular attributes - in fact it states there are 2906 regular attributes.
Why is this and how can I ensure that the attributes are kept in the table for processing at a later stage?
Why is this and how can I ensure that the attributes are kept in the table for processing at a later stage?
Tagged:
0
Best Answer
-
The regular attributes there are the ones created by the word vector. Unforunately it drops any other regular attributes you might have. The best solution is to join them back in after the text processing (make sure your examples have an id and then the subsequent join will be easy).5
Answers
-
The regular attributes there are the ones created by the word vector. Unforunately it drops any other regular attributes you might have. The best solution is to join them back in after the text processing (make sure your examples have an id and then the subsequent join will be easy).5
-
Great @Telcontar120. Thank you!
1