"Text Mining / Process documents"

Question

Hello, I'm trying to make a text classification modeling and I'm using the following processes: 1)Read Excel 2)Data to documents 3)Process documents 4)Validation The problem a get is in the validation process. I receive a mensage tha i don't have a label atribbute althought I have the following structure in my data:: Column1 Column2 Column3 ROW 1 ID (ID) attribute (text) LABEL(binomial) To make a test I've put a SELECT ATRIBUTES operator after the Read Excel and I'm able to see the tree atributes. When I use the Select atributes operator after the Process Documents operator i canot select any atribute (the same occurs when i put after the Data to documents operators). ?xml version="1.0" encoding="UTF-8" standalone="no"?> Thank you

MariusHelf · Answer

First of all you are using RapidMiner 5.2.8 - this version is very old, please upgrade to the latest version 5.3.12.

Do you know breakpoints? If you set a breakpoint on an operator, the process stops after that operator such that you can investigate the results.

If Select Attributes does not show the attributes you want to select, you can simply type them into the field with the keyboard.

To solve your problem you could try to remove the connection that comes in from the left to the Read Excel operator, and select the file you want to import via the wizard (if you have not done so already).

Good luck!

~Marius