🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

[SOLVED] Data ordering

User: "rosana"
New Altair Community Member
Updated by Jocelyn
Good afternoon,

I’m a new user of RapidMiner. I would like to build the following process with RapidMiner and the data ordering is being quite complicated for me.
I explain bellow what I would like to do:

1. The final goal of my RapidMiner process is to plot curves and correlation matrix between different attributes over a period of time.

2. I will explain later the source of the data. I would like to obtain an “example set” with the following structure:

[move][/move]Pressure Flow Temperature
dd/mm/yy hh:mm P1 F1 T1
dd/mm/yy hh:mm P2 F2 T2
dd/mm/yy hh:mm P3 F3 T3
…. …. … ….
(table 1)

From this table I would be able to plot curves, correlation matrix, etc.

3. The problem is that I don’t have the data stored in such a simple format.
4. I have the data stored in .txt files. (>1000 rows per file). I can read it from the operator “Read CSV”.
5. But the data are stored as follows:

dd/mm/yy hh:01 Pressure P1
dd/mm/yy hh:02 Flow F1
dd/mm/yy hh:02 Pressure P1
dd/mm/yy hh:03 Temperature T1
dd/mm/yy hh:03 Temperature T1
…. …. ….
(table 2)

So, a very different order to what I would like to have.

6. I need to build an “example set” reordering the data (from table 2 to table 1). How can I do that?

7. Besides that I would like to reduce the final number of examples considering only an example per every 5 minutes instead of per minute. Therearefore I would need to have an example set whose examples are averages of several examples included in another “example set”. Again, I dind’t find any operator to do that.

I hope I explained my problem clearly.

Thank you very much for your help.
Best regards! Rosana

Find more posts tagged with