Split column value with fixed and dynamic values
Hello,
I have a column, 'col', that contains data in the following format.
rownum col
1 lab 1 100
2 lab 1 200|lab3 910
Each value is a pipe delimited name value pair. Names are a fixed set of labels that could contain space, such as 'lab 1'. Values such as 910 and 100 are dynamic.
I'd like to get the data into the format below. Can you please advise how to achieve this? I am looking at operators such as split value and tokenizer, but neither seems to handle this out of the box.
rownum lab 1 lab3
1 100 ?
2 200 910
Thanks very much!
TS
Answers
-
Split Values first on the pipe, and then again on the space. That should give you 4 attributes (although some will be empty). Then you can De-Pivot to change to the structure you want (take a look at this helpful recent thread for some example processes using De-Pivot: https://community.rapidminer.com/t5/RapidMiner-Studio-Forum/Pivot-date-time-columns-into-a-certain-way-EventLog-using-Turbo/m-p/53790#M33295
1