Preprocessing grouped data
Dear All,
Is it possible to pre-process groups of data points differently then other groups?
For example, a dataset with 2 groups (ordered by Object ID), with 4 data points in each group.
GroupID, Date, Red, Blue, Class
0000001, 12-4, 113, 122, 0
0000001, 13-4, 114, 122, 0
0000001, 14-4, 112, 121, 1
0000001, 15-4, 113, 122, 0
0000002, 12-4, 119, 122, 0
0000002, 13-4, 133, 122, 0
0000002, 14-4, 100, 121, 1
0000002, 15-4, 114, 122, 0
Is it possible to discretise the attributes red and blue into {High, Medium, Low} ordered by group?
H = "greater then group_mean + group_standard_deviation"
M = "in between group_mean +- standard_deviation"
L = "smaller then group_mean - standard_deviation"
So the result would be:
0000001, 12-4, M, M, 0
0000001, 13-4, H, M, 0
0000001, 14-4, L, L, 1
0000001, 15-4, M, M, 0
0000002, 12-4, M, M, 0
0000002, 13-4, H, M, 0
0000002, 14-4, L, L, 1
0000002, 15-4, M, M, 0
Best regards,
Wessel