question about groupby windowing(?)
Hello, everyone
How are you?
I have one question.
My data look like below
city / date / amount
NY / 20210401 / 100
NY / 20210402 / 150
NY / 20210403 / 50
NY / 20210404 / 30
LA / 20210401 / 40
LA / 20210402 / 20
LA / 20210403 / 50
Chicago / 20210401 / 30
Chicago / 20210402 / 40
Huston / 20210401 / 30
Huston / 20210402 / 20
Huston / 20210403 / 40
....
....
....
In this data, there are over 1 thousand cities (hugh number of cities)
I want to create a new variable (previous amount) in the data above, like this
city / date / amount / previous amount
NY / 20210401 / 100 / NA
NY / 20210402 / 150 / 100
NY / 20210403 / 50 / 150
NY / 20210404 / 30 / 50
LA / 20210401 / 40 / NA
LA / 20210402 / 20 / 40
LA / 20210403 / 50 / 20
Chicago / 20210401 / 30 / NA
Chicago / 20210402 / 40 / 30
Huston / 20210401 / 30 / NA
Huston / 20210402 / 20 / 30
Huston / 20210403 / 40 / 20
That is, I want to generate a lagged variable for each city
So I used operator "Loop by value" and "Filter example" and "Windowing"
but it requires hugh amount of memory and it was very slow
So could you please help me with this task?
How can I do this task using rapidminer operator?
Thank you in advance
Find more posts tagged with
