I need help with predicting an attribute
Hallo all,
I have a dataset that contains the yearly gas use 2010- 2018, temperature from 2010 - 2018, the provinces in the country that i want to analyze, and the speed of wind. What i want to do know is to predict what the usage of gas will be in the future. When i observe the data set it is clearly that the usage of gas is getting less by year and that there is correlation between temperature and gas usage. I tried to get an decision tree but it wont work for some reason.
Can someone help me with predicting the gas usage?
Thanks in advance
I have a dataset that contains the yearly gas use 2010- 2018, temperature from 2010 - 2018, the provinces in the country that i want to analyze, and the speed of wind. What i want to do know is to predict what the usage of gas will be in the future. When i observe the data set it is clearly that the usage of gas is getting less by year and that there is correlation between temperature and gas usage. I tried to get an decision tree but it wont work for some reason.
Can someone help me with predicting the gas usage?
Thanks in advance
Find more posts tagged with
Sort by:
1 - 13 of
131
Hi @ramos213,
Have you tried to connect the in port to the Generate Macro operator input port ?
If the issue is still present after doing this connection, please share your data and your process in order we can reproduce and understand your issue.
Regards,
Lionel
Have you tried to connect the in port to the Generate Macro operator input port ?
If the issue is still present after doing this connection, please share your data and your process in order we can reproduce and understand your issue.
Regards,
Lionel
Hello @ramos213
The input of cross-validation is an "example set". I guess you are using a wrong input to cross-validation operator.
The input of cross-validation is an "example set". I guess you are using a wrong input to cross-validation operator.
Hello @ramos213
Can you provide us with the data and your process? You can download your process by going to FILE --> Export Process and then attach here in the thread with data set.
Can you provide us with the data and your process? You can download your process by going to FILE --> Export Process and then attach here in the thread with data set.
@ramos213
You have only 9 examples in your initial example set and 4 examples after windowing !!!
Thus you can not perform a 10-folds cross -validation (there are no enough examples).
You can not build a reliable model with so few data : You have to increase significantly the size of your training set....
Regards,
Lionel
PS : if you want absolutly a working process, set the k-folds parameters of CV to k = 4.
You have only 9 examples in your initial example set and 4 examples after windowing !!!
Thus you can not perform a 10-folds cross -validation (there are no enough examples).
You can not build a reliable model with so few data : You have to increase significantly the size of your training set....
Regards,
Lionel
PS : if you want absolutly a working process, set the k-folds parameters of CV to k = 4.
Yes , in deed the error is due to the fact that you have duplicate years values in your initial dataset.
You have 27 examples in your initial dataset for your 3 provinces, thus you have 9 examples for each province after splitting according the provinces :
That's you did and you did the right thing but ...I have to insist : 9 examples is no enough to build a relevant and reliable model.
Try to increase the size of your dataset by finding the variables values before 2010 (for example) ....
Thanks you for your understanding,
Regards,
Lionel
You have 27 examples in your initial dataset for your 3 provinces, thus you have 9 examples for each province after splitting according the provinces :
That's you did and you did the right thing but ...I have to insist : 9 examples is no enough to build a relevant and reliable model.
Try to increase the size of your dataset by finding the variables values before 2010 (for example) ....
Thanks you for your understanding,
Regards,
Lionel
Sort by:
1 - 2 of
21
Hi @ramos213,
Have you tried to connect the in port to the Generate Macro operator input port ?
If the issue is still present after doing this connection, please share your data and your process in order we can reproduce and understand your issue.
Regards,
Lionel
Have you tried to connect the in port to the Generate Macro operator input port ?
If the issue is still present after doing this connection, please share your data and your process in order we can reproduce and understand your issue.
Regards,
Lionel
This seems like you are trying to forecast future usage of gas. If you don't have any future data without labels then you need to go for forecasting related to time series data. Here is a link that helps you understand how time series analysis works.
https://rapidminer.com/resource/time-series-analysis/
Do let us know if this helps. If not, please clearly inform, how your data looks and do you have any unlabelled data to predict and the way you are building models.