How to use Polynomial Regression in rapidminer correctly

Question

Hello, everyone. This is my first forum post asking questions
about polynomial regression in rapidminer.

The original data is：x:4194.06 3466.45 
2070.08   874.98  corresponding to   y：91540.07 
109460.36  120338.64  102182.19

As shown in the first flow, the first result expression is
obtained by using the polynomial regression operator.

I want to ask why the results of the two processes are not
the same, the original data presents a quadratic nonlinear relationship, and
why the quadratic expression cannot be made by polynomial regression.

Thanks you very much!

rookie · Answer

hi @yyhuang，
       Sorry in advance, I don't know how to use the function of this forum.That's why it took so long to reply
         First of all, thank you for your answer 
 . According to your description, I am as the data is too little, and not standardized, to lead to the results out? But these four samples are real data , need the four data to construct a yuan quadratic polynomial, Because nonlinear equations can be converted to linear equations , so I use z instead of x2, I have the linear regression equation. But why do with polynomial regression is not to come out, how do you explain that please？Polynomial regression is there any limit to this operator ？

rookie · Answer

First of all, thank you for your answer <3  . According to your description, I am as the data is too little, and not standardized, to lead to the results out? But these four samples are real data , need the four data to construct a yuan quadratic polynomial, Because nonlinear equations can be converted to linear equations , so I use z instead of x2, I have the linear regression equation. But why do with polynomial regression is not to come out, how do you explain that please？Polynomial regression is there any limit to this operator ？

YYH · Answer

Hi @rookie,

Thanks for sharing the data and process.

If we have got four (4) example and train a polynomial regression, we may fail for the model. So I filled up the gap with interpolation to add more data here. Also this Polynomial regression will not perform well without the normalization...

Process attached here for your reference.

These two models are close but I can not guarantee the polynomial will output similar coefficient without normalization ;)
I would strongly suggest to use GLM with new attribute manually created or attributes from Auto Feature Engineer.

Happy Rapid-Mining and Stay Healthy!

YY