A program to recognize and reward our most engaged community members
haddock wrote:Hi there,It takes a bit of time to see the world from the RM point of view; of course any view is acceptable in a liberal world, but the one you are grappling with comes from Dortmund. In that world models get made from data and are applied to other data; here application means something quite specific, namely that a new prediction column is created, and filled with values for the attribute being modeled ( the label ). What tends to happen is that the model gets applied to lots of new cases at once, as in validation, or its parameters get tweaked as it gets applied to lots of new cases, as in optimisation ploys. So applying a model to just one case is actually quite rare from the modeling point of view, but may be handy from the application point of view. In the latter case you should consider RapidAnalytics, a mighty server that lets the masses play with your delicately engineered models ... ( check out Simon's rather good videos to see this beast at work ).But the bottom line remains that if you really want to get the most out of RM you need to work through the support material, tutorials,samples, videos etc.. it just saves time in the long run. Look at operators as Lego that data goes through and you won't go far wrong!Have fun...
so are you saying if I or the masses would like to play with my delicate model such as asking it what cluster this example falls in etc... I would need to use RA and run the model as a process on the RA server? Is there no way how to accomplish that from RM?
It takes a bit of time to see the world from the RM point of view; of course any view is acceptable in a liberal world, but the one you are grappling with comes from Dortmund. In that world models get made from data and are applied to other data; here application means something quite specific, namely that a new prediction column is created, and filled with values for the attribute being modeled ( the label ).What tends to happen is that the model gets applied to lots of new cases at once, as in validation, or its parameters get tweaked as it gets applied to lots of new cases, as in optimisation ploys. So applying a model to just one case is actually quite rare from the modeling point of view, but may be handy from the application point of view. In the latter case you should consider RapidAnalytics, a mighty server that lets the masses play with your delicately engineered models ... ( check out Simon's rather good videos to see this beast at work ).But the bottom line remains that if you really want to get the most out of RM you need to work through the support material, tutorials,samples, videos etc.. it just saves time in the long run.Look at operators as Lego that data goes through and you won't go far wrong!
one cannot do inference, classification and the like on single input instance under a learned model in a straight forward fashion.
I do not know how to do it , I do not want to learn the models params every single time I want to load it the learned model and submit to it a new example.
Ingo Mierswa wrote:Hi,(EDIT: Ok, Haddock had answered while I was typing. Anyway, find my answer below...)I don't know really what you are talking about but I will try it in another way (although Haddock gave a perfect concise description of the basic concepts):Of course it is possible to create predictions for fresh and unseen data (which is called "apply a model" in the RM-world) with RapidMiner as well as with RapidAnalytics. The latter might be by far more appropriate if you want to integrate those predictions into other software but still it's also possible with RM directly.The basic idea is: load the training data learn the model and tune its parametersstore the trained model for later useload new dataapply the model on this new data with the operator "Apply Model" (by the way, it's called 'apply model' instead of 'create prediction' since there are also (preprocessing) models which do not create a prediction but still can be applied for data transformation...) The following resources should help you: The Manual: The samples which are delivered with RapidMiner in your first repository (especially 01_Learner/11_ModelApplier) The Video Tutorials (especially tutorial No 3): http://rapid-i.com/content/view/189/212/RapidMiner Resources: http://rapidminerresources.com/ Cheers,Ingo