-
Beginner question regarding train / test set
First of all: I am a total beginner in data science. For my university project, I need to create a process in rapidminer which predicts a customer satisfaction based on a survey. The dataset can be obtained from kaggle by searching for "Airline Passenger Satisfaction" by TJ Klein (cannot post links yet). I get a train and…
-
Logic based Rule/Decision Tree/Flowcharts in RapidMiner
Hi, I want to know whether Rapidminer has operators that can allow me to build decision trees using basic operations like {if else} and maybe even use the condition{>, <}. I know RM supports learning decision tree models through its operator, but can I manually build such a tree, in a way people draw a tree in other…
-
Decision Tree - only one attribute per branch?
Hello, i hope everyone here is doing great! I have a question regarding decision trees. Is it possible to set up the decision tree in a way, so that the model will use every attribute just once per branch? I need this for a project for my studies, and it would mean a lot if someone here can help me :smile: . Thanks in…
-
Using decision tree on titanic
This may sound like a dumb question but i am trying to create a decision tree using the titanic dataset but I always end up failing.I am trying to have male/female separated(which works) and right after i want the separation for both of them to be based on age or passenger class.Instead of that it keeps getting different…
-
How do I use Optmize Parameters to find a seed where the minimum class precision is maximized?
I was using the Cross-Validation operator to create a Gradient Boosted Trees model on a small database (419 examples). I had 5 different classes and I wanted to find a seed where the class precision percentage was best distributed. For example, in the image above I had a good average accuracy, but the class precision…
-
Converting Dates for Use in Model to Predict Oil & Gas Drill Timing?
Hi There! I recently completed MIT's "No-Code AI & Machine Learning" certificate where we used RapidMiner on some more simple models. I'm now trying to put what I've learned into use in my business. I am trying to use certain "key drilling indicators" to predict when a drilling rig will show up on a given tract of land.…
-
Any recommendations on how to share/utilize decision tree algorithm amongst a team?
I created a decision tree algorithm, but the rest of my team does not use the rapidminer. Any ideas how to share something like this so it can be utilized by multiple people?
-
Gradient Boosted Tree don's give the final prediction
Hello Rapidminer Community ! I want to ask regarding Gradient Boosted model
that i used for my study on predicting corporate default risk. My dependent variable is
default and non default and i use number 1 as default and 0 as non default. I
already setup the data type as binominal. After
i call the related operators such…
-
Gradient Boosted Tree don't show the dependent variable in the resulting trees
Hello, i want to ask regarding Gradient Boosted model that i used for my study on corporate default risk. My dependent variable is default and non default and i use number 1 as default and 0 as non default. I already setup the data type as binominal for the default and non default. After i call the related operators such…
-
Convert categorical variables into dummy variables
Hi, I want to perform a regression task to predict continuous response. I have 4 categorical variables, others are numerical. Categorical variables are:age=(≤20, 21-35, 36-50, ≥51)gender=(Female, Male)income level=(1=insufficient, 2=sufficient)BMI range=(1=<25, 2=>25)*Income level & BMI are keyed in as numerical code in my…
-
Decision trees (Multiway). How can I fill in a subprocess
Hello! How can I fill in a subprocess Decision trees (Multiway)? I need example, please. I did it, but i'm not sure...
-
Using C5.0 with Criterion as Information Gain?
I'm trying to learn more about decision trees (DT), and I stumbled upon C5.0 DT. I know that it uses information gain as the attribute selection measure. So I'm asking if I can modify the criterion of the DT operator to Information Gain, instead of the default, to make it use C5.0 DT? And I've read that the DT operator on…
-
Use QUEST , CART ,C5.0 algorithms
I want use some Decision Tree algorithms same as QUEST , CART ,C5.0 but there is no any options in parameters panel of decision Tree prediction model.
-
How to Post Pruning in RapidMiner
Hi everyone, I’m having trouble with an overfitting decision tree and my teacher suggested using post-pruning to improve accuracy. However, I’m not sure how to apply post-pruning in RapidMiner. Can anyone offer some guidance?
-
Where is the decision tree operator parameters?
Hello- I am currently using an Educational version of Rapidminer Version 10.1.001, and while designing a decision tree model, I was unable to change the parameters for maximal depth or any other parameter for this operator. I have attached a screenshot of the issue. If you could help me out on this, it would be great.
-
Why attribute names is Not recognized?
Hello everyone, I'd like to ask why the attribute name in excel cannot automaticlly recognized here? and the attribute name cannot be recognized in the "set role" part. Looking forward to reply, Thank you very much! Charlene
-
How do you reduce variables before doing a decision tree?
Hi! Just wanted some help. 1) do you need to reduce the number of variables before you execute a decision tree analysis? Currently, i have 19. It makes the decision tree hard to read as i need to go to 12 layers to get the accuracy up. 2) If I use the select attributes for the ones i believe are important after doing some…
-
Criterion for overfitting evaluation
Hello everyone. Have a nice day. I am getting some overfitting trouble. I have been searching the information on RM Community and the other websites. They told that if the accuracy is greater than 90%, I am most probaly facing to overfitting. My case below: I have the datasets like this: Then I created the process using…
-
How to implement the Network Optimization in RapidMiner ?
dear folks , I am Prabakaran working on building a POV on Supply network optimization for a Retail Chain for which we need to consider Rapidminer as the data science tool. Meanwhile, we need to perform many POVs on Network Optimization context using Rapidminer. I tried to find relevant Optimization related Operators in the…
-
Deploy model
Hi ! I am a student who doing research first time. could you please guide me to deploy model? I have data set with me I tried based on the this site instruction. but i got error.
-
How do I create balanced clusters?
Hi guys, I'm pretty new to the community so sorry if my question will seem quite elementary, but how do I create balanced clusters (k-means) - meaning that each cluster will have the same size of items in it? Or is there a way to force a minimum cluster size to anything else than 1? (What I am trying to do is to create…
-
How come there is a difference in the Model and the Prediction in a decision tree made by automodel?
Hi everyone, I am training a model for classic churn prediction. I would like to get a decision tree made by the a feature.What I don't understand is, that it creates a model/tree, but the predictions do not match this tree.As you can see we have a customer (number 12) which was predicted as 'Kündiger' he has a LZ Abo bis…
-
About the reason why the thickness of the branches of the decision tree is different
I created a model using a decision tree.
After that, I checked the output result. It seems that if the number of samples after branching is large, it is thick, and if it is small, it is thin. But I haven't been confirmed.
Is this interpretation correct? thank you.
-
Where this number came from ?
I'm using operator "Optimize by Generation (Evolutionary Aggregation) with Decision Tree. Compared with only Decision Tree, this operator only using 2 atributes and 1 generated atribute, i want to know how this atribute get the number from other 2 atributes This is before using Optimize by Generation (Evolutionary…
-
Decision Tree Result not showing
When i run it, it's only showing perfomance vector, attribute weights, and example set, the decision tree result not showing. Before i'm using optimize selection (evolutionary), it worked just fine here my model's screenshot Here inside the optimize selection (evolutionary) Here inside Split Validation here my XML <?xml…