Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
predicting float label which depends on polynominal attributes
denmla
Hi there!
I am new to rapidminer and to datamining. Rapidminer is my first dataminig tool and im very pleased with it, it is very good for newcomers, but i have some problems and i need someone who is an expert to help me
I have a data in excel that looks like this:
variable X -
lable
, its an float variable (student grade, for an example 3.73)
variable Y
1
-
attribute
, nominal value (can have 4 values that ive coded in numberes: 1, 2, 3, 4)
variable Z
1
-
attribute
, nominal value (can have 6 values taht ive coded in numbers from 1 to 6)
variable Y
2
-
attribute
, nominal value (can have 4 values that ive coded in numberes: 1, 2, 3, 4)
variable Z
2
-
attribute
, nominal value (can have 6 values taht ive coded in numbers from 1 to 6)
i want to predict X depending on Y
1
, Z
1
, Y
2
, Z
2
first i thought to use linear regression with converting nominal to binominal first (dummy coding), but rapidminer made an output of only 2 variables in model that X depends on, instead of all 4 (and it has no sense that others have no influance on X)
ive also used weka library linear regression, without dummy coding first, got same result
any help? can you point me how to setup this? is there some other algorithms for this problem (label is a number, but attributes are polynominal)?
i hope im making it clear whats my problem
thank you
mladen
Find more posts tagged with
AI Studio
Accepted answers
All comments
homburg
Hi denmla.
The reason for this problem is the build-in feature selection of the linear regression methods. By default
M5 prime
is used in both cases (RM and Weka). Simply turn it off (RM: feature selection = none, Weka: S = 1.0) and you should receive a model that refers to more than two attributes.
Greetings,
Helge
denmla
Hi Helge,
sorry for late response, I had some exams and I was not around my computer. Your advice helped, RapidMiner managed to output all variables. I have some questions concerning the output, since im not sure if RapidMiner is using dummy coding as binary. I want to calculate residuals. I'll post result and explain the question better later when i get to my desktop PC.
Mladen
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups