🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

Why does the correlation matrix show the minority class when the majority class was selected

User: "GeezerDoc"
New Altair Community Member
Updated by Jocelyn
I'm running a simple classification model to predict the presence or absence of heart disease based on multiple risk factors. When I run Automodel I specify the class of interest being the "presence of heart disease". After the algorithms have run I took a look at the correlation matrix and the obvious risk factors showed a negative correlation with heart disease. When I expanded the attribute column I now realize that the correlation matrix is based on the "absence of heart disease". That is why the results are counter-intuitive. Any idea why this might be? Thanka

Find more posts tagged with

Sort by:
1 - 5 of 51
    User: "DocMusher"
    New Altair Community Member
    Accepted Answer

    @GeezerDoc

    If you don't mind could you send me the data you are mentioning or is the data available somewhere. From my perspective I am also interested to see what happens and how the correlation matrix looks like. It is important to be able to explain the step or how you should make any interpretation if you want to demonstrate your results to your colleagues.

    Sven Van Poucke, MD, PhD

    User: "GeezerDoc"
    New Altair Community Member
    OP
    Accepted Answer
    I will attach the Heart Prediction File and you can run it to predict "Heart Disease Present" and see what happens. I have also run it with the class as 0,1 but RapidMiner interpreted that as regression so had to tell it to do classification
    User: "GeezerDoc"
    New Altair Community Member
    OP
    Accepted Answer
    @varunm1
    While I don't think your response answered my initial question it may have answered another question that I had and that was related to "one hot encoding". Are you saying that AutoModel automatically uses this technique when it sees categorical data? If that is so, there is not need to convert categorical data using a visual operator before uploading the data to AutoModel. Please elaborate and thanks
    User: "DocMusher"
    New Altair Community Member
    Accepted Answer
    Hi @GeezerDoc
    I used your data with the following process:
    Any feedback is welcome!
    Sven

    User: "MartinLiebig"
    Altair Employee
    Accepted Answer
    @GeezerDoc,

    yes. AM is doing this if the model cannot handle this and the model cannot handle nominal data. This is a very good go-to method to do - thats why AM is doing it.

    Cheers,
    Martin