(SOLVED) Wrong Correlation Matrix Output

ndf-ls
ndf-ls New Altair Community Member
edited November 5 in Community Q&A
Hello,

i just started working with rapidminer and have a problem using a correlation matrix:
My database is an excel sheet. The feed into Rapidminer works, however when i connect a correlation matrix operator next to it, the results in the matrix show many "?" instead of values between -1 and 1. Single cells show actual values, however some of them are outside of the normal -1 to What does this mean? I have 15 attributes, is that to much?!

Thank you very much in advance

Answers

  • MariusHelf
    MariusHelf New Altair Community Member
    Hi,

    15 attributes are definitely not too much. There is either a problem with the process setup, with the data or maybe there is a bug in the Correlation Matrix operator. Can you please post the XML of your process?
    It would also be good if you could provide a sample of your data.

    Best regards,
    Marius
  • ndf-ls
    ndf-ls New Altair Community Member
    The problem is now solved. The reason for the wrong Output is the following:
    The used Database consisted of many constant values for most of the attributes. The "Correlation Matrix"-Operator uses the Pearson Correlation Coefficient. As the standard deviation of this formula is listed in the denominator, the correlation is not defined for constant values. This was the reason for the "?" Output in many of the correlation matrix cells.
    Big thanks to Marius for his great support!
  • Mrudu
    Mrudu New Altair Community Member
    Hi, I am facing similar problem with 15 attributes. You explained the reason, can you please tell the solution also. I am attaching mt process xml file for your reference.
  • sgenzer
    sgenzer
    Altair Employee
    hi @Mrudu this is a thread from 2014. I'd highly recommend starting a new discussion :wink:

    Scott