🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

Continuous and categorical mixed features

User: "Yin"
New Altair Community Member
Updated by Jocelyn
What function can i use to perform PCA on a dataset with mixed continuous and mixed features in rapidminer? I applied PCA on continuous features that have been standardized and left the categorical variables with dummy encoding only without PCA. Is there a dimensionality reduction method (e.g.FAMD) that can be used on such dataset? Thanks in advance.

Sort by:
1 - 4 of 41
    Hi,
    PCA is simply not defined on nominal values. You would need to transform it first to numericals (i.e. using Target encoding).

    Best,
    Martin
    User: "Yin"
    New Altair Community Member
    OP
    Hi, my data is transformed, but i am asking if the PCA in rapid miner can handle categorical variables. Typically speaking, it is not good to use PCA on one-hot encoded variables or categorical dummy encoded ones. There should be a specific function implementation that deals with mixed data and i'm asking if this is already integrated here.
    User: "MartinLiebig"
    Altair Employee
    Accepted Answer
    Hi,
    PCA itself is simply not defined on non-numerical types. Any other solution would not be a PCA.

    Best,
    Martin
    User: "Yin"
    New Altair Community Member
    OP
    Agreed. I should have said dimentionality reduction.  I will resolve this Q and start a new one.