🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

"Numeric Dimension Reduction (PCA, etc) for Cluster Analysis"

User: "WI_Noble"
New Altair Community Member
Updated by Jocelyn
I'm working on a clustering application and want to reduce the numeric attributes, especially correlated ones, prior to clustering.  A proven technique to doing so is PCA, which RapidMiner contains.  What I also want to do is the Varimax rotation algorithm to "load" a minimum number of attributes per component to improve interpretability.  In a prior post, it was mentioned that PCA rotation (Varimax, etc.) is on the future wish list.  I have the following questions on this:

1.) How close is RapidMiner to including rotation in future releases?
2.) If not "close", would I be better off looking at using an R package within RapidMiner to accomplish this?
3.) What other techniques have people used in RM to reduce numeric dimensions to a minimum, non-correlated set that are just as good/better than PCA for cluster analysis?

Thanks!

Comments

No comments on this post.