The Siemens Community Catalyst program was co-created with our community to acknowledge technology leaders who consistently contribute to the Siemens Community. Nominations are accepted on a rolling basis.
whose role is declared to be id, used by default by the software in building clusters (by the attribute participating in the computation of distances. etc)?
What about building a supervised learning model as a:
- decision tree - does the implemented algorithm compute by default the gain ratio for an id attribute?
- naive bayes classifiers - does the algorithm compute conditional probabilities (and implicitly sample means and standard deviations) in the case of the declared id attribute?