Dear RM-Enthusiasts,
Working on an online advertising dataset I have a list with Product-IDs. Every product has a number of attributes and I want to predict one of them. A fairly basic Decision Tree model is already yielding acceptable results.
However I still have one source of predictive potential that is not used yet and that is the number of observations. The data for some Product-IDs are based on 1 observation, while others are based on 20 or more observations. Obviously I would like to weigh the data for the IDs with many observations heavier than the ones with few observations.
Can anybody direct me to a way of handling this? Maybe a tutorial or youtube video?
Any advice would be greatly appreciated. Thanks in advance!
Best,
Marc