Predict (assign) viewers to emissions
Hi all!
I have a data set that contains TV commercial emissions. It has many properties, like
- date & time of emission,
- GRP value
- TV channel
- TV show
- commercial position (beginning of the block, middle, or end)
- Channel subject group (cooking, traveling, etc)
each property is important. Date and time determines whether the emission was during the prime time, night, etc., GRP value indicates range of emission, etc.
on the other side I have new website visitors count (based on Google Analytics), so I can clearly see how many people each emission has brought to the site and how effective it was.
Visitors data set is aggregated to minutes, so I have information like
- 2020-05-10 13:30:00 - 7 visitors
- 2020-05-10 13:31:00 - 10 visitors
- 2020-05-10 13:32:00 - 8 visitors
- 2020-05-10 13:33:00 - 2 visitors
so I can estimate, that this particular emissions has brought 27 new visitors to my website.
Problem is when emissions interfere. So having two (or more) emissions colliding all I know is that they have brought together eg. 57 visitors.
Is it possible to estimate how many visitors came from particular, interferred emission, using information based on "clean" (not colliding) emissions? Each emission is described by many properties. How to achieve it with RapidMiner? I'm trying hard with Impute Missing Values and k-NN operator with no luck.
Any help will be appreciated!