I am new to the world of datamining and for a school project I have to do an exercise. I have 3 datasets with a total of over 100 variables. Out of these variables I have chosen a couple of relevant ones in regard to the case I am trying to solve. I want to find which ID's are corresponding to the chosen variables, what kind of model do I use to do so?