I'm doing research on the CIC IDS 2017 dataset which contains 200-300MB of data for one file.
I try to do a automodel and predict the source IP based on other attributes. I get into memory issues running this (I have 16GB RAM) but I assume that I have used a too large dataset or too many attributes for the modeling.
So my question is what number of lines and attributes can I expect to be handled doing this?