A program to recognize and reward our most engaged community members
learn on a subset which fits into memory, then apply to large datasets incrementally
siamak_want wrote:would you please explain more? Do you mean I should use a sampling method to select some examples manually and then I should apply it on all of the data set? Then how about clustering?