generate a subset of wordlist based on a given weight table
I have generated a wordlist file based on the processing a document corpus. The following is a screenshot of part of the wordlist file.. Thare are around 15000 rows(15000 different tokenized words). Based on the feature selection method, I already have a list of words that should be kept. This list only contains 500 words, and is saved in the weight object. How can I join this two items, a wordlist and a weight table to generate a short wordlist which only has 500 rows.