"Hierarchical Text Classification"
Hi,
I am planning to do a hierarchical classification using a top-down approach. My idea of the top-down approach is to first classify the exampleset using the top classes, then filter the examples of each top class and apply another classification using the top class' subclasses.
Here is an example procedure:
1. generate exampleset
2. classify into class X, Y or Z
3. filter examples of predicted X class
4. classify X examples into subclass X1, X2 or X3
... iterate the other top classes
9. merge the filtered examples into one exampleset.
10. end
I was about to start building the process when I realize that using the modelapplier requires also to apply the training word list to the exampleset. That means I need to have as many training word list as the models. How do I load the other training word lists? I know that the first training word list can loaded in the TextInput operator. But how about the other word lists?
Or is there a better way of doing this in RM?
thanks in advance.
Matthew
I am planning to do a hierarchical classification using a top-down approach. My idea of the top-down approach is to first classify the exampleset using the top classes, then filter the examples of each top class and apply another classification using the top class' subclasses.
Here is an example procedure:
1. generate exampleset
2. classify into class X, Y or Z
3. filter examples of predicted X class
4. classify X examples into subclass X1, X2 or X3
... iterate the other top classes
9. merge the filtered examples into one exampleset.
10. end
I was about to start building the process when I realize that using the modelapplier requires also to apply the training word list to the exampleset. That means I need to have as many training word list as the models. How do I load the other training word lists? I know that the first training word list can loaded in the TextInput operator. But how about the other word lists?
Or is there a better way of doing this in RM?
thanks in advance.
Matthew