"defining new variables (text-mining)"

User: "derchief"
New Altair Community Member
Updated by Jocelyn
hi,

I would like to define some additional variables for comparing groups of textfiles. for example, I would like to differentiate between texts written by certain age-groups as well as between male and female sources. the result should allow interpretations like "older female people use this words" or "women tend to that explanation". consequently, the mentioned variables should be choosable in the plot views.


thanx and best regards,

chris

Find more posts tagged with

Sort by:
1 - 3 of 31
    User: "fischer"
    New Altair Community Member
    Dear Chris,

    can you elaborate on what exactly your problem is? Once the vartiables are in your data set they should be appearing in the plot views. You might want to use an ExampleFilter to train different models for subsets of your data defined by values of these variables or ChangeAttributeRole to declare these variables as the label.

    Cheers,
    Simon
    User: "derchief"
    New Altair Community Member
    OP
    Dear Simon,

    I´m about to solve the problem! I added an attribute (gender), that is choosable in the plot view. Is it possible now to set a value (eg. "male") for a selection of examples in rapidminer or do I have to change the values in the dat-file using another application such as excel?

    Cheers,
    Chris
    User: "land"
    New Altair Community Member
    Hi Chris,
    for this purpose I would recommend the AttributeConstruction operator. There you can specify a list of attributes to construct and insert some scripting language style commands for setting the value. For example the operator provides something like conditions (if) or mathematical constructions.
    One hint on the usage: Strings (and hence nominal value) have to be included into ".

    Greetings,
      Sebastian