Select Terms

gustavo_medeiro
gustavo_medeiro New Altair Community Member
edited November 2024 in Community Q&A
Hi, Everybody

I don't if the topic is in correct place. Anyway...

Is it possible to make a filtering of the terms as follows in RapidMiner?

Supose:

NA - Number of occurrence of a word in Class A
NB - Number of occurrence of a word in Class B
NC - Number of occurrence of a word in Class C
Total = NA + NB + NC

They remain the terms that meet the following criteria:

(NA / Total) * 100%> X% or (NB / Total) * 100%> Y% or (CN / Total) * 100%> Z%

Is it possible?
Tagged:

Answers

  • MartinLiebig
    MartinLiebig
    Altair Employee
    Sure,

    tokenize, generalte attributes and filter example should do that.
  • gustavo_medeiro
    gustavo_medeiro New Altair Community Member
    Classes are defined by the value of "label1" which are respectively -1, 0, 1. Suppose I want to remain the only terms which meet the following criteria:

    ([occurrence of words with label1] = -1) / total)> 0.7
    ([occurrence of words with label1] = -0) / total)> 0.7
    ([occurrence of words with label1] = 1) / total)> 0.7

    Total =[ Occurrence in label1= -1] + [ Occurrence in label1= 0] +[ Occurrence in label1= 1]


    What should I write on the fields inside "Generate Attribuite" and "Filter Examples"?



    I cannot sendo the image to show you better. But there are fields to be filled inside "Generate Attribuite" and "Filter Examples".
  • gustavo_medeiro
    gustavo_medeiro New Altair Community Member
    Do the people here abandoned me?
  • MartinLiebig
    MartinLiebig
    Altair Employee
    no,

    but you posted at x differnt threads got different answers and i do not know what the status is.

    ~Martin
  • gustavo_medeiro
    gustavo_medeiro New Altair Community Member
    I did not know the correct place this is a reason I had done it.
  • gustavo_medeiro
    gustavo_medeiro New Altair Community Member
    What should I do now?