"Input for FPGrowth"

svpriyan
svpriyan New Altair Community Member
edited November 5 in Community Q&A
Hai
Thanks earmijo, land for the previous hints.
I want to ask here, i got an error when i apply FPGrowth & Association Analysis.
I have the *.csv file and i try to do the following
-example source
-nominaltobinary
-FPGrowth
-Association Analysis

My *.CSV file contains the answer of the query [quote author=earmijo link=topic=499.msg1929#msg1929 date=1228839575]


I did not understand anyway there exists to make it correct.
thanks



[attachment deleted by admin]

Answers

  • TobiasMalbrecht
    TobiasMalbrecht New Altair Community Member
    Hi Pryian,

    one look into the operator info does help here ...

    Please note that the given data set is only allowed to contain binominal attributes, i.e. nominal attributes with only two different values. Simply use the provided preprocessing operators in order to transform your data set. The necessary operators are the discretization operators for changing the value types of numerical attributes to nominal and the operator Nominal2Binominal for transforming nominal attributes into binominal / binary ones.
    It seems that your transaction id is not nominal. However, the id should be not considered in the analysis at all. You should therefore mark it as special (use the [tt]ChangeAttributeRole[/tt] operator).

    Regards,
    Tobias
  • svpriyan
    svpriyan New Altair Community Member
    Thanks for the hints, I did in this way

    <operator name="Root" class="Process" expanded="yes">
        <operator name="CSVExampleSource" class="CSVExampleSource">
            <parameter key="filename" value="C:\Book1.csv"/>
        </operator>
        <operator name="Numerical2Polynominal" class="Numerical2Polynominal">
        </operator>
        <operator name="Aggregation" class="Aggregation">
            <list key="aggregation_attributes">
              <parameter key="item" value="count"/>
            </list>
            <parameter key="group_by_attributes" value="tid|item"/>
        </operator>
        <operator name="Example2AttributePivoting" class="Example2AttributePivoting">
            <parameter key="group_attribute" value="tid"/>
            <parameter key="index_attribute" value="item"/>
        </operator>
        <operator name="ChangeAttributeRole" class="ChangeAttributeRole">
            <parameter key="name" value="tid"/>
        </operator>
        <operator name="UserBasedDiscretization" class="UserBasedDiscretization">
            <list key="classes">
              <parameter key="yes" value="1.0"/>
              <parameter key="no" value="0.0"/>
            </list>
        </operator>
        <operator name="FPGrowth" class="FPGrowth">
            <parameter key="min_support" value="0.2"/>
        </operator>
        <operator name="AssociationRuleGenerator" class="AssociationRuleGenerator">
            <parameter key="keep_frequent_item_sets" value="true"/>
            <parameter key="min_confidence" value="1.0"/>
        </operator>
    </operator>
    Do I have some mistakes here.
    thanks
  • TobiasMalbrecht
    TobiasMalbrecht New Altair Community Member
    Hi,

    well, is it working the way you did it?

    Tobias
  • svpriyan
    svpriyan New Altair Community Member
    Hai
    Thanks for the reply
    Yea, I got the answers but Can I Simplify more this code.
    also, I have lot of Rules with me.. how can i reduce the rules , means until which level i can increase min_sup & min_conf. normally max 1. is it?

    thanks
    priyan
  • land
    land New Altair Community Member
    Hi,
    yes. 1 is the maximum. As stated in the parameter description.

    Greetings,
      Sebastian