🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

KernelKMeans now produces error when classify text

User: "B_"
New Altair Community Member
Updated by Jocelyn
RM team

I have switched to RM 4.2.  I began testing by using an existing project that classifies text by KernelKMeans.  Text is read from a database and passed through StringtextInput and StringTokenizer.  This operator chain worked before.  Now I receive an error message

Error 104 - non-numeric
Error in: KernelKMeans (KernelKMeans) The example set contains non-numerical attribute #0: StockItemDesc (nominal/single_value)/values=

Using KMediods to classify text works.  Looking at the metadata with examplevisualizer there are string vectors and weights.

Here is the project.

<operator name="Root" class="Process" expanded="yes">
    <description text="#ylt#h3#ygt#Specifying texts by an example set#ylt#/h3#ygt##ylt#p#ygt#Using the parameter list or the wizard are simple methods for setting up the directories from which the text documents are read. Sometimes, however, a more flexible solution is needed. If, for instance, your text documents have different types of encoding or are written in different languages, you might wish to provide this information  for each input directory separately.#ylt#/p#ygt# #ylt#p#ygt#You can do this by using an example set that contains one row for each input directory and corresponding attributes for source, encoding, type and class. If such an example set is provided, the texts in the parameter list are ignored.#ylt#/p#ygt#"/>
    <operator name="DatabaseExampleSource" class="DatabaseExampleSource">
        <parameter key="database_system" value="Microsoft SQL Server (JTDS)"/>
        <parameter key="database_url" value="jdbc:jtds:sqlserver://localhost:1433/XXX"/>
        <parameter key="id_attribute" value="IDNbr"/>
        <parameter key="password" value="y6sa3JX9Wrc="/>
        <parameter key="query" value="SELECT [Text], [IDNbr] FROM [Classify]"/>
        <parameter key="username" value="sa"/>
    </operator>
    <operator name="StringTextInput" class="StringTextInput" expanded="yes">
        <parameter key="filter_nominal_attributes" value="true"/>
        <list key="namespaces">
        </list>
        <operator name="StringTokenizer" class="StringTokenizer">
        </operator>
    </operator>
    <operator name="ExampleVisualizer" class="ExampleVisualizer" breakpoints="before">
    </operator>
    <operator name="KernelKMeans" class="KernelKMeans" breakpoints="after">
        <parameter key="k" value="500"/>
        <parameter key="kernel_type" value="KernelDot"/>
    </operator>
    <operator name="ClusterModel2ExampleSet" class="ClusterModel2ExampleSet">
        <parameter key="keep_cluster_model" value="false"/>
    </operator>
    <operator name="ExampleSetWriter" class="ExampleSetWriter">
        <parameter key="example_set_file" value="Example.dat"/>
        <parameter key="special_format" value="$i $v[cluster]"/>
    </operator>
</operator>

Thanks for your help.

B