"Cluster Purity , cluster validaiton"

Legacy User
Legacy User New Altair Community Member
edited November 5 in Community Q&A
Hi,

I wonder if the new 4.3 allows the user  to validate the clustering results 
for example cluster purity or recall and percesion

and I already know the label  (supervised clustering )




Thank you very much

Answers

  • land
    land New Altair Community Member
    Hi,
    this feature has not been included into the release since it was not in stable state yet. But if you use the developer version from cvs, its now included. The operator is called Cluster2Label and will search the best fitt between cluster and label values. You might then use all technics available for supervised learning performance measuring.

    Greetings,
      Sebastian
  • ema
    ema New Altair Community Member
    Thank you very much .. I really appreciate it ...
  • ema
    ema New Altair Community Member
    Hi again

    i spent the last 4 hours trying to run the CVS from eclipse 

    unfortunatly
    i am getting 17 compilation errors when  i try to run build.xml

    any advice?



    the errors are all in the import com.sun.javadoc.....

    thank you

  • IngoRM
    IngoRM New Altair Community Member
    Hi,

    did you exactly follow the instructions given at

    http://rapid-i.com/content/view/25/48/

    ? The javadoc package is only necessary if you want to build the documentation (i.e. the tutorial) yourself but not for simply compiling and running the RM core. The important thing is that you use the project description file ".project" which is also part of the CVS module. If you use this project file, everything will work at once. If not, here are two hints:

    - add the jar files "rt.jar" and "tools.jar" from a JDK installation with a version >= 5
    - exclude the package "com.rapidminer.doc" from the source in the project preferences

    Cheers,
    Ingo
  • ema
    ema New Altair Community Member
    Thank you very much for ur fast response

    I have a question

    you said
    "add the jar files "rt.jar" and "tools.jar" from a JDK installation with a version >= 5"

    where should i add them to?


    Thank you very much
  • IngoRM
    IngoRM New Altair Community Member
    Add them to the project classpath in the project properties dialog. But you could try the second option (excluding the source of the package "*.doc") first in the same dialog.

    Cheers,
    Ingo
  • ema
    ema New Altair Community Member
    Hi,
     I know i am asking too many questions...
    sorry, in advance :)

    I did exactly what you told me
    and what happened is that i got the exact message in the link you sent me

    ERROR: cannot find resources. Probably the ant target
         'copy-resources' must be performed!

    (below step 6)

    so what i did is i right clicked on the file build.xml  
    and i clicked run as  Ant Build

    and this is what i got

    Buildfile: C:\Documents and Settings\Administrator\workspace\yale\build.xml
    init:
    compile:
        [echo] Compile with Java from dir: C:\Program Files\Java\jre6
        [echo] using Java version: 1.6.0_10-beta
       [javac] Compiling 12 source files to C:\Documents and Settings\Administrator\workspace\yale\build
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:49: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.ClassDoc;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:50: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.RootDoc;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:51: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.SeeTag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:52: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.Tag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:53: package com.sun.tools.doclets does not exist
       [javac] import com.sun.tools.doclets.Taglet;
       [javac]                             ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\OperatorDocGenerator.java:28: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.RootDoc;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:113: cannot find symbol
       [javac] symbol  : class Taglet
       [javac] location: class com.rapidminer.doc.AbstractOperatorDocGenerator
       [javac] private Map<String, Taglet> tagletMap = new HashMap<String, Taglet>();
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:138: cannot find symbol
       [javac] symbol  : class RootDoc
       [javac] location: class com.rapidminer.doc.AbstractOperatorDocGenerator
       [javac] public void generateDoc(Operator op, RootDoc rootDoc, PrintWriter out) {
       [javac]                                      ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\OperatorDocGenerator.java:40: cannot find symbol
       [javac] symbol  : class RootDoc
       [javac] location: interface com.rapidminer.doc.OperatorDocGenerator
       [javac] public void generateDoc(Operator operator, RootDoc rootDoc, PrintWriter out);
       [javac]                                           ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\CiteTaglet.java:27: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.Tag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\CiteTaglet.java:28: package com.sun.tools.doclets does not exist
       [javac] import com.sun.tools.doclets.Taglet;
       [javac]                             ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\TexTaglet.java:25: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.Tag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\TexTaglet.java:26: package com.sun.tools.doclets does not exist
       [javac] import com.sun.tools.doclets.Taglet;
       [javac]                             ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\TexTaglet.java:34: cannot find symbol
       [javac] symbol: class Taglet
       [javac] public interface TexTaglet extends Taglet {
       [javac]                                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\CiteTaglet.java:74: cannot find symbol
       [javac] symbol  : class Taglet
       [javac] location: class com.rapidminer.doc.CiteTaglet
       [javac] public static void register(Map<String, Taglet> tagletMap) {
       [javac]                                        ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\CiteTaglet.java:83: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.CiteTaglet
       [javac] public String toString(Tag tag) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\CiteTaglet.java:87: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.CiteTaglet
       [javac] public String toString(Tag[] tags) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\CiteTaglet.java:91: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.CiteTaglet
       [javac] public String toTex(Tag tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\CiteTaglet.java:95: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.CiteTaglet
       [javac] public String toTex(Tag[] tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\TexTaglet.java:36: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: interface com.rapidminer.doc.TexTaglet
       [javac] public String toTex(Tag tag);
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\TexTaglet.java:38: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: interface com.rapidminer.doc.TexTaglet
       [javac] public String toTex(Tag[] tag);
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\DocumentationGenerator.java:40: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.RootDoc;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\DocumentationGenerator.java:56: cannot find symbol
       [javac] symbol  : class RootDoc
       [javac] location: class com.rapidminer.doc.DocumentationGenerator
       [javac] private static RootDoc rootDoc = null;
       [javac]               ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\DocumentationGenerator.java:76: cannot find symbol
       [javac] symbol  : class RootDoc
       [javac] location: class com.rapidminer.doc.DocumentationGenerator
       [javac] public static boolean start(RootDoc rootDoc) {
       [javac]                            ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\MathTaglet.java:27: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.Tag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\MathTaglet.java:28: package com.sun.tools.doclets does not exist
       [javac] import com.sun.tools.doclets.Taglet;
       [javac]                             ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\MathTaglet.java:73: cannot find symbol
       [javac] symbol  : class Taglet
       [javac] location: class com.rapidminer.doc.MathTaglet
       [javac] public static void register(Map<String, Taglet> tagletMap) {
       [javac]                                        ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\MathTaglet.java:82: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.MathTaglet
       [javac] public String toString(Tag tag) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\MathTaglet.java:86: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.MathTaglet
       [javac] public String toString(Tag[] tags) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\MathTaglet.java:90: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.MathTaglet
       [javac] public String toTex(Tag tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\MathTaglet.java:94: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.MathTaglet
       [javac] public String toTex(Tag[] tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ProgramHTMLOperatorDocGenerator.java:31: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.ClassDoc;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ProgramHTMLOperatorDocGenerator.java:32: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.RootDoc;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ProgramHTMLOperatorDocGenerator.java:42: cannot find symbol
       [javac] symbol  : class RootDoc
       [javac] location: class com.rapidminer.doc.ProgramHTMLOperatorDocGenerator
       [javac] public void generateDoc(Operator op, RootDoc rootDoc, PrintWriter out) {
       [javac]                                     ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:27: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.Tag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:28: package com.sun.tools.doclets does not exist
       [javac] import com.sun.tools.doclets.Taglet;
       [javac]                             ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:74: cannot find symbol
       [javac] symbol  : class Taglet
       [javac] location: class com.rapidminer.doc.RefTaglet
       [javac] public static void register(Map<String, Taglet> tagletMap) {
       [javac]                                        ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:83: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.RefTaglet
       [javac] private String[] split(Tag tag) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:93: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.RefTaglet
       [javac] public String toString(Tag tag) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:97: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.RefTaglet
       [javac] public String toString(Tag[] tags) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:101: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.RefTaglet
       [javac] public String toTex(Tag tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\RefTaglet.java:105: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.RefTaglet
       [javac] public String toTex(Tag[] tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ReferenceTaglet.java:37: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.Tag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ReferenceTaglet.java:38: package com.sun.tools.doclets does not exist
       [javac] import com.sun.tools.doclets.Taglet;
       [javac]                             ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ReferenceTaglet.java:48: cannot find symbol
       [javac] symbol: class Taglet
       [javac] public class ReferenceTaglet implements Taglet {
       [javac]                                         ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ReferenceTaglet.java:120: cannot find symbol
       [javac] symbol  : class Taglet
       [javac] location: class com.rapidminer.doc.ReferenceTaglet
       [javac] public static void register(Map<String, Taglet> tagletMap) {
       [javac]                                        ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ReferenceTaglet.java:129: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.ReferenceTaglet
       [javac] public String toString(Tag tag) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\ReferenceTaglet.java:133: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.ReferenceTaglet
       [javac] public String toString(Tag[] tags) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:30: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.SourcePosition;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:31: package com.sun.javadoc does not exist
       [javac] import com.sun.javadoc.Tag;
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:32: package com.sun.tools.doclets does not exist
       [javac] import com.sun.tools.doclets.Taglet;
       [javac]                             ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:79: cannot find symbol
       [javac] symbol  : class Taglet
       [javac] location: class com.rapidminer.doc.XMLExampleTaglet
       [javac] public static void register(Map<String, Taglet> tagletMap) {
       [javac]                                        ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:88: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.XMLExampleTaglet
       [javac] private String[] split(Tag tag) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:97: cannot find symbol
       [javac] symbol  : class SourcePosition
       [javac] location: class com.rapidminer.doc.XMLExampleTaglet
       [javac] private File resolve(String file, SourcePosition source) {
       [javac]                                  ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:101: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.XMLExampleTaglet
       [javac] public String toString(Tag tag) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:123: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.XMLExampleTaglet
       [javac] public String toString(Tag[] tags) {
       [javac]                       ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:127: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.XMLExampleTaglet
       [javac] public String toTex(Tag tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\XMLExampleTaglet.java:140: cannot find symbol
       [javac] symbol  : class Tag
       [javac] location: class com.rapidminer.doc.XMLExampleTaglet
       [javac] public String toTex(Tag[] tag) {
       [javac]                    ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer\doc\AbstractOperatorDocGenerator.java:113: cannot find symbol
       [javac] symbol  : class Taglet
       [javac] location: class com.rapidminer.doc.AbstractOperatorDocGenerator
       [javac] private Map<String, Taglet> tagletMap = new HashMap<String, Taglet>();
       [javac]                                                            ^
       [javac] C:\Documents and Settings\Administrator\workspace\yale\src\com\rapidminer
    .
    .
    .
       [javac] 74 errors

    BUILD FAILED



    So what should i do ....  ?

    Thanx alot
  • IngoRM
    IngoRM New Altair Community Member

    So what should i do ....  ?
    Follow the instructions and hints given here  ;)

    You need a JDK, not only a runtime environment. Specify this JDK in the Eclipse properties as the standard Java compiler. Wait for the rebuild. Start the target again.

    Don't get me wrong but here is another hint: most of your problems derive from some lack of basic knowledge about Java programming with Eclipse and are independent of compiling RapidMiner. Maybe you should look for some good tutorials about Java project setup in Eclipse.

    Cheers,
    Ingo
  • ema
    ema New Altair Community Member
    Hi


    Thank you thank you  thank you

    from my  heart ...

    it is working :)
  • ema
    ema New Altair Community Member
    Hi..


    it is working now but i could not find the cluster2label operator 


    but i found new cluster validation like
    clusterModelFscore

    can  you please post an xml example of how to use this operator

    Thank you
  • Legacy User
    Legacy User New Altair Community Member
    Thank so much Ingo Mierswa, your advices helped me alot, Finally I can run the rapid minner.
    Once again, thank you so much

    So the important think is
                              1. exclude the package com.rapidminer.doc from the project reference
                              2. dont forget include the tools.jar and rt.jar where you can cat those jar from your JDK directory

    best