An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
<?xml version="1.0" encoding="UTF-8" standalone="no"?><process version="5.1.005"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="5.1.005" expanded="true" name="Process"> <process expanded="true" height="280" width="413"> <operator activated="true" class="text:create_document" compatibility="5.1.001" expanded="true" height="60" name="Create Document" width="90" x="45" y="30"> <parameter key="text" value="word1 word2 (word3) word4 word5 (word6)"/> </operator> <operator activated="true" class="text:tokenize" compatibility="5.1.001" expanded="true" height="60" name="Tokenize" width="90" x="179" y="30"> <parameter key="mode" value="regular expression"/> <parameter key="expression" value="\s"/> </operator> <operator activated="true" class="text:filter_tokens_by_content" compatibility="5.1.001" expanded="true" height="60" name="Filter Tokens (by Content)" width="90" x="313" y="30"> <parameter key="condition" value="contains match"/> <parameter key="regular_expression" value=".*[()]"/> <parameter key="invert condition" value="true"/> </operator> <connect from_op="Create Document" from_port="output" to_op="Tokenize" to_port="document"/> <connect from_op="Tokenize" from_port="document" to_op="Filter Tokens (by Content)" to_port="document"/> <connect from_op="Filter Tokens (by Content)" from_port="document" to_port="result 1"/> <portSpacing port="source_input 1" spacing="0"/> <portSpacing port="sink_result 1" spacing="0"/> <portSpacing port="sink_result 2" spacing="0"/> </process> </operator></process>
<?xml version="1.0" encoding="UTF-8" standalone="no"?><process version="5.1.004"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="5.1.004" expanded="true" name="Process"> <process expanded="true" height="341" width="681"> <operator activated="true" class="retrieve" compatibility="5.1.004" expanded="true" height="60" name="Retrieve" width="90" x="45" y="165"> <parameter key="repository_entry" value="//Samples/data/Golf"/> </operator> <operator activated="true" class="k_nn" compatibility="5.1.004" expanded="true" height="76" name="k-NN" width="90" x="179" y="165"/> <operator activated="true" class="apply_model" compatibility="5.1.004" expanded="true" height="76" name="Apply Model" width="90" x="380" y="165"> <list key="application_parameters"/> </operator> <operator activated="true" class="select_attributes" compatibility="5.1.004" expanded="true" height="76" name="Select Attributes" width="90" x="581" y="165"> <parameter key="attribute_filter_type" value="regular_expression"/> <parameter key="regular_expression" value=".*\(.*"/> <parameter key="invert_selection" value="true"/> <parameter key="include_special_attributes" value="true"/> </operator> <connect from_op="Retrieve" from_port="output" to_op="k-NN" to_port="training set"/> <connect from_op="k-NN" from_port="model" to_op="Apply Model" to_port="model"/> <connect from_op="k-NN" from_port="exampleSet" to_op="Apply Model" to_port="unlabelled data"/> <connect from_op="Apply Model" from_port="labelled data" to_op="Select Attributes" to_port="example set input"/> <connect from_op="Select Attributes" from_port="example set output" to_port="result 1"/> <connect from_op="Select Attributes" from_port="original" to_port="result 2"/> <portSpacing port="source_input 1" spacing="0"/> <portSpacing port="sink_result 1" spacing="0"/> <portSpacing port="sink_result 2" spacing="0"/> <portSpacing port="sink_result 3" spacing="0"/> </process> </operator></process>
haddock wrote: that is down to the the plus sign. '[()]+' would match a round bracket followed by '+', so ' (+' or ' )+' but not...';^)'
Colo[()]+Options: ^ and $ match at line breaksMatch a single character present in the list “()” «[()]»Match the character “+” literally «+»Created with RegexBuddy
colo wrote:Hello,please stop double-posting new questions (perhaps abandon your new topic http://rapid-i.com/rapidforum/index.php/topic,3476.0.html).Do the memory problems occur before the model is trained? Otherwise you could store the model and then make your predictions in smaller pieces to avoid memory limitations. How much of your memory do you provide for RapidMiner?I absolutely have to disagree on this, the plus sign is a valid quantifier saying that at least one bracket has to occur. In my experience character classes are mostly used with a following quantifier. This works fine in other areas, only the "contains match" option didn't do what I expected.