"Text Mining"
Hi,
I am working with TextMining with a huge dataset. I have applied Tokeiniser,Stemmer and EnglishStopWordFilter.I am getting the result as '0' in TFIDF. The same code I have tried with another system and it is working fine.
Please find the attached code
<operator name="Root" class="Process" expanded="yes">
<operator name="TextInput" class="TextInput" expanded="yes">
<list key="texts">
<parameter key="review" value="C:\Documents and Settings\ADMIN\Desktop\dd"/>
</list>
<list key="namespaces">
</list>
<operator name="StringTokenizer" class="StringTokenizer">
</operator>
<operator name="EnglishStopwordFilter" class="EnglishStopwordFilter">
</operator>
<operator name="TokenLengthFilter" class="TokenLengthFilter">
</operator>
</operator>
</operator>
Thanks,
Nancy :-[
I am working with TextMining with a huge dataset. I have applied Tokeiniser,Stemmer and EnglishStopWordFilter.I am getting the result as '0' in TFIDF. The same code I have tried with another system and it is working fine.
Please find the attached code
<operator name="Root" class="Process" expanded="yes">
<operator name="TextInput" class="TextInput" expanded="yes">
<list key="texts">
<parameter key="review" value="C:\Documents and Settings\ADMIN\Desktop\dd"/>
</list>
<list key="namespaces">
</list>
<operator name="StringTokenizer" class="StringTokenizer">
</operator>
<operator name="EnglishStopwordFilter" class="EnglishStopwordFilter">
</operator>
<operator name="TokenLengthFilter" class="TokenLengthFilter">
</operator>
</operator>
</operator>
Thanks,
Nancy :-[
Find more posts tagged with
Sort by:
1 - 3 of
31
this process works for me with the newsgroup texts. Did you control if there are any texts at all? Make a breakpoint inside the TextInput operator.
Depending on the texts this might be correct, for example if all words are occurring the same time in all texts. Then the TFIDF would be 0.
Greetings,
Sebastian