🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

Stop Word and Stemming List / Dictionary

User: "arizah78"
New Altair Community Member
Updated by Jocelyn

Dear All...

I've been using RapidMiner for quite some time, especially for the text mining function. I have difficulty in retrieving the stop word list and stemming (snowball), both for English. The list would help me in updating the content and increase the preciseness of my text mining process. I do really hope if anybody could share with me these lists (stop word and stemming) or at least let me know where/how I can find these lists. Your kind assistance is highly appreciated. 

Thanks.

 

 

Find more posts tagged with

Sort by:
1 - 5 of 51
    User: "Telcontar120"
    New Altair Community Member

    There is a lot of more detailed information available about the snowball project here: https://snowballstem.org/algorithms/

     

    User: "sgenzer"
    Altair Employee

    hello @arizah78 -

     

    Just to add a bit...there was a similar request in another thread for the Arabic stopword list and I'm looking into it.  The lists are easy to access; we just want to make sure that we're allowed to (the extension is not open-source and hence the author of the list has copyright ownership by default).  I will let folks here on the community know when I get this answered.

     

    Thanks for understanding.


    Scott

    User: "arizah78"
    New Altair Community Member
    OP

    Hi Scott,

    Thanks for your update.

    Really hope to get a positive feedback soon.

     

     

     

    User: "arizah78"
    New Altair Community Member
    OP

    Thanks. Appreciate the link sharing.

    User: "sgenzer"
    Altair Employee

    hello @arizah78 - I have the code to the extension (which contains the wordlists) and it is indeed open source.  I will send you the file via PM.

     

    Scott