🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

python scripting operator

User: "Mohamad1367"
New Altair Community Member
Updated by Jocelyn
hi dear community...i have a question about python scripting operator...i have a dataset in persian that i want to tokenize and then filter stop words...for tokenization i use rosette extension  but for stopword removal i want to use a code written in python...where i shoul put this operator in my procces?is there anyone to help me?

Find more posts tagged with

Sort by:
1 - 4 of 41
    User: "kayman"
    New Altair Community Member
    Is there a real need to use an external stopwordlist or Rosette? 
    The text mining extension has all of these options also, and this way you could keep your work flow a bit more organized. 
    User: "Mohamad1367"
    New Altair Community Member
    OP
    @kayman yes i need the rosette extension to tokenize my dataset because my data set is in persian language which rosette supports this language but text mining extension don't support persian for this pupose
    User: "kayman"
    New Altair Community Member
    Clear, Farsi is indeed not in the standard toolkit.
    I'm not familiar with the rosetta output, but as it seems an exampleset you could just add the python operator straight after your Rosetta operator. This way you can reuse your existing code.

    User: "sgenzer"
    Altair Employee
    @Mohamad1367 there's a Persian stopword dictionary in the Community Samples repo:



    Scott