How to Segment Text Effectively Using TextTiling with Python 3.x

koushikdeb2009
koushikdeb2009 New Altair Community Member
edited November 5 in Community Q&A

I have written these line of codes

 

f = open("1.ref","r")
alice=f.read()
tt = nltk.tokenize.TextTilingTokenizer()
tiles = tt.tokenize(alice[0:2000])
print(tiles) #Total text in a single valued list

I need to consider full text.If I omit [0:2000] ,then getting an error

TypeError: slice indices must be integers or None or have an index method

Spoiler
 

While printing tiles ,I am getting full text.I need to show segmented text.

Answers

  • Thomas_Ott
    Thomas_Ott New Altair Community Member

    @koushikdeb2009 can you post the XML process using the </> button? It will be helpful to see how you formed the python script in Execute Python operator. 

  • koushikdeb2009
    koushikdeb2009 New Altair Community Member

    There is no XML process.Its program to implement Text Tile process.If there is any code sample to implement texttiling properly with python 3.x ,then please send the link.It will be great help to my project.

  • Thomas_Ott
    Thomas_Ott New Altair Community Member

    @koushikdeb2009 we usually see python executed in RapidMiner via the Execute Python operator, hence me asking if you can post your XML process. If you're strictly working in python, then I'd suggest visiting a forum related to Text Tiling.