How to Segment Text Effectively Using TextTiling with Python 3.x
I have written these line of codes
f = open("1.ref","r")
alice=f.read()
tt = nltk.tokenize.TextTilingTokenizer()
tiles = tt.tokenize(alice[0:2000])
print(tiles) #Total text in a single valued list
I need to consider full text.If I omit [0:2000] ,then getting an error
TypeError: slice indices must be integers or None or have an index method
While printing tiles ,I am getting full text.I need to show segmented text.
Answers
-
@koushikdeb2009 can you post the XML process using the </> button? It will be helpful to see how you formed the python script in Execute Python operator.
1 -
There is no XML process.Its program to implement Text Tile process.If there is any code sample to implement texttiling properly with python 3.x ,then please send the link.It will be great help to my project.
0 -
@koushikdeb2009 we usually see python executed in RapidMiner via the Execute Python operator, hence me asking if you can post your XML process. If you're strictly working in python, then I'd suggest visiting a forum related to Text Tiling.
0