Hi. I am new to RapidMiner text plugin.
I have an XML file consisting of <document> elements. Each document tag contains one document as follows:
<documents>
<document>
<id> 1 </id>
<text>...............</text>
</document>
<document>
<id> 1 </id>
<text>...............</text>
</document>
...
</documents>
I think I have to split them first and extract documents to be able to construct the word vector. Is there any way to do that?