Keep the date column after tokenization for text mining

Question

Hi everyone, When conducting sentiment analysis on my dataset, I receive a positivy/negativity score for each article (each row) by counting positive and negative words. I would now need this table to be expanded by the date for each article (each row) from the original file, which not only included text but also date information. How can I do that? Here is my code: Thanks a lot in advance!

Titzaaa · Answer

Thank you to both of you! Unfortunately the "set role" does not work, as the column remains available only before applying a loop collection and applying the dictionary, afterwards it is not there anymore. For the join operator I cannot find a key attribute. Please find attached the text I conduct the Sentiment Analysis with. Here again the code: Export_finanzen.net_V5.xlsx

kayman · Answer

You could use the 'set role' operator for that. You are actually not restricted to the dropdown options but can give any name to whatever attribute.

So if you would give your datefield the role 'date' before you start your sentiment flow,it becomes a special attribute, and by default it will remain available for the rest of your process.

lionelderkrikor · Answer

Hi @Titzaaa,
I would say that you have to use the Join operator.
But to better understand can you share all your data ?
if not can you at least provide a sample example of what you have, and from this example, what you want to obtain ?

Regards,

Lionel