[SOLVED] Text mining: Does pruning make sense at all?

New Altair Community Member

May 30, 2012

Updated Nov 5, 2024 by Jocelyn

Hi,
i have a question (of cause):
The process document from text-operator can create fectors using the tf-idf-measure.
Further, it allows pruning the text beforehand based on e.g. the occurence of terms.
So, does it make sense at all to prune the text from frequen terms, when i want to use the tf-idf-measure?
Does pruning beforehand bias the resulting tf-idf-values?

Thank you very much,
Julian

Find more posts tagged with

AI Studio

Sort by:

1 - 2 of 21

MariusHelf

New Altair Community Member

May 30, 2012

Hi Julian,

often pruning does help, but there is no general answer. Just put the Process Documents operator into a Parameter Optimization and experiment with the parameter settings until you get good results.

Best, Marius

chaosbringer

New Altair Community Member

May 30, 2012

Thank you for your answer.
It seems to me that this is a bit fishing/dredging for data, but obviously i have to live with that. Thank you.

Best,
Julian

🎉Community Raffle - Win $25

[SOLVED] Text mining: Does pruning make sense at all?

Find more posts tagged with

Quick Links