What would be the best operator to use to filter/remove bad data for Sentiment Analysis?

fishmansf
fishmansf New Altair Community Member
edited November 2024 in Community Q&A
I have a data set that is a list of reviews for the Spotify app but some of the data is either corrupted and outputs non-numerical, alphabetical, or character data as seen below.  What would be a good operator to remove this kind of data?  Thank you!

Answers

  • Marco_Barradas
    Marco_Barradas
    Altair Employee
    Hi @fishmansf

    You can use a filter example to set the filtering rule as a match and define a Regex that looks for these non word characters and remove those from your example set. 

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.