How can we implement dropna() in the rapidminer?

Anusha
Anusha New Altair Community Member
edited November 2024 in Community Q&A
Hi All!

I have a dataset that has NAs, N/A, null, NULL, and multiple spaces in different cells. I just want to remove those particular rows.
Can anyone guide me.

Source Data:

C1                  C2                      C3                C4

12                ADNF                   NCJK               NA
34                HDDW                  CNJ                        -(single space ) 
38               CNJKD                  JIC                  N/A
78                NJDS                    NCSW            NULL
90                 CJNEK                 C JDSK          12NJDNC
08             DNCJS                      CSKJ               null
13                           -(tab space)  bdjf                ndf097

Desired Data:

C1                  C2                    C3                 C4

90                 CJNEK               C JDSK          12NJDNC

Thanks in Advance!
Tagged:

Best Answer

  • Caperez
    Caperez Altair Community Member
    Answer ✓
    Hi @Anusha

    Into the Select Attributes operator you have many alternatives to carry out the filtration of your dataset, for example remove the missing values, or work with regular expressions. 

    Best

Answers

  • Caperez
    Caperez Altair Community Member
    Answer ✓
    Hi @Anusha

    Into the Select Attributes operator you have many alternatives to carry out the filtration of your dataset, for example remove the missing values, or work with regular expressions. 

    Best
  • MartinLiebig
    MartinLiebig
    Altair Employee
    Hi,
    First you use declare missing values to make it a missing, then you can use filter examples with 'is not missing' to remove it.

    Best,
    Martin