-
Is there a way to filter by values in specific rows?
For example, using 'Filter Examples':I can filter data ('rows') where the value of the first 'column' is not null. But what I want to do is:I want to filter out 'columns' where the value of the first 'row' is not null. I've been using 'transpose' to handle it, but is there another way? Please help if there is a way to…
-
How to create a filter contains AND/OR clauses
I want to remove outliers based on the market segements, different market segment has different ranges of prices, so I want to filter based on both attributes for example (Market segment = online and price > 500) or (Market segment = offline and price > 1000)
-
I am trying to perform null validation on a table
null validation has to performed on the table consisting of more than one column. Then storing it in two different tables in SQL Server such that records which pass the validation that is the records that are not null must get stored in one table and the records which fail the validation should get stored in separate table…
-
Filter data: If data contains | then don't consider those records
Hello, If multiple attribute contains attribute value like A|B or 123|987 how to handles dynamically w.r.t all attributes Thanks, Sachin N
-
Filtering WordList
Hello RM Community, I got a WordList something like Word InDocuments Total aaa 5 5 bbb 5 5 ccc 5 3 I want to filter the word with a different number between InDocuments and Total. It should be something like: InDocuments "does not equal" or "=" Total, but I failed this with Filter Examples. Any idea to solve this? Thanks!
-
Extraction of sentences based on a wordlist (to create a new doc)
Hello, For the purpose of my thesis I have to analyze multiple corporate reports. I have to extract from these reports sentences that contains specific words (from a wordlist) and create a document with all the selected sentences, which will be used later for further analysis. For that I used first a "read document"…
-
How to I sort out various unstandardised data from a single cell ?
Hello everyone, For context, I'm trying to find out which marketing medium is the most effective to be used by Starbucks. Attached is the dataset used and the column pertaining to my part is question 19. As you can see, the data are retrieved from survey forms, as such, the format of their answer varies (e.g. some people…
-
How to filter out examples with missing values!
Hello everyone! I am using the data set "Titanic" and am wondering how to filter out the examples with missing values. I see how to filter and see the ones with missing values, but not the other way around!
-
Time series data with binary column analysis
Hello there , i m trying to generate two timestamp columns (from,to) out of the table given below in such a way that whenever there is '1' after '0' in the resultant column that particular timestamp should copied to the from column and whenever there is '0' after '1' that particular timestamp should be copied to the "to"…
-
Multi Label for Deep Learning Modell
Hey, I'm kinda new with RapidMiner. I'm working on a Deep Learning Modell and my dataset has around 400 attributes which has to be predicted. Is there an possible way to label these attributes without doing it one by one? Thanks!
-
create a Logistic Regression with restrictions
Hi I want/need to build a Log. Regression Model with additional restrictions for my Bachelor Thesis. in my case: ( value of Attribute - avarage of this atribute) * Confidence(1) / 1000 <= x (extern Variable) && ( value of Attribute - avarage of this atribute) * Confidence(1) / 1000 >= -x in Words: Get best Regression Model…
-
Decision tree looks inaccurate
Hey guys, I am fairly new to using rapid miner and in this dataset, I'm trying to analyze the viability of a property (based on the reviews, overall satisfaction and the date listed). So I have used filter example to eliminate data records that do not have a review nor a satisfaction rate, because I'm assuming the…
-
Best way to convert Nominal values by using expressions
I’ve set of data with INCH measurement unit , the values are so missy and I’m trying to figure out a way to unified them and convert them to a way I can work with them Example of the values : * 1/8 , 1/16 , 19/17 , 13/3 , 1.1/8 * 0.139 , 0.07 , 3.37 , .250 , 0.14 * 3-1/8 , 1-1/26 , 1-18/23 , 10-1/2 * 20 , 3 , 10 *…
-
Need all the timestamp between two dates.
I have a dataset which consist of timestamp. The timestamp are in a gap of 15 mins. If I have two dates stored into macros ( e.g. first_date, last_date), how can I fetch all the timestamp between two macros? Example: 2020-08-15T02:30:00.000Z 2020-08-15T02:45:00.000Z 2020-08-15T03:00:00.000Z 2020-08-15T03:15:00.000Z…
-
Tell k-NN (and possibly other models) to ignore training data dated past the Unlabeled record's time
I have a large database of news records and their published timestamp. I'm currently experimenting with using k-NN to classify the company's stock behavior by comparing the news to similar cases that have occurred in the past. Naturally, I don't want the model to use any news that has been published AFTER the…
-
How to change metadata especially values of a label attirbute?
Hi I have a label attribute that has some values. When I filter the example set to remove those records including a special value for their labels, the remind example set has that special value in its metadata, and my average precision and recall will be reduced even though my example set has not that value. I have to use…
-
How I can erase a column from the data base?
If my data base has a column with a missing value. And I want to delete this column or hide it. How can I erase that and then when I press de play button show me the result with out this column ?
-
How can I take only the variables with at least 5.000 observations?
Hello folks, I need a hand here... How can I take only the variables with at least 5.000 observations? I have too many variables, thank
you in advance. Cecilia
-
Comparing every row of an exampleset with all the rows in another
I have two examplesets, say A and B, with the same set of attribute names, and each individual row from A needs to be compared with all rows in B to be categorized based on a criteria on a particular attribute. My initial thought was to use a Loop Examples operator to iterate over the rows of A, and to retrieve B and apply…
-
Hello everyone,I want prediction with aply model and linear regression but ı want prediction all at
Hello everyone,I want prediction with aply model and linear regression but ı want prediction all attrübutes.How can use set role
-
Filter examples with dictionary
Hi, I'm looking for a solution to filter out examples of a data set, using a dictionary containing words. That would be a "filter example" operator working as the "replace (Dictionary)". That could allow to filter out all examples if a chosen attribute would contain words contained in the dictionary (or, with the "invert…
-
How do i aggregate per Attribute where value of row is 1?
Hello all together! I got a csv file which contains this: i need to count the number of 1 per attribute. I had an Idea to first filter for every attribute = 1 and then to aggregate each attribute with count, but with 110 attributes this could take a while. Does anybody have an Idea how to speed up this process?
-
Remove seemingly "empty" rows in Excel Sheet - Containing Spaces
Hi guys, I'm currently cleaning my data and also used an operator to remove missing values. I realized, after writing an excel sheet, that there were still some "empty" cells left. These cells aren't empty though, they are containing a space, thus they seem empty but are really not and Rapidminer won't detect them as such.…
-
how to comapare date in filter example operator
Hello, how to compare date in filter example operator e.g. I have multiples date in "Date" column and i want to filtered only 16/8/2020 data. how can i do that? I tried this [TRUNC(INPUTDATETIME)]==16/8/2020 not work for me
-
how Can I filtered out multiple product through filter example expression.
Hello , I have two macro that contain %{key} %{value} In key I have "Product" and Value I have "Remittence,Travel Card" In my input data contains column "Product" that contains many product i want to filtered out only two i mentinoned in macro How can i do?