-
Clustering based on geographical and non-geographical attributes
I'm working on an order dataset and I want to cluster orders by different clustering algorithms. also I want to add following constraint. For clustering, I have both geographical and non-geographical, and I want to cluster in such a way that each city is placed in only one cluster, and therefore all orders that include…
-
How to calculate predictive weights?
Hello! I am using multiple variables (rainfall, temperature, wind speed, slope, distance to settlements, etc) for predicting fire events using different machine-learning algorithms. In the below-given example, I have used Random Forest to do so. Using 'Explain Predictions" this result was generated. In Support Prediction,…
-
Where I will get certification in Altair?
Process
-
Get min row column name
I have a table like this A B C D 5 2 1 9 I should get C in the results tab. Is there any operator to do this? Can somebody please help.
-
Loop +Aggregate; color to a column
Hi, I have say S.No, Required, A, B, C, D, E.... 2 columns are surely present (S.No, Required) and others are unknown. But at least 1 will be there.(S.No, Required, A). What I want is to multiply Quantity* A, Quantity*B,Quantity*c like that.. with all the columns except S.No. later want to add all the values =>…
-
RapidMiner - aggregate operator issue
Hi, i'm following the rapidminer data engineering course, im using an ubuntu laptop with jdk version >8 & have mentioned rapid miner version above. Here is the screenshot of the error i'm facing - i encountered it while i was following the generate attributes chapter I also tried to use market place to find relevant plugin…
-
How to I sort out various unstandardised data from a single cell ?
Hello everyone, For context, I'm trying to find out which marketing medium is the most effective to be used by Starbucks. Attached is the dataset used and the column pertaining to my part is question 19. As you can see, the data are retrieved from survey forms, as such, the format of their answer varies (e.g. some people…
-
Aggregate operator: How to use "most" as an aggregation function?
Dear RapidMiner community, in the Aggregate operator (Blending->Table->Grouping->Aggregate) I do see "least", but not "most". How can I use "most" as aggregation function?
-
One-Hot Encoding Top 10 Items (Fractional) Rest Other
Hello together, i am searching for a smart solution for One-Hot Encoding to the Top 10 (Fractional) Items. Currently I solve the problem by creating a new attribute for the top 10 values. For example: For each Attribute I need to generate a new Column: if((contains([Attri],"Example Data")) ,1,0) Does anybody have a smart…
-
Aggregate / Generate Attribute - merge and sum up multiple example rows
Dear Ladies and Gentlemen, I would like to sum up my Data row values per date for each item, in order to have only one row per date with the total quantity data ("Menge kg") for the individual item. Varying prices ("Preisherkunft") exist, which has resulted in multiple entries for the same item ("Artikel-Nr",…
-
Analyzing Categorical Data or Polynomial Data in Rapid Miner
Hi! I need help. I am currently task to analyze and cluster the data that I've got in our Learning Management System. But, I have a problem. I am really new to R and I do not know how to perform analysis on the Polynomial data. It doesn't allow me to normalize the data or put K means clustering to it. Please help the…
-
Time series data with binary column analysis
Hello there , i m trying to generate two timestamp columns (from,to) out of the table given below in such a way that whenever there is '1' after '0' in the resultant column that particular timestamp should copied to the from column and whenever there is '0' after '1' that particular timestamp should be copied to the "to"…
-
Group & Sum items based on basketID
Hi to community, This is my first post here. I want to sum the unique products ab-15 (1 +1 ) as quantity, but i want to sum products included in each basketID so to output 13151 ab-15 2 13151 ab-24 3 etc... 13152 ab-15 1 13152 ab-17 3
-
Outlook Emails exported to excel to be grouped by Subject
Hi Rapidminer Community, I came across the Aggregate function and would like to group by the Subject type. However, I do realise that the group by function works differently from the Group by function in Outlook. Take the below email subject type for instance: Email 1: This is it Email 2: Re: This is it Email 3: FW: This…
-
What is fractional count? What is percentage count? What is fractional sum?
In the Aggregate operator I could find the aggregation functions count(fractional), count(percentage) and sum(fractional).How do these aggregation functions work?
-
Map Operator isn't working with Aggregation Operator
Hello, I calculated the average value of each product with the Aggregation operator. I got those values: Now I'm trying to categorize 2 products together. Wines and Meat, Fish and Gold, Fruits and Sweets, like this: I want to visualize all 3 categories in the Pie Plot Type but if I run this process I still only get all…
-
Distinct counts
I have a dataset with thousands of IDs that have multiple encounters tied to them and each encounter has multiple events in it (each ID, encounter and event are listed rows). These all have specific dates tied to each encounter. The dataset is thousands of rows long. I am trying to count the number of distinct encounters…
-
count total occurrence and sort by date
I have rapidminer example set like this, ID Issue Exp 100 9/8/2020 11/8/2020
100 8/5/2019 9/5/2019
101 6/3/2020 10/1/2020
102 8/15/2020 12/12/2020 I want to add a new column which will count the occurrence of the ID by adding the numbers and sort by the earliest date so we know at what date how many count I had. Output…
-
How do I use aggregation to find the average of a row?
A pleasant day/evening to the community. I hope you can help me again with my problem. I have a dataset that has around 500 rows relating to student grades. It is really hard to explain in words but here's what the dataset looks like student_id | Math | English | Science | PE | History | Computer student1 80 92 87 95 90 86…
-
How do i aggregate per Attribute where value of row is 1?
Hello all together! I got a csv file which contains this: i need to count the number of 1 per attribute. I had an Idea to first filter for every attribute = 1 and then to aggregate each attribute with count, but with 110 attributes this could take a while. Does anybody have an Idea how to speed up this process?
-
I want consider columns through regex in generate aggregation operator
Hello, I have total 20 columns in my data ,three column like scenario_1,scenario_2,scenario_3 i want concat these three through generate aggregation operator how can i do this. I write regex like "\w+_ " but not working for me
-
How to transform daily time series to monthly data and keeping the years ?
These are the dataset statistics. As you can see it runs over several years (from jan 2015 till may 2020) Here is a sample of how my data looks like before transformation. How can I transform my dataset to something like this ? Sum of Sales per month per year. I already tried the following: - convert date to numerical…
-
Specific case for Joining multiple datasets
Hi everyboday, In a data science project I have recieved 4 cleaned datasets on an intersting topic healthy diet to protects ourselselves against Covid-19. I have already imported these 4 data sets in Rapidminer Studio. before I analyse them with machine learning, modelling and statistical preditions I would like to join…
-
Aggregate using the same set
I am trying to aggregate on the set itself on the basis of intra day amount happening. May be below query will help you understand my requirement more SELECT B.time, B.date, B.att1, SUM(A.COST) FROM TBL1 B, TBL1 A WHERE A.time <= B.time AND A.DATE = B.DATE AND A.att1 = B.att1 GROUP BY A.att1,A.att2,A.time, A.date; I am not…
-
grouping
Hi! I am a beginner so I just started to learn RapidMiner, and I have a question. I have a huge database and I want to simplify my database. I want to create groups. The database contains people data, like AGE of the people. From 18 years old to 100 years old and I want to make 4 group. Age below 20, Age between 21 and 40…