-
Map operator function
Hi. First off, this may or may not relate to RapidMiner's problem directly. It's just that i am stuck with my data, to go through with using Map operator (as part of ETL process). Please note that I am first and foremost a language student and I had to take this course as a requirement. So, i basically have zero knowledge…
-
cross project dataset
Hello, Can RapidMiner make data mining on cross project (more than 2) to make 1 dataset for testing and 3 different data set for training? What data integration tool / node can do this? Thank you
-
Hiring at my employer
My team is looking for someone experienced in RapidMiner process development, data engineering, and building a data lake. We keep getting resumes from developers in various coding languages, but that's not our focus. We want someone who knows RapidMiner. Please check it out, I can't post links, goto SchwabJobs then search…
-
Aggregate / Generate Attribute - merge and sum up multiple example rows
Dear Ladies and Gentlemen, I would like to sum up my Data row values per date for each item, in order to have only one row per date with the total quantity data ("Menge kg") for the individual item. Varying prices ("Preisherkunft") exist, which has resulted in multiple entries for the same item ("Artikel-Nr",…
-
Pattern Recognition
Hello, I m trying to identify patterns in a time series data(please refer to the data below),in such a way that it tells me about the signature(s) (pattern it has in a particular period of time and therefore the algorithm extract those continuous time period in the form of 'from' and 'to' dates). Suppose my resultant…
-
Dummy-Encoding of Movie Genres
Hello Community,I am new to Rapidminer and currently trying to prepare a dataset through turbo-prep and I am stuck at the movie genres.The genres are given like : ['Action', 'Drama', ...] I would like to preserve the genres as a attribute, to do this I tried to dummy encode them but then each combination of genres gets…
-
Time series data with binary column analysis
Hello there , i m trying to generate two timestamp columns (from,to) out of the table given below in such a way that whenever there is '1' after '0' in the resultant column that particular timestamp should copied to the from column and whenever there is '0' after '1' that particular timestamp should be copied to the "to"…
-
Impute missing values of more than one attribute
Hello I need to impute the missing values of 2 different attributes, i have tested and selected a knn and a neural network to predict each variable, what i don´t know is how to implement both algorithms in the subprocess "Impute missing values"; when i try aplying the neural network with his specific label, the process…
-
Excel worksheet loop using Macro
Hi All - I'm having trouble with something I thought would be relatively easy and unable to get through it. I basically would like to "Read Excel" in a loop by using a macro to loop through however many sheets are in this file as they can change from month to month. The data structure is the same on each tab but each…
-
Help with disappeared column
Good morning,I am trying to use several join operators to be able to make 1 to n relationships from several tables created previously. That join seems to have only the new table Accessibility and the previously selected attributes. The weird thing is that I have not selected Id_Marcas, anywhere and it appears alone. Where…
-
How to change data of value
How to change data of value? like example the data contains 0 and 1, i want replace the number if 0 mean male and 1 is female, anyone have idea? I already used replace and map operators before but its not work
-
Example Set Transpose to Attribute Names
Hi All, Hoping I'm missing the simple answer here but I'm trying to transpose an aggregated set of data (Sales Months) to be Attribute Names (trying to make this dynamic as the number of months varies from data set to data set). below is a screenshot of current view. I attempted to "Rename by Example Set" but ended up with…
-
Stemming Dictionary
Hi, I want to know how to implement stem dictionary, for stemming process in malay language , I help anyone can help me resolve this problem. Thanks,
-
Example Transformation and formatting
Hi All - I have question around transforming / formatting an incredibly ugly Excel file (export from PDF). I have essentially rows 1-3 as important parts of a clothing product (color / product name / style number) listed on 3 rows repeating (Total file is roughly 900 rows). I then need to apply the corresponding headers /…
-
Filter Stopwords with Regular Expression
Hi guys, I'm currently doing a sentiment analysis in Rapidminer with Knn. I want to count the number of words that are left in the document when removing stopwords. Using the "Filter stopwords" operator inside the "process documents from data operator" only works if I tokenize the data and use the "Nominal to Text"…
-
Remove seemingly "empty" rows in Excel Sheet - Containing Spaces
Hi guys, I'm currently cleaning my data and also used an operator to remove missing values. I realized, after writing an excel sheet, that there were still some "empty" cells left. These cells aren't empty though, they are containing a space, thus they seem empty but are really not and Rapidminer won't detect them as such.…
-
Removing mentions with "@" and emojis from Excel Data
Hello RapidMiner Community, I am currently working on a supervised sentiment analysis. I had success doing the sentiment analysis itself, but I'm not quiet happy with the data it uses. As part of the data preparation, I wand to remove mentions (thus names following an "@" ) and I have tried out some suggestions. The…
-
How to fill missing values, based on other records in my dataset?
Hello everyone, I am working on data preperation for training a ML algorithm on a classification problem. In my dataset (~23000 records) some values (dates) are missing. The dataset looks like in the screenshot below: Every row is a position on a particular customer order. Every order has a unique Number (column A).…
-
Cannot map index of nominal attribute to nominal value
Hello Sir: I got a error for use LSTM in Rapidminer ,and I don't have missing value in data. process : <?xml version="1.0" encoding="UTF-8"?><process version="9.7.002"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="9.4.000" expanded="true" name="Process"…
-
Error reading Excel files - issue probably due to merged cells in first row
Hello eveverybody, I am simply trying to import data from Excel files. Unfortunately, I am facing an issue using the read Excel function as shown in the next picture : I notice that this is most probably due to the fact that cell 1A is merge with cell B1. Indeed, when I try my process using "Unmerged.xlsx" file, it works…
-
how create this attribute
Hi, I´m new in RM. I have to do the next task: Transform the attribute "index_evento". Excel to RM. I need you help !!! I need the attribute "index_event" as like in Excel. In the next comment you will find the XML. I´m going to share the last part of the process. THANKS !!!
-
how to make a football ranking?
Hi guys, so i'm stuck to make a football ranking with my data. i did the data processing but now i don't have any idea how to group each team with the points. So if home team won + 3 Home team lose + 0 scoreless + 1 away team + 3 away team lose + 0 it would help me a lot :)
-
Why does a correctly formatted Excel date import as a long row of nominal numbers?
Hi there, Wondering why my correctly formatted (short date) in Excel (dd/mm/yyyy) imports as a long row of nominal numbers when using xlsx? I've seen a huge amount of coverage on this but nothing that addresses the reason why a string of (meaningless to me!) nominal numbers would somehow be easier to interpret?…
-
How to select external data that is relevant to train models for the use in the company?
Help others to get started with Machine Learning. To democratize the organizational use of AI especially for those with little data, I have set up a survey to gain insights on the training of Machine Learning models on external data. https://www.surveyhero.com/s/external_data_sources_ML But I need your help (if you are…
-
Converting text to numerical values with generate attributes
Good morning I am trying to generate a new column in the file using the generate attribute operator. This new column would take the text in the serviciointernet column and replace these text strings with numerical values. For example, a "no" would be replaced with a 0, DSL with a 5, and fibraoptica with a 10. I am quite…