Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
I want to combine multiple data.
furufuru
Hello,
I started to learn rapid miner recently.
I want to combine multiple data. I import 10 excel files, so I want to combine these files into one. 'Join' operator connect only two files. It's annoying to connect one by one. Please tell me efficient method.
Sorry, I'm not good at English.
Thank you
Find more posts tagged with
AI Studio
Accepted answers
Pavithra_Rao
Hi
@furufuru
I would recommend using the 'Turbo Prep' feature in RapidMiner Studio to make it much easier to Join 10 excel files.
https://rapidminer.com/products/turbo-prep/
Cheers,
Pavithra
YYH
Hi
@furufuru
welcome to RapidMiner community.
The joining of 10 different data set is doable with "loop". I am listing step by step SUDO codes that might be useful
remember your 1st table, "remember" operator, name it Merged_Tab
join your 1st and 2nd table with some keys, you will need "recall" to bring the 1st table and "remember" the joined results, name it the same Merged_Tab
repeat step2 as you always have any updated Merged_Tab and a new table, keep joining the new tables
finish up for all 10 tables
Before you execute the loop, you could load all data into repository and "loop repository" will automatically take each table as input for each iteration. The important thing is to "remember" the joined results from two inputs for each iteration i, and "recall" the the joined table for the next iteration i+1
Hope it helps.
YY
All comments
Pavithra_Rao
Hi
@furufuru
I would recommend using the 'Turbo Prep' feature in RapidMiner Studio to make it much easier to Join 10 excel files.
https://rapidminer.com/products/turbo-prep/
Cheers,
Pavithra
YYH
Hi
@furufuru
welcome to RapidMiner community.
The joining of 10 different data set is doable with "loop". I am listing step by step SUDO codes that might be useful
remember your 1st table, "remember" operator, name it Merged_Tab
join your 1st and 2nd table with some keys, you will need "recall" to bring the 1st table and "remember" the joined results, name it the same Merged_Tab
repeat step2 as you always have any updated Merged_Tab and a new table, keep joining the new tables
finish up for all 10 tables
Before you execute the loop, you could load all data into repository and "loop repository" will automatically take each table as input for each iteration. The important thing is to "remember" the joined results from two inputs for each iteration i, and "recall" the the joined table for the next iteration i+1
Hope it helps.
YY
furufuru
Hi
@Pavithra_Rao
Turbo Prep is useful. I will study to use Turbo Prep.
Thank you.
furufuru
furufuru
Hi
@yyhuang
I could understand your description.
Thank you.
furufuru
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups