Geographic operations in RapidMiner

BalazsBaranyRM
BalazsBaranyRM New Altair Community Member
edited November 2024 in Community Q&A
Sometimes there's a need for processing data sets related to geographic information, e. g. locations. For example, you might want to calculate distances between customer addresses and your shop.

Over the last few weeks, I prepared some processes that solve common GIS tasks. They use the GeoScript project's libraries and RapidMiner scripting. You might find this functionality useful.

Part 1 describes the basics of Geographic Information System operations and the installation of GeoScript in RapidMiner studio:
https://datascientist.at/2015/12/gis-in-rapidminer-1/#english

Part 2 is about importing Shapefiles, a popular file format in GIS circles:
https://datascientist.at/2015/12/gis-in-rapidminer-2-shapefile-import/#english

Part 3 describes distance and area calculations:
https://datascientist.at/2016/01/gis-in-rapidminer-3-distanz-flaeche/#english

Part 4, the last one for now, is about joining and filtering data sets using geographic criteria:
https://datascientist.at/2016/01/gis-in-rapidminer-4-geo-filter-und-joins/#english

Feel free to ask if something is not described clearly enough. I'm also really curious about the use cases people solve with this, so please post your results!

Cheers

Balázs
Tagged:

Answers