Large data set with Time Series

User: "Toaldo"
New Altair Community Member
Updated by Jocelyn
Dear all - I am kind of new to using RapidMiner. So, I am working with a large data set with Time Series (from 2000 to the 2019 year). There are about ~200.000 lines and 4 different attributes (variable, region, times series, and values). The Decision Tree and Forecasting with Windowing are one of those that are on my radar. Anyway, I am kind of lost here... what type of analysis I could do within this type of database? Thanks in advance for your help! Alexsandro Toaldo

Find more posts tagged with

Sort by:
1 - 2 of 21
    User: "Toaldo"
    New Altair Community Member
    OP
    Accepted Answer
    Hi Martin -
    Thanks for your prompt response.
    This is a great question, therefore I am not sure yet. 
    As a background, I am working with public information about our city (Sao Paulo) which contain about ~200.000 register within 4 different attributes. As this is a time-series dataset, I am not sure where I could start and what type of analysis I can do. The attached file is a sample of the dataset. 


    User: "MartinLiebig"
    Altair Employee
    Accepted Answer
    Hi,
    first you likely want to Pivot this whole table to get something like:

    Date, Region, Value Of Taxa de Universalizacão, Value Of ... , Value of ...

    This is more the data set of interest.

    In German we got the saying: To saddle the horse from the wrong side. That's somewhat what you do here. Usually you have a problem and formulate a question to the data you want to answer. You are doing it more the other way around, which is tough.

    Besides forecasting a general thing to do with this data may be outlier detection. Are there values which are unexpected? And why? Maybe this helps.

    Cheers,
    Martin