A space for data analytics & AI related technical resources and discussions.
The Binning block enables you to group a variable into discrete categories. The following demonstrates how to use the Binning block to categorise an input dataset loan_data.csv (containing observations each of which describes a completed loan and the person who took the loan out) using the numerical variable Income: *…
</p The Transpose block enables you to restructure a dataset to convert rows to columns. The following demonstrates how the Transpose block is used to change the loandatasample.csv dataset (containing observations describing a loan and the person taking the loan out) structure such that each column represents an individual…
The Top Select block enables you to retain variables in a dataset. The block is used to keep or discard certain columns from an input loan_data.csv dataset (containing observations describing a loan and the person taking the loan out). The following demonstrates how to use the Top Select block to sort variables from the…
The Sampling block enables you take a sample of a dataset. Taking samples of datasets can help with processing times if you are working with a large dataset. The following demonstrates how to use the Sampling block to take ten percent of the dataset loan_data.csv: * Import the loan_data.csv dataset onto a Workflow canvas…
The Rank block enables you to grade observations of numeric variables based on their values through a ranking order. The following demonstrates how to use the Rank block to rank the values in the MockResult variable in an input dataset ExamResults.csv (which contains observations that describe a range of test scores from a…
The R block enables you to use R language programs in a Workflow. To use the R block, you must have a R interpreter installed and configured. The following demonstrates how the R block can be used to restrict an input lib_books.csv dataset (which contains observations that describe a range of books available from a lending…
The Query block enables you to create SQL code that you can use to join database tables or datasets, to get specific data from one or more database tables or datasets. The following demonstrates how to use the Query block to select rows from the input dataset lib_books.csv (which contains observations that describe a range…
The Python block enables you to use Python language programs in a Workflow. To use the Python block, you must have a Python interpreter installed and configured. The following demonstrates how the Python block can be used to sort an input lib_books.csv dataset that contains observations describing a range of books…
The Partition block enables you to split up a dataset variable into multiple parts. For example, if you are training a model, you can use this block to split up a dataset into training and testing datasets. The following demonstrates how to use the Partition block to split the input dataset IRIS.csv into three equal parts:…
The Mutate block enables you to create a new dataset variable based on existing variables in the dataset. The following demonstrates how to use an expression in the Mutate block to create a new variable in the loan_data dataset (containing observations each of which describes a completed loan and the person who took the…
There's a wealth of content waiting for you to explore! Need support or have a question? Login or join our community.