"Reading a CSV file"
ripple
New Altair Community Member
Hi everyone, I'm a new user to rapidminer, i'm working at the analytics division of a bank. One issue that I haven't been able to resolve is, while reading in CSV files, if the delimiter is ~ or ; the reader reads files properly. But how do I read files that have | as delimiter ? Bcoz if I ain't wrong | is taken as XOR operator in rapidminer....unfortunately the data i receive is occasionally around 700mb of text file, so replacing each | by ~ doesn't seem to be a feasible option.
regards
regards
0
Answers
-
Greets,
After Uni. my first job was in Credit Analysis - so you have my sympathies! You can use the csv reader if you add "\|" to the front of the regex string that matches column separators , like this ...<?xml version="1.0" encoding="UTF-8" standalone="no"?>
You just need to point to the file.
<process version="5.0">
<context>
<input/>
<output/>
<macros/>
</context>
<operator activated="true" class="process" expanded="true" name="Process">
<process expanded="true" height="381" width="868">
<operator activated="true" class="read_csv" expanded="true" height="60" name="Read CSV" width="90" x="93" y="75">
<parameter key="column_separators" value="\||,\s*|;\s*|\s+"/>
</operator>
<connect from_op="Read CSV" from_port="output" to_port="result 1"/>
<portSpacing port="source_input 1" spacing="0"/>
<portSpacing port="sink_result 1" spacing="0"/>
<portSpacing port="sink_result 2" spacing="0"/>
</process>
</operator>
</process>
Ciao
0