A program to recognize and reward our most engaged community members
cgkolar wrote:1. Is there a way to use an unsupervised classifier to "cluster" student-years to identify aberrant performance. (ie: the student has a really good or a really bad year)
<?xml version="1.0" encoding="UTF-8" standalone="no"?><process version="5.2.008"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="5.2.008" expanded="true" name="Process"> <process expanded="true" height="100" width="413"> <operator activated="true" class="generate_nominal_data" compatibility="5.2.008" expanded="true" height="60" name="Generate Nominal Data" width="90" x="45" y="30"/> <operator activated="true" class="loop_values" compatibility="5.2.008" expanded="true" height="94" name="Loop Values" width="90" x="246" y="30"> <parameter key="attribute" value="att1"/> <process expanded="true" height="527" width="658"> <operator activated="true" class="filter_examples" compatibility="5.2.008" expanded="true" height="76" name="Filter Examples" width="90" x="45" y="30"/> <operator activated="true" class="k_means" compatibility="5.2.008" expanded="true" height="76" name="Clustering" width="90" x="179" y="30"> <parameter key="measure_types" value="MixedMeasures"/> </operator> <connect from_port="example set" to_op="Filter Examples" to_port="example set input"/> <connect from_op="Filter Examples" from_port="example set output" to_op="Clustering" to_port="example set"/> <connect from_op="Clustering" from_port="cluster model" to_port="out 1"/> <connect from_op="Clustering" from_port="clustered set" to_port="out 2"/> <portSpacing port="source_example set" spacing="0"/> <portSpacing port="sink_out 1" spacing="0"/> <portSpacing port="sink_out 2" spacing="0"/> <portSpacing port="sink_out 3" spacing="0"/> </process> </operator> <connect from_op="Generate Nominal Data" from_port="output" to_op="Loop Values" to_port="example set"/> <connect from_op="Loop Values" from_port="out 1" to_port="result 1"/> <connect from_op="Loop Values" from_port="out 2" to_port="result 2"/> <portSpacing port="source_input 1" spacing="0"/> <portSpacing port="sink_result 1" spacing="0"/> <portSpacing port="sink_result 2" spacing="0"/> <portSpacing port="sink_result 3" spacing="0"/> </process> </operator></process>
cgkolar wrote:2. Assuming that one of the classifications amounts to a "bad year," are there techniques that would allow us to look at the time series data to predict if the coming year is going to be good or bad.
What are the variables you will be measuring?How many grades do students get in 1 year?Let's say you have 1 grade per year, then you'd have a data set consisting of 200 rows:student_id, grade1, grad2, grade3, grade4, ..., grade10In this setup, applying a standard clustering algorithm should be able to find students that have abnormal grades?