Correct ARFF Format?

User: "Legacy User"
New Altair Community Member
Updated by Jocelyn
Hi all,

I'm running a Naive Bayes classifier on a set of keyword/keyphrases and then using the produced model to predict the label attribute for an unclassified set of keywords/keyphrases.  However, I'm running into some strange problems where the result of my applied model shows a ? if I have a space between keywords.  I'm thinking that I may be formatting my ARFFs incorrectly?

Here is my training set:
@RELATION c_training

@ATTRIBUTE keywords STRING
@ATTRIBUTE change {up,down,neutral}

@DATA
'delay acquisition',down
'facing the same conundrum',down
'restructuring',down
'delay acquisition',up
'divestiture',down
'profit dissipated',down
'delay acquisition',up
'profits up', up
'profits down', down
'delay acquisition',up
'delay acquisition',up
'delay acquisition',up
'delay acquisition',up
And here is my test set:
@RELATION c_test

@ATTRIBUTE keywords STRING

@DATA
'profit dissipated'
Any help would be appreciated.

Thank you.

Find more posts tagged with