"Problem PCA SVM Large DATA set"

Question

Dear All I have been using Rapidminer 4.5 64bit on Windows Vista 64 bit JDK64bit RAM 4 GB CPU intel core2Duo 2.0 GHZ My Dataset are ~30000 Attributes and ~12000 instances ---------------------------------------------------------------------------------------------------- I tried increasing the memory for Rapidminer 4.5 >> edit 2 file C:\Program Files (x86)\Rapid-I\RapidMiner\scripts\RapidMinerGUI ## set the maximum amount of memory Java uses here or in an environment variable #MAX_JAVA_MEMORY=4000 if [ -z "${MAX_JAVA_MEMORY}" ] ; then MAX_JAVA_MEMORY=4000 echo "No maximum Java memory defined, using 4000 Mb..." C:\Program Files (x86)\Rapid-I\RapidMiner\scripts\RapidMinerGUI.bat rem ########################################## rem ### Setting Maximal Amount of Memory ### rem ########################################## if "%MAX_JAVA_MEMORY%"=="" set MAX_JAVA_MEMORY=4000 -------------------------------------------------------------------------------------------------- I have some question? 1. Now I want to using the feature selection operator on a data with PCA transformation Keep Top K highest score and I want to leaner with SVM. How can I do? My XML 2. If I want to create new weighting . form my dataset thairath2.arff eg. (Log2(every attribute in my dataset +2))^2 How can I do it? Writing to new File and to learning with SVM…. Please suggest step by step.. 3. I have a problem "Out of memory " errors and the process stops . In my dataset . so if anyone has ideas / suggestions to solve my problem please let me know . Regard nivet

land · Answer

Hi,
I would suggest to switch to RapidMiner 5.0. It eases the process design a lot by omitting the implicit data flow and shows explicitly, where the data comes from and goes to. 
Unfortunately I didn't understand, which values you are going to change?

Greetings,
  Sebastian

nivet · Answer

2.  I Want to edit value in dataset (.arff)   in this formula ----> 
http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.59.6314&;rep=rep1&type=pdf

and export a new file .csv or   pre processing   to    infogainweighting -----> feature selction  ----> svm  --->accurary...

how can i do it?

regard
nivet

nivet · Answer

thankyou so much. I have any question? 1. I try form this tutorial ---> http://kmandcomputing.blogspot.com/search/label/datamining. but i cannot find Read-input vector on --->rapidminer 4.5 + text plugin 4.5 i have error ----> Error in: XValidation (XValidation) The operator needs some input of type com.rapidminer.example.ExampleSet which is not provided. Each operator defines which input is desired for applying this operator (these input objects are shown in operator info screen (F1)). Previous operators must load or produce the desired input objects. You can check the correct experiment setup by validating the experiment (via the icon or the menu item). --------------------------------------------- --------------------------------------------------------------------------------