"Integrating RM Process with Text Processing plugin into Java Application"
dranammari
New Altair Community Member
Hello all,
I am relatively new to RapidMiner integration into Java Applications, though I am familiar with building RapidMiner processes using the RapidMiner GUI platform. I am using RapidMiner 5.1.014.
I have built a process that uses many operators from the "Text Processing" plugin. The process takes input from a mySQL database and produces a "wordlist" and "model" outputs saved as files in a defined RapidMiner repository. The process runs successfully using the RapidMiner GUI.
My problem is in integrating the process into a Java Application that I created using Netbeans 6.9.1. I have included all the libraries (JARs) that exist in the RapidMiner5\lib folder. I have even found and downloaded the rapidminer-Text Processing-5.0.007.jar and included it in the Application library too. However, the process does not run successfully when I run the main Java class from Netbeans. My Java code that is supposed to launch the RapidMiner process is as follows:
RapidMiner.setExecutionMode(com.rapidminer.RapidMiner.ExecutionMode.EMBEDDED_WITHOUT_UI);
RapidMiner.init();
Process process = new Process(new File("BuildYouTubeNoiseFiltration.rmp"));
process.run();
The BuildYouTubeNoiseFiltration.rmp is the XML file that has all the process information after I built the process using the GUI. Here is what I get in the Netbeans output window:
Dec 20, 2011 2:45:38 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Property rapidminer.home is not set. Guessing.
Dec 20, 2011 2:45:38 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\launcher.jar'...gotcha!
Dec 20, 2011 2:45:38 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\rapidminer.jar'...gotcha!
Dec 20, 2011 2:45:38 PM com.rapidminer.tools.ParameterService init
INFO: Reading configuration resource com/rapidminer/resources/rapidminerrc.
Dec 20, 2011 2:45:40 PM com.rapidminer.parameter.ParameterTypePassword decryptPassword
WARNING: Password in XML file looks like unencrypted plain text.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.OperatorService init
INFO: Loading additional operators specified by RapidMiner.PROPERTY_RAPIDMINER_OPERATORS_ADDITIONAL (${RAPIDMINER_OPERATORS_ADDITIONAL})
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.OperatorService init
SEVERE: Cannot find operator description file '${RAPIDMINER_OPERATORS_ADDITIONAL}'
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.postgresql.Driver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.hsqldb.jdbcDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties <init>
WARNING: Missing database driver class name for 'ODBC Bridge (e.g. Access)'
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver com.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver ca.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver oracle.jdbc.driver.OracleDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The operator class 'text:data_to_documents' is unknown. Possibly you must install a plugin for operators of group 'text'.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: The parameter 'specify_weights' of type list is unknown for operator 'Data to Documents' (dummy).
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The operator class 'text:process_documents' is unknown. Possibly you must install a plugin for operators of group 'text'.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">Operator '<class>dummy</class>' may not have children. Ignoring.
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The input port <var>example set</var> is unknown at operator <var>Data to Documents</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The output port <var>documents</var> is unknown at operator <var>Data to Documents</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The output port <var>example set</var> is unknown at operator <var>Preprocessing</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The output port <var>word list</var> is unknown at operator <var>Preprocessing</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.WrapperLoggingHandler log
INFO: No filename given for result file, using stdout for logging results!
Dec 20, 2011 2:45:41 PM com.rapidminer.Process run
INFO: Process BuildYouTubeNoiseFiltration.rmp starts
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.DatabaseHandler executeStatement
INFO: Executing query: 'SELECT `commentID`, `commentText`, `binaryScore`
FROM `comments`'
Dec 20, 2011 2:45:42 PM filtration.BuildYouTubeNoiseFiltration buildYouTubeNoiseFiltration
SEVERE: null
com.rapidminer.operator.UserError: The dummy operator Data to Documents (replacing text:data_to_documents) cannot be executed.
at com.rapidminer.operator.DummyOperator.doWork(DummyOperator.java:88)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:51)
at com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:709)
at com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:379)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.Process.run(Process.java:925)
at com.rapidminer.Process.run(Process.java:848)
at com.rapidminer.Process.run(Process.java:807)
at com.rapidminer.Process.run(Process.java:802)
at com.rapidminer.Process.run(Process.java:792)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:29)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:43)
Can you please help in telling me what I am missing so the RapidMiner process can be run successfully from my Java application as it is run from the GUI?
Many thanks,
Ahmad
I am relatively new to RapidMiner integration into Java Applications, though I am familiar with building RapidMiner processes using the RapidMiner GUI platform. I am using RapidMiner 5.1.014.
I have built a process that uses many operators from the "Text Processing" plugin. The process takes input from a mySQL database and produces a "wordlist" and "model" outputs saved as files in a defined RapidMiner repository. The process runs successfully using the RapidMiner GUI.
My problem is in integrating the process into a Java Application that I created using Netbeans 6.9.1. I have included all the libraries (JARs) that exist in the RapidMiner5\lib folder. I have even found and downloaded the rapidminer-Text Processing-5.0.007.jar and included it in the Application library too. However, the process does not run successfully when I run the main Java class from Netbeans. My Java code that is supposed to launch the RapidMiner process is as follows:
RapidMiner.setExecutionMode(com.rapidminer.RapidMiner.ExecutionMode.EMBEDDED_WITHOUT_UI);
RapidMiner.init();
Process process = new Process(new File("BuildYouTubeNoiseFiltration.rmp"));
process.run();
The BuildYouTubeNoiseFiltration.rmp is the XML file that has all the process information after I built the process using the GUI. Here is what I get in the Netbeans output window:
Dec 20, 2011 2:45:38 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Property rapidminer.home is not set. Guessing.
Dec 20, 2011 2:45:38 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\launcher.jar'...gotcha!
Dec 20, 2011 2:45:38 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\rapidminer.jar'...gotcha!
Dec 20, 2011 2:45:38 PM com.rapidminer.tools.ParameterService init
INFO: Reading configuration resource com/rapidminer/resources/rapidminerrc.
Dec 20, 2011 2:45:40 PM com.rapidminer.parameter.ParameterTypePassword decryptPassword
WARNING: Password in XML file looks like unencrypted plain text.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.OperatorService init
INFO: Loading additional operators specified by RapidMiner.PROPERTY_RAPIDMINER_OPERATORS_ADDITIONAL (${RAPIDMINER_OPERATORS_ADDITIONAL})
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.OperatorService init
SEVERE: Cannot find operator description file '${RAPIDMINER_OPERATORS_ADDITIONAL}'
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.postgresql.Driver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.hsqldb.jdbcDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties <init>
WARNING: Missing database driver class name for 'ODBC Bridge (e.g. Access)'
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver com.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver ca.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver oracle.jdbc.driver.OracleDriver not found. Probably the driver is not installed.
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The operator class 'text:data_to_documents' is unknown. Possibly you must install a plugin for operators of group 'text'.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: The parameter 'specify_weights' of type list is unknown for operator 'Data to Documents' (dummy).
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The operator class 'text:process_documents' is unknown. Possibly you must install a plugin for operators of group 'text'.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">Operator '<class>dummy</class>' may not have children. Ignoring.
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The input port <var>example set</var> is unknown at operator <var>Data to Documents</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The output port <var>documents</var> is unknown at operator <var>Data to Documents</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The output port <var>example set</var> is unknown at operator <var>Preprocessing</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.io.process.XMLImporter addMessage
INFO: <em class="error">The output port <var>word list</var> is unknown at operator <var>Preprocessing</var>.</em>
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.WrapperLoggingHandler log
INFO: No filename given for result file, using stdout for logging results!
Dec 20, 2011 2:45:41 PM com.rapidminer.Process run
INFO: Process BuildYouTubeNoiseFiltration.rmp starts
Dec 20, 2011 2:45:41 PM com.rapidminer.tools.jdbc.DatabaseHandler executeStatement
INFO: Executing query: 'SELECT `commentID`, `commentText`, `binaryScore`
FROM `comments`'
Dec 20, 2011 2:45:42 PM filtration.BuildYouTubeNoiseFiltration buildYouTubeNoiseFiltration
SEVERE: null
com.rapidminer.operator.UserError: The dummy operator Data to Documents (replacing text:data_to_documents) cannot be executed.
at com.rapidminer.operator.DummyOperator.doWork(DummyOperator.java:88)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:51)
at com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:709)
at com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:379)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.Process.run(Process.java:925)
at com.rapidminer.Process.run(Process.java:848)
at com.rapidminer.Process.run(Process.java:807)
at com.rapidminer.Process.run(Process.java:802)
at com.rapidminer.Process.run(Process.java:792)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:29)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:43)
Can you please help in telling me what I am missing so the RapidMiner process can be run successfully from my Java application as it is run from the GUI?
Many thanks,
Ahmad
Tagged:
0
Answers
-
Hi,
please use
instead of EMBEDDED_WITHOUT_UI, as the latter will NOT load plugins.
RapidMiner.setExecutionMode(ExecutionMode.COMMAND_LINE);
Regards,
Marco0 -
Hi Marco,
Actually I have used the ExecutionMode.COMMAND_LINE too. Unfortunately the program still gives me a set of Exceptions. Here is the complete output:Dec 20, 2011 5:19:33 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
What could be wrong? The exact process runs very fine from the RapidMiner GUI.
INFO: Property rapidminer.home is not set. Guessing.
Dec 20, 2011 5:19:33 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Users\admin2\Documents\My LeedsUni Job\ImREAL\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\launcher.jar'...gotcha!
Dec 20, 2011 5:19:33 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Users\admin2\Documents\My LeedsUni Job\ImREAL\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\rapidminer.jar'...gotcha!
Dec 20, 2011 5:19:33 PM com.rapidminer.tools.ParameterService init
INFO: Reading configuration resource com/rapidminer/resources/rapidminerrc.
Dec 20, 2011 5:19:34 PM com.rapidminer.PluginInitR initPlugin
INFO: Trying to load R Library...
Dec 20, 2011 5:19:34 PM org.rosuda.JRI.Rengine <clinit>
INFO: Creating Connection to R...
Dec 20, 2011 5:19:34 PM org.rosuda.JRI.Rengine loadLibrary
INFO: Trying to loaded R library from C:\Program Files\R\R-2.12.0\library\rJava\jri\jri.dll
Dec 20, 2011 5:19:34 PM org.rosuda.JRI.Rengine loadLibrary
INFO: Engine ID: class org.rosuda.JRI.Rengine ClassLoader:PluginClassLoader ([file:/C:/Users/admin2/.RapidMiner5/managed/rmx_r-5.1.4.jar])
Dec 20, 2011 5:19:34 PM org.rosuda.JRI.Rengine loadLibrary
INFO: Successfully loaded R library from C:\Program Files\R\R-2.12.0\library\rJava\jri\jri.dll(true)
Dec 20, 2011 5:19:36 PM com.rapidminer.parameter.ParameterTypePassword decryptPassword
WARNING: Password in XML file looks like unencrypted plain text.
Dec 20, 2011 5:19:37 PM com.rapidminer.tools.OperatorService init
INFO: Loading additional operators specified by RapidMiner.PROPERTY_RAPIDMINER_OPERATORS_ADDITIONAL (${RAPIDMINER_OPERATORS_ADDITIONAL})
Dec 20, 2011 5:19:37 PM com.rapidminer.tools.OperatorService init
SEVERE: Cannot find operator description file '${RAPIDMINER_OPERATORS_ADDITIONAL}'
[Fatal Error] :1:1: Content is not allowed in prolog.
Dec 20, 2011 5:19:39 PM com.rapidminer.operator.r.ROperatorFactory readROperatorFragments
WARNING: Could not read R fragments from 'file:/C:/Users/admin2/Documents/My%20LeedsUni%20Job/ImREAL/Service%20Documentation/YouTubeNoiseFiltration/Source%20Code/Java/YouTubeNoiseFiltration/', error: Content is not allowed in prolog.
org.xml.sax.SAXParseException: Content is not allowed in prolog.
at com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:249)
at com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:284)
at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:124)
at com.rapidminer.io.process.XMLTools.parse(XMLTools.java:175)
at com.rapidminer.operator.r.ROperatorFactory.readROperatorFragments(ROperatorFactory.java:166)
at com.rapidminer.operator.r.ROperatorFactory.<init>(ROperatorFactory.java:129)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at java.lang.Class.newInstance0(Class.java:355)
at java.lang.Class.newInstance(Class.java:308)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:294)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:232)
at com.rapidminer.tools.OperatorService.registerOperators(OperatorService.java:206)
at com.rapidminer.tools.plugin.Plugin.registerOperators(Plugin.java:471)
at com.rapidminer.tools.plugin.Plugin.registerAllPluginOperators(Plugin.java:725)
at com.rapidminer.tools.OperatorService.init(OperatorService.java:167)
at com.rapidminer.RapidMiner.init(RapidMiner.java:465)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:26)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:44)
[Fatal Error] :1:1: Content is not allowed in prolog.
Dec 20, 2011 5:19:40 PM com.rapidminer.operator.r.ROperatorFactory <init>
WARNING: Could not read documentation of user defined R operators from .
java.io.IOException: Malformed XML operator help bundle: org.xml.sax.SAXParseException: Content is not allowed in prolog.
at com.rapidminer.tools.documentation.XMLOperatorDocBundle.<init>(XMLOperatorDocBundle.java:119)
at com.rapidminer.operator.r.ROperatorFactory.<init>(ROperatorFactory.java:150)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at java.lang.Class.newInstance0(Class.java:355)
at java.lang.Class.newInstance(Class.java:308)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:294)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:232)
at com.rapidminer.tools.OperatorService.registerOperators(OperatorService.java:206)
at com.rapidminer.tools.plugin.Plugin.registerOperators(Plugin.java:471)
at com.rapidminer.tools.plugin.Plugin.registerAllPluginOperators(Plugin.java:725)
at com.rapidminer.tools.OperatorService.init(OperatorService.java:167)
at com.rapidminer.RapidMiner.init(RapidMiner.java:465)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:26)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:44)
Caused by: org.xml.sax.SAXParseException: Content is not allowed in prolog.
at com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:249)
at com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:284)
at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:124)
at com.rapidminer.tools.documentation.XMLOperatorDocBundle.<init>(XMLOperatorDocBundle.java:117)
... 16 more
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.plugin.Plugin registerOperators
INFO: No operator descriptor specified for plugin Community. Trying plugin initializtation class com.rapidminer.community.CommunityPluginInit.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.plugin.Plugin registerOperators
WARNING: No operator descriptor defined for: Community
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.postgresql.Driver not found. Probably the driver is not installed.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.hsqldb.jdbcDriver not found. Probably the driver is not installed.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties <init>
WARNING: Missing database driver class name for 'ODBC Bridge (e.g. Access)'
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver com.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver ca.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver oracle.jdbc.driver.OracleDriver not found. Probably the driver is not installed.
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.WrapperLoggingHandler log
INFO: No filename given for result file, using stdout for logging results!
Dec 20, 2011 5:19:40 PM com.rapidminer.Process run
INFO: Process BuildYouTubeNoiseFiltration.rmp starts
Dec 20, 2011 5:19:40 PM com.rapidminer.tools.jdbc.DatabaseHandler executeStatement
INFO: Executing query: 'SELECT `commentID`, `commentText`, `binaryScore`
FROM `comments`'
Dec 20, 2011 5:19:43 PM filtration.BuildYouTubeNoiseFiltration buildYouTubeNoiseFiltration
SEVERE: null
com.rapidminer.operator.UserError: Input example set has no attributes
at com.rapidminer.operator.learner.AbstractLearner.doWork(AbstractLearner.java:142)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:51)
at com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:709)
at com.rapidminer.operator.validation.ValidationChain.executeLearner(ValidationChain.java:214)
at com.rapidminer.operator.validation.ValidationChain.learn(ValidationChain.java:305)
at com.rapidminer.operator.validation.XValidation.performIteration(XValidation.java:159)
at com.rapidminer.operator.validation.XValidation.estimatePerformance(XValidation.java:151)
at com.rapidminer.operator.validation.ValidationChain.doWork(ValidationChain.java:273)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:51)
at com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:709)
at com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:379)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.Process.run(Process.java:925)
at com.rapidminer.Process.run(Process.java:848)
at com.rapidminer.Process.run(Process.java:807)
at com.rapidminer.Process.run(Process.java:802)
at com.rapidminer.Process.run(Process.java:792)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:30)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:44)
Many thanks,
Ahmad0 -
In addition to the Exceptions:
1) org.xml.sax.SAXParseException: Content is not allowed in prolog
2) java.io.IOException: Malformed XML operator help bundle: org.xml.sax.SAXParseException: Content is not allowed in prolog.
when I use "Resolve relative to the repository_name" when I store a wordlist from a (Process Documents) operator and store a model from an (X-Validation) operator, I get also the following error:
3) com.rapidminer.operator.UserError: Cannot resolve relative repository location 'NoiseFiltrationWordlist'. Process is not associated with a repository.
Would appreciate any help if possible. Here is the all the output:Dec 21, 2011 5:59:17 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
Many thanks,
INFO: Property rapidminer.home is not set. Guessing.
Dec 21, 2011 5:59:17 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Users\admin2\Documents\My LeedsUni Job\ImREAL\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\launcher.jar'...gotcha!
Dec 21, 2011 5:59:17 PM com.rapid_i.Launcher ensureRapidMinerHomeSet
INFO: Trying parent directory of 'C:\Users\admin2\Documents\My LeedsUni Job\ImREAL\Service Documentation\YouTubeNoiseFiltration\Source Code\Java\YouTubeNoiseFiltration\lib\rapidminer.jar'...gotcha!
Dec 21, 2011 5:59:17 PM com.rapidminer.tools.ParameterService init
INFO: Reading configuration resource com/rapidminer/resources/rapidminerrc.
Dec 21, 2011 5:59:18 PM com.rapidminer.PluginInitR initPlugin
INFO: Trying to load R Library...
Dec 21, 2011 5:59:18 PM org.rosuda.JRI.Rengine <clinit>
INFO: Creating Connection to R...
Dec 21, 2011 5:59:18 PM org.rosuda.JRI.Rengine loadLibrary
INFO: Trying to loaded R library from C:\Program Files\R\R-2.12.0\library\rJava\jri\jri.dll
Dec 21, 2011 5:59:18 PM org.rosuda.JRI.Rengine loadLibrary
INFO: Engine ID: class org.rosuda.JRI.Rengine ClassLoader:PluginClassLoader ([file:/C:/Users/admin2/.RapidMiner5/managed/rmx_r-5.1.4.jar])
Dec 21, 2011 5:59:18 PM org.rosuda.JRI.Rengine loadLibrary
INFO: Successfully loaded R library from C:\Program Files\R\R-2.12.0\library\rJava\jri\jri.dll(true)
Dec 21, 2011 5:59:21 PM com.rapidminer.parameter.ParameterTypePassword decryptPassword
WARNING: Password in XML file looks like unencrypted plain text.
Dec 21, 2011 5:59:21 PM com.rapidminer.tools.OperatorService init
INFO: Loading additional operators specified by RapidMiner.PROPERTY_RAPIDMINER_OPERATORS_ADDITIONAL (${RAPIDMINER_OPERATORS_ADDITIONAL})
Dec 21, 2011 5:59:21 PM com.rapidminer.tools.OperatorService init
SEVERE: Cannot find operator description file '${RAPIDMINER_OPERATORS_ADDITIONAL}'
[Fatal Error] :1:1: Content is not allowed in prolog.
Dec 21, 2011 5:59:23 PM com.rapidminer.operator.r.ROperatorFactory readROperatorFragments
WARNING: Could not read R fragments from 'file:/C:/Users/admin2/Documents/My%20LeedsUni%20Job/ImREAL/Service%20Documentation/YouTubeNoiseFiltration/Source%20Code/Java/YouTubeNoiseFiltration/', error: Content is not allowed in prolog.
org.xml.sax.SAXParseException: Content is not allowed in prolog.
at com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:249)
at com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:284)
at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:124)
at com.rapidminer.io.process.XMLTools.parse(XMLTools.java:175)
at com.rapidminer.operator.r.ROperatorFactory.readROperatorFragments(ROperatorFactory.java:166)
at com.rapidminer.operator.r.ROperatorFactory.<init>(ROperatorFactory.java:129)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at java.lang.Class.newInstance0(Class.java:355)
at java.lang.Class.newInstance(Class.java:308)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:294)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:232)
at com.rapidminer.tools.OperatorService.registerOperators(OperatorService.java:206)
at com.rapidminer.tools.plugin.Plugin.registerOperators(Plugin.java:471)
at com.rapidminer.tools.plugin.Plugin.registerAllPluginOperators(Plugin.java:725)
at com.rapidminer.tools.OperatorService.init(OperatorService.java:167)
at com.rapidminer.RapidMiner.init(RapidMiner.java:465)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:26)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:44)
[Fatal Error] :1:1: Content is not allowed in prolog.
Dec 21, 2011 5:59:24 PM com.rapidminer.operator.r.ROperatorFactory <init>
WARNING: Could not read documentation of user defined R operators from .
java.io.IOException: Malformed XML operator help bundle: org.xml.sax.SAXParseException: Content is not allowed in prolog.
at com.rapidminer.tools.documentation.XMLOperatorDocBundle.<init>(XMLOperatorDocBundle.java:119)
at com.rapidminer.operator.r.ROperatorFactory.<init>(ROperatorFactory.java:150)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at java.lang.Class.newInstance0(Class.java:355)
at java.lang.Class.newInstance(Class.java:308)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:294)
at com.rapidminer.tools.OperatorService.parseOperators(OperatorService.java:232)
at com.rapidminer.tools.OperatorService.registerOperators(OperatorService.java:206)
at com.rapidminer.tools.plugin.Plugin.registerOperators(Plugin.java:471)
at com.rapidminer.tools.plugin.Plugin.registerAllPluginOperators(Plugin.java:725)
at com.rapidminer.tools.OperatorService.init(OperatorService.java:167)
at com.rapidminer.RapidMiner.init(RapidMiner.java:465)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:26)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:44)
Caused by: org.xml.sax.SAXParseException: Content is not allowed in prolog.
at com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:249)
at com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:284)
at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:124)
at com.rapidminer.tools.documentation.XMLOperatorDocBundle.<init>(XMLOperatorDocBundle.java:117)
... 16 more
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.plugin.Plugin registerOperators
INFO: No operator descriptor specified for plugin Community. Trying plugin initializtation class com.rapidminer.community.CommunityPluginInit.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.plugin.Plugin registerOperators
WARNING: No operator descriptor defined for: Community
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.postgresql.Driver not found. Probably the driver is not installed.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver org.hsqldb.jdbcDriver not found. Probably the driver is not installed.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties <init>
WARNING: Missing database driver class name for 'ODBC Bridge (e.g. Access)'
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver net.sourceforge.jtds.jdbc.Driver not found. Probably the driver is not installed.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver com.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver ca.ingres.jdbc.IngresDriver not found. Probably the driver is not installed.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.JDBCProperties registerDrivers
INFO: JDBC driver oracle.jdbc.driver.OracleDriver not found. Probably the driver is not installed.
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.WrapperLoggingHandler log
INFO: No filename given for result file, using stdout for logging results!
Dec 21, 2011 5:59:24 PM com.rapidminer.Process run
INFO: Process BuildYouTubeNoiseFiltration.rmp starts
Dec 21, 2011 5:59:24 PM com.rapidminer.tools.jdbc.DatabaseHandler executeStatement
INFO: Executing query: 'SELECT `commentID`, `commentText`, `binaryScore`
FROM `comments`'
Dec 21, 2011 5:59:27 PM filtration.BuildYouTubeNoiseFiltration buildYouTubeNoiseFiltration
SEVERE: null
com.rapidminer.operator.UserError: Cannot resolve relative repository location 'NoiseFiltrationWordlist'. Process is not associated with a repository.
at com.rapidminer.Process.resolveRepositoryLocation(Process.java:1211)
at com.rapidminer.operator.Operator.getParameterAsRepositoryLocation(Operator.java:1382)
at com.rapidminer.operator.io.RepositoryStorer.write(RepositoryStorer.java:53)
at com.rapidminer.operator.io.AbstractWriter.doWork(AbstractWriter.java:67)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:51)
at com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:709)
at com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:379)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.Process.run(Process.java:925)
at com.rapidminer.Process.run(Process.java:848)
at com.rapidminer.Process.run(Process.java:807)
at com.rapidminer.Process.run(Process.java:802)
at com.rapidminer.Process.run(Process.java:792)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:31)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:44)
Ahmad0 -
Hi,
does your process make use of R extension operators? It looks similiar to other R extension problems which have been posted in the forums.
If your process does not contain any confidential data, please post it here.
Regards,
Marco0 -
Hi Marco,
No not at all, I am not using using any R operator or R script in my process.
Here is the process:<?xml version="1.0" encoding="UTF-8" standalone="no"?>
Note that he process continues running despite the "Content is not allowed in prolog" error. However, it produces another error later after trying to query the mySQL database table. Here is the part of the command-line output in Netbeans where the second error occurs:
<process version="5.1.017">
<context>
<input/>
<output/>
<macros/>
</context>
<operator activated="true" class="process" compatibility="5.1.017" expanded="true" name="Process">
<parameter key="logverbosity" value="init"/>
<parameter key="random_seed" value="2001"/>
<parameter key="send_mail" value="never"/>
<parameter key="notification_email" value=""/>
<parameter key="process_duration_for_mail" value="30"/>
<parameter key="encoding" value="SYSTEM"/>
<parameter key="parallelize_main_process" value="false"/>
<process expanded="true" height="406" width="681">
<operator activated="true" class="read_database" compatibility="5.1.017" expanded="true" height="60" name="Read Database" width="90" x="45" y="30">
<parameter key="define_connection" value="predefined"/>
<parameter key="connection" value="youtube"/>
<parameter key="database_system" value="MySQL"/>
<parameter key="define_query" value="query"/>
<parameter key="query" value="SELECT `commentID`, `commentText`, `binaryScore` FROM `comments`"/>
<parameter key="use_default_schema" value="true"/>
<parameter key="prepare_statement" value="false"/>
<enumeration key="parameters"/>
<parameter key="datamanagement" value="double_array"/>
</operator>
<operator activated="true" class="rename" compatibility="5.1.017" expanded="true" height="76" name="Rename" width="90" x="179" y="30">
<parameter key="old_name" value="binaryScore"/>
<parameter key="new_name" value="label"/>
<list key="rename_additional_attributes"/>
</operator>
<operator activated="true" class="numerical_to_binominal" compatibility="5.1.017" expanded="true" height="76" name="Numerical to Binominal" width="90" x="313" y="30">
<parameter key="attribute_filter_type" value="single"/>
<parameter key="attribute" value="label"/>
<parameter key="attributes" value=""/>
<parameter key="use_except_expression" value="false"/>
<parameter key="value_type" value="numeric"/>
<parameter key="use_value_type_exception" value="false"/>
<parameter key="except_value_type" value="real"/>
<parameter key="block_type" value="value_series"/>
<parameter key="use_block_type_exception" value="false"/>
<parameter key="except_block_type" value="value_series_end"/>
<parameter key="invert_selection" value="false"/>
<parameter key="include_special_attributes" value="false"/>
<parameter key="min" value="0.0"/>
<parameter key="max" value="0.0"/>
</operator>
<operator activated="true" class="set_role" compatibility="5.1.017" expanded="true" height="76" name="Set Role" width="90" x="447" y="30">
<parameter key="name" value="commentID"/>
<parameter key="target_role" value="id"/>
<list key="set_additional_roles">
<parameter key="label" value="label"/>
<parameter key="commentText" value="regular"/>
</list>
</operator>
<operator activated="true" class="text:data_to_documents" compatibility="5.1.004" expanded="true" height="60" name="Data to Documents" width="90" x="581" y="30">
<parameter key="select_attributes_and_weights" value="false"/>
<list key="specify_weights"/>
</operator>
<operator activated="true" class="text:process_documents" compatibility="5.1.004" expanded="true" height="94" name="Preprocessing" width="90" x="45" y="210">
<parameter key="create_word_vector" value="true"/>
<parameter key="vector_creation" value="TF-IDF"/>
<parameter key="add_meta_information" value="true"/>
<parameter key="keep_text" value="false"/>
<parameter key="prune_method" value="percentual"/>
<parameter key="prunde_below_percent" value="3.0"/>
<parameter key="prune_above_percent" value="95.0"/>
<parameter key="prune_below_rank" value="5.0"/>
<parameter key="prune_above_rank" value="5.0"/>
<parameter key="datamanagement" value="double_sparse_array"/>
<parameter key="parallelize_vector_creation" value="false"/>
<process expanded="true" height="418" width="734">
<operator activated="true" class="web:extract_html_text_content" compatibility="5.1.004" expanded="true" height="60" name="Extract Content" width="90" x="49" y="31">
<parameter key="extract_content" value="true"/>
<parameter key="minimum_text_block_length" value="5"/>
<parameter key="override_content_type_information" value="true"/>
<parameter key="neglegt_span_tags" value="true"/>
<parameter key="neglect_p_tags" value="true"/>
<parameter key="neglect_b_tags" value="true"/>
<parameter key="neglect_i_tags" value="true"/>
<parameter key="neglect_br_tags" value="true"/>
<parameter key="ignore_non_html_tags" value="true"/>
</operator>
<operator activated="true" class="text:transform_cases" compatibility="5.1.004" expanded="true" height="60" name="Transform Cases" width="90" x="45" y="165">
<parameter key="transform_to" value="lower case"/>
</operator>
<operator activated="true" class="text:tokenize" compatibility="5.1.004" expanded="true" height="60" name="Tokenize" width="90" x="45" y="300">
<parameter key="mode" value="linguistic tokens"/>
<parameter key="characters" value=".:"/>
<parameter key="language" value="English"/>
<parameter key="max_token_length" value="3"/>
</operator>
<operator activated="true" class="text:filter_stopwords_english" compatibility="5.1.004" expanded="true" height="60" name="Filter Stopwords (English)" width="90" x="246" y="30"/>
<operator activated="true" class="text:filter_stopwords_dictionary" compatibility="5.1.004" expanded="true" height="60" name="Filter Stopwords (Dictionary)" width="90" x="246" y="165">
<parameter key="file" value="C:\Users\admin2\Documents\NetBeansProjects\YouTubeNoiseFiltration\StopWords_Enhanced.txt"/>
<parameter key="case_sensitive" value="false"/>
<parameter key="encoding" value="SYSTEM"/>
</operator>
<operator activated="true" class="text:filter_by_length" compatibility="5.1.004" expanded="true" height="60" name="Filter Tokens (by Length)" width="90" x="246" y="300">
<parameter key="min_chars" value="2"/>
<parameter key="max_chars" value="100"/>
</operator>
<operator activated="true" class="text:filter_tokens_by_content" compatibility="5.1.004" expanded="true" height="60" name="Filter Tokens (by Content)" width="90" x="447" y="30">
<parameter key="condition" value="matches"/>
<parameter key="regular_expression" value="[a-zA-Z]+"/>
<parameter key="case_sensitive" value="false"/>
<parameter key="invert condition" value="false"/>
</operator>
<operator activated="true" class="text:generate_n_grams_terms" compatibility="5.1.004" expanded="true" height="60" name="Generate n-Grams (Terms)" width="90" x="447" y="165">
<parameter key="max_length" value="2"/>
</operator>
<operator activated="true" class="text:stem_porter" compatibility="5.1.004" expanded="true" height="60" name="Stem (Porter)" width="90" x="447" y="300"/>
<connect from_port="document" to_op="Extract Content" to_port="document"/>
<connect from_op="Extract Content" from_port="document" to_op="Transform Cases" to_port="document"/>
<connect from_op="Transform Cases" from_port="document" to_op="Tokenize" to_port="document"/>
<connect from_op="Tokenize" from_port="document" to_op="Filter Stopwords (English)" to_port="document"/>
<connect from_op="Filter Stopwords (English)" from_port="document" to_op="Filter Stopwords (Dictionary)" to_port="document"/>
<connect from_op="Filter Stopwords (Dictionary)" from_port="document" to_op="Filter Tokens (by Length)" to_port="document"/>
<connect from_op="Filter Tokens (by Length)" from_port="document" to_op="Filter Tokens (by Content)" to_port="document"/>
<connect from_op="Filter Tokens (by Content)" from_port="document" to_op="Generate n-Grams (Terms)" to_port="document"/>
<connect from_op="Generate n-Grams (Terms)" from_port="document" to_op="Stem (Porter)" to_port="document"/>
<connect from_op="Stem (Porter)" from_port="document" to_port="document 1"/>
<portSpacing port="source_document" spacing="0"/>
<portSpacing port="sink_document 1" spacing="0"/>
<portSpacing port="sink_document 2" spacing="0"/>
</process>
</operator>
<operator activated="true" class="store" compatibility="5.1.017" expanded="true" height="60" name="Store" width="90" x="179" y="300">
<parameter key="repository_entry" value="NoiseFiltrationWordlist"/>
</operator>
<operator activated="true" class="select_attributes" compatibility="5.1.017" expanded="true" height="76" name="Select Attributes" width="90" x="246" y="210">
<parameter key="attribute_filter_type" value="no_missing_values"/>
<parameter key="attribute" value=""/>
<parameter key="attributes" value=""/>
<parameter key="use_except_expression" value="false"/>
<parameter key="value_type" value="attribute_value"/>
<parameter key="use_value_type_exception" value="false"/>
<parameter key="except_value_type" value="time"/>
<parameter key="block_type" value="attribute_block"/>
<parameter key="use_block_type_exception" value="false"/>
<parameter key="except_block_type" value="value_matrix_row_start"/>
<parameter key="invert_selection" value="false"/>
<parameter key="include_special_attributes" value="false"/>
</operator>
<operator activated="true" class="x_validation" compatibility="5.1.017" expanded="true" height="112" name="Model Building" width="90" x="447" y="210">
<parameter key="create_complete_model" value="false"/>
<parameter key="average_performances_only" value="true"/>
<parameter key="leave_one_out" value="false"/>
<parameter key="number_of_validations" value="10"/>
<parameter key="sampling_type" value="stratified sampling"/>
<parameter key="use_local_random_seed" value="false"/>
<parameter key="local_random_seed" value="1992"/>
<parameter key="parallelize_training" value="false"/>
<parameter key="parallelize_testing" value="false"/>
<process expanded="true" height="418" width="333">
<operator activated="true" class="naive_bayes" compatibility="5.1.017" expanded="true" height="76" name="Naive Bayes" width="90" x="116" y="55">
<parameter key="laplace_correction" value="true"/>
</operator>
<connect from_port="training" to_op="Naive Bayes" to_port="training set"/>
<connect from_op="Naive Bayes" from_port="model" to_port="model"/>
<portSpacing port="source_training" spacing="0"/>
<portSpacing port="sink_model" spacing="0"/>
<portSpacing port="sink_through 1" spacing="0"/>
</process>
<process expanded="true" height="418" width="333">
<operator activated="true" class="apply_model" compatibility="5.1.017" expanded="true" height="76" name="Apply Model" width="90" x="106" y="27">
<list key="application_parameters"/>
<parameter key="create_view" value="false"/>
</operator>
<operator activated="true" class="performance_classification" compatibility="5.1.017" expanded="true" height="76" name="Performance" width="90" x="99" y="165">
<parameter key="main_criterion" value="accuracy"/>
<parameter key="accuracy" value="true"/>
<parameter key="classification_error" value="true"/>
<parameter key="kappa" value="false"/>
<parameter key="weighted_mean_recall" value="true"/>
<parameter key="weighted_mean_precision" value="true"/>
<parameter key="spearman_rho" value="false"/>
<parameter key="kendall_tau" value="false"/>
<parameter key="absolute_error" value="false"/>
<parameter key="relative_error" value="false"/>
<parameter key="relative_error_lenient" value="false"/>
<parameter key="relative_error_strict" value="false"/>
<parameter key="normalized_absolute_error" value="true"/>
<parameter key="root_mean_squared_error" value="true"/>
<parameter key="root_relative_squared_error" value="false"/>
<parameter key="squared_error" value="false"/>
<parameter key="correlation" value="false"/>
<parameter key="squared_correlation" value="false"/>
<parameter key="cross-entropy" value="false"/>
<parameter key="margin" value="false"/>
<parameter key="soft_margin_loss" value="false"/>
<parameter key="logistic_loss" value="false"/>
<parameter key="skip_undefined_labels" value="true"/>
<parameter key="use_example_weights" value="true"/>
<list key="class_weights"/>
</operator>
<connect from_port="model" to_op="Apply Model" to_port="model"/>
<connect from_port="test set" to_op="Apply Model" to_port="unlabelled data"/>
<connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>
<connect from_op="Performance" from_port="performance" to_port="averagable 1"/>
<portSpacing port="source_model" spacing="0"/>
<portSpacing port="source_test set" spacing="0"/>
<portSpacing port="source_through 1" spacing="0"/>
<portSpacing port="sink_averagable 1" spacing="0"/>
<portSpacing port="sink_averagable 2" spacing="0"/>
</process>
</operator>
<operator activated="true" class="store" compatibility="5.1.017" expanded="true" height="60" name="Store (2)" width="90" x="581" y="300">
<parameter key="repository_entry" value="NoiseFiltrationModel"/>
</operator>
<connect from_op="Read Database" from_port="output" to_op="Rename" to_port="example set input"/>
<connect from_op="Rename" from_port="example set output" to_op="Numerical to Binominal" to_port="example set input"/>
<connect from_op="Numerical to Binominal" from_port="example set output" to_op="Set Role" to_port="example set input"/>
<connect from_op="Set Role" from_port="example set output" to_op="Data to Documents" to_port="example set"/>
<connect from_op="Data to Documents" from_port="documents" to_op="Preprocessing" to_port="documents 1"/>
<connect from_op="Preprocessing" from_port="example set" to_op="Select Attributes" to_port="example set input"/>
<connect from_op="Preprocessing" from_port="word list" to_op="Store" to_port="input"/>
<connect from_op="Select Attributes" from_port="example set output" to_op="Model Building" to_port="training"/>
<connect from_op="Model Building" from_port="model" to_op="Store (2)" to_port="input"/>
<connect from_op="Model Building" from_port="training" to_port="result 1"/>
<connect from_op="Model Building" from_port="averagable 1" to_port="result 2"/>
<connect from_op="Store (2)" from_port="through" to_port="result 3"/>
<portSpacing port="source_input 1" spacing="0"/>
<portSpacing port="sink_result 1" spacing="0"/>
<portSpacing port="sink_result 2" spacing="0"/>
<portSpacing port="sink_result 3" spacing="0"/>
<portSpacing port="sink_result 4" spacing="0"/>
</process>
</operator>
</process>Jan 12, 2012 4:14:13 PM com.rapidminer.Process run
I think the error now is that rapidminer API does not know where to save the 'NoiseFiltrationWordlist' object that the original process generates because the process is not associated with a repository. How to solve this problem?
INFO: Process BuildYouTubeNoiseFiltration.rmp starts
Jan 12, 2012 4:14:13 PM com.rapidminer.tools.jdbc.DatabaseHandler executeStatement
INFO: Executing query: 'SELECT `commentID`, `commentText`, `binaryScore`
FROM `comments`'
Jan 12, 2012 4:18:58 PM filtration.BuildYouTubeNoiseFiltration buildYouTubeNoiseFiltration
SEVERE: null
com.rapidminer.operator.UserError: Cannot resolve relative repository location 'NoiseFiltrationWordlist'. Process is not associated with a repository.
at com.rapidminer.Process.resolveRepositoryLocation(Process.java:1210)
at com.rapidminer.operator.Operator.getParameterAsRepositoryLocation(Operator.java:1382)
at com.rapidminer.operator.io.RepositoryStorer.write(RepositoryStorer.java:53)
at com.rapidminer.operator.io.AbstractWriter.doWork(AbstractWriter.java:67)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:51)
at com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:709)
at com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:379)
at com.rapidminer.operator.Operator.execute(Operator.java:833)
at com.rapidminer.Process.run(Process.java:925)
at com.rapidminer.Process.run(Process.java:848)
at com.rapidminer.Process.run(Process.java:807)
at com.rapidminer.Process.run(Process.java:802)
at com.rapidminer.Process.run(Process.java:792)
at filtration.BuildYouTubeNoiseFiltration.buildYouTubeNoiseFiltration(BuildYouTubeNoiseFiltration.java:31)
at filtration.BuildYouTubeNoiseFiltration.main(BuildYouTubeNoiseFiltration.java:44)
Many thanks,
Ahmad0 -
Hi,
either use
to load the process you execute via java so it has a repository location, or make sure your parameters use absolute repository locations instead of relative paths.
RepositoryLocation loc = new RepositoryLocation("//LocalRepository/Data/Iris");
IOObjectEntry entry = (IOObjectEntry) loc.locateEntry();
Regards,
Marco0