-
RapidMiner Studio Spark Cluster Connection issue
Hi I am currently using RapidMiner Studio to help me design my streaming workflows. I have an issue regarding spark connector. What I am trying to achieve is to design a simple workflow and upload it to my local spark cluster. My workflow consists of a spark connector(Retrieve) and a Streaming Nest(Contains a Kafka Source,…
-
Azure HD Insight 3.5 and Radoop: cannot get it to work
Hi, I want to connect with RM Free Edition 9.3 using Radoop Basic to Azure HD Insight 3.5+ and using Spark 2.2. I have everything setup as suggested in the manuals and forums. I imported the cluster info from Azure in RM. In Azure all services, including Hive, Spark etc. are working. Using a test with PowerBI via the…
-
"What are the languages supported by Apache Spark for developing big data applications?"
Can you use Spark to access and analyse data stored in Cassandra databases?
-
Radoop Full Test Fail with Spark (Apache Hadoop 2.8.1, Spark 2.2)
I am facing the same issue with following links, https://community.rapidminer.com/discussion/51889/radoop-full-test-spark-job-test-error-hadoop-2-8-spark-2-1-1/p1. Does anyone have the solution to this problem?
-
"Probem with spark assembly jars"
When I try to run my process composed by these operators: retrieve data from hive, set role, and decision tree (spark). I have this error: the specified spark assembly jar, archive or lib directory does not exist or cannot be read. Please check the assembly jar location on the advanced connection panel and make sure that…
-
"Fp-growth and generation rules with spark"
Hello, In my work, I need to extract association rules by applying the fpgrowth algorithm with spark. I found that fpgrowth and apply association rules operators cannot be used inside the radoop nest. and there are no fpgrowth and association rules operators with spark available. what should I do? Best regards, Asma.
-
"Any plans for Spark without Hadoop (ex. Databricks)?"
Are there any plans to support Spark without Hadoop, for example, to run on Databricks?
-
Spark Radoop connection
Hi everyone, I am using Cloudera and the upgraded to Spark 2.2. I am having trouble when performing a Full Test. So in the configration what should be in "Spark Archive (or libs) path"? I have tried getting the jar from (http://spark.apache.org/downloads.html) I wasn't able to find the "..assembly.jar" file. So I tried…
-
"FPGrowth in spark and docker"
Hello :) I created a spark cluster on my pc via docker image, and I want to apply the FPGrowth on my data. Can you give information about how to apply this algorithme from Rapidminer on my spark cluster, without using cloudera? Thank you in advance.
-
Spark Job Failed - Connection from Cloudera to Radoop
Hello can someone help me. I get the following error message when I connecting Cloudera to Rapidminer Radoop. All services except Spark work and can be connected. What do I have to do that the Spark job runs as well? Thanks in advance. [Jul 6, 2017 8:31:03 PM]: Integration test for 'myCloudera' started. [Jul 6, 2017…
-
How to using Spark 2.1 in HDP 2.6 and radoop
1-copy spark.sh in /etc/profile.d 2-run: source /etc/profile.d/spark.sh In radoop connection->Spark Settings 3-change "Spark Version" to Spark 2.1 4-change "Spark Archive(or libs) path" to local:///usr/hdp/current/spark2-client/jars/*
-
Radoop Full Test (Spark job) test error , Hadoop 2.8 , Spark 2.1.1
I installed Hadoop 2.8 , Spark 2.1.1 Single node in VM and Rapidminer 7.5.001 , Radoop 7.5 I selected "Apache Hadoop 2.2+" in Radoop Connection. I validated Spark installation with SparkPi. Quick Test finished successfully. I got following error in yarn for Full Test(only Spark Job selected) : 17/06/29 02:42:36 INFO…
-
Radoop "Full Test" (Spark job) connection test error with Hadoop 2.8 and Saprk 2.1
I installed hadoop 2.8 , Spark 2.1.0 binaries, Rapidminer 7.5.001 , and Radoop 7.5.0 The Hadoop version in connection is "Apache Hadoop 2.2+" . (connection xml is attached) I validated spark installation with SparkPi. Quick Test finished successfully , but in Spark Job test (Full Test) I got this error in yarn: 17/06/29…
-
"Spark is not working with Cloudera HA cluster"
I configured a Radoop connection to a HA Cloudera cluster and access to Hive and Impala works. But when I try to start a Spark Operator it fails when the Spark Application Master is started: Java HotSpot(TM) 64-Bit Server VM warning: ignoring option MaxPermSize=128m; support was removed in 8.0 17/04/24 10:52:57 INFO…
-
Rapidminer Radoop Connection with Spark 1.6.2 failed
Hi folks I am trying to connect Radoop Connection with Spark 1.6.2 in standalone mode spark, Here i have configured all the required properties based on rapidminer documentation, That configuration screenshots, I have attached below while i am performing 'quicktest', it shows the following error. But when I have selected…
-
Rapidminer with Spark
I realize RapidMiner can be implemented with Spark in the Hadoop environment on a cluster but my question is can RadipMiner be implemented with Spark on a Standalone? If so, can you direct me to some information to help me with the implementation?
-
Problem with Radoop on Spark failed error
Dear All, I am using Radoop and connected to a single node Hadoop cluster. I have also Hive running, and Spark is also on my machine. I run the Quick test of Radoop connection and is successful. Next I have started to follow the Titanic data tutorial to understand more about Radoop and Rapidminer functions. When I run an…