The most recent content from our members.
There's a wealth of content waiting for you to explore! Need support or have a question? Sign in or register to get started.
Dear All, I am a terrific newbie on rapidminer. I need to extract dominant cycles (peaks of frequencies) on a time series of the financial tools, in the example of S&P future, candlesticks on 1-minute timeframe. I have previously calculated per minute the average price value (high+low)/2 and then the moving average on 10…
Can someone tell how to implement or add semantic analysis into my process? means i need to compare not only similar words but also the meanings of 2 words are same in a single document.
Hi everybody! I've calculated TF-IDF with "Process document from data" and I found a matrix that have a word in every column and a body for every row and every cell of the matrix cointains TF-IDF's value. Now I filter by cluster, creates with k.means, and I want to see only columns with values non-zero. I firstly thought…
hi guys! after doing a clustering on a list of documents with the k-means, I would like to analyze the words in each cluster (in order to correlate them with other attributes). About this I added up the value of tf-idf for each words, but I think that this solution can be wrong. Could it be more correct to use term…
Dear Community, is it technically or rather mathematically possible to calculate the Cosine Similarity measure based on results derived by SVD Feature Extraction? Or does the distance metric only operate on measures like TF-IDF? Thank you in advance for your help!
Hello together, do you have a recommendation with regard to the question of which classification model sould be used within Feature Selection (e.g. Optimize Selection or Backward Elimination) to be able to efficiently select attributes or rather dimensions based on a high-dimensional TF-IDF matrix? Thank you in advance for…
I have a word vector attribute Negative and Positive. when I run this process result is incorrect. in the picture you will see word attribute in column six (กรุณา) value is two. how to count word attribute in column confidence(P) and column confidence(N)?