Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
Hi ! Can I implement a focused web crawler in Rapidminer using it's operators?
AnumBse16
I'm a newbie. I looked into some operators like crawl web and TF-IDF. I need to implement focused web crawler for my project
Find more posts tagged with
AI Studio
Web Mining
Accepted answers
SGolbert
Hi
@AnumBse16
,
With the Crawl Web operator of the Web Mining extension you can set up crawling rules as regular expressions. Note, however, that the web mining tools of RM are kind of basic and limited to HTML code analysis. You will need an external tool for complex web crawling (Javascript, PHP, authentification, lots of pages, etc.)
Regards,
Sebastian
All comments
Telcontar120
You should definitely download the web mining and text mining extensions, which are free. There are many operators that will support web mining and text processing. You can do almost anything you want in RapidMiner in this area if you are willing to put in some time to learn the platform. There are excellent free video tutorials available and good in-program documentation with sample processes as well.
AnumBse16
@Telcontar120
can i get some guidance I'm confused which data mining operators to use to make a keyword focused web crawler
SGolbert
Hi
@AnumBse16
,
With the Crawl Web operator of the Web Mining extension you can set up crawling rules as regular expressions. Note, however, that the web mining tools of RM are kind of basic and limited to HTML code analysis. You will need an external tool for complex web crawling (Javascript, PHP, authentification, lots of pages, etc.)
Regards,
Sebastian
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups