Web Mining
Hai All,
I am new to RM. Currently I am using RM5.0 version. My objective is to crawl web(Using Crawling operator) and I am able to save the URLs by giving the regular expression rules into an excel file.Now the problem is I am not able to see the content related to each URLs. After geting the content I have to eliminate html contents in each page.
Can anyone suggest how to proceed further. It will be great if someone can explain with operator names in process order.
Thanks
I am new to RM. Currently I am using RM5.0 version. My objective is to crawl web(Using Crawling operator) and I am able to save the URLs by giving the regular expression rules into an excel file.Now the problem is I am not able to see the content related to each URLs. After geting the content I have to eliminate html contents in each page.
Can anyone suggest how to proceed further. It will be great if someone can explain with operator names in process order.
Thanks