Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
Crawl Google Search
AtiahKhoirunnisa
Hello everyone,
Can Rapid Miner crawl documents from Google Search, example i want to retrieve all the document that related to what i search in Google search, example i wanna to find docs or headlines that contain 'Obama', how do i do that?
Find more posts tagged with
AI Studio
Web Mining
Accepted answers
All comments
SGolbert
There are ways to do it, but Google Searches are not legally crawlbable:
Check out:
https://www.google.com/robots.txt
If you still want to do it with smaller chances of getting banned or having problems, you need to use Selenium, a web crawler that uses your web browser (and is therefore quite slow).
Kind regards,
Sebastian
Marco_Barradas
@AtiahKhoirunnisa
You could use the Custom Search Engine and retrieve the information from the JSON
https://cse.google.com/cse/
Keep in mind that their is a limit of places you can search information from and if you need to increase the amount of information you may need you'll need to pay for the usage of the tool.
But I guess this could help you.
Best Regards
AtiahKhoirunnisa
Thank you Golbert and Marco, I will try first
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups