Forum OpenACS Q&A: OT: How to search the web...
I have the following problem, tough not directly OACS related, but
interesting anyway. I have to search several sites, on a daily or
weekly basis, for some keywords (i'm looking for news coverage of
childhood/family violence/minors traffic/etc issues). The first idea
was to hit google with the queryes, process the pages and store the
results in a table. Then an operator would walk all the hits and
separate the data from the noise. But in theyr rules of use they
(reasonably) forbid that kind of use and say "don't even ask", and say
than they will block acces from offending IPs upon detection. The
questions are, anyone know a service (it could be even not free...
eeeech) that could solve this problem? English is not my first
language, so i'm not sure i'm being polite enough... but anyone has
experience on how to deal with search services, and if this is an
option? Is there another solution viable, like setting up a crawler
myself or something?
Thanks in advance,
Also, check out spyonit.com - they have something like what you're looking for, but it's not programmable.