Eric Maino

System.Brain.CoreDump();

Utilizing the Google Web Services

Tonight I decided to start playing with the web services that Google has exposed. They seem to be pretty cool, I just wish you could get more then 10 results back at a given time. A number closer to 100 would have pleased me much more, but maybe this will come at a later time for *Premium* members.

What I started to build was an engine that will index queries for me and update them every 24 hours (or how ever often you call the update routine). Once I look at the page it gets cached in a list of past pages, while the pages I have not looked at but google knows about are kept in my active list. It appears what I just threw together in this last hour will do the trick and let me index the content I would like to stay on top of.

There are a few things I still need to do before I release this code (if anyone is interested).

  • Finish the state management
  • Enable query sharing (it's setup on an account basis with each account having a list of queries)
  • Enable load balancing for searches
  • Optimize to be sure the memory footprint doesn't get too big

Comments

rick said:

It might be more interesting if you index Feedster searches. Its results are in RSS, can be queried at will with no appkey, and indexed way more often.
# December 9, 2004 1:50 AM

TrackBack said:

# December 14, 2004 2:05 AM
Leave a Comment

(required) 

(required) 

(optional)

(required)