Follow me on Twitter at Twitter.com/wbm
FYI, I'm blogging most of my stuff over at More Wally now.
You might want to add my rss feed to your reader at:http://morewally.com/cs/blogs/wallym/rss.aspx
New Search Pet Project - NNTP Newsgroup Spider and Search - Wallace B. McClure

Wallace B. McClure

All About Wally McClure - The musings of Wallym on Web, HTML5, Mobile, MonoTouch for iPhone, MonoDroid for Android, and Windows Azure.

News

Personal Blog

Work Blog

.NET

Book Authors

Business

Family

Friends

Georgia Tech Bloggers

Personal

Archives

New Search Pet Project - NNTP Newsgroup Spider and Search

So, I started thinking about other things that I can search on besides the web.  This time, I actually thought of something useful.  Another pet project of mine has been newsgroups.  While my code and database for the Web Search were stuck in Seattle, I started working on a similar project to search NNTP newsgroups.  I started looking for NNTP communications libraries.  I found one called Smilla and its accompyaning code.  Currently, all my code is connect to an NNTP server and retrieve the news groups themselves.  I just started today working on returning the posted articles.  This will be very interesting.  In some ways, it will be similar to the Web Search.  In other ways, it will be very different.  Using some of the tricks I learned in the Web Search code, I am planning on spawning a thread for each newsgroup, dumping articles into a MSMQ queue, and then picking them up to be inserted in the database.  Hopefully, this will be easier than the Web Search due to the fact that the data will not grow continually like it does with the Web Search.  It will be quite fun to see how things go.  More coming soon............

Wally

Comments

Matt Hawley said:

Sounds pretty cool. Let me know if you need any NNTP help, I've had some experience with it for my NNTP Posting Plugin.
# April 13, 2004 11:10 PM

James Crowley said:

You might also be interested in http://www.developerfusion.com/show/4472/ . This describes connecting/getting newsgroups/posting articles.
# April 14, 2004 8:56 AM

TrackBack said:

# May 26, 2004 10:33 AM

Weblog said:

Ok, I have been working on this NNTP Spider that goes out to a news server and sucks down the content...

# October 3, 2006 2:44 PM
Leave a Comment

(required) 

(required) 

(optional)

(required)