Roll your own real-time twitter search with thrudb

I’ve been using twitter quite a bit lately and really like the simplicity of the service and api. One thing thats missing though is search, but there are some great sites like tweetscan and summize that let you search public tweets in close to real-time.

I decided indexing twitter is a great application for thrudb, specifically the thrudex service. Thrudex is essentially a Thrift service for CLucene with some special sauce added. If you’d like to read about the inner workings read this.

Anyway, I whipped up a demo (in perl) for a realtime twitter search and have indexed a few days of tweets (over 3 million!) . check it out here.

tweetsearch.gif

One of our regular contributers, Thai Duong, was kind enough to port it to python+django for you new school folks.
*Note* this is running on a single dev box, so be forgiving… It’s currently polling the public timeline feed so it’s not going to catch every tweet. but It captures ~85%
We’ve added the code as a tutorial for thrudb here. Take it and build your own service… Any takers on building a ruby version or a cross site social search aggregation ala friendfeed?

[del.icio.us] [Digg] [dzone] [Google] [Mixx] [Reddit] [StumbleUpon]
Writen by jake

5 Responses to “Roll your own real-time twitter search with thrudb”

  1. links for 2008-04-23 « Tom Altman’s Wedia Conversation Says:

    […] THIRD RAIL » Blog Archive » Roll your own real-time twitter search with thrudb (tags: twitter thrudb search tutorial) […]

  2. THIRD RAIL » Blog Archive » 100 Most Popular Words Twittered This Week Says:

    […] « Roll your own real-time twitter search with thrudb […]

  3. THIRD RAIL » Blog Archive » Thurday at Noon is the best time post and be noticed (PST) Says:

    […] happened to me a few times; I stay up late working on a great post and finish at 1am EST.  In a rush of excitement I decide to submit it to reddit or del.icio.us and goto bed fully […]

  4. Blog Post at Noon on Thursday to Hit Digg’s Front Page Says:

    […] happened to me a few times; I stay up late working on a great post and finish at 1am EST.  In a rush of excitement I decide to submit it to reddit or del.icio.us and goto bed fully […]

  5. Blog Post at Noon on Thursday to Hit Digg’s Front Page Says:

    […] happened to me a few times; I stay up late working on a great post and finish at 1am EST.  In a rush of excitement I decide to submit it to reddit or del.icio.us and goto bed fully […]

Leave a Reply