June 26, 2003

blog language stats: verified

i had a nice discussion with Maciej Ceglowski concerning the blog language stats i was on about (22-jun post), turns out they have had students manually verifying the language guesser's stats and came back with a 95% correct score. so the blogging world does indeed appear to be dominated by english, portuguese, polish, and farsi. wow. since geography is something i eat, sleep and drink, i downloaded and somewhat cleaned up the blog geography data from NITLE Blog Census and put together a static map of blog geography. there's also instructions on that page on how to find your blog's location and how to add an ICBM meta tag (yes thats right ICBM as in missle tag, geography is so cool ;-) so you too can be "found".


