themolotov.net: Word Statistics
I created a bunch of statistics after pulling a bunch of word data from all of my posts!
I am by no means a statistician, but I came up with something neat while I was trying to implement the next 'feature' I'm going to try to create (tag suggestion using Bayesian probabilities!). Word Statistics!
I just grabbed all of my posts (excluding titles, summaries, tags, etc.) and started keeping track of all of the numbers: total words, unique words, appearance count, etc. - lots of stuff! After I created the initial page, I decided that it'd be more efficient to do the calculations once for the initial population of the database, and then incremental updates and checks every time I make a post, so viola! Now when I post I update my word statistics table. The only thing that I'm not currently doing is keeping a 'Words Stats' table and keeping things like total word count, unique word count, etc. so when you visit the Words page, it has to parse everything again.