themolotov.net


Skip To Content

themolotov.net: Word Statistics

I created a bunch of statistics after pulling a bunch of word data from all of my posts!

I am by no means a statistician, but I came up with something neat while I was trying to implement the next 'feature' I'm going to try to create (tag suggestion using Bayesian probabilities!). Word Statistics!

I just grabbed all of my posts (excluding titles, summaries, tags, etc.) and started keeping track of all of the numbers: total words, unique words, appearance count, etc. - lots of stuff! After I created the initial page, I decided that it'd be more efficient to do the calculations once for the initial population of the database, and then incremental updates and checks every time I make a post, so viola! Now when I post I update my word statistics table. The only thing that I'm not currently doing is keeping a 'Words Stats' table and keeping things like total word count, unique word count, etc. so when you visit the Words page, it has to parse everything again.

this entry

Mood: accomplished
Music: Tool
Location: work



You are not able to comment on this entry.



All Content Copyright Jon Gartman 2006, unless otherwise noted.
This site is part of the molonet. Generated in about 0.191 seconds.
Spiral out, keep going.