Based on an algorithm developed by three Israeli scientists and an American, there is a test out on the web where you can supply a blog entry (or other types of writing) and have it analyzed for whether the author is male or female. I supplied the longest blog entry I have so far and the test returned 5:2 odds that I am male.
What was interesting to me was that the test was not based on content. Instead, it was simply based on word frequency. Apparently " the" is indicative of a male author while the word "and" is indicative of a female author. I suppose that goes to well-known steoreotypes: egotistic male authors (making assertions with definitive articles) and verbose female authors (can't resist getting one more thing in). The word pair with and around (female and male respectively) is similiarly suggestive of female-male steoreotypes: for social skills vs. spatial reasoning.
Incidentally, my blog entry used 26 "the"'s while using only 10 "and"'s -- this disparity pretty much explained the 5:2 odds of my being male.