it's a major award
Coach Schiano here. You might remember me from such fine diaries as MGoStatistics, Visualizing the Hennechart (aka the Hennegraph), and some other forgotten gems (the last being a drug-induced haloscan rant of epic proportion). Or you might not. But at least those stats got some front page love from blogmaster-in-chief Brian, despite the purported "diss". PYTHON RULES!
In last week's post, we summarized some word counts over the years to definitively show that Brian is awesome, which he is. What left a bad taste, however, was the weak attempt at the end of that diary to summarize word usage via a single Wordle. Yes, Wordle is awesome, but no one Wordle can this blog describe, as someone famous once said; probably not somebody associated with Wordle, though.
Thus we bring you a deeper analysis of the blog via the simple tool of Word Frequency Analysis (WFA). By simply counting how many times a word is used, great insight into this blog and its content can be achieved. Or, at least, mild amusement can your way be brought. Minimally, sentences can in Yoda style be written.
The results below come from (somewhat arbitrary) comparisons of the frequencies of different words. The conclusions come from my brain. Thus, the former can be trusted, and the latter should likely be dismissed. But hopefully each analysis is clear: a table, with a list of (frequency, word) pairs, where frequency is the number of times that particular word appeared in mgoblog over its entire lifetime, 2004 until present.
And now, for the results! Brace yourselves, this gets ugly.
First, we analyze how often particular sports are mentioned:
Now, an analysis of how often various places are mentioned:
Now we study the popularity of various coaches:
You might find yourself wondering about the dominant mgoblog receiver. If so, we give you the receiver analysis:
Who is mgoblog's favorite running back? Well, this was an easy one to guess:
Onto the quarterback competition:
And now we study two particular schools of football philosophy: Lloydball and Tresselball.
Speaking of football philosophy, we also study the dominance of the spread:
Now we move onto more important matters, like the study mascot names:
Finally, if you'll indulge, we'll get into some slightly more off-topic terms. Let's start with food. What about the food preferences of mgoblog? Sadly, not much data here, making us wonder if Brian eats very much or is rather some kind of blog-creating Cyborg sent from our future UofM overlords to get us through these rough times (possible, no? hmm? HMMM?). But from what we could find:
Being a blog of international repute, mgoblog also mentions some people of differing nationalities:
Brian also uses his fair share of saltier language. For example:
"I suppose it is possible that Germany is a plant biology major and spends his time before the snap screaming "I gonna sprout all up in your ass, mothafucka*" at the quarterback, but it seems unlikely."Classic.
Sorry, one last set of bad words:
Just keep moving folks, keep moving. And let them never be mentioned again. Speaking of which:
Just keep moving folks, keep moving. And let them never be mentioned again. Speaking of which:
We end with some fairly random studies. First, a gender study yielded the following information about the different types of "boys" mentioned on the blog:
And we conclude with some word counts that we noticed "coincidentally" ended up at the same frequency. Or did they?????
The Final Home Game of the Season
Ball State (27-24)
Tuesday 6:35pm ET, Ray Fisher Stadium, Ann Arbor, MI
Notes: Michigan beat Ball State 12-8 at the Coastal Tournament. That made the
all time series 17-1.
Much has happened since these two teams squared off just 2 months ago (formatting on that has changed by itself, taking out all of my paragraphs breaks). Michigan has seen their Preseason-All American return to the lineup. Ball State has turned their season back around after starting 3-9. Both teams are in the thick of tight conference pennant races.
This time around, the stakes are equally unimportant in the long run, but both teams need to keep their momentum going as they enter their last full week of regular season games.
Quick preview after the jump.
That just happened. Michigan wins the series 2-1 on the most stunning senior day in recent memory. This series has it all - A pitcher's duel, a stunning heart breaker, a come-from-behind win on senior day featuring the two co-captains completing a walk off. If you have ever been a baseball fan, this was the series for you.
W – Gerbe (2-0)… Save – Burgoon (9)
Game one was the pitchers' duel. Michigan managed the early lead thanks to a leadoff walk of Patrick Biondi. After going to third on a perfectly placed hit and run by Toth, going right through the hole vacated by the second baseman, LaMarre would knock him in on a would-be double play, but Northwestern's second baseman double clutched, giving LaMarre just enough time to beat out the throw. After Berset's single, Crank would line out deep to left, gaining an easy sacrifice fly, and Michigan led 2-0.
Alan Oaks was on the mound for Michigan and had a pretty good game. In his 6 innings of work, he gave up 7 hits and 3 runs. Two of those came in the form of solo home runs by Northwestern's third baseman Chris Lashmet. The third run also involved Lashmet. In the 6th, he would single and score on a Zach Morton double that screamed past a diving Lorenz and took a strange hop off the wall, evading Ryan LaMarre in left.
LaMarre would lead the response for Michigan, knocking a triple off the center field wall. This set up Chris Berset up for an easy RBI single.
Oaks would open the 7th with a hard hit double, and the bullpen would take over for Oaks after that, with the game tied at 3 a piece. Gerbe would give up a sac bunt to move the runner to third, but Mike Dufek made a great play on a slow roller by the next batter to gun the runner trying to score and preserve the tie.
In the bottom of the 7th, Biondi got the offense started on a two-out rally. His walk was followed by back-to-back singles by Toth and LaMarre to bring in a run. With the lead, 4-3, it was all Burgoon from here on out.
- The Pen – 3 IP, 2 H, 1 BB, 1 K, W, Save
- Anthony Toth – 3/4
- Ryan LaMarre – 2/4, 2 R, 2 RBI, 3B
- Game Time: 2 hours 35 minutes
- Attendance: 1385
- Coley Crank – 0/3 RBI, 3 LOB, SACF
The rest of the series, including the THRILLING CONCLUSION, after the jump.
Just a little bit of change since last week, but ESPN did tweak their ratings a bit. Action since last rankings:
NONE. I think for the first time since I've been doing this, there were no commits in the Big Ten this week. Scout did drop a couple guys down from 3-stars to unranked, which happens to push Wisconsin past Minnesota. Michigan will hopefully have a couple new commits by next Sunday.
|Big Ten Recruiting Class Rankings|
|Rank||School||# of Commits||Rivals 250||Scout Average||ESPN 150|
I'll only make charts for the teams that currently have commits. Rivals 250 means that a given prospect is on the Rivals 250 to Watch, and ESPN 150 means that a prospect is on the Watch List for the ESPNU 150. Scout ratings are on the 5-star scale.
|#1 Ohio State - 10 Commits|
ESPN now deems DE Chase Farris worthy of being on the ESPN150 watch list.
|#2 Notre Dame - 9 Commits|
Nothing new for the Irish.
|#3 Michigan - 4 Commits|
Brennen Beyer goes back on the ESPN150 watchlist.
|#4 Michigan State - 5 Commits|
|#5 Indiana - 8 Commits|
Teams are going to start passing the Hoosiers once they get more commits.
|#6 Wisconsin - 2 Commits|
No change for Wisconsin.
|#7 Minnesota - 2 Commits|
ESPN finally acknowledges that Calvin Phillips isn't committed to the Gophers (which he hasn't been for at least a couple months now).
|#8 Iowa - 3 Commits|
Iowa picks up a decent offensive lineman. They're right on the heels of Minnesota and Wisconsin, passing Northwestern and Illinois.
|#9 Northwestern - 2 Commits|
Both of NU's commits are unranked to the services.
|#10 Illinois - 2 Commits|
Still just two for the Illini.
Summer is upon is, and with it, a bit of a lull in our mgoblogging fervor - there are simply not as many sports to talk about. The great wait for the football season begins.
With this in mind, what better time to celebrate this very blog in some bizarre and uniquely mgobloggish way? Hence I present: MGoStats, a statistical look at this blog over the years since its inception.
It began on December 4th, 2004, with the following post at 6:30am by some guy named Brian:
An inauspicious beginning, to say the least, but thus mgoblog was born. In the years since, we have all come here for a multitude of reasons: to celebrate the highs, commiserate during the lows, but mostly for one single reason, which is to hear what one Brian Cook has to say about all matters Michigan Football (and occasionally other sports).
So I found myself wondering: how much has Brian said over the years? A couple of python scripts later, I had some answers. I wrote a trivial script to download the entire blog (old pages are available through links of the form
http://www.mgoblog.com/?page=X, where higher
X values link to older pages), and then a less trivial script to parse the downloaded content into a more manageable form. The python SGML parser is amazing, for those of you who care about such things.
What I found follows below. Note: there may be some errors, but I believe the results to be in the right ballpark.
Perhaps the single most amazing fact is that Brian himself has written something on the order of 3 million words (or typed about 17 million characters) over about 3600 articles. Wow! That's a lot of content, from his hands to our eyeballs.
|Who||Articles||Words (Millions)||Characters (Millions)|
The table shows these sums, as well as the sums across all contributed articles (including ones from Tim, TomVH, formerlyanonymous, and anyone else who has made the front page). It might be interesting to see how these counts (number of articles, number of words, number of comments made by users) play out on a week-by-week basis. So interesting one could even make a ... chart? Chart. Or actually, Charts.
The first chart I present is the number of articles published per week over the entire existence of mgoblog.
From the chart, one can observe some interesting facts. First, from mgoblog we should expect about 14 articles per week on average over the course of a year. Second, that number is notably higher in the fall (no surprise), and lower in the spring. Finally, and perhaps most interestingly, one can see the growth of the mgoblog community in the orange bars, which represent articles written by somebody other than Brian; this content, which now represents a significant portion of mgoblog, picked up halfway through last year and has continued to get stronger. Brian's efforts at making the blog more than just himself are clearly having an impact.
The second chart just shows the number of words on a per week basis:
The graph reflects the same trends seen above, but in word counts. Even early on, Brian was producing above 10,000 words per week during football season, and last year during the same season, we were spoiled with over 30,000 words per week about the sport and team we love.
Finally, I show the number of comments per article:
The big effect in this graph is the lack of comments before the switch to the new blog infrastructure (e.g., the Haloscan era). The other effect is the growth of the community: the difference in the number of comments in Fall '08 and Fall '09 is likely a sign of the increased importance of this site as a place for the broad UM football community. Aside: the one early outlier which has a large number of comments (Fall '06) is just full of a bunch of comment spam: Unverified Voracity 99 Bonus Guest. Who knows why it's there, but Brian should probably remove those comments.
I was also interested in what the longest articles were, but that should have been obvious: UFRs. Here are the ten longest articles (by number of letters in the article):
- 10. Upon Further Review: Defense vs Notre Dame (by Brian on September/16/2009, 48949 letters long)
- 9. Upon Further Review: Defense vs Iowa (by Brian on October/14/2009, 49477 letters long)
- 8. Upon Further Review: Defense vs Indiana (by Brian on September/30/2009, 49913 letters long)
- 7. Upon Further Review: Offense vs Iowa (by Brian on October/15/2009, 50279 letters long)
- 6. Upon Further Review: Offense vs Illinois (by Brian on November/5/2009, 50421 letters long)
- 5. Upon Further Review: Defense vs Purdue (by Brian on November/11/2009, 51002 letters long)
- 4. Upon Further Review: Offense vs Purdue (by Brian on November/12/2009, 51279 letters long)
- 3. Upon Further Review: Offense vs Notre Dame (by Brian on September/17/2009, 51572 letters long)
- 2. Upon Further Review: Offense vs Western Michigan (by Brian on September/10/2009, 51616 letters long)
- 1. Upon Further Review: Offense vs Indiana (by Brian on October/1/2009, 51721 letters long)
If you remove the UFRs from the list, these ten get the longest billing. A number of previews and various other summaries show up:
- 10. Michigan 2007, Part II: Defense (by Brian on August/31/2007, 28513 letters long)
- 9. Michigan State: Sometimes The Bar Eats You (by Brian on August/13/2007, 28636 letters long)
- 8. Purdue 2007: You're Killing Your Father, Larry (by Brian on August/23/2007, 29656 letters long)
- 7. Purdue 2008: Tiller On A Treadmill (by Brian on July/31/2008, 29964 letters long)
- 6. Illinois Preview: Redact This (by Brian on August/9/2007, 30014 letters long)
- 5. Michigan Preview 2005: A Tale Of Two Units, Part I (by Brian on August/30/2005, 30163 letters long)
- 4. Offense Unit By Unit, 2008 (by Brian on August/26/2008, 33989 letters long)
- 3. Michigan Preview Part I: Offense (by Brian on August/29/2006, 34844 letters long)
- 2. Penn State Preview: Stupefying (by Brian on July/20/2007, 35006 letters long)
- 1. Michigan 2007, Part I: Offense (by Brian on August/30/2007, 38809 letters long)
Most-Commented Upon Articles
I was also interested in the most commented-on articles. They were:
Nothing gets people rev'd up like the Offense's Units, or RAWK MUSIC, I guess.
Finally, I was generally curious as to what words show up in the blog. Sounds like a case for a ... chart? Nope. But close, a wordle:
The word cloud here shows a list of the most popular words used in this blog, with some editing done by y.t. to remove words like "the" (actually the most popular word on the site) and so forth.
Anyhow, that's all for now. An amazing amount of content, built up over the years on the backs of UFRs and other regular features we all know and love. Thanks Brian for all the hard work - it is truly staggering to see the sheer verbiage that has powered the site over the years.
This is an offer that I was expecting to happen sooner, rather than later. QB Brett Hundley out of Chandler High School, in Arizona, received an offer from Michigan today. "They called my coach, Coach Ewan, and then he told me they had offered," said Hundley.
This one is exciting for me, because it's another Michigan recruit in my home state. I saw Brett play this past season, and he's the real deal. He's a 6-foot-4, 210-pound athletic dual threat quarterback. He says his strengths are, "Leadership, mobility, I'm coachable, and I have a pretty strong arm. I do think I need to work on my footwork, though." I would add that academics are a strong point for Brett, since he carries a 3.9 GPA.
Hundley holds offers now from Arizona, ASU, LSU, Oregon, Nebraska, Stanford, UCLA, Washington, and Michigan just to name a few. He hasn't started to focus on recruiting yet, and told me, "No one's in the lead right now. Everything is wide open for me. I'm going to start taking trips in June, or July. After that I'll start all my officials, when I'm allowed to." When asked if Michigan will receive one of those visits, he said, "I'm definitely going to talk to my dad about that one."
Chandler High is in spring practice right now, and I will be heading out to watch this coming week. I'll have some practice footage, and possibly a video interview sometime later this week.