Discussion about this post

User's avatar
Grant Marn's avatar

Several years ago at an analytics conference, Bill James described a process that he believed was more likely to yield greater clarity and useful insights. Specifically, you should always start with a question that is interesting or provocative (preferably a large or significant question) and THEN go find data to answer it. Too often these days, he said he reads some piece where the author clearly went the other direction - they found or created some new data and then went hunting for a question - any question - that it answers. The results are both obvious and suboptimal.

The problems with this approach are many. Your question and data are often too narrow in scope and less likely to be truly insightful or invite further research or questions to advance the thinking. The data leads the question and too often excludes other valuable data that would be more probative either alone or in combination in searching for truth. It's all too siloed and limiting with insufficient collaboration or peer review.

Importantly, this approach reduces the analyst at a human level to simply "the [fill in the blank acronym] person" as in Cheryl is the xABOE% person and not a more sought after broader thinker on issues. In turn, the analyst is too often tied to their own data and overly uses it in future research to the exclusion of different and better sources or approaches.

It strikes me that the NFL analytics community is awash in this precise malady which continues to grow. Too many overly narrow data sources with increasingly illogical acronyms that are sold as presumptively insightful with small sample sizes and even smaller numbers of people who can explain them at a rudimentary level along with their limitations. It's become one black box after another that few really understand but "feel" smarter just by dropping the acronym on everyone..."Terry McLaurin's xQOR is better than Justin Jefferson's. Can you believe it!?"

Well no, actually I can't. Yet, somehow these sorts of abberant conclusions which belie common sense and what we observe week to week ironically add mystique and credibility instead of scrutiny to the purported number. We are instructed that solely because it's presented in numerical form or as a "grade" it's accurate and precise...even laughably to two decimal places.

It's one thing to have this in baseball where performance is more isolated, but it's utterly fatal in football given all the interdependencies at play. This hesitancy in the belief in the proliferating numbers is precisely why film is still around as a counter, and why this debate still exists. Analytics has failed to win the day in football. Imagine people in baseball arguing the need to review game film to reach conclusions - that is a symbol of how far football has to go.

Football requires a broader lens and holistic analytical approach versus the current myopic linking together of a series of unrelated data source acronyms and hoping that it says something...about something.

Thanks for raising the issue.

Expand full comment
Grant Marn's avatar

Robbie, thank you for taking the time to read my comment here and offer such a thoughtful response. I appreciate how valuable time is, and you taking some of yours to provide your reactions here is worth a ton. My kids constantly chide me about commenting on Substack, "dude, you get no likes, no comments or even a thumbs up...and they're so ridiculously long, nobody is reading them. Why do you do it?"

I tell them that I still subscribe to the old notion that ideas are like trees - you plant them knowing that you'll likely never see them to fruition, but you remain hopeful and optimistic that they will somehow in some way positively impact another person in the future. Even if you never know it. Now, I can show them your comment as proof!

In the spirit of your mission, I'll leave you with a true story. In the early 90s, I communicated with an award-winning writer from the Akron Beacon Journal about a particular Cleveland Cavalier player that he was fond of, but that I found unproductive and inefficient. I wrote a very nice letter outlining my valid analytical arguments using some box score data. As background, he is an amazing writer (I have read almost all his books) and an even better person from the many people who know him personally that I have spoken with. When his letter back to me came in the mail, I couldn't have been more excited.

When I opened it up, his response was very polite but quite short. It noted my admission that I had not seen this player play a lot due to the limited number of NBA games being broadcast in those days. He said that he, on the other hand, was fortunate as a writer to have seen every game he had played in. He concluded by noting that since I hadn't really watched him play, I wasn't able to really comment on his value.

While completely professional and polite, it still stung. Beyond that, his comment bothered me on a deeper level literally for decades - not because it was dismissive (it wasn't really - his view was the prevailing one at that time), but because it seemed so wrong from someone I viewed as so knowledgeable. It didn't seem to help that a few years later, the player at issue was washed out of the League and labelled a "bust."

What resonated long after, was the key question he raised - do you need to "see" a performance to evaluate it?

In the 25 years since his letter found its way into my mailbox, the world has moved decidedly towards my 1991 worldview. Analytics has become a huge industry in sports, and everyone speaks now of numbers as synonymous with truth. If you don't worship at the altar of numbers cloaked under the label of "analytics" you are a foolish Neanderthal not worthy of consideration.

Yet, at a moment in time when I should have been able to declare my intellectual victory, I was troubled. Some of it what that most of the numbers were not analytics at all but statistics (and nobody seemed to know the difference), while others were just middle of the bell curve crude averages partially comprised of wholly irrelevant use cases than the one before the coach or manager in the game.

Beyond that, most were calculated by people who never saw a single minute of the performance that generated that number. For example, I read somewhere people calculated updated WAR stats overnight in some nondescript office building long after the games were complete, and the stadium lights turned off. They went about their business with the utmost confidence that they were truth tellers...oracles who weren't required by the gods to watch the contests.

I wasn't so sure.

Age robs of you many things, but what it gives back is perspective. A few years ago, I found myself at an airport when my flight was delayed. On the television in the lobby was one of those morning "hot take" shows that I never watch. Held captive, I watched as one panel member trotted out some statistics that I was very familiar with and had personally used in evaluating players - the kind of analysis I might use today if writing a letter to a sportswriter.

When he finished, the other panelist simply said "nice numbers, but you act as if I haven't actually watched the games. Because I have, I know that those numbers are complete garbage." That struck me and took me back 30 years in an instant.

What he said I agreed with. I had watched his games too and found the numbers to be misleading at best. From that point forward, I started paying closer attention to how numbers were being used and how they were presented to the public.

I don't dislike numbers - to the contrary. They are invaluable in helping us see the penumbras of truth in the complex interactions that take place within a game. What's troubling though is that too many of them are foisted upon the public without any real understanding of their calculation or limitations by a far too often smug and sanctimonious media intelligentsia.

Strangely, I increasingly disagree with the "numbers people," not because they are using numbers, but because they are misusing numbers in my view. It's taken 30 plus years, but I've finally arrived at a place of perspective where I see the other point - the others side of the argument.

Numbers are terrific when used carefully and appropriately and offered on bended knee, but you really need to see the games too. At a minimum, I know this. Whether it changes your opinion or not, watching sports - particularly with other people - is a whole lot more fun that silently updating WAR numbers while oblivious to what happened.

And isn't that the point of sports after all? Thanks again for the time.

Expand full comment
5 more comments...

No posts