Basebology (The Study of Baseball): Charlie Manuel flouts statistics (or does he?)

Friday, March 21, 2008

Charlie Manuel flouts statistics (or does he?)

From this BBTF thread about this Philadelphia Inquirer article:

[Phillies Manager Charlie Manuel] will continue to consult statistics when considering matchups, but he will trust his eyes more. So what if a hitter is 0 for 6 against a particular pitcher? Manuel saw those six outs, and they were all hard-hit line drives. Eyes win. And so does the gut.

This is totally illustrative of the inability of the mainstream to grasp what statistical analysis is. Manuel is here portrayed as flouting statistics to go with his gut feeling, but in reality he is demonstrating an implicit understanding of proper use of statistical analysis. 0 for 6 tells you absolutely nothing. No self-respecting statistician would try to draw any sort of conclusion ever on the basis of 6 trials.

Now, you shouldn't actually use the fact that your guy hit the ball hard six times either. That's just another statistic. However, I suspect that what Manuel is really saying here is "I know based on my vast experience observing baseball players that my player is well equipped to deal with this pitcher, I just don't have the numbers to back it up yet." And you know what? That's what you have to do in that situation.

Now if you find that your opinion is constantly leading you to choose hitters for specific situations who are generally inferior, it may be time to question your baseball experience. It should take a lot of experience to choose Neifi Perez over Albert Pujols, for example. Pujols should be your choice 99.9999% of the time, not because he has performed better or worse in a specific situation a small number of times, but because he has performed well in a general sense many, many times. Therefore, the number of situations in which Perez is a better hitter must be exceedingly small (or non-existent). If it were not, the general gap between them would be smaller as hitters.

People who do not understand statistical analysis are the ones who constantly abuse it. The people who think stats are garbage and say that "spreadsheets don't play baseball" or that "players aren't stat generating robots" are the same people who turn around and use small samples both as the tools of confirmation bias and as a straw man to attack those number-loving geeks.

For the love of God, people, please stop using small samples for any reason whatsoever.

** EDIT ** Changed use of the word "flaunt" to "flout" because I care about the English language.

1 comment:

Robert Lynch said...: I agree mostly with what you have to say about sample sizes, but both Melky and Abreu were one for one with two walks yesterday, which can only mean that Melky will become a second Bobby Abreu!; March 21, 2008 at 10:22 PM

Post a Comment

Key Stats

ARP
Adjusted Runs Prevented

ARP measures the amount of runs that a relief pitcher prevented from scoring above what an average relief pitcher would have prevented. ARP is adjusted for the situation in which the pitcher was used.

ISO
Isolated Power

ISO is the ratio of extra bases that a player has accumulated to the number of at bats he has received. ISO is essentially a player's SLG minus his batting average. This has the effect of giving a player credit only for extra base hits. ISO is not a useful measure of player value on its own, but is a very effective measure of a player's extra base ability.

OBP
On Base Percentage

OBP is the ratio of the number of times a player reached base safely to the number of opportunities he had to reach base. It effectively measures a player's skill at not making outs. Since outs are a teams most precious commodity, OBP measures perhaps the most valuable and fundamental skill a player can have.

OPS
On Base Plus Slugging Percentage

OPS is a crude metric that simply sums a player's on base and slugging percentages. It is probably the most popular non-traditional measure of overall batting performance due to its simplicity. However, it has drawn criticism from performance analysts for its inaccuracy relative to other advanced metrics and because it works by adding two numbers with different denominators together to produce a conceptually meaningless quantity. It is best used as a quick and dirty estimator of batting prowess.

SLG
Slugging Percentage

SLG is the ratio of total bases that a player has accumulated to the number of at bats he has received. It is essentially a weighted batting average that gives a player more credit for extra base hits.

UZR
Ultimate Zone Rating

UZR is a defensive metric that uses play-by-play data to determine how good a player's defense is. On Fangraphs, it is denominated in runs saved above average.

VORP
Value Over Replacement Player

VORP measures the amount of runs that a player contributed above what a "replacement player" at the same position would produce. VORP considers only offensive contributions.

WARP
Wins Above Replacement Player

WARP measures the amount of wins that a player contributed above what a "replacement player" at the same position would produce. WARP considers both offensive and defensive contributions.

WXRL
Win Expectancy added above Replacement adjusted for Lineup

WXRL measures the amount of wins that a relief pitcher contributed above what a "replacement player" would produce. WXRL differs from WARP because it is adjusted for both the game situation in which the pitcher was used and the hitters that the pitcher faced.

Basebology (The Study of Baseball)

Friday, March 21, 2008

Charlie Manuel flouts statistics (or does he?)

1 comment:

Blog Archive

Key Stats

Contributors