Discussion Question: Least Favorite Stathead Stat
Discussion Question: What is your least-favorite stathead stat, and why?
By "stathead stat," I mean something developed within the last 10 years or so. OPS doesn't count as a stathead stat. If it was something people were using in 1995, it doesn't count.
My own opinion: I don't know why, but the obsession with BABIP gets on my nerves sometimes, even though I use it myself.
75 comments
|
2 recs |
Do you like this story?
Comments
damn
I thought this said “least favorite stathead” and left it at that. Could of had an f’ing field day with that one.
baseball rules.
I saw and thought
The exact same thing.
WTY's ERA+ = 123 : - /
by Figgi4life on May 6, 2010 1:10 AM EDT via mobile up reply actions
BABIP
My own opinion: I don’t know why, but the obsession with BABIP gets on my nerves sometimes, even though I use it myself.
I 100% agree. I think people overuse this stat and if a player is struggling they automatically turn to it and say it’s the cause. If a player is struggling he might not be making solid contact, which can lead to a lower BABIP. BABIP isn’t all luck, it’s caused by multiple things.
BABIP
Do statheads say that there is a norm BABIP for all hitters to revert to? That’s new to me…
I don't think so
I think they say that there is sort of normal level for each hitter that he will trend towards. So, if a Player A normally averages a BABIP of say .300 but is currently at .200 he could be experiencing bad luck (or maybe he is not making good contact, etc.).
Wilbur Wood touches on this below. As an aside, I remember one of the first baseball cards I ever got was a Wilbur Wood card.
Keep Moving Forward.
Agreed
A friend of mine uses BABIP against Shin Soo Choo who has always had a high BABIP. He says Choo is bound to regress eventually… hasn’t Choo proven he truly is this good?
PPPPPPUNTO 4 MVP 2010
Matt Kemp
is the poster boy for this. There’s something to be said for some guys hitting the ball hard, all the time, and a correlation existing between that and a higher BABIP.
by blackoutyears on May 8, 2010 3:59 PM EDT up reply actions
thoughts
BABIP would rate right up there . . .for a while it seemed like people were trying to use line drive rate as a means to predict BABIP which would have been even worse, but it seems like that’s calmed down as we increasingly recognize that a “line drive” is pretty subjective. So I’d say BABIP is pretty terrible.
I tend to think that defense-independent ERA calculations are rather misleading too. They’re useful tools to evaluate a player, but at the end of the day I think plain old ERA is just as important if not more so. There are players who definitely constantly pitch above or below defense-independent ERA, and I suspect a lot of it has to do with mental makeup.
There are players who definitely constantly pitch above or below defense-independent ERA
Which ones?
Javy Vazquez
Career ERA = 4.24
Career FIP = 3.87
Career xFIP = 3.67
by auclairkeithbc on May 6, 2010 2:58 PM EDT up reply actions
pretty consistent too
2003 and 2007 were the only years his FIP was higher than his ERA, but only by 0.07 and 0.06 respectively.
by auclairkeithbc on May 6, 2010 3:00 PM EDT up reply actions
ha
well, it wasn’t my point. that was just someone i knew off the top of my head, though i had to look up the specific numbers of course.
by auclairkeithbc on May 6, 2010 6:15 PM EDT up reply actions
A sampling
Career FIP – ERA:
Ryan Franklin: .75
Jarrod Washburn: .50
Barry Zito: .47
Steve Trachsel: .45
…
Jeremy Bonderman: -.66
Glendon Rusch: -.75
Len Barker: -.94
And the xFIP gaps are even wider for these guys (where the data is available of course).
+1
I am not a big fan of xFIP although I generally do like FIP. However, people need to realize that if a guy underperforms his FIP for 3-4 straight years there is likely a reason. He may not be good pitching from the stretch, or he may just get flustered when the going gets tough.
"When a true genius appears in the world, you may know him by this sign, that the dunces are all in confederacy against him."
-Jonathan Swift
by King Billy Royal on May 6, 2010 8:59 PM EDT up reply actions
Interesting
I’m of the mind that it’s a better predictive tool than an evaluative one.
by blackoutyears on May 8, 2010 4:01 PM EDT up reply actions
I think its UZR-- if it counts
I dont think most realize it needs multiple years of data to figure out the correct prognosis on a player’s defense…
by lakersdodgersyankees4life on May 5, 2010 10:47 PM EDT reply actions
I totally agree
it also doesn’t account well for first basemen or catchers
That's why they call them business sox
How does it not account for first basemen well?
www.zekeishungry.com
by thejd44 on May 8, 2010 6:26 PM EDT up reply actions
BABIP is interesting
I’m interested to see how the next frontier-PitchFX Data-effects the way we look at batted ball data. I have a feeling we’ll go beyond line drives and start talking about velocity off the bat, area-of-field selectivity, and all sorts of things I can’t conceive of.
Exciting stuff.
http://oursaviorchuck.ytmnd.com/
It's a shame you said no OPS...
cause that is the one that bothers me the most. I think it’s a great stat, but I think too many quote it as gospel. They try to paint every player with the OPS brush and don’t account for different types of players. Lead-off/speed types of players are OPS outliers I believe.
But of the newer stats that I don’t much care for, I’d have to go with FIP. I like to check out all the stats a pitcher puts together, and ERA is one of them. FIP just takes into account homers, walks, strikeouts, etc. I can see all that. But it doesn’t tell me how a pitcher does in a jam. Give me two pitchers with similar FIPs and I will want to see an ERA. I want to know if one of them is allowing less runs, because it could be indicative of how they do with runners on, pitching out of the stretch, etc. I like old-fashioned ERA, and then I can look at the component stats to see if it was a bit of an illusion or not.
Adam Dunn: Proof that even sabermetrics doesn't have it right.
OPS
I rarely see someone seriously painting all players with the same brush. For instance; common sense would tell you that comparing a SS to a 1B based off of OPS is stupid….thus you wont see many people who actually know WTF they are talking about arguing that ‘X’ 1B is better than ‘Y’ SS because ‘X’ has a much better OPS. So far in 2010 there are 18 first basemen with .800+ OPS compared to just 6 shortstops. I mean a logical person would come to the conclusion that OPS is only valuable in quickly comparing similar players.
I have seen many people on this site...
use OPS when discussing players like Bret Gardner, Michael Brantley, Jacoby Ellsbury, Nyger Morgan, Elvis Andrus, etc. Most of these speed types of players will have a sub-800 OPS.
Adam Dunn: Proof that even sabermetrics doesn't have it right.
Shoot...
meant that as a reply to metafour.
Adam Dunn: Proof that even sabermetrics doesn't have it right.
carl crawford. . .
i’ve never really heard anyone try to bad mouth Crawford, highest OPS is 830, and he is a leadoff/speed guy that doesn’t even get on base at a highly desirable rate
The Flaw in BABIP%
Lets talk about 2 concepts.
- BABIP%.
- “Mean Reversion Tendencies”
Assume that we are talking about players that have a large enough sample to have both established their own MLB BABIP% of say 32%, along with a track record of MLB production and say that such hitter is a 28 year old lifetime 300 hitter.
I constantly hear statheads and fantasy analysts say things like the above player, who is only hitting 210 has been extremely unlucky due to a 24% BABIP or if he’s hitting 347 simply say that he has been extremely lucky because of an unsustainable 39% BABPIP. Assume that the K rate hasn’t changed dramatically. This assumes that said player is making the same type of contact at all times and it is simply luck that defines the outcomes. However, in watching said player in both a cold streak and a hot streak one can clearly tell the difference in the type of contact that the player is making in a prolonged hot or cold streak. Routine grounders pulled to SS vs ball scolded up the middle. Balls driven into the gap vs. fly balls to the OF with more hang time than a Ray Guy punt. Hot streaks end and usually slumps do as well and there is an old adage that eventually the state line resembles the back of the player’s baseball card which is a “mean reversion” at work.
While clearly there is some “luck” involved with cheap hits and hard hit outs, however it is a gross oversimplification of the game to simply look at BABIP% and state that player X has been very lucky or unlucky.
Fat man is no more,
Bursting on through Heaven's Door
Come on in, says Bill
+1
At a certain point, how a pitcher is pitching should matter more than how he “should” pitching.
BABIP is also over-used.
--Pablo Zevallos of yankeesfuture.wordpress.com
by Pablo Zevallos on May 6, 2010 12:06 PM EDT up reply actions
xFIP does not bother me.
I think it is useful, but maybe I just buy into the DIPS theory a little more especially when you can segregate types of balls in play.
Keep Moving Forward.
xFIP as a predictor doesn't bother me
in that it’s probably one of the better (best?) ones out there. My issue is with the normalization itself and what exactly it tells us.
xFIP treats all pitchers as having the same ability to prevent home runs on fly balls. This is an invalid assumption, therefore, as the sample size becomes larger, the predictive value of xFIP becomes smaller.
Beyond the Box Score / Capitol Avenue Club / shwitter: @CapitolAvenue
I don't like defensive stats
for the one reason that there is so much variance between different stats comparing the same players. Whenever there is a discussion about defense, there are inevitably the couple of posters who will pick and choose which stat fits their viewpoint and then act like that is infallible, while another defensive metric they often use says something else.
R.I.P Jazz #6
I find it hard to rely on any one of the new defensive stats
However, I think if you look at all of them (UZR, +/-, PMR, Total Zone, etc) then you get a better reading. Lee Panas advocates this approach in recent book Beyond Batting Average (worth getting for the $7 pdf download in my opinion).
Keep Moving Forward.
it's not that I necessarily hate the stats all together
it’s just that there always is the asshole who calls you ignorant and an idiot for not agreeing that the defensive metric he just used is the absolute word of god.
R.I.P Jazz #6
wOBA
For some reason this one bugs me. It is just a linear weights rate stat artificially re-jiggered to look like OBP. So, if someone says Player X has a wOBA of .400 this does not tell me much other than I know that a .400 OBP is good so this must be good, I guess.
I have the same problem with Clay Davenport’s eqAv. This does not do much for me for some reason. I much prefer the old style (as in Bill James) measures of RC/27 outs. I just find this more meaningful to me personally.
I do understand there some advantages to the Tango (wOBA) and Davenport (eqAv) derived stats, but I have been able to fully embrace them.
Keep Moving Forward.
I have a problem with wOBA too
more because I just don’t know a lot about it, so if I see a guy has a .308 wOBA I have no idea if that means he’s good or not.
by Chris Redman is my hero on May 6, 2010 6:27 PM EDT up reply actions
Except he just TOLD you what it means
It lines up with OBP.
Not understanding a stat really isn’t a good reason to dislike it.
www.zekeishungry.com
by thejd44 on May 8, 2010 6:28 PM EDT up reply actions
Bill James' RC lags behind superior run-estimators such as BaseRuns
Patriot, Tango, and studes from The Hardball Times have all shown how much more accurate BaseRuns is than RC.
Aaron King is still my homeboy... iffy mechanics and all
McFAQ for all you newcomers out there.
unemployment #'s
oops wrong site
"The key to winning baseball games is pitching, fundamentals, and three run homers."
I agree with you...
the idea that someone has given up looking for work is no longer “unemployed” is basically designed to pad the stats to look good.
Not looking for work does sort of imply "not in the workforce," doesn't it?
By your implied definition (not working = unemployed), shouldn’t 12 year olds and 85 year olds in nursing homes also be considered unemployed?
www.zekeishungry.com
by thejd44 on May 8, 2010 6:30 PM EDT up reply actions
Try this
The basic concepts involved in identifying the employed and unemployed are quite simple:
* People with jobs are employed.
* People who are jobless, looking for jobs, and available for work are unemployed.
* People who are neither employed nor unemployed are not in the labor force.
your pre-teens and octegenarians would clearly be unavailable for work. What’s being referred to above is the idea that most unemployment percentages do not include those who want to work but have stopped looking after extended failure to secure it. The rates you see in the news include only those who are eligible and still looking. Most make a distinction between U3 and U4 unemployment at the least. Again, reflected in the above comments.
by blackoutyears on May 10, 2010 1:06 PM EDT up reply actions
UZR
Mostly because fangraphs had a whole series of Jason Bay posts emphasizing his low UZR’s, and some people pointed out that for 2009 at least the UZR looked unreasonably low given whatever defense metrics and even the trad stats indicated. They then changed certain things for UZR, park effects etc., and Bay went from -13 last year to +1.9 (he still was bad in prior years).
Overall uzr is great and adds to the sum store of knowledge, and it is better now that it has been updated. May they continue to improve it. But that large discrepancy between the number under the old system for Bay (and Ellsbury) and “reality” sort of poisoned the well for me. If it was that far off, it was harder to take seriously. And I generally think over time it is pretty accurate.
Epstein
Made comments about Ellsbury’s stats under “their” (the Red Sox) metrics (developed by Tom Tippett, I believe) not being as bad as what UZR showed somewhere. Sorry I cannot remember where or I would link. I wonder if the Red Sox system was already adjusting for the Fenway Park better than UZR was.
These stats are still being refined. I think they are useful and interesting, but like anything you cannot look at one piece of the data and come to a conclusion.
Keep Moving Forward.
Fenway
does seem to gum up the UZR works, doesn’t it? That and the seeming inability of the stat to accurately measure 1B defense gives me pause.
by blackoutyears on May 8, 2010 4:11 PM EDT up reply actions
I don't think it's just UZR though
I think it messes with all the advanced metrics.
www.zekeishungry.com
by thejd44 on May 8, 2010 6:30 PM EDT up reply actions
Most of the defensive and baserunning metrics
they all seem to have a long ways to go before they are dependable means of calculation
Not really a new stat, but...
Statheads love using RA instead of ERA. Give it a rest. In the long haul, they have got to be pretty correlated, and in the attempt to mainly simplify things, RA just makes things more complicated.
WAR!
Not so much WAR in and of itself, but forgetting that there are a lot of moving parts causing a lack of precision—especially when it is taken as gospel as to whether a player has over/underperformed his salary. At the very least, it is often forgotten that the precision with WAR is generally overstated — the bigger picture can be seen, but the details really can’t.
by SagehenMacGyver47 on May 6, 2010 1:53 PM EDT reply actions
I'm with you...
When calculating WAR for a player, do Offensive Statistics (50%) + Defensive Statistics (50%) = WAR? It seems like uber-stat teams like the A’s and Mariners are going with Defensive Heavy players these days. Is it because they weighing “WAR” to heavily?
Definitely WAR for me
Waaay too many people use it as a be-all end-all stat
"The A's have to be setting some record this year for simultaneously maximizing team quality and player anonymity. I guess that’s sort of their thing though." - Luke in MN
Agreed
"When a true genius appears in the world, you may know him by this sign, that the dunces are all in confederacy against him."
-Jonathan Swift
by King Billy Royal on May 6, 2010 9:00 PM EDT up reply actions
WAR! What is it good for?
Absolutely nothin’! Say it again, y’all!
Adam Dunn: Proof that even sabermetrics doesn't have it right.
by Boxkutter on May 7, 2010 2:02 AM EDT up reply actions 1 recs
WPA
I mean, it’s fun to graph out a game, but some people try using it like it actually means something. It’s a junk stat, nothing more.
Bad Left Hook - The SB Nation boxing blog
"Baseball is played on the field, not on a calculator."
+10000
I came here to post Win Expectancy as my least favorite stat
I agree exactly that its fun to see how things changed after the fact, but it has little predictive value, imo, and to make arguments later based on what the win expectancy was is silly. Its a statistic likely with very high variance, that means little in real time, I think
To everyone saying "WAR"
“WAR” is not a statistic. Wins Above Replacement is just a model, built from a set of assumptions.
There are many, many different types of WAR, they all define the parameters differently. For instance, Fangraphs’ WAR uses wOBA as the offensive metric and UZR as the defensive metric. Rally uses Total Zone. Etc..
If you don’t like the way somebody has defined one of the parameters, that’s OK. Do something else. If you prefer +/- to UZR (I do), use +/-. If you’d like to build your metric with WPA/LI instead of context neutral linear weights, do that. The model, however (for pos. players, batting + defense + positional + replacement, for pitchers, (((((LgRAA/9) – (RAA/9))/run environment ) + .500 – Rep) * IP/9), is valid. If there’s a good argument this model doesn’t work, I haven’t seen it.
If you’re going to say “WAR”, you need to specify which type of WAR and which parameter you disagree with.
Beyond the Box Score / Capitol Avenue Club / shwitter: @CapitolAvenue
The criticism applies to all formulations, since they rely on defensive metrics. Fangraphs’ is the most known, and particularly egregious, since they also base their pitcher’s WAR on FIP, which should be used as a predictive tool only.
The criticism applies to all formulations, since they rely on defensive metrics
No, no, no, no, no, no, NO, NO, NO, NO, NO, NO, NO, NO!
This is sooo wrong. First of all, nobody in the world knows what all types of WAR are comprised of. I’ve got 14 or 15 different versions saved in spreadsheets on my lap top right now, most of which nobody else ever seen. Some use UZR, some use Total Zone, some use +/-, some use a regressed version of the Fans’ Scouting Report. Some use nothing more than a WAG from me. Some ignore defense all together. However you want to do it, it’s up to you. If you don’t think any defensive metrics are good enough, do something else. That’s fine.
If you want to criticize a particular version of WAR, go right ahead. But if you’re going to criticize “WAR”, you need to be prepared to discuss why the model doesn’t work. And I’m yet to see any good reason why it doesn’t work. Bonus would be to present an improved model. The only ways to improve the model would be to a) simplify it (which is practically impossible) or b) make it more intelligent (which is very distinctly possible). I’m yet to see a model that does either of those things.
Beyond the Box Score / Capitol Avenue Club / shwitter: @CapitolAvenue
by PWHjort on May 8, 2010 10:54 AM EDT up reply actions 1 recs
I didn't realize that we were being so literal.
Maybe one day WAR can become a stat:
a numerical value that characterizes the sample from which it was derived.
Yes, some comments lumped all WAR calcs together when saying they dislike the use of UZR, and you’re right about that. And I and another stepped a bit out of bounds by saying we didn’t like the fast and loose application of WAR (whichever version one might use) to make broader statements, instead of criticizing the stat itself (which I think we’re both fine with).
by SagehenMacGyver47 on May 6, 2010 7:33 PM EDT up reply actions
UZR
I think we still have a long way to go with defensive metrics
Relive Royals History at royalsretro.blogspot.com
Most of the problems cited here aren't with the stats themselves, but with misuse of those stats
www.zekeishungry.com
by thejd44 on May 8, 2010 6:33 PM EDT reply actions

by 
















