Question on BABIP
what is a league average number for BABIP? i was looking at the stats for the leading mvp candidates in the NL and noticed how much lower the BABIP's of beltran and pujols are compared to most of the other guys in the top ten for RC/game. beltran is at .277 and pujols is at .275, while howard is at .346, as are many others. what is a baseline average for BABIP over the course of a season? at what point is a player unlucky or lucky on balls in play? thanks in advance for any help.
0 recs |
21 comments
|
Comments
Response
Remember that it's tough to determine whether a batter is getting lucky or unlucky with BABIP until you consider their line-drive percentage. A guy who hits 90% popups and has a BABIP of .220 is not unlucky, he's just not a good baseball player
yes
you can find batted ball type data for hitters at hardballtimes.
Luck?
by TT on Sep 5, 2006 5:38 PM EDT reply actions
well, that's precisely it
that's the whole point. batting average has a lot to do with luck. (i'll define "luck" as the sum of all factors other than application of a repeatable skill by the hitter.) hitting the ball harder certainly helps, but some hard-hit balls are right at somebody, and some base hits are not hard-hit balls. a season's worth of plate appearances for one batter isn't necessarily enough to even out the breaks.
the idea behind looking at BABIP is to figure out which hitters are hitting the ball hard but having it go right at a fielder more often than others, and which guys have an inflated average because their bloops and bleeders are finding holes and their ground balls have eyes and their popups are falling in between three fielders, none of whom can be charged with an error.
as for your claim about research, well, it depends on your definition of luck. here's a link to a related study that seems to support the notion that hitting line drives is higher on the skill-to-luck spectrum than plain batting average:
http://www.hardballtimes.com/main/article/if-line-drives-could-speak/
siiigh
by nms on Sep 5, 2006 6:28 PM EDT up reply actions
I think that's his point
yes
BABIP
How does BABIP tell you any more about that than batting average? You just eliminate the home runs and strikeouts from the equation. How does that tell you whether it was luck or that they hit the ball hard?
here's a link to a related study that seems to support the notion that hitting line drives is higher on the skill-to-luck spectrum than plain batting average:
The article seems to assume that, but presents no evidence for it. In fact it seeme to present the opposite case. Palmeiro, in fact, struggled with the same "bad luck" the following year and is out of baseball. So did Hidalgo.
But I doubt anyone would be surprised to learn that good hitters tend to hit line drives more often than bad hitters. Or, perhaps more to the point, those that hit a lot of soft line drives don't hang around the major leagues very long.
And, not surprisingly, the article finds a lot of speedy players are "lucky" and a lot of slow players are "unlucky".
But hitting infield singles is obviously not a "luck" issue either since guys who have speed are more likely to beat out a ball than guys who aren't. In fact, players with speed often consciously work at making use of that tool by hitting the ball on the ground. And its not really surprising that guys who are slow and hit a lot of groundballs don't tend to hang around the major leagues very long either.
The problem here is that in an effort, I suppose, to simplify, the idea is that you can isolate one aspect of the game and pretend the other aspects don't effect its value. Moreover there is a pretense that players are not consciously adapting their games to get the complimentary benefits of their skills. I think either one is rarely, if ever, true.
by TT on Sep 5, 2006 8:28 PM EDT up reply actions
response
in this particular context, removing HR and K just takes out irrelevant events. we know that a strikeout isn't going to drop in for a lucky hit. we know (usually) that a home run is not going to be caught. any other AB could go either way, so we only look at those.
of course you're right--you still don't know how often they hit the ball hard. that's why the batted ball type data is useful.
you're also dead-on about the article itself not being particularly convincing. to be honest, most of it is useless--i only linked to it because the introduction to that article has references to several relevant studies. sorry i didn't make that clearer. the study in the body of the article has some serious weaknesses (which you've already pointed out). i wish they had at least tried to account for speed.
by the way--xBABIP?
expected BABIP = GB% * (league BA on GB) + FB% * (league BA on FB) + LD% * (league BA on LD)
where the league averages are around .276, .212, and .743 for GB, FB and LD respectively.
you could make this slightly smarter by removing both HR and infield flies from FB, since all HR are hits and pretty much no IF are hits. then if you really wanted to do things right you could also park-adjust, since some parks are tougher to field in than others.
but even the dumb version seems to have some merit, and yet i've never seen anybody compute this stat, which i called xBABIP in that other thread. i suspect it might get us closer to isolating luck from skill, since probably most people agree that generating a certain type of batted ball is a skill for both pitchers and hitters.
Yeah
Actually I made a spreadsheet that runs a Markov chain with batted ball probabilities to create a BIP-derived ERA. I could send it to you if you want...it's pretty cool.
neat
the thing is, you could compute the same stat for both pitchers and hitters, and see who's been "hit-lucky" or "hit-unlucky" (where the working assumption is that the ability to produce a certain distribution of batted ball types constitutes "skill", and "luck" encompasses everything else).
this might turn out to be valuable for player projection systems. or it might not--i really don't know whether it's true that producing batted-ball types is a repeatable skill (though i strongly suspect it is), or that it is the only relevant repeatable hitting/pitching skill other than K/BB/HR (i sort of suspect that it isn't, but i really don't know). the most surprising thing to me is that nobody has yet done this on BP or HBT or anywhere else that has come to my attention.
hmm
http://www.vivaelbirdos.com/story/2006/1/21/13956/0281
except he ran the numbers for juan encarnacion, not chien-ming wang.
and over on HBT, studes made a similar attempt:
http://www.hardballtimes.com/main/article/im-batty-for-baseball-stats/
except instead of using GB%, FB%, and LD%, he uses LD%, FB% and ... K rate. it's a head-scratcher at first, but he was optimizing by maximum correlation coefficient rather than by inductive reasoning. apparently for hitters, K rate is important because guys that swing harder strike out more but hit the ball harder when they don't K.
this weekend if i have some time i'll flesh out my idea some more. maybe i'll crunch xBABIP for the last couple of years and see if it's any good.
Some thoughts...
Second, I didn't run Markov chains, but did some various regressions, and found that if you don't have batted ball type, you can come up with a pretty reasonable xBABIP using just 2 factors that each have a strong positive correllation to BABIP:
- (non-HR)XBH%, and
- (speed score).
That said, and getting back to the original query... I don't have any idea why Pujols and Beltran would have low BABIPs, but they do appear to have lower LD% than I might expect (annually, not just this year).
LD%
by DavidWrightismyGod on Sep 7, 2006 9:09 AM EDT up reply actions
my guess
i'll second your comment that this is a fascinating discussion.
Pujols Beltran
Beltran's is .83
Those are low, but far from league lowest.
Question
by CrimsonLiederhosen on Sep 7, 2006 11:52 AM EDT up reply actions

by 










