Minor League Ball: An SB Nation Community

Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
New Blog: Cowboy Altitude for Wyoming Fans!

Question on BABIP

what is a league average number for BABIP? i was looking at the stats for the leading mvp candidates in the NL and noticed how much lower the BABIP's of beltran and pujols are compared to most of the other guys in the top ten for RC/game. beltran is at .277 and pujols is at .275, while howard is at .346, as are many others. what is a baseline average for BABIP over the course of a season? at what point is a player unlucky or lucky on balls in play? thanks in advance for any help.

0 recs  |  Comment 21 comments |

Story-email Email Printer Print

Comments

Display:

Response
I believe the average BABIP in the majors is from .300 - .310 or so.  The BABIP in the minors is higher due to sketchy field conditions and worse fielding, however.  

Remember that it's tough to determine whether a batter is getting lucky or unlucky with BABIP until you consider their line-drive percentage.  A guy who hits 90% popups and has a BABIP of .220 is not unlucky, he's just not a good baseball player

by Jgaztambide on Sep 5, 2006 3:14 PM EDT reply actions  

yes
the expected BABIP for a hitter is their LD% (line drive percentage) plus .120 or so. so if a guy is hitting 18% of his balls in play for line drives, then he should have a BABIP of .300. if it's significantly higher or lower than that, it's most likely due to luck.

you can find batted ball type data for hitters at hardballtimes.

by jpahk on Sep 5, 2006 3:27 PM EDT up reply actions  

from the '06 Baseball Prospectus:
"A typical league-average BABIP is about .290."

by The Colonel on Sep 5, 2006 3:16 PM EDT reply actions  

Luck?
As far as I can tell there is no rearch for any claim that BABIP for a hitter has any more to do with "luck" than Batting Average does. Hard hit balls tend to go for hits whether they are ground balls, line drives or fly balls. Weakly hit balls tend to be outs.

by TT on Sep 5, 2006 5:38 PM EDT reply actions  

well, that's precisely it
As far as I can tell there is no rearch for any claim that BABIP for a hitter has any more to do with "luck" than Batting Average does.

that's the whole point. batting average has a lot to do with luck. (i'll define "luck" as the sum of all factors other than application of a repeatable skill by the hitter.) hitting the ball harder certainly helps, but some hard-hit balls are right at somebody, and some base hits are not hard-hit balls. a season's worth of plate appearances for one batter isn't necessarily enough to even out the breaks.

the idea behind looking at BABIP is to figure out which hitters are hitting the ball hard but having it go right at a fielder more often than others, and which guys have an inflated average because their bloops and bleeders are finding holes and their ground balls have eyes and their popups are falling in between three fielders, none of whom can be charged with an error.

as for your claim about research, well, it depends on your definition of luck. here's a link to a related study that seems to support the notion that hitting line drives is higher on the skill-to-luck spectrum than plain batting average:

http://www.hardballtimes.com/main/article/if-line-drives-could-speak/

by jpahk on Sep 5, 2006 5:56 PM EDT up reply actions  

siiigh
but hitters with "unluckyily low" or "luckily high" BABIPs tend to repeat those BABIPs within a general range.  It IS a skill and it IS repeatable.

by nms on Sep 5, 2006 6:28 PM EDT up reply actions  

I think that's his point
I don't want to speak for jpahk, but I believe his last comment regarding LD% covers what you're trying to say.  The ability to hit lines drives for example is somewhat repeatable and reduces some of the element of luck in BA.

by count sutton on Sep 5, 2006 8:13 PM EDT up reply actions  

yes
that's pretty much what i was getting at. i guess i didn't state it very clearly, did i? hitting the ball hard is a skill, but unfortunately you can't look at a guy's stat line and tell how many balls he hit hard. but since there are people nice enough to count line drives and publish the data for free, that's a nice start.

by jpahk on Sep 6, 2006 1:11 AM EDT up reply actions  

BABIP
the idea behind looking at BABIP is to figure out which hitters are hitting the ball hard but having it go right at a fielder

How does BABIP tell you any more about that than batting average? You just eliminate the home runs and strikeouts from the equation. How does that tell you whether it was luck or that they hit the ball hard?

here's a link to a related study that seems to support the notion that hitting line drives is higher on the skill-to-luck spectrum than plain batting average:

The article seems to assume that, but presents no evidence for it. In fact it seeme to present the opposite case. Palmeiro, in fact, struggled with the same "bad luck" the following year and is out of baseball. So did Hidalgo.

But I doubt anyone would be surprised to learn that good hitters tend to hit line drives more often than bad hitters. Or, perhaps more to the point, those that hit a lot of soft line drives don't hang around the major leagues very long.

And, not surprisingly, the article finds a lot of speedy players are "lucky" and a lot of slow players are "unlucky".

But hitting infield singles is obviously not a "luck" issue either since guys who have speed are more likely to beat out a ball than guys who aren't. In fact, players with speed often consciously work at making use of that tool by hitting the ball on the ground. And its not really surprising that guys who are slow and hit a lot of groundballs don't tend to hang around the major leagues very long either.

The problem here is that in an effort, I suppose, to simplify, the idea is that you can isolate one aspect of the game and pretend the other aspects don't effect its value. Moreover there is a pretense that players are not consciously adapting their games to get the complimentary benefits of their skills. I think either one is rarely, if ever, true.

by TT on Sep 5, 2006 8:28 PM EDT up reply actions  

response
How does BABIP tell you any more about that than batting average? You just eliminate the home runs and strikeouts from the equation. How does that tell you whether it was luck or that they hit the ball hard?

in this particular context, removing HR and K just takes out irrelevant events. we know that a strikeout isn't going to drop in for a lucky hit. we know (usually) that a home run is not going to be caught. any other AB could go either way, so we only look at those.

of course you're right--you still don't know how often they hit the ball hard. that's why the batted ball type data is useful.

you're also dead-on about the article itself not being particularly convincing. to be honest, most of it is useless--i only linked to it because the introduction to that article has references to several relevant studies. sorry i didn't make that clearer. the study in the body of the article has some serious weaknesses (which you've already pointed out). i wish they had at least tried to account for speed.

by jpahk on Sep 6, 2006 1:17 AM EDT up reply actions  

by the way--xBABIP?
over in the wang crystal ball thread, i tried to "predict" what his BABIP would be from his GB%, FB% and LD%. has anybody ever seen a systematic study along these lines? some of the articles i linked have used LD% + .120 as an estimator of BABIP, but why not use all three batted ball frequencies:

expected BABIP = GB% * (league BA on GB) + FB% * (league BA on FB) + LD% * (league BA on LD)

where the league averages are around .276, .212, and .743 for GB, FB and LD respectively.

you could make this slightly smarter by removing both HR and infield flies from FB, since all HR are hits and pretty much no IF are hits. then if you really wanted to do things right you could also park-adjust, since some parks are tougher to field in than others.

but even the dumb version seems to have some merit, and yet i've never seen anybody compute this stat, which i called xBABIP in that other thread. i suspect it might get us closer to isolating luck from skill, since probably most people agree that generating a certain type of batted ball is a skill for both pitchers and hitters.

by jpahk on Sep 6, 2006 1:23 AM EDT reply actions  

Yeah
I did something similar to this.

Actually I made a spreadsheet that runs a Markov chain with batted ball probabilities to create a BIP-derived ERA.  I could send it to you if you want...it's pretty cool.

by limozeen on Sep 6, 2006 3:26 AM EDT up reply actions  

neat
it sounds cool. what are you chaining? i don't see where any iteration would come in. but i'd definitely like to take a look at it.

the thing is, you could compute the same stat for both pitchers and hitters, and see who's been "hit-lucky" or "hit-unlucky" (where the working assumption is that the ability to produce a certain distribution of batted ball types constitutes "skill", and "luck" encompasses everything else).

this might turn out to be valuable for player projection systems. or it might not--i really don't know whether it's true that producing batted-ball types is a repeatable skill (though i strongly suspect it is), or that it is the only relevant repeatable hitting/pitching skill other than K/BB/HR (i sort of suspect that it isn't, but i really don't know). the most surprising thing to me is that nobody has yet done this on BP or HBT or anywhere else that has come to my attention.

by jpahk on Sep 6, 2006 4:12 AM EDT up reply actions  

hmm
i did some poking around. somebody else had the exact same idea as me here:

http://www.vivaelbirdos.com/story/2006/1/21/13956/0281

except he ran the numbers for juan encarnacion, not chien-ming wang.

and over on HBT, studes made a similar attempt:

http://www.hardballtimes.com/main/article/im-batty-for-baseball-stats/

except instead of using GB%, FB%, and LD%, he uses LD%, FB% and ... K rate. it's a head-scratcher at first, but he was optimizing by maximum correlation coefficient rather than by inductive reasoning. apparently for hitters, K rate is important because guys that swing harder strike out more but hit the ball harder when they don't K.

this weekend if i have some time i'll flesh out my idea some more. maybe i'll crunch xBABIP for the last couple of years and see if it's any good.

by jpahk on Sep 6, 2006 4:23 AM EDT up reply actions  

Some thoughts...
First, BABIP for a league can be easily computed, it's just (H-HR)/(AB-HR-K).  For the AL this year, it's .307.

Second, I didn't run Markov chains, but did some various regressions, and found that if you don't have batted ball type, you can come up with a pretty reasonable xBABIP using just 2 factors that each have a strong positive correllation to BABIP:

  1. (non-HR)XBH%, and
  2. (speed score).  
My thought process was that two things led to a higher percentage of hits on BABIP, and those are 1) hitting the ball hard, and 2) running quickly.  Obviously, for pitchers, the footspeed factor should balance out to somewhere near league average, and I think I'll go check and see how good of an indicator extra-base hit percentage is.

That said, and getting back to the original query... I don't have any idea why Pujols and Beltran would have low BABIPs, but they do appear to have lower LD% than I might expect (annually, not just this year).

by BobbyMac on Sep 7, 2006 6:25 AM EDT reply actions  

LD%
after reading the above replies mentioning LD% i went back and looked at their numbers, and indeed they have low LD%, which when you add the .120 recommended shows beltran and pujols to only be marginally unlucky this year. then i wondered why their LD% was so much lower than the other elite hitters. does anyone have any ideas? i am fascinated by this entire discussion so far. thanks for all the good feedback.

by DavidWrightismyGod on Sep 7, 2006 9:09 AM EDT up reply actions  

my guess
is that they're lofting the ball more in an effort to hit home runs. and far be it from me to say that this is a bad strategy, because these guys are killing the ball!

i'll second your comment that this is a fascinating discussion.

by jpahk on Sep 7, 2006 9:27 AM EDT up reply actions  

Pujols Beltran
Pujols GB/FB is .88
Beltran's is .83

Those are low, but far from league lowest.

by limozeen on Sep 7, 2006 9:39 AM EDT up reply actions  

Question
Does the GB:FB ratio for baters ignore homeruns like the GB:FB ratio typically used for pitchers does?  This is a pet peeve of mine, the other day Tavarez had a GB:FB ratio of 14:0 even though he gave up a HR to Thome.  Ridiculous.
God rested one day out of 7, Felix rests 4 out of 5.

by CrimsonLiederhosen on Sep 7, 2006 11:52 AM EDT up reply actions  

Yes
Only counts hits inside the park.  Hitting out of the park isn't considered to be a part of luck, and thus is irrelevant to the typical G/F analysis.

by Brickhaus on Sep 8, 2006 3:38 AM EDT up reply actions  

Comments For This Post Are Closed


User Tools

Minor League Ball: Where the Future of Baseball is Discussed
Start posting on Minor League Ball »

Join SB Nation and dive into communities focused on all your favorite teams.

Connect_with_facebook

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Zackgreinke2_small
milb 9/5
Hagerstown_suns_small
Milb 9/4
Small
Sept 3 minor league thread
Dustinpedroia_small
Strasburg
Sdc10109_small
Minor Leagues August 27Th, 2010

Recent FanPosts

Small
Mariners Prospects age 25-30 projections
Small
The Colby Rasmus Situation
Small
Twins purchase contract of Ben Revere
2001-bowman-chrome-auto-340-pujols_small
HR Power+Contact
Small
2010 Rookie Evaluations: AL East
Small
2010 Rookie Evaluations: AL Central
Bullpen_banter_logo_small
BB Top 10 1B Prospects

+ New FanPost All FanPosts >

SBNation.com Recent Stories

Texas Rangers Michael Young, left, is held back by Texas Rangers third base coach Dave Anderson (obscured) after being called out at third by third base umpire Alfonso Marquez, front right, to end a baseball game against the Minnesota Twins, Sunday, Sept. 5, 2010, in Minneapolis.  Rangers manager Ron Washington (wearing sunglasses) looks on. The Twins won 6-5. (AP Photo/Paul Battaglia) +1 updates

Twins Top Rangers 6-5 Thanks To Controversial Ump Decision

LOS ANGELES CA - SEPTEMBER 04:  Jamey Carroll #14 of the Los Angeles Dodgers is pulled off the bag as Darren Ford #34 of the San Francisco Giants steals second base in the eighth inning at Dodger Stadium on September 4 2010 in Los Angeles California. The Giants defeated the Dodgers 5-4.  (Photo by Jeff Gross/Getty Images) +1 updates

Dodgers' Slide Continues With 3-0 Loss To Rival Giants

BOSTON - SEPTEMBER 05: Jonathan Papelbon #58 of the Boston Red Sox heads for the dugout after he is pulled in the ninth inning against the Chicago White Sox on September 5 2010 at Fenway Park in Boston Massachusetts.  (Photo by Elsa/Getty Images)

White Sox Score Four In 9th, Hand Jonathan Papelbon His Seventh Blown Save Of 2010

More from SBNation.com >


Managers

Bert_small John Sickels

Jeri_avatar_small mssickels

Authors

Small SethSpeaks

Favicon1_small ravensfan3

Moderators

Small mrkupe


Site Meter