What Are Sabermetrics?

 ....................

No disrespect, I just don't think we can completely figure out baseball based off of sabermetrics alone.

Jack Zduriencik does not primarily use Baseball Prospectus or Fangraphs to decide whether Shawn Kelley is coming north.  Statistics are backwards-looking!  What Zduriencik needs is a reliever who will get outs in the 2012 season.

Most people would understand "sabermetrics," as Andrew uses it, to mean "the statistics you can find on Baseball Prospectus and Fangraphs."  So, the comment would bring derision in some quarters.  But let's remember that Jack Zduriencik and Eric Wedge are in profound agreement with this.  Neither do they think that you can make roster decisions based on 2009-11 statistics alone.  

.........

It is not because we fail to understand WAR that we, at SSI, refuse to limit ourselves to its implications.  Tony Blengino his ownself comes, as the years go by, to see himself more and more as a scout/saber dual class.  Most of the stats-based, "pure sabermetrician" types who get hired, wind up blossoming into stats/scouting blended analysts.  Every GM who ever lived --- > used methods that transcend WAR.

My own approach is based on organizing our thinking about any given baseball question -- directing our attention to the right questions.  For example:  what are the causes of Felix' lack of velocity, and if he does lose velocity, what does that mean?  What patterns are there here, and what precedents apply to the problem?  

It's a chess paradigm.  Whatever position you've got, it's been played before.  Felix Hernandez' velocity loss has been seen before, in Pedro Martinez and in other great pitchers.  Find the games that have looked like this before, and review them from the standpoint of the masters who won those games.

We try to begin, sincerely, with questions, as opposed to beginning with what are in essence position statements.  If we find a good clear question, it is not so hard to find historical patterns that illuminate the questions.  

............

By the way, WAR is not synonymous with Sabermetrics.  Neither are statistics.  Remember, you can join Bill James Online for $3 a month, and exploit his pattern recognition as if he were an Ask.com webstie.  In 2008, Bill James was asked to define sabermetrics for a group of risk analysts.  We break pattern to include a good portion of his remarks, so that the essence of his definition won't be lost.

...............

BILL JAMES, 2008

Good afternoon, and thank you for inviting me to speak to you for a few minutes here.    I have been asked to say a few words about sabermetrics, about what I do for the Red Sox, and about how this might relate to risk management, if it does.

            Sabermetrics is descended from traditional sportswriting.   Sportswriting consists of two types of things—reporting, and analysis.  Sabermetrics came from that part of sportswriting which consists of analysis, argument, evaluation, opinion and baloney.   I can tell you very precisely when and how we parted ways with traditional analysis.

            Sportswriters discuss a range of questions which are much the same from generation to generation.   Who is the Most Valuable Player?   Who should go into the Hall of Fame?   Who will win the pennant?   What factors are important in winning the pennant?   If Boston won the pennant, why did they win it?   If Kansas City finished last, why did they finish last?   How has baseball changed over the last few years?   Who is the best third baseman in baseball today?   Who is better, Mike Lowell or Eric Chavez?

            The questions that we deal with in our work are the same as the questions that are discussed by sports columnists and by radio talk show hosts every day.   To the best of my knowledge, there is no difference whatsoever in the underlying issues that we discuss.   The difference between us is very simple.  Sportswriters always or almost always begin their analysis with a position on the issue.   We always begin our analysis with the question itself. 

            If you find a sportswriter debating who should be the National League’s Most Valuable Player this season, his article will probably begin by asserting a position on the issue, and then will argue for that position.   If you find 100 articles by sportswriters debating issues of this type, in all likelihood all 100 articles will do this.  

            What we do is simply to begin by asking “Who is the National League’s Most Valuable Player this season?” rather than to begin by stating that “Albert Pujols is the National League’s Most Valuable Player this season, and let me tell you why.”  That’s all.   That is the entire difference between sabermetrics and traditional sportswriting.  It isn’t the use of statistics.   It isn’t the use of formulas.   It is merely the habit of beginning with a question, rather than beginning with an answer.

            From this very small difference, profound changes arise.   A person who begins by asserting a position on an issue naturally focuses on what he KNOWS.   “What facts do I know, he asks himself, which will help me to explain why Mike Lowell is better than Eric Chavez?”

            The person who begins with the question itself. ...who is better, Mike Lowell or Eric Chavez. . ..the person who begins with the question itself naturally focuses not on what he DOES know, but on what he does NOT know.   I misstated that just a little bit.   The person who begins with the question itself naturally focuses first on what he NEEDS to know.    As soon as he begins to think through what he needs to know, however, it will become apparent that he does not know many of the things that he needs to know to really answer the question.

            Every large general question in baseball can be broken down into smaller and more specific questions, which can be broken down into yet smaller and yet more specific questions, which can be broken down into yet smaller and yet more specific questions.   Eventually you reach the level at which the questions have small and definite answers, even though you may not know what those answers are.  The question of who is better, Mike Lowell or Eric Chavez, for example, leads to the question “What are the elements of a third baseman’s job?”   Well, there is hitting, fielding, baserunning, and off-the-field contributions if you want to count those.  

            The question of who is a better fielder breaks down into 20 other questions.   Who is quicker?  Who is more reliable?  Who makes more mistakes?  Who has a better arm?  Who is better at anticipating the play, and positioning himself correctly?   Who is better at going to his left, or to his right?  Who is better at applying the tag if he needs to make a tag on a runner?

            At that level, the questions that we ask ourselves are actually very much like the questions that scouts ask.   Scouts try to answer them by their expert judgment.   We are not experts, and so we try to find ways to answer them that do not rely on our judgment.  In that way, sabermetrics forms a kind of transition between journalism, from which it descended, and scouting.   In any case, the question of who is better at applying the tag if he needs to make a tag play leads to the question “How many times a year does a third baseman have to make a tag play?”    That is a question that has an actual answer.   I don’t know what the answer is, but there is an answer to that question.   In a few years, somebody will figure out a way to get that answer. 

            The question of who is better, Mike Lowell or Eric Chavez, contains within it probably several hundred smaller questions.   The person who begins by asking that question naturally realizes that he does not know the answers to most of those questions.   He then proceeds inevitably to ask, “How can I find the best answer to that question?” 

            When a person begins with the question itself he inevitably winds up confronting his own ignorance, and trying to find ways to fill in the gaps in his knowledge.   The person who begins with a position on the issue never sees his own ignorance, and, in fact, deliberately avoids seeing his own ignorance.  The person who begins with a position on the issue and argues for that position naturally tries to hide his ignorance of the other internal issues, since the things that he doesn’t know are a weakness in his argument.   The person who begins with the question itself, on the other hand, inevitably winds up reveling in his own ignorance, celebrating his ignorance, and sharing it freely with the world at large.  

            But the person who begins with a position on the issue, by this process, becomes a borrower from the Bank of Knowledge.   He borrows from the things that he knows, and uses them to construct an argument.

            The person who begins with the issue itself, on the other hand, eventually becomes a contributor to the Bank of Knowledge.   Forced to confront his own ignorance, he is forced to find ways to figure out the information that he is missing—ways to count things that haven’t been counted, or ways to estimate the parameters of things that are unknown.    Through this process, he winds up knowing things that were not known before.  

            This is essentially what we do:  We try to construct knowledge to fill in some of the spaces in our massive ignorance.   We are not people who know things.   We are people who are honest enough to admit that we don’t understand things, and frankly, we don’t believe that you understand them, either.  

            Because we have been involved in the effort to figure out the things that we don’t understand for a long time now, we have developed an inventory of a few hundred standard methods that can be used to analyze a new question.  

        .....

When our GM or one of his associates asks me a question, I almost never give him an immediate answer—nor does he expect me to.   If he wanted an immediate answer, he would ask an expert, or he would ask a scout.   What I always say, when asked anything, is “I’ll study the issue.”

            I shouldn’t say “never”.   Occasionally I’ll slip up and give an immediate answer.   A year ago, for example, Theo asked me “How confident are you that Dustin Pedroia will hit what your projections show that he will hit?”   Without thinking about it, I said “Absolutely 100%.”  I knew three things that caused that to slip out.   First, I knew that our hitting projections for minor leaguers have a very high degree of reliability—much higher than most people assume that they do.   Second, I knew that our projections for Dustin Pedroia were actually very conservative, that he was actually a better hitter than we were saying that he was.  Third, I knew that, even though Pedroia had hit .191 as a late-season callup in 2006, he had made contact with more than 90% of his swings.  If you swing as hard as he does and you make contact with every swing, how can you possibly fail to hit?

            That’s risk management, of course. . . .knowing how reliable the projections are.   But that’s also very much the exception to the rule, actually—the exception to two rules.   The first rule is:  We are not experts.   We are not people who “know” things about baseball players.   We are people who know how to study things about baseball players.   When Theo asks me a question about a player or about the game in general, he is not asking me for what I know.   He is asking me to study the issue. 

            And the other rule is, we are never certain.  I said that I was 100% certain that Pedroia would hit in the majors, but in reality, we are never 100% certain of anything.   The talk show hosts are 100% certain.  The sports columnists are 100% certain.   We are just doing the best we can.   Our methods are always flawed, and our answers are usually tenative and muddled.  

            But the difference between knowledge and [baloney] is that knowledge moves forward, whereas [baloney] moves in circles.   When I develop a new method to analyze a baseball question, someone will point out the flaws in that method, and then someone else will suggest an entirely different method to approach the problem, and then somebody else will point out the flaws in that entirely different method, and then somebody else will figure out a way to combine the best features of both methods into one method that is better than either one.  We wind up with methods that get better over time. 

           ....

            We are no more statisticians than we are historians, or scouts, or accountants, or computer programmers.   I suspect that everything we do is much the same as what many of you do.  We look to the past, and we try to organize the things we have seen so that they make some sense.   We ask ourselves “how many of those were there?” and “how many of those others were there?” and “How many of them ended well?” and “How many of them ended badly?”, just as I would imagine most of you do.  

            Often people wonder if we will run out of things to study, and what I always say is that there will never be a shortage of ignorance.   I have realized recently that some people take this wrong, and that when I say that there will never be a shortage of ignorance people think that I am referring to some other group of people, some “ignorant” group of people, when in reality what I am talking about is myself.   I will never understand baseball; I will never understand 1% of what I need to understand.   My view of our work is that we are attacking a mountain range of ignorance with a spoon and a used toothbrush.   The things that we do not know are inexhaustible.  

- Bill James, Article 671 BJOL

Comments

1
Andrew Long's picture

Interesting stuff, Jemanji! I've more or less thought of sabermetrics as the statistics that come out of formulas that should help us determine the most valuable players. James puts it in a new light saying the mere difference in this and sports writing is that sabermetrics begin with a question. Putting sabermetrics in a box labeled 'statistics' may not be fair on my part.
I still have a problem with how some people interpret the results of the formula. A few weeks ago, Beavan and Iwakuma were compared on the 'big blog' (as you folks like to say) and they wanted us to believe they were the same pitcher this spring. I had a big problem with them dismissing the amount of hard-hit balls that landed for base hits there. It's that kind of example that has me questioning the people who base opinions solely off of what Fangraphs tells us.
Well said in this paragraph, "We try to begin, sincerely, with questions, as opposed to beginning with what are in essence position statements. If we find a good clear question, it is not so hard to find historical patterns that illuminate the questions." That is what gives rich content to a site.
Curious to hear what you think of the ol' 'team chemistry' debate? How much can it really help or hurt? A week or so ago I wrote (admittedly, a muddled post) that questioned some of the 'fangraphs is the only way' believers, thinking along the lines of chemistry and special players. http://goodguyssports.com/2012/03/27/tim-tebow-and-baseball-sabermetrics/
Good stuff!

2
ghost's picture

They don't throw the same pitches, they don't throw at the same velocity, they aren't the same height/build/temperament at all, one has demonstrated excellent fastball command and the other is questionable...but other than that...they're completely identical. *eyeroll*

3
ghost's picture

The Times has kindly given us a concise listing of the minor league rosters down to Clinton...so it's time to do a bit of depth chart prospecting. Doc, G et al, this is my best crack at it...would welcome your opinions...(first letter grade is current MLB value, second is potential for development, will leave off the letter grade for guys with no experience)
CATCHERS:
1) Miguel Olivo (C / C)
2) Jesus Montero (B / A+)
3) John Jaso (C- / C+)
4) Adam Moore (D+ / C+)
5) Guillermo Quiroz (F / F)
6) Jesus Sucre ( - / C-)
7) Brandon Bantz ( - / D)
8) John Hicks ( - / D+)
9) Steve Baron ( - / D)
10) Ralph Henriquez ( - / D-)
11) Jack Marder ( - / F)
12) Michael Dowd ( - / F)
POSITION GRADE: B (this will heavily depend on how much PT Montero sees behind the plate in future seasons - if he sticks there, the grade goes up)
FIRST BASEMEN:
1) Justin Smoak (B- / A-)
2) Alex Liddi (C- / B)
3) Mike Carp (B / B)
4) Rich Poythress ( - / C)
5) Johan Limonta ( - / D)
6) Dennis Raben ( - C+)
7) Steven Proscia ( - / B)
8) Joe Dunigan ( - / C)
9) Mickey Wiswall ( - / D+)
10) Jharmidy De Jesus ( - / D)
POSITION GRADE: B (Smoak has a lot to prove this year...behind him there are two or three solid options but after that...a lot of ugly mush...this position needs more depth)
SECOND BASEMEN:
1) Dustin Ackley (A- / A+)
2) Kyle Seager (C+ / B)
3) Munenori Kawasaki (C / C)
4) Luis Rodriguez (D / D)
5) Carlos Triunfel ( - / C)
6) Stefen Romero ( - / C+)
7) Brad Miller ( - / B-)
8) Gabriel Noriega ( - / C)
9) Daniel Paolini ( - / B-)
POSITION GRADE: B+ (if we're scoring just Ackley, the grade is higher, but the problem is that we don't have prospect depth behind him and filling his shoes with Seager just opens up a hole at another position)
THIRD BASEMEN:
1) Kyle Seager (C+ / B)
2) Alex Liddi (C- / B)
3) Chone Figgins (D / D)
4) Vinny Catricala ( - / A-)
5) Francisco Martinez ( - / B+)
6) Leury Bonilla ( - / F)
7) Ramon Morla ( - / B+)
8) Mario Martinez ( - / D+)
POSITION GRADE: B (the major leaguers aren't quite B-caliber yet...all of them save Figgins have that upside and the two guys right below them have even higher upside. Long term, this position is covered nicely)
SHORTSTOPS:
1) Brendan Ryan (C+ / C+)
2) Munenoria Kawasaki (C / C+)
3) Kyle Seager (D+ / B-)
4) Luis Rodriguez (D+ / D+)
5) Nick Franklin ( - / B+)
6) Carlos Triunfel ( - / C)
7) Brad Miller ( - / B-)
8) Anthony Phillips ( - / F)
POSITION GRADE: C+ (we love to talk about Nick Franklin and Brad Miller, but last season ought to give pause before assumptions are made about imminent stardom...until Franklin solidifies his prospect status with a top season in the high minors, this position looks a bit wobbly, especially given Ryan's health questions)
CENTER FIELDERS:
1) Franklin Gutierrez (B- / B-)
2) Michael Saunders (B- / B+)
3) Chone Figgins (F / F)
4) Trayvon Robinson (D- / B)
5) Darren Ford (D / D+)
6) Daniel Carroll ( - / B-)
7) Denny Almonte ( - / D)
8) James Jones ( - / B)
9) Jabari Blash ( - / C)
POSITION GRADE: B- (I love the work Saunders has done to improve his game but I need to see it over the course of a season and see how he responds to slumps - beyond him, the ranks are a bit mushy if you don't like Robinson's K rate)
CORNER OUTFIELDERS:
1) Ichiro! (B- / B)
2) Mike Carp (B- / B)
3) Michael Saunders (C+ / B-)
4) Casper Wells (C- / C)
5) Chone Figgins (F / F)
6) Carlos Peguero (D / B-)
7) Johermyn Chavez ( - / C)
8) Chih-Hsien Chiang ( - / B)
9) Mike Wilson ( D / D)
10) Julio Morban ( - / B)
11) Michael McGee ( - / C-)
12) Guillermo Pimentel ( - / B+)
13) Kevin Rivers ( - / F)
14) Jamal Austin ( - / D)
POSITION GRADE: C- (once Ichiro departs, the club has little to fill his shoes in RF...Saunders and Robinson are not ideal there, and neither is Casper Wells...they should think seriously about trading volume for star power and acquiring a right fielder with pop)
START PITCHING:
1) King Felix Hernandez (A+ / A+)
2) Jason Vargas (B- / B)
3) Hector Noesi (C- / C+)
4) Blake Beavan (D+ / B-)
5) Kevin Millwood (D+ / D+)
6) Erasmo Ramirez ( - / B)
7) Hisashi Iwakuma (D / B-)
8) Charlie Furbush (D+ / C+)
9) Forrest Snow ( - / C+)
10) Taijuan Walker ( - / A)
11) Danny Hultzen ( - / A-)
12) James Paxton ( - / A-)
13) Andrew Carraway ( - / B)
POSITION GRADE: A- (I didn't want to go further down the depth chart for the starters because I think most of the rest are going to get traded or converted to relievers...there's no way anyone else is getting through that brick wall of awesome unless a catastrophe injury plague descends on AA Jackson - we are well WELL covered here with even the secondary prospects all showing signs of improvement)
RELIEVERS:
1) Brandon League (B+ / B+)
2) Tom Wilhelmson (B- / A-)
3) George Sherrill (F / D)
4) Steve Delabar (C / B)
5) Erasmo Ramirez ( - / B)
6) Hisashi Iwakuma (D / B)
7) Lucas Luetge (C / B-)
8) Charlie Furbush (C / B-)
9) Shawn Kelley (C+ / B-)
10) Chance Ruffin (D+ / B)
11) Forrest Snow ( - / B+)
12) Carter Capps ( - / A)
13) Stephen Pryon ( - / A-)
14) Mauricio Robles ( - / B)
15) Matt Fox ( - / C)
16) Cesar Jimenez ( C- / C)
17) Steven Hensley ( - / B-)
18) Brandon Maurer ( - / B)
19) Carson Smith ( - / A-)
POSITION GRADE: B- (The big league pen is frighteningly bad...but immediately below that group is a large cluster of fast-moving power arms that has the potential to make us all feel much safer with a lead late in games)
Overall, the depth chart is much MUCH more promising than it has been in a decade...thoughts from the peanut gallery?

4

On the run today, but thath high-quality H20 amig-O... should provide an interesting mosh deck for us in the peanut gallery...

5

The infield looks great for every position. The outfield only looks so-so.
Per MLB Trade Rumors, The best free agent outfielders for 2013 will be Hamilton, Ethier, Quentin, and Swisher. Andre Ethier looks good because he never gets hurt, he has a good average and on base percentage and he already plays in a slight pitcher's park. The complaint with this guy is that he doesn't have big time power.
Swisher and Quentin look good. Quentin is a beast who is always good for 25 bombs at US Cellular field. That field has a 400 foot center field and is on the sea, and is in Chicaco, which has cold air, so that has to count for something. The Mariners haven't had anything close to 25 bombs since Russ Branyan hit 31 in 2009. 
Plus, Quentin could be Jesus Montero's twin brother, so he also looks like Babe Ruth's long lost latin grandson.  Therefore, he automatically has good baseball face, and would be a good fit on the team.

Plus, he's from LA, so he'd play better when he faced the hated Angels, as he'd want to play well in front of the home crowd.
Nick Swisher doesn't make your heart beat faster. His OBP is good but the rest of his game is mediocre, he plays in a serious hitter's park, with the best lineup protection in baseball.
Hamilton is old, gets injured alot, and has drama and other issues, but it would be sweet if the Rangers could slight him somehow, and he signed with the Mariners with the intention of torturing the Rangers at every opportunity. We could use a guy like that. A guy that makes them feel like we felt when Vlad came to the plate and we were nursing a slim lead.
It will be interesting to see which of the internal candidates, Saunders, Trayyyvon, "video game" Chiang, or Catricala, step up to make this outfield discussion obsolete by this time next year.

7

Great job, ghost, on the chart. Have a couple of questions/comments.
Carlton Tanabe at Clinton has been mentioned by other sites as a possible break-out prospect at C. Notice he isn't on your list. Any thoughts about him, or about Tyler Marlette or Marcus Littlewood, whom John Sickels also noted as possible good C prospects? Realize this was mostly off the published rosters, but those three may be better than the other Ms MiLB C coming up.
Also note that Noriega isn't listed at SS. I've read that he is the best glove in the system - he just can't hit. Would his rating be the same as at 2B? Seems to me the glove is more important at SS (which might only get him a C+, but, hey, give the kid some props for what he does do!).
Finally, Erasmo Ramirez impresses me as having more upside starting - B+?

8
ghost's picture

First on E-Ram - I think what you give a player in terms of grades depends on what you think of as what each grade represents. I see E-Ram as having perennial 3.80 ERA, 6 K/9, 2 BB/9 potential...is that a B+? I think solid #3 starters are B grades...but what does everyone else think?
I left Littlewood off my list...he should probably be ahead of the everyone below Brad Miller at short and second....my bad...he wasn't on the roster list and I had a brain cramp.
I don't know enough about Tanabe to have had a confident rating of him...and, at least for now, he has to be well down the depth chart because he has almost no playing time yet...but he has an outstanding baseball personality and Z is SKY HIGH on him, so that has to count for something...would think he could quickly move ahead of all catchers on my list minus the four big leaguers and Moore if he has the breakout year people are expecting.

Add comment

Filtered HTML

  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <blockquote> <code> <ul> <ol> <li> <dl> <dt> <dd><p><br>
  • Lines and paragraphs break automatically.

Plain text

  • No HTML tags allowed.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Lines and paragraphs break automatically.

shout_filter

  • Allowed HTML tags: <a> <em> <strong> <cite> <blockquote> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.