Thanks to Geoff Baker for linking us to Bill James' system for predicting "UP" ballplayer seasons.
I'll use this system heavily in roto. I expect it to be my biggest "secret weapon" since the days when Ron Shandler used to be secret.
.
Q. Why a new UP/DWN system, considering that PECOTA already issues the 75%, 50%, 25%, etc. probabilities?
A. An Excel spreadsheet will calculate the math perfectly, of course ... AFTER a human programmer has told it what he thinks defines a "comparable player."
Underlying PECOTA's calculations are the arbitrary opinions of a Baseball Prospectus' employee's opinion of what makes two ballplayers comparable. And, especially, that employee's opinion of how those 12-14 factors should be weighted against each other.
Who is Carl Crawford's best comp: Johnny Damon or Chone Figgins? Are BB more important than SB? If BB and SB each comprise 1/12 of the formula, why are both weighted the same?
PECOTA is cool, but the assumptions underlying it are very, very simple and very, very arbitrary. Take it from a chess-AI geek. It's a good system, and a very useful one. But if you're thinking PECOTA's simple little forecasts are From On High, beyond question, think again. :- )
..............
Bill James' system has -- as far as I can tell -- vastly more intelligent human intuition applied, in terms of the underlying criteria used to define "comparable player." (I know, I know, the player is mostly compared to himself; that's a whole other subject. Bear with me.)
...............
PECOTA, at least a few years ago, picked a dozen criteria out of a hat and threw them all into the mix (equally, at least at that time) to find "most-comparable players" who were, at best, roughly comparable.
Bill James is a 60-year-old historian who has spent his life comparing baseball players to each other, and he applied his finely-honed judgment to come up with criteria that he felt mattered most.
If James' underlying comp-criteria are better, then his system will be more accurate.
.
Q. Is there any evidence for believing that James' system will accurately forecast UP and DOWN seasons?
A. The money chart is in this article, page 83. Take a look.
If you want a system to show more accuracy than that, you'll have to call the 31st century. Bill would probably be the best guy to call about whether that is do-able, too.
.
Q. What does it mean to have an UP season?
A. It means that you had the same* OPS as the year before, and at least 80% as many at-bats.
So bear in mind, if Russell Branyan is forecasted for a DOWN season, that would become the case IF he had (1) <400 plate appearances, OR (2) -20 to his OPS or more.
For example, if Branyan hit .260/.345/.495 in 145 games with 42 homers and 105 RBI, that would be a DOWN season per James' system.
.
Q. Does that make any sense?
A. Yes, because if you take 100 players having DOWN seasons, only a few are going to have the type of "in practical terms, UP" season above.
.
Q. Is it "fair" to take 7 or 8 criteria and kind of mold them like Silly Putty until your prophecies come out accurate?
A. This is "fuzzy logic." It's state-of-the-art for dealing with math problems that can't be captured otherwise.
For example, I want to know whether the phrase "Stunning" sells more eBay items -- for jewelry? For dresses? For black dresses? I can't isolate the variables.
So what I do, I take one phrase -- "Stunning" or "Throws Left Handed" or "Born In Peurto Rico" and I simply track that to see if gains or losses are reflected.
Then I cascade other search phrases -- "No Reserve" or "Versace" or "95 MPH" or whatever -- and form a 2x2 grid, then a 3x3 -- going back and changing "Spectacular" for "Stunning" and so on. (This actually is how Cindy and I built a PowerSeller business.)
Trial-and-error "points systems" -- in which you assume values in columns A, B, C and proceed on those assumptions -- are often the starting point for capturing variables that can't be captured algebraically. You start molding the data like Play-Doh until you're, Eureka!, coming up with some predictive validity.
............
Bill wasn't throwing spaghetti against the wall. He took a bunch of criteria that he knew were important, and he experimented until his hypothesis was matching up beautifully to baseball history.
He'll probably improve and revise his system in the future. Right now I'm guessing that his system is a big step forward. I'll certainly be using it a lot in Roto.
.
Q. How might the system be improved?
A. First question I would ask Bill: how do you think it would affect your system, if you added the John Benson criterion "had a great/terrible second half"?
James' system doesn't do that. And if we just plugged in 1H/2H, the predictive validity might collapse. So we can't just bolt it on.
But tell you what: you got a roto draft pick who gets UP from James, and had a great 2H, you got a sleeper pick, bro'.
I get the hitters who passed both, you get the hitters who failed both, you'll lose. And you won't even know why. ;- )
Cheers,
Dr D

