There was quite a popular tweet from a canine about New York’s improved play this season when Jamison Olave was playing.
There are obviously confounding factors at play here, not to mention small sample sizes. There were only seven matches this season in which Olave did not start, and eight in which he played 45 minutes or less. Any data obtained from these games is going to be subject to A) small sample sizes, B) lots of variance in the response variable (goals or wins), and C) no control for quality of opponent or location of the match.
To deal with the small sample size/variance problem, I’m going to use our now semi-famous data set on shot location origins. Steven Fenn kindly showed the world their predictive value, and to me that means that expected goals for and against are the most stable stat available for such an analysis. To control for New York’s opponents—when Olave was both in and out of the starting XI—I have included each of New York’s opponent’s expected goals data in the linear regression, while also accounting for whether or not the Red Bulls were at home. Blah, blah, blah, to the results!
Looking at the defensive side, New York allowed shots leading to 0.24 fewer expected goals against in games that Olave started. That seems to indicate New York’s need for Olave, but the p-value was a kind-of-high 26 percent. Overall, New York’s expected goal differential climbed 0.19 goals in those games that Olave started, though again, the p-value was quite high at 46 percent.*
Now for your shitty conclusion, courtesy of shitty p-values: Olave’s influence on New York’s level of play this season was questionable. There is some suggestion that he helped reduce goal-scoring against, however there is a reasonable chance that that difference was due to other, not-measured-here variables. What I am more comfortable claiming is that he does not make a 0.86-goal difference on the defensive side.
The point is this. New York’s shot creation and goal scoring ability, for and against, are more a function of whether or not the Red Bulls are home, and against whom they are playing. Not as much whether Olave starts. Obviously putting an inferior player into the starting XI isn’t going to help New York out. But, as I always question, do we really know how to value soccer players at all? Maybe Olave just doesn’t make that much of a difference. After all, he’s only one of eleven players.
*For those curious, the number of minutes Olave played was a worse predictor variable than the simple binary variable of whether or not he started. Controlling for the strength of opponent was necessary since perhaps Mr. Petke was more likely to sit Olave against a worse opponent at home, or something like that.