The value of continuity
Posted: Thu Dec 12, 2013 9:28 am
Hello guys. This is my first time posting here but it isn't my first time reading. I've been trying to acclimate myself with the level of discussion that happens here and to be entirely honest, I'm overwhelmed. I've forgotten a lot about the mathematical theory that is mostly used here and that's why I find myself constantly trying to look at my books when I read stuff here. This is why it took me so long to post here - I've been "shy" (to a degree) to comment on anything because compared to you guys, I can be considered undereducated. I've been trying to play catch up (been reading books when I can and studying some programming in between) but I realized that there's no better way to get acclimated with this than just to dive right into the pool, swimming lessons or not. So here goes,
I'm a big fan of plus/minus based metrics and I think they're a huge key to evaluating players in the NBA in terms of current production and equivalent contract, future potential and what not. Two things:
If we've constructed a plus/minus model for all relevant metrics as it relates to a player, then we can in essence use this to breakdown a player's APM as it relates to points per possession. Example:
Player X's Improved APM (whatever method it supposedly becomes, whether it's the statistical or the regularized one) on a points per 100 basis is bad but it supposedly happens that in reality his APM in the "effective field goal department" (i.e. adjusting for teammates and quality of opponents and homecourt, his team is X1 points better at making shots when Player X is on the court), but his APM in the turnover department is bad then at least you have a basis for basing where his "APM-PPP" comes from since it's been long understood that efficiency differential comes from the four factors (eight for offense and defense).
I haven't thought about whether it's doable, whether there are certain things that can hinder the study, intrinsic ideas that I have yet to consider and what not. Just an idea.
2. Another thing where I think PM can be used is in evaluating continuity. I've always held a belief that the more players play together, the better they play and the sum of the parts truly becomes greater than the whole. Just looking at certain recent "cores" from teams (numbers are taken from bball ref unless otherwise stated):
KD/Russ/ibaka
2010/11 — KD/Russ/Ibaka > +3 points per 100
2011/12 — KD/Russ/Ibaka > +6 points per 100
2012/13 — KD/Russ/Ibaka > +9 points per 100
George/Hibbert
2010/11 — George/Hibbert > +0.7 points per 100
2011/12 — George/Hibbert > +6.7 points per 100
2012/13 — George/Hibbert > +8.5 points per 100
2013/14 — George/Hibbert > +17 points per 100
Griffin/Jordan
2008/09 – -5.6 points per 100
2009/10 – +4.4 points per 100
2010/11 – +4.2 points per 100
2011/12 – +6.4 points per 100
2012/13 – +11 points per 100
Kidd/Terry/Dirk
2008-09 > +5.5
2009-10 > +7.1
2010-11 > 15.5
Bad Boys 2.0 (Billups/Hamilton/Big Ben):
2002-03 > +4.8
2003-04 > +7.2
2004-05 > +9.4
Horford/Smith/Johnson:
2007-08 > 0.5
2008-09 > +1.9
2009-10 >8.0
Kobe/Lamar:
04-05 > -1.2
05-06 > 5.2
06-07 >2.4
07-08 > +9.5
Even considering that each iteration had a significant addition (except for OKC) in the last year of the date I used (West and Hill for Indiana, Paul for Clippers, TC for Dallas, Sheed for Detroit, Crawford for Atlanta, Pau for the Lakers), there’s seems to be an upward trend.
My idea is instead of using each player as a single element in the regression (example: EffDiff = A1+A2+A3+A4+A5 - B1 - B2 - B3 - B4 - B5 + HC+e), we clump them together. (EffDiff = A12 + A3 + A4 + A5 - B1 - B2 - B3 - B4 - B5 + HC + e). The sample size is definitely smaller and instead of tens of thousands of combinations, we may only have a couple of thousands.
Will this work especially for players who play a lot together over multiple season?
Please do tell if both these ideas are bad. I just thought with the emergence of NBAWowy, give it a couple of years and you could do a study on both. Just an idea.
I'm a big fan of plus/minus based metrics and I think they're a huge key to evaluating players in the NBA in terms of current production and equivalent contract, future potential and what not. Two things:
If we've constructed a plus/minus model for all relevant metrics as it relates to a player, then we can in essence use this to breakdown a player's APM as it relates to points per possession. Example:
Player X's Improved APM (whatever method it supposedly becomes, whether it's the statistical or the regularized one) on a points per 100 basis is bad but it supposedly happens that in reality his APM in the "effective field goal department" (i.e. adjusting for teammates and quality of opponents and homecourt, his team is X1 points better at making shots when Player X is on the court), but his APM in the turnover department is bad then at least you have a basis for basing where his "APM-PPP" comes from since it's been long understood that efficiency differential comes from the four factors (eight for offense and defense).
I haven't thought about whether it's doable, whether there are certain things that can hinder the study, intrinsic ideas that I have yet to consider and what not. Just an idea.
2. Another thing where I think PM can be used is in evaluating continuity. I've always held a belief that the more players play together, the better they play and the sum of the parts truly becomes greater than the whole. Just looking at certain recent "cores" from teams (numbers are taken from bball ref unless otherwise stated):
KD/Russ/ibaka
2010/11 — KD/Russ/Ibaka > +3 points per 100
2011/12 — KD/Russ/Ibaka > +6 points per 100
2012/13 — KD/Russ/Ibaka > +9 points per 100
George/Hibbert
2010/11 — George/Hibbert > +0.7 points per 100
2011/12 — George/Hibbert > +6.7 points per 100
2012/13 — George/Hibbert > +8.5 points per 100
2013/14 — George/Hibbert > +17 points per 100
Griffin/Jordan
2008/09 – -5.6 points per 100
2009/10 – +4.4 points per 100
2010/11 – +4.2 points per 100
2011/12 – +6.4 points per 100
2012/13 – +11 points per 100
Kidd/Terry/Dirk
2008-09 > +5.5
2009-10 > +7.1
2010-11 > 15.5
Bad Boys 2.0 (Billups/Hamilton/Big Ben):
2002-03 > +4.8
2003-04 > +7.2
2004-05 > +9.4
Horford/Smith/Johnson:
2007-08 > 0.5
2008-09 > +1.9
2009-10 >8.0
Kobe/Lamar:
04-05 > -1.2
05-06 > 5.2
06-07 >2.4
07-08 > +9.5
Even considering that each iteration had a significant addition (except for OKC) in the last year of the date I used (West and Hill for Indiana, Paul for Clippers, TC for Dallas, Sheed for Detroit, Crawford for Atlanta, Pau for the Lakers), there’s seems to be an upward trend.
My idea is instead of using each player as a single element in the regression (example: EffDiff = A1+A2+A3+A4+A5 - B1 - B2 - B3 - B4 - B5 + HC+e), we clump them together. (EffDiff = A12 + A3 + A4 + A5 - B1 - B2 - B3 - B4 - B5 + HC + e). The sample size is definitely smaller and instead of tens of thousands of combinations, we may only have a couple of thousands.
Will this work especially for players who play a lot together over multiple season?
Please do tell if both these ideas are bad. I just thought with the emergence of NBAWowy, give it a couple of years and you could do a study on both. Just an idea.