Need help with RAPM article for European blog

wolf · Post by **wolf** » Mon Aug 19, 2013 4:05 pm

Hey APBRcommunity,

I am an avid basketball fan from Germany who is not going to become a contributor on this site due to the fact that I have neglected math at school, but I’m often visiting your forum because of the intriguing models, ideas and discussions – even if I cannot fully grasp most of the advanced math stuff.

I am writing this because I need your help. The basketball discussion in Germany is generally not very educated because it does not rely much on statistics as it does on eye test and points per game. Just to give you an idea of how bad it is – recently, the official German contractor for NBA content took a poll on the 'most underpaid players by position' with Monta Ellis leading the way at the Shooting Guard position....

Well, I am a contributing writer on a German basketball blog that has a nice followership, and tries to go against the stream of the NBA ‘narrative’ by promoting educated discussion and the usage of advanced statistics like USG%, TS% an so on.

I have become interested in Advanced Plus/Minus Statistics, especially RAPM because of its defensive potential. I have been thinking about realizing an introductory article about (R)APM for our website that comments on the models strengths, field of operation, limitations, future potential, etc. I have googled a number of posts regarding the ridge regression (here, sportskeptic, godismyjudgeok.com, realgmforums, ..) that is used to erase the problem of co-linearity and think that I have a decent idea of how it works (for somebody who is just plain dumb in the mathematical field) but I still have a few questions. I would be very grateful if somebody could answer those and I will put references to this forum, and several stat-databases like the one of J.E., Daniel Myers or Evan Zamir (if that’s okay for them – if you have more databases that you think should be featuredd just drop a link).

This is how I understand the ridge regression:
Usual APM includes the problem of co-linearity. Some players work in pairs or trios all the time and it’s hard to say which player is 'more' responsible for those lineups results. The problem with APM is that it rates all players equal. If Roy Hibbert, Paul George and D.J. Augustin play in a lineup, APM gives them a neutral ‘zero’ before they start logging any minutes. The trick with RAPM is that it gives each player a certain range of expected performance. Thus, Hibbert and George would ‘go’ into those lineups with a bigger defensive potential right away. This range (or prior?) depends on past offensive and defensive performance.

Is that more or less correct?
I know that the use of advanced statistics, especially one-number-stats like RAPM is a controversial issue with some conservative NBA folks. I want to be precise in my article but also be as clear and comprehensible as possible – the average German basketball fan is less ‘prepared’ for such an approach as the average US-American fan, and I believe that making it too complicated would just alienate our readership for this topic. I know that this might be a little offensive towards the scientific method/significance behind the approach, but as I said - this is supposed to be an introductory article that hopefully will open a few people eyes towards more detalined and informed basketball evaluation.

Oh man, I'm sorry for writing so much.

Here are some questions that I have:

#1 – Why exactly is the ridge regression a useful method for basketball? Is it because the regression has proven to be successful with similar non-basketball problems where you have to resolve which element is mainly responsible for a certain effect?

#2 – While usual Plus Minus says “With player A on the court, team 1 is X points better/worse than with another player of team 1 on the field”, (R)APM says “With player A in his role on the court, team 1 is X points better/worse than with an average NBA player (0) on the field”. Is this assumption correct? Is that why it is a regression instead of just a point differential tracker? This would imply that it is also not a strict ranking metric like WP claims to be, but more a efficiency ranking based on role (at least on offense)?

#3 – As I understood, RAPM seems to be very flexible in its scope (primer?). You can include all sorts of data – box score numbers, advanced stats, shot location data, and so on. Is it possible to build RAPM-models for all kinds of different, specific purposes? For example one model for 3-and-D players, another model for bigs who are exceptional at passing, et cetera. Do you know whether the coach evaluation system which the Mavs build to find Rick Carlisle was APM-based?

#4 – What do you think can the future bring for a statistic like RAPM? I remember Kirk Goldsberry writing in his paper about post defense that his approach and findings might be a first, small step towards a bigger development in rating big men defense. Do you see something similar with RAPM now that so much of the chaotic, fast paced NBA action can be tracked more detailed by playbyplay-data and camera tracking? Any idea how that could look like?

#5 - Do you know of anybody working for an NBA team using specifically RAPM?

Phew, I hope some of you find the time to help me out a little.

J.E. · Post by **J.E.** » Mon Aug 19, 2013 8:04 pm

What's up german APBR buddy

that is used to erase the problem of co-linearity

that problem can't really be erased as it's in the data. Ridge regression just deals with it better (usually) than standard ordinary least squares regression

Unfortunately it seems you're a little confused about APM/RAPM/ASPM

Usual APM includes the problem of co-linearity. Some players work in pairs or trios all the time and it’s hard to say which player is 'more' responsible for those lineups results. The problem with APM is that it rates all players equal. If Roy Hibbert, Paul George and D.J. Augustin play in a lineup, APM gives them a neutral ‘zero’ before they start logging any minutes. The trick with RAPM is that it gives each player a certain range of expected performance. Thus, Hibbert and George would ‘go’ into those lineups with a bigger defensive potential right away. This range (or prior?) depends on past offensive and defensive performance.

Hopefully this doesn't come off as rude, but this is pretty much all incorrect. APM doesn't have any starting points (nobody gets a "zero"). Original/vanilla (whatever you want to call it) RAPM has starting points, but the starting point is the same for everyone (0). Then there's "RAPM informed RAPM", where everyone's starting point is last seasons' RAPM rating; and there' "ASPM informed RAPM" where the starting point comes from a different (usually BoxScore based) metric.

Why exactly is the ridge regression a useful method for basketball?

There's multi-collinearity in the data and ridge regression is one way of somewhat dealing with this

As I understood, RAPM seems to be very flexible in its scope (primer?). You can include all sorts of data – box score numbers, advanced stats, shot location data, and so on.

Standard/original RAPM doesn't use any data except for matchupdata. You could feed ASPM with all sorts of data though

Here are two links that give further insight into APM vs RAPM
http://www.d3coder.com/thecity/advanced-stats-primer/ (near the bottom)
http://godismyjudgeok.com/DStats/2011/n ... ilization/

If you want to, I can proof-read some of your stuff before you put it online

Crow · Post by **Crow** » Thu Aug 22, 2013 9:26 pm

My quick responses:

#2 – While usual Plus Minus says “With player A on the court, team 1 is X points better/worse than with another player of team 1 on the field”, (R)APM says “With player A in his role on the court, team 1 is X points better/worse than with an average NBA player (0) on the field”. Is this assumption correct? Yes Is that why it is a regression instead of just a point differential tracker? Yes This would imply that it is also not a strict ranking metric like WP claims to be, but more a efficiency ranking based on role (at least on offense)? This is one way to look at. clearly it covers all impacts observed in the boxscore from actions / inactions.

#3 – As I understood, RAPM seems to be very flexible in its scope (primer?). You can include all sorts of data – box score numbers, advanced stats, shot location data, and so on. Is it possible to build RAPM-models for all kinds of different, specific purposes? there have been efforts to develop RAPM at the 4 factors level (Joe Sill, JE and Evan Zamir) and for other stats such as various shooting measures (Evan Zamir)

For example one model for 3-and-D players, another model for bigs who are exceptional at passing, et cetera. I haven't heard anybody doing a separate model by player type or role. But one could and should look at averages and highs and lows for types / roles. Do you know whether the coach evaluation system which the Mavs build to find Rick Carlisle was APM-based? Not sure but possibly at least part of it. One could try to talk to Wayne Winston at his site. http://waynewinston.com/wordpress/He was a longtime stat advisor to Cuban.

#4 – What do you think can the future bring for a statistic like RAPM? I think blends (in the method and after the fact) of different versions of RAPM or RAPM with Statistical Plus-Minus, Box Score stats, possibly height could help reduce average errors. Dan Rosenbaum started with a blend in his efforts and then most who followed went with "pure" APM.

#5 - Do you know of anybody working for an NBA team using specifically RAPM? Rosenbaum was / probably still is with Cavs as is another stat guy David Lewin. Houston has Eli Witus. Washington has / had Joe Sill. Aaron Barzilai is now with Philly after Memphis. I hear Wayne Winston may be consulting with the Knicks. Steve Ilardi was with the Suns. JE is /was with some team, I believe.

Speaking of Eli, I see he is now Vice President of Basketball Operations at Houston Rockets.
http://www.linkedin.com/pub/eli-witus/57/a85/b69

APBRmetrics

Need help with RAPM article for European blog

Need help with RAPM article for European blog

Re: Need help with RAPM article for European blog

Re: Need help with RAPM article for European blog