To frame the discussion, there are a few different types of risk metrics we can use for the project. I think we have been considering either an odds ratio or relative risk ratio or one of the many permutations.
The problem is the above ratios are mostly designed around experimental data, for instance, that uses treatment and control groups or cohort studies of individuals. However, our use case is for point predictions--the risk of an individual compared to the group. The typical intent of odds ratios are looking whether or not the relative risk is higher for one group compared to the other. If relatively higher, then it is bad.
This may be hard to translate this to an individual prediction. The incidence of lead poisoning is low. So, what is an "unacceptable" odds ratio? Anything more than the average? But why accept that -- aren't we trying to eliminate any incidence? We need to explicitly define our objective here.
Instead of relative risks, we could use an absolute risk such as the raw probability. Again, this may not be useful for medical staff where they will likely and always see low numeric numbers (e.g., 5%) that represent higher-than-average risk.
Of course, we could interpret raw scores into quantiles (e.g., top 25%). But again, will need a business rule to determine thresholds on when follow-ups should be conducted.
We could potentially reference known resource limitations. It is expensive and burdensome on families to undergo lead testing. Do we suspect there is a maximum number of additional inspections that could be completed this year? This could correspond to quantiles where we recommend action.
To frame the discussion, there are a few different types of risk metrics we can use for the project. I think we have been considering either an odds ratio or relative risk ratio or one of the many permutations.
The problem is the above ratios are mostly designed around experimental data, for instance, that uses treatment and control groups or cohort studies of individuals. However, our use case is for point predictions--the risk of an individual compared to the group. The typical intent of odds ratios are looking whether or not the relative risk is higher for one group compared to the other. If relatively higher, then it is bad.
This may be hard to translate this to an individual prediction. The incidence of lead poisoning is low. So, what is an "unacceptable" odds ratio? Anything more than the average? But why accept that -- aren't we trying to eliminate any incidence? We need to explicitly define our objective here.
Instead of relative risks, we could use an absolute risk such as the raw probability. Again, this may not be useful for medical staff where they will likely and always see low numeric numbers (e.g., 5%) that represent higher-than-average risk.
Of course, we could interpret raw scores into quantiles (e.g., top 25%). But again, will need a business rule to determine thresholds on when follow-ups should be conducted.
We could potentially reference known resource limitations. It is expensive and burdensome on families to undergo lead testing. Do we suspect there is a maximum number of additional inspections that could be completed this year? This could correspond to quantiles where we recommend action.