Here are 7 steps for using probability to formalize medical decisions. For more on this topic, see papers on (dynamic) treatment regimes, reinforcement learning, or decision analysis.
1. Gather information. Suppose we have some patient covariates S that are continuous, for simplicity.
2. Decide how you might be able to act. Suppose we have a binary treatment, A, that will be taken according to . Draw
from the conditional distribution
which we call a “treatment policy”.
3. Understand how baseline covariates and treatments affect the patient. Suppose we have some future, post-treatment covariates, , which depend on the initial covariates and the treatment. Draw
from
.
4. Think about what is truly desired. Define a reward, that you wish to maximize. For example,
might pick out one covariate that we want to be large (for simplicity, assume the function
is not random).
5. Relate the treatment policy, to the reward,
Write the expectation
.
6. Find the treatment policy that gives the most reward, in expectation. Estimate , where
7. Doing all of this in practice is difficult. However, the framework provides a way in which data can be used to make healthcare decisions, which will ultimately lead to better outcomes for patients.
Leave a comment