Groupe d’études et de recherche en analyse des décisions

# Choice of estimators based on different observations : Modified AIC and LCV criteria

## Benoît Liquet

It is quite common in epidemiology that we wish to assess the risk incurred by estimators on a particular set of information, while the estimators may use a larger set of information. Two examples are studied: the first occurs when we construct a model for an event which happens if a continuous variable is above a certain threshold. We can compare estimators based on the observation of only the event or on the whole continuous variable. The other example is that of predicting the survival based only on survival information or using in addition information on a disease. We develop modified AIC and LCV criteria to compare estimators in this non-standard situation. We show that a normalized difference of AIC has a bias equal to $$(n^{-1})$$ if the estimators are based on well-specified models; a normalized difference of LCV always has a bias equal to $$o(n^{-1})$$.

A simulation study shows that both criteria work well, although the normalized difference of LCV tends to be better and is more robust. Moreover in the case of well specified models the difference of risks boils down to the difference of statistical risks which can be rather precisely estimated.