183 – Multivariate Data Analysis and Genetics
Long-Term Survival Probabilities and Kaplan-Meier Estimator
Ion Grama
University of South Brittany
Jean-Marie Tricot
University of South Brittany
Jean-Francois Petiot
University of South Brittany
The nonparametric Kaplan-Meier estimator is a standard tool for estimating a survival time distribution in a right censoring schema. Our goal is to analyze this estimator in the long term, particularly when the censoring rate is high. We combine the Kaplan-Meier estimator and a parametric-based model into one construction which we call semiparametric Kaplan-Meier estimator. Our estimator incorporates a threshold $t$ in such a way that the survival function is estimated by the Kaplan-Meier estimator on $[0,t]$ and by the exponential distribution on $(t,\infty ).$ Our main result is that with an appropriate choice of the threshold $t$ such an estimate is consistent. Rates of convergence are obtained which in particular cases turn to be nearly optimal. A data driven multiple testing procedure for choosing the threshold $t$ is proposed. As byproduct it provides a goodness-of-fit test for the parametric-based part of the model. Our numerical simulations show that the proposed technique improves Kaplan-Meier's in the long as well as in the mid term.