Abstract:
|
When large amounts of survival data arrive in streams, conventional estimation methods may become computationally infeasible since they require storage of all the risk sets at each accumulation point. We develop online updating methods for carrying out survival analysis under the Cox proportional hazards models. Specifically, we propose online-updating estimators as well as their corresponding standard errors for both the regression coefficients and the baseline hazard function. An extensive simulation study is conducted to investigate the empirical performance of the proposed estimators. A large colon cancer data set from the Surveillance, Epidemiology, and End Results (SEER) program is analyzed to further demonstrate the proposed methodologies.
|