Abstract:
|
As RNA-seq rapidly develops and costs continue to decrease, more samples will continue to be sequenced. Thus, the determination of sample size becomes an important issue in planning the experimental design. Some current methods for calculating the required sample size of a study are based on the hypothesis testing framework, assuming the counts for each gene come from the Poisson or negative binomial distributions. However, these methods are limited in terms of accommodating covariates. To deal with this issue, we propose an estimating procedure based on the generalized linear model. By constructing a representative exemplary dataset and estimating the conditional power, the proposed method is easy to use and requires no complicated mathematical approximations or formulas. Most attractively, the downstream analysis can be based on the many currently existing R/Bioconductor packages. Finally, the proposed method is applied to two real-world studies.
|
ASA Meetings Department
732 North Washington Street, Alexandria, VA 22314
(703) 684-1221 • meetings@amstat.org
Copyright © American Statistical Association.