Abstract:
|
Sample size and power calculation are essential components of experimental designs in biomedical research. It is very challenging to estimate power for RNA-Seq differential expression under complex experimental designs. Moreover, the dependency among genes should be taken into account in order to obtain accurate results. Therefore, we propose a simulation based approach for power estimation using the negative binomial distribution and assuming a generalized linear model (at the gene level) that considers the dependence between the gene expression level and its variance (dispersion). We compare the performance of both LRT and Wald tests under different scenarios where the simulated exact distribution of the test statistics under the null hypothesis was used for false positive control.
|