The basic approach to power analysis is to imagine that we are doing
an identical study many times. For example, we pretend that we are
trying to estimate , and
for a given population by taking
samples of a given number of MZ and DZ twins. Each sample would give
somewhat different estimates of the parameters, depending on how many
twins we study, and how big
, and
are in the study
population. Suppose we did a very large number of studies and
tabulated all the estimates of the shared environmental component,
. In some of the studies, even though there was some shared
environment in the population, we would find estimates of
that
were not significant. In these cases we would commit ``type II
errors.'' That is, we would not find a
significant effect of the shared environment even though the value of
in the population was truly greater than zero. Assuming we were
using a
test for 1 d.f. to test the significance of the
shared environment, and we had decided to use the conventional 5%
significance level, the probability of Type II error would be the
expected proportion of samples in which we mistakenly decided in favor
of the null hypothesis that
. These cases would be those in
which the observed value of
was less than 3.84, the 5%
critical value for 1 d.f. The other samples in which
was
greater than 3.84 are those in which we would decide, correctly, that
there was a significant shared environmental effect in the population.
The expected proportion of samples in which we decide correctly
against the null hypothesis is the power of the test.
Designing a genetic study boils down to deciding on the numbers and types of relationships needed to achieve a given power for the test of potentially important genetic and environmental factors. There is no general solution to the problem of power. The answers will depend on the specific values we contemplate for all the factors listed above. Before doing any power study, therefore, we have to decide the following questions in each specific case:
It is essential to remember that the sample size we obtain in step five only applies to the particular effect, design, sample sizes, and even to the distribution of sample sizes among the different types of relationship assumed in a specific power calculation. To explore the question of power fully, it often will be necessary to consider a number, sometimes a large number, of different designs and population values for the relevant effects of genes and environment.