Next: 4 Power for the
 Up: 7 Power and Sample
 Previous: 2 Factors Contributing to
     Index 
3 Steps in Power Analysis
The basic approach to power analysis is to imagine that we are doing
an identical study many times.  For example, we pretend that we are
trying to estimate 
, and 
 for a given population by taking
samples of a given number of MZ and DZ twins.  Each sample would give
somewhat different estimates of the parameters, depending on how many
twins we study, and how big 
, and 
 are in the study
population.  Suppose we did a very large number of studies and
tabulated all the estimates of the shared environmental component,
.  In some of the studies, even though there was some shared
environment in the population, we would find estimates of 
 that
were not significant.  In these cases we would commit ``type II
errors.''  That is, we would not find a
significant effect of the shared environment even though the value of
 in the population was truly greater than zero.  Assuming we were
using a 
 test for 1 df to test the significance of the
shared environment, and we had decided to use the conventional 5%
significance level, the probability of Type II error would be the
expected proportion of samples in which we mistakenly decided in favor
of the null hypothesis that 
.  These cases would be those in
which the observed value of 
 was less than 3.84, the 5%
critical value for 1 df.  The other samples in which 
 was
greater than 3.84 are those in which we would decide, correctly, that
there was a significant shared environmental effect in the population.
The expected proportion of samples in which we decide correctly
against the null hypothesis is the power of the test.
Designing a genetic study boils down to deciding on the numbers and
types of relationships needed to achieve a given power for the test of
potentially important genetic and environmental factors.  There is no
general solution to the problem of power.  The answers will depend on
the specific values we contemplate for all the factors listed above.
Before doing any power study, therefore, we have to decide the
following questions in each specific case:
- What kinds of relationships are to be considered?
 
- What significance level is to be used in hypothesis testing? 
 
- What values are we assuming for the various effects of interest
  in the population being studied?
 
- What power do we want to strive for in designing the study?
 
When we have answered these questions exactly, then we can conduct a
power analysis for the specified set of conditions by following some
basic steps:
- Obtain expected covariance matrices for each set of
  relationships by substituting the assumed values of the population
  parameters in the model for each relationship.
 
- Assign some initial arbitrary sample sizes to each separate
  group of relatives.
 
- Use Mx to analyze the expected covariance matrices just as
  we would to analyze real data and obtain the 
 value for
  testing the specific hypothesis of interest.
 
- Find out (from statistical tables) how big that 
 has to
  be to guarantee the power we need.
 
- Use a simple formula (given below) to multiply our assumed
  sample size and solve for the sample size we need.
 
It is essential to remember that the sample size we obtain in step
five only applies to the particular effect, design, sample sizes, and
even to the distribution of sample sizes among the different types of
relationship assumed in a specific power calculation.  To explore the
question of power fully, it often will be necessary to consider a
number, sometimes a large number, of different designs and population
values for the relevant effects of genes and environment.
 
 
  
 
 Next: 4 Power for the
 Up: 7 Power and Sample
 Previous: 2 Factors Contributing to
     Index 
Jeff Lessem
2002-03-21