![]() |
This intuitive idea of opportunities for
failure translates directly into the concept of degrees of freedom. When we
use a bivariate normal liability model to predict the proportions in a
contingency table with rows and
columns, we use
thresholds for
the rows,
thresholds for the columns, and one parameter for the
correlation in liability, giving
in total. The table itself contains
proportions, neglecting the total sample size as above. Therefore we
have degrees of freedom equal to:
In principle, models could be fitted by maximum likelihood directly to contingency tables, employing the observed and expected cell proportions. This approach is general and flexible, especially for the multigroup case -- the programs LISCOMP (Muthén, 1987) and Mx (Neale, 1991) use the method -- but it is currently limited by computational considerations. When we move from two variables to larger examples involving many variables, integration of the multivariate normal distribution (which has to be done numerically) becomes extremely time-consuming, perhaps increasing by a factor of ten or so for each additional variable.
An alternative approach to this problem is to PRELIS 2 to compute each correlation in a pairwise fashion, and to compute a weight matrix. The weight matrix is an estimate of the variances and covariances of the correlations. The variances of the correlations certainly have some intuitive appeal, being a measure of how precisely each correlation is estimated. However, the idea of a correlation correlating with another correlation may seem strange to a newcomer to the field. Yet this covariation between correlations is precisely what we need in order to represent how much additional information the second correlation supplies over and above that provided by the first correlation. Armed with these two types of summary statistics -- the correlation matrix and the covariances of the correlations, we may fit models using a structural equation modeling package such as Mx or LISREL, and make statistical inferences from the goodness of fit of the model.
It is also possible to use the bivariate normal liability distribution to
infer the patterns of statistics that would be observed if an ordinal and
continuous variables were correlated.
Essentially, there are specific predictions made about the expected
mean and variance of the continuous variable in each of the categories of the
ordinal variable. For example, the continuous variable means are predicted to increase
monotonically across the categories if there is a correlation between the
liabilities. An observed pattern of a high mean in category 1, low in category
2 and high again in category 3 would not be consistent with the model.
The number of parameters used to describe this model for an ordinal variable
with categories is
, since we use
for the thresholds, one each
for the mean and variance of the continuous variable, and one for the
covariance between the two variables. The observed statistics involved are the
proportions in the cells (less one because the final proportion may be obtained
by subtraction from 1) and the mean and variance of the continuous variable in
each category. Therefore we have: