Some LISREL users have experienced problems with the concepts
of a covariance matrix and an asymptotic covariance matrix.
One of the misconceptions is that the ACM replaces the usual
covariance matrix (or correlation matrix) as the input to
LISREL/SIMPLIS when dealing with non-normal data.
To illustrate the differences between an ACM and the sample
covariance matrix, consider the following 3 small datasets, each
representing ten measurements on the three variables Y1, Y2 and
Y3:
The number of non-duplicated elements of a covariance matrix
equals k = p*(p+1)/2, where p is the number of variables. For the
datasets above p equals 3 and therefore k = 3*4/2=6, as can
easily be verified from the printouts above.
These non-duplicated elements can be presented as a row of
numbers:
S11, S21, S22, S31, S32, S33
For continuous data the formula to calculate these variances
and covariances is illustrated by considering the covariance
between Y1 and Y2
S12 = SUM[Y1 -Mean(Y1)][Y2 -Mean(Y2)]/(n-1), where
Mean(Y1) = SUM(Y1)/n, Mean(Y2) = SUM(Y2)/n;
and where n denotes the number of observations.
The 3 covariance matrices can therefore be summarized as 3
rows of a data matrix with six variables, these being:
Now, suppose that instead of 3 datasets, we had 500 datasets
based on the same 3 variables Y1, Y2, Y3. From these datasets one
can calculate 500 sample covariance matrices, which, in turn, can
be represented as a dataset with 6 columns and 500 rows.
One estimate of the asymptotic covariance of Y1, Y2 and Y3
would be to use the sample covariance matrix of S11, S21, . . .,
S33.
This ACM matrix has k*(k+1)/2 non-duplicated elements, where
k=p*(p+1)/2. In our case k=6 and therefore the ACM has 21
non-duplicated elements. It is left to the reader to verify that
k, if the number of observed variables (p) equals 80 is equal to
5,250,420. For the sake of interest it should further be noted
that it requires 8 bytes of memory to store one element of the
ACM. Hence, one would require approximately 40 megabytes of disk
space to store an ACM based on 80 observed variables.
In practice, a researcher would not have the luxury of having
a large number of datasets at his disposal. The ACM is therefore
calculated from the information in a single dataset and the
formula used depends on whether the variables are continuous or
ordinal or a mixture of continuous and ordinal variables.
An alternative approach to the calculation of an ACM is to use
the bootstrap method discussed in the PRELIS manual. According to
this method, the researcher randomly draws a large number of
datasets from the original raw data. Note that drawing is done
with replacement meaning that the same row of the raw dataset can
be selected more than once. To illustrate, one bootstrap sample
of size 10 from dataset 1 is shown below:
In LISREL the chi-square statistic corresponds to testing the
null hypothesis
H0: The population covariance matrix can be estimated by
SIGMAO, against the alternative hypothesis
H1: The population covariance matrix is estimated by the
sample covariance matrix S.
Note that the covariance matrix SIGMAO is a function of the
postulated structural equation model, and hence, the elements of
SIGMAO are determined by the values of the estimated parameters.
A perfect fit, indicated by a chi-square value of 0, implies
that SIGMA0 equals S. Goodness of fit, generally speaking, is a
measure of similarity between these two matrices. Fit statistics
are therefore based on some function of the differences between
the sample covariances and the fitted covariances.
For example, in weighted least squares we minimize a function
F,
In the notation above Wij represents a typical element of the
ACM.
Under normality, it is well known that the ACM can be
expressed as the so-called Kronecker product of the population
covariance matrix with itself. It is also well known that the
inverse of the ACM in the case of multivariate normality is the
Kronecker product of inverse(SIGMA) and inverse(SIGMA).
By specifying that the method of maximization is normal
maximum likelihood, one avoids the necessity of inverting the
usually large k x k ACM matrix obtained by PRELIS.
In general, there are two ways in which the ACM can be used
when dealing with non-normal data. It should, however, be pointed
out that in both cases LISREL/SIMPLIS additionally requires the
sample covariance/correlation matrix as input. For example:
LISREL:
CM = socmob.cov
AC = socmob.acm
SIMPLIS:
Covariance matrix from file socmob.cov
Asymptotic covariance matrix from file socmob.acm
The two approaches to the use of the ACM are:
(1) Use the ACM as a weight matrix that has to be inverted by
specifying the method of optimization as weighted least squares.
(2) Specify the method of optimization as normal maximum
likelihood, in which case the ACM is not inverted, but is used as
a multiplying factor in an expression containing the normal
theory weight matrix (inverse(SIGMA0) Kronecker inverse(SIGMA0) )
to correct for bias in standard errors and fit statistics.
Final Remarks
1) When an ACM matrix is used as input to LISREL, it does not
replace the sample covariance matrix, but is used as a weight
matrix in the WLS procedure, or as a matrix which adjusts the
normal-theory weight matrix in the sense that the chi-square
statistic and standard-errors are less biased.
2) The ACM of a correlation matrix: A correlation matrix has
p*(p-1)/2 non-duplicated correlations, since all the diagonal
elements are equal to one. For the Y1, Y2, Y3 example these
correlations are (3 x 2)/2 = 3 namely R21, R31 and R32, and hence
the number of non-duplicated elements of the ACM is 3 x 4/2 = 6.
The ACM of a correlation matrix therefore has less elements than
that of a covariance matrix.
3) From the above, it is clear that if an ACM is calculated in
PRELIS, the appropriate moment matrix (covariances or
correlations) must be selected prior to requesting the
calculation of an ACM.