Let @A be a set of factors, on an observation set Y, that meet the following criteria:
|
Then @A is called an #~{orthogonal set} of factors
As we shall see, orthogonal sets of factor have some properties that make the statistical analysis possible. We'll start by looking at how such a set lends itself to a decomposition of &reals.^~n which is governed by the factors in the set.
We will now demonstrate how we can associate certain quantities with the individual factors. For the time being this is rather abstract, but it will soon become clear how these quantities can be used in analysing the data.
Let @A be an orthogonal set of factors as defined above, and let ~p_F be the orthogonal projection onto the linear space associated with the factor F, with the associated matrix P_F. Obviously
prod{(P_F + (I &minus. P_F)),F &in. @A, _ } _ = _ I, _ _ _ _ _ _ where I is the identity map
But by expanding the product:
prod{(P_F + (I &minus. P_F)),F &in. @A, _ } _ = _ sum{ prod{P_F,F&in.M, _ }prod{(I &minus. P_F),F&nin.M, _ } ,M &subseteq. @A, _ } _ _ _ _ _ [i.e. sum taken over all subsets of @A.]
_ _ _ _ _ _ _ _ _ _ _ = _ sum{Q_M , M &subseteq. @A, _ }, _ _ _ _ _ _ where _ _ _ _ _ _ Q_M _ = _ prod{P_F,F&in.M, _ }prod{(I &minus. P_F),F&nin.M, _ }
Now if _ G >= F _ then _ P_FP_G = P_GP_F = P_F , _ and if _ F &in. M _ but _ G &nin. M , _ then _ P_F (I &minus. P_G) _ is a factor of Q_M , _ and
P_F (I &minus. P_G) _ = _ P_F - P_FP_G _ = _ 0 _ _ => _ _ Q_M _ = _ 0.
Also, if _ F &in. M , _ G &in. M , _ but _ F&min.G &nin. M , _ then _ P_FP_G (I &minus. P_{F&min.G}) _ is a factor of Q_M , _ and
P_FP_G (I &minus. P_{F&min.G}) _ = _ P_FP_G (I &minus. P_FP_G) _ = 0 _ _ => _ _ Q_M _ = _ 0.
In the sum, &sum. Q_M we need only consider non-zero Q_M, i.e. we need only sum over sets M, where
This means that the minimum element of M specifies Q_M. _ So for any factor G, write _ Q_G = Q_M , _ where _ M = \{ F | F &ge. G \} _ i.e:
Q_G _ = _ prod{P_F,F&ge.G, _ }prod{(I &minus. P_F),F!&ge.G, _ }
where !&ge. represents 'not greater than or equal to'. [Note this is not synonymous with < as this is only a partial ordering.]
Now _ P_FP_G = P_G _ for any _ F &ge. G . _ So
prod{P_F,F&ge.G, _ } _ = _ P_G
So we have:
Q_G _ = _ P_G prod{(I &minus. P_F),F!&ge.G, _ } I _ = _ sum{ Q_G ,G &in. @A, _ } |
Now Q_G is the product of orthogonal projections [or strictly speaking, their associated matrices], and so Q_G is an orthogonal projection for any factor G &in. &Delta.. _ Let _ V_G = im Q_G , _ then V_G is a subspace of &reals.^{~n}, _ and _ V_G &perp. V_F _ any two factors G ≠ F &in. &Delta.. _ So
&reals.^{~n} _ = _ oplus{ V_G ,G &in. &Delta., } V_G _ = _ im Q_G |
Note that if F !&ge. G then P_F Q_G = 0, _ so
P_F _ = _ P_F &mult. I _ = _ P_F &mult. sum{ Q_G ,G &in. &Delta., _ } _ = _ sum{P_FQ_G ,G &le. F, _ } _ = _ sum{Q_G ,G &le. F, _ }
Since P_F Q_G = Q_G when G &le. F. We have:
P_F _ = _ sum{Q_G ,G &le. F, _ } L_F _ = _ oplus{ V_G ,G &le. F, _ } |
The above decomposition is unique, for suppose
I _ = _ sum{ R_G ,G &in. &Delta., _ }
&reals.^{~n} _ = _ oplus{ W_G ,G &in. &Delta., _ }, _ _ _ _ _ _ W_G _ = _ Im R_G
Let &Theta. &subset. &Delta. be the subset of all factors, G, in the design for which R_G ≠ Q_G. _ Let F &in. &Theta. where F is not the minimal element of &Delta..
Now
P_F _ = _ sum{ Q_G,G &le. F, _ } _ _ _ _ _ _ => _ _ _ _ _ _ sum{ Q_G,G < F, _ } _ = _ P_F - Q_F _ _ != _ _ P_F - R_F _ = _ sum{ R_G,G < F, _ }
So &exist. some G < F such that Q_G ≠ R_G, i.e. G &in. &Theta.. So either &Theta. contains the minimal element of &Delta. or &Theta. is empty. Let M be the minimal element then Q_M = P_M = R_M , so M &nin. &Theta. &imply. &Theta. is empty.
Q_G _ = _ P_G prod{(I &minus. P_F),F!&ge.G, _ } _ = _ P_G prod{(P_G &minus. P_{F&min.G}),F!&ge.G, _ }
Now if _ F < G _ then _ P_{F&min.G} = P_F.
If _ F !< G _ then _ F&min.G = H , _ where H < G, _ so _ P_G &minus. P_H _ will be a factor of the above product.
Note also that if _ F < G , _ then _ ( P_G - P_F ) ( P_G - P_F ) _ = _ P_G - P_F. _ So:
Q_G _ = _ P_G prod{( P_G - P_F ),F<.G, _ } |
This can sometimes make calculations simpler, e.g.
Q_A _ = _ P_A ( P_A - P_{A&min.B} ) _ = _ P_A - P_{A&min.B}
A ~#{design} is a set of factors on the same finite observation set Y.
In terms of the statistical analysis, the design is the set of factors that we decide that we are going to consider before the experiment starts, or before the observations are made. Subsequently we will look at hypotheses that involve a subset of the design. So the design is the maximal set of factors that will be considered in an experiment or survey.
A design, &Delta., on Y is an ~#{orthogonal design} if it is an orhtogonal set as defined above, i.e.
|
Consider a model with two factors, A and B, which is orthogonal, e.g. there are an equal number of observations in each cross-cell, i.e. in each level of A # B.
Let &Delta. _ = _ \{ I, A # B, A, B, A&min.B \}
Q_{A&min.B} _ = _ P_{A&min.B}
Q_A _ = _ P_A ( I - P_B ) ( I - P_{A&min.B} )
_ _ _ = _ P_A _ - _ P_A P_B _ - _ P_A P_{A&min.B} _ + _ P_A P_B P_{A&min.B} _ _ = _ _ P_A _ - _ P_{A&min.B}
[ remembering that _ P_AP_B _ = _ P_{A&min.B} _ for orthogonal factors. ]
Q_B _ = _ P_B ( I - P_A ) ( I - P_{A&min.B} ) _ _ = _ _ P_B _ - _ P_{A&min.B}
Q_{A # B} _ = _ P_{A # B} ( I - P_A ) ( I - P_B ) ( I - P_{A&min.B} )
_ _ _ = _ P_{A # B} _ - _ P_{A # B} P_A _ - _ P_{A # B} P_B _ - _ P_{A # B} P_{A&min.B}
_ _ _ _ _ + _ P_{A # B} P_A P_B _ + _ P_{A # B} P_B P_{A&min.B} _ + _ P_{A # B} P_B P_{A&min.B} _ - _ P_{A # B} P_A P_B P_{A&min.B}
_ _ _ = _ P_{A # B} _ - _ P_A _ - _ P_B _ - _ P_{A&min.B} _ + _ P_{A&min.B} _ + _ P_{A&min.B} _ + _ P_{A&min.B} _ - _ P_{A&min.B}
_ _ _ = _ P_{A # B} _ - _ P_A _ - _ P_B _ + _ P_{A&min.B}
In fact, for any factor, G, in an orthogonal design, Q_G can be expressed in the form
Q_G _ = _ sum{&alpha._G^F P_F, F &le. G, _ }, _ _ _ _ _ _ where &alpha._G^F &in. \{ -1 , +1 \} |