MINQUE

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

In statistics, the theory of minimum norm quadratic unbiased estimation (MINQUE)[1][2][3] was developed by C. R. Rao. MINQUE is a theory alongside other estimation methods in estimation theory, such as the method of moments or maximum likelihood estimation. Similar to the theory of best linear unbiased estimation, MINQUE is specifically concerned with linear regression models.[1] The method was originally conceived to estimate heteroscedastic error variance in multiple linear regression.[1] MINQUE estimators also provide an alternative to maximum likelihood estimators or restricted maximum likelihood estimators for variance components in mixed effects models.[3] MINQUE estimators are quadratic forms of the response variable and are used to estimate a linear function of the variances.

Principles

[edit | edit source]

We are concerned with a mixed effects model for the random vector 𝐘n with the following linear structure.

𝐘=𝐗𝜷+𝐔1𝝃1++𝐔k𝝃k

Here, 𝐗n×m is a design matrix for the fixed effects, 𝜷m represents the unknown fixed-effect parameters, 𝐔in×ci is a design matrix for the i-th random-effect component, and 𝝃ici is a random vector for the i-th random-effect component. The random effects are assumed to have zero mean (𝔼[𝝃i]=𝟎) and be uncorrelated (𝕍[𝝃i]=σi2𝐈ci). Furthermore, any two random effect vectors are also uncorrelated (𝕍[𝝃i,𝝃j]=𝟎ij). The unknown variances σ12,,σk2 represent the variance components of the model.

This is a general model that captures commonly used linear regression models.

  1. Gauss-Markov Model[3]: If we consider a one-component model where 𝐔1=𝐈n, then the model is equivalent to the Gauss-Markov model 𝐘=𝐗𝜷+𝝐 with 𝔼[𝝐]=𝟎 and 𝕍[𝝐]=σ12𝐈n.
  2. Heteroscedastic Model[1]: Each set of random variables in 𝐘 that shares a common variance can be modeled as an individual variance component with an appropriate 𝐔i.

A compact representation for the model is the following, where 𝐔=[𝐔1𝐔k] and 𝝃=[𝝃1𝝃k].

𝐘=𝐗𝜷+𝐔𝝃

Note that this model makes no distributional assumptions about 𝐘 other than the first and second moments.[3]

𝔼[𝐘]=𝐗𝜷

𝕍[𝐘]=σ12𝐔1𝐔1++σk2𝐔k𝐔kσ12𝐕1++σk2𝐕k

The goal in MINQUE is to estimate θ=i=1kpiσi2 using a quadratic form θ^=𝐘𝐀𝐘. MINQUE estimators are derived by identifying a matrix 𝐀 such that the estimator has some desirable properties,[2][3] described below.

Optimal Estimator Properties to Constrain MINQUE

[edit | edit source]

Invariance to translation of the fixed effects

[edit | edit source]

Consider a new fixed-effect parameter 𝜸=𝜷𝜷0, which represents a translation of the original fixed effect. The new, equivalent model is now the following.

𝐘𝐗𝜷0=𝐗𝜸+𝐔𝝃

Under this equivalent model, the MINQUE estimator is now (𝐘𝐗𝜷0)𝐀(𝐘𝐗𝜷0). Rao argued that since the underlying models are equivalent, this estimator should be equal to 𝐘𝐀𝐘.[2][3] This can be achieved by constraining 𝐀 such that 𝐀𝐗=𝟎, which ensures that all terms other than 𝐘𝐀𝐘 in the expansion of the quadratic form are zero.

Unbiased estimation

[edit | edit source]

Suppose that we constrain 𝐀𝐗=𝟎, as argued in the section above. Then, the MINQUE estimator has the following form

θ^=𝐘𝐀𝐘=(𝐗𝜷+𝐔𝝃)𝐀(𝐗𝜷+𝐔𝝃)=𝝃𝐔𝐀𝐔𝝃

To ensure that this estimator is unbiased, the expectation of the estimator 𝔼[θ^] must equal the parameter of interest, θ. Below, the expectation of the estimator can be decomposed for each component since the components are uncorrelated with each other. Furthermore, the cyclic property of the trace is used to evaluate the expectation with respect to 𝝃i.

𝔼[θ^]=𝔼[𝝃𝐔𝐀𝐔𝝃]=i=1k𝔼[𝝃i𝐔i𝐀𝐔i𝝃i]=i=1kσi2Tr[𝐔i𝐀𝐔i]

To ensure that this estimator is unbiased, Rao suggested setting i=1kσi2Tr[𝐔i𝐀𝐔i]=i=1kpiσi2, which can be accomplished by constraining 𝐀 such that Tr[𝐔i𝐀𝐔i]=Tr[𝐀𝐕i]=pi for all components.[3]

Minimum Norm

[edit | edit source]

Rao argues that if 𝝃 were observed, a "natural" estimator for θ would be the following[2][3] since 𝔼[𝝃i𝝃i]=ciσi2. Here, 𝜟 is defined as a diagonal matrix.

p1c1𝝃1𝝃1++pkck𝝃k𝝃k=𝝃[diag(p1ci,,pkck)]𝝃𝝃𝜟𝝃

The difference between the proposed estimator and the natural estimator is 𝝃(𝐔𝐀𝐔𝜟)𝝃. This difference can be minimized by minimizing the norm of the matrix 𝐔𝐀𝐔𝜟.

Procedure

[edit | edit source]

Given the constraints and optimization strategy derived from the optimal properties above, the MINQUE estimator θ^ for θ=i=1kpiσi2 is derived by choosing a matrix 𝐀 that minimizes 𝐔𝐀𝐔𝜟, subject to the constraints

  1. 𝐀𝐗=𝟎, and
  2. Tr[𝐀𝐕i]=pi.

Examples of Estimators

[edit | edit source]

Standard Estimator for Homoscedastic Error

[edit | edit source]

In the Gauss-Markov model, the error variance σ2 is estimated using the following.

s2=1nm(𝐘𝐗𝜷^)(𝐘𝐗𝜷^)

This estimator is unbiased and can be shown to minimize the Euclidean norm of the form 𝐔𝐀𝐔𝜟.[1] Thus, the standard estimator for error variance in the Gauss-Markov model is a MINQUE estimator.

Random Variables with Common Mean and Heteroscedastic Error

[edit | edit source]

For random variables Y1,,Yn with a common mean and different variances σ12,,σn2, the MINQUE estimator for σi2 is nn2(YiY)2s2n2, where Y=1ni=1nYi and s2=1n1i=1n(YiY)2.[1]

Estimator for Variance Components

[edit | edit source]

Rao proposed a MINQUE estimator for the variance components model based on minimizing the Euclidean norm.[2] The Euclidean norm 2 is the square root of the sum of squares of all elements in the matrix. When evaluating this norm below, 𝐕=𝐕1++𝐕k=𝐔𝐔. Furthermore, using the cyclic property of traces, Tr[𝐔𝐀𝐔𝜟]=Tr[𝐀𝐔𝜟𝐔]=Tr[i=1kpici𝐀𝐕i]=Tr[𝜟𝜟].

𝐔𝐀𝐔𝜟22=(𝐔𝐀𝐔𝜟)(𝐔𝐀𝐔𝜟)=Tr[𝐔𝐀𝐔𝐔𝐀𝐔]Tr[2𝐔𝐀𝐔𝜟]+Tr[𝜟𝜟]=Tr[𝐀𝐕𝐀𝐕]Tr[𝜟𝜟]

Note that since Tr[𝜟𝜟] does not depend on 𝐀, the MINQUE with the Euclidean norm is obtained by identifying the matrix 𝐀 that minimizes Tr[𝐀𝐕𝐀𝐕], subject to the MINQUE constraints discussed above.

Rao showed that the matrix 𝐀 that satisfies this optimization problem is

𝐀=i=1kλi𝐑𝐕i𝐑,

where 𝐑=𝐕1(𝐈𝐏), 𝐏=𝐗(𝐗𝐕1𝐗)𝐗𝐕1 is the projection matrix into the column space of 𝐗, and () represents the generalized inverse of a matrix.

Therefore, the MINQUE estimator is the following, where the vectors 𝝀 and 𝐐 are defined based on the sum.

θ^=𝐘𝐀𝐘=i=1kλi𝐘𝐑𝐕i𝐑𝐘i=1kλiQi𝝀𝐐

The vector 𝝀 is obtained by using the constraint Tr[𝐀𝐕i]=pi. That is, the vector represents the solution to the following system of equations j{1,,k}.

Tr[𝐀𝐕j]=pjTr[i=1kλi𝐑𝐕i𝐑𝐕j]=pji=1kλiTr[𝐑𝐕i𝐑𝐕j]=pj

This can be written as a matrix product 𝐒𝝀=𝐩, where 𝐩=[p1pk] and 𝐒 is the following.

𝐒=[Tr[𝐑𝐕1𝐑𝐕1]Tr[𝐑𝐕k𝐑𝐕1]Tr[𝐑𝐕1𝐑𝐕k]Tr[𝐑𝐕k𝐑𝐕k]]

Then, 𝝀=𝐒𝐩. This implies that the MINQUE is θ^=𝝀𝐐=𝐩(𝐒)𝐐=𝐩𝐒𝐐. Note that θ=i=1kpiσi2=𝐩𝝈, where 𝝈=[σ12σk2]. Therefore, the estimator for the variance components is 𝝈^=𝐒𝐐.

Extensions

[edit | edit source]

MINQUE estimators can be obtained without the invariance criteria, in which case the estimator is only unbiased and minimizes the norm.[2] Such estimators have slightly different constraints on the minimization problem.

The model can be extended to estimate covariance components.[3] In such a model, the random effects of a component are assumed to have a common covariance structure 𝕍[𝝃i]=𝜮. A MINQUE estimator for a mixture of variance and covariance components was also proposed.[3] In this model, 𝕍[𝝃i]=𝜮 for i{1,,s} and 𝕍[𝝃i]=σi2𝐈ci for i{s+1,,k}.

References

[edit | edit source]
  1. ^ a b c d e f Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ a b c d e f Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  3. ^ a b c d e f g h i j Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).