Least Squares Estimation

The model for an incomplete block design is

\[ \begin{gathered} Y_{h i}=\mu+\theta_h+\tau_i+\epsilon_{h i}, \\ \epsilon_{h i} \sim N\left(0, \sigma^2\right),\\ h=1, \ldots, b ; i=1, \ldots, v ; \quad(h, i) \text{ in the design},\\ \epsilon_{h i} \text{'s are mutually independent} \end{gathered} \] The least squares estimates for \(\theta_h\) and \(\tau_i\) minimize \[ \sum_{(h,i) \text{ in the design}}(Y_{hi}-\mu-\theta_h-\tau_i)^2 \] subject to \(\sum \tau_i=\sum \theta_h=0\).

One major difference is that the sum is not over all pairs \((h, i)\) but only over those that appeared in the design. The least squares estimators for the treatment parameters in the model for an incomplete block design must include an adjustment for blocks. This means that the least squares estimator for the pairwise comparison \(\tau_p-\tau_i\) is not the unadjusted estimator \(\bar{Y}_{. p}-\bar{Y}_{. i}\) as it would be for a randomized complete block design.

The least squares estimates are a solution to \[ r(k-1) \hat{\tau}_i-\sum_{p \neq i} \lambda_{p i} \hat{\tau}_p=k Q_i, \quad \text { for } i=1, \ldots, v \\ \sum_i \hat{\tau}_i=0, \sum_h \hat{\theta}_h=0. \] where \(\lambda_{pi}\) is the number of blocks containing both treatments \(p\) and \(i\), and \[ Q_i=T_i-\frac{1}{k} \sum_{h=1}^b n_{h i} B_h, \quad T_i=\sum_{h=1}^b n_{h i} y_{h i}, \quad B_h=\sum_{i=1}^v n_{h i} y_{h i}. \]

Confidence intervals for a contrast \(\sum_i c_i\tau_i\) are of the form \[ \text{estimate}\pm w \text{(standard error)}. \]

For a single \(100(1-\alpha)\) confidence interval, \(w=t_{\alpha/2, df}\) where \(df\) is the number of degrees of freedom for error.

For simultaneous confidence intervals, the Bonferroni and Scheffé methods can be used for all incomplete block designs. Tukey’s and Dunnett’s methods can be used for balanced incomplete block designs.

Analysis of Variance

\[ \begin{array}{lcccc} \hline \text { Source of variation } & \text { Degrees of freedom } & \text { Sum of squares } & \text { Mean square } & \text { Ratio } \\ \hline \text { Blocks (adj) } & b-1 & s s \theta_{\text {adj }} & \mathrm{ms} \theta_{\text {adj }} & - \\ \hline \text { Blocks (unadj) } & b-1 & s s \theta & - & - \\ \text { Treatments (adj) } & v-1 & s s T_{\text {adj }} & m s T_{\text {adj }} & \frac{m s T_{\text {adj }}}{m s E} \\ \text { Error } & b k-b-v+1 & s s E & m s E & \\ \text { Total } & b k-1 & s s t o t & \end{array} \] where \[ \begin{array}{ll} s s \theta=\sum_{h=1}^b B_h^2 / k-G^2 /(b k) & s s E=s s t o t-s s \theta-s s T_{\text {adj }} \\ s s T_{\text {adj }}=\sum_{i=1}^v Q_i \hat{\tau}_i & \text { sstot }=\sum_{h=1}^b \sum_{i=1}^v n_{h i} y_{h i}^2-G^2 /(b k) \\ Q_i=T_i-\sum_{h=1}^b n_{h i} B_h / k . & s s \theta_{\mathrm{adj}}=s s t o t-s s E-\left(\sum_{i=1}^v T_i^2 / r-G^2 /(b k)\right) \end{array} \]

Analysis of Balanced Incomplete Block Designs

For the balanced incomplete block design the estimates of treatment effects have a simple form: \[ \hat{\tau}_i=\frac{k}{\lambda v} Q_i, \quad \text { for } i=1, \ldots, v, \] where \(\lambda\) is the number of times that every pair of treatments occurs together in a block, and \(Q_i\) is the adjusted treatment total. Thus, the sum of squares for treatments adjusted for blocks in becomes \[ s s T_{\mathrm{adj}}=\sum_{i=1}^v \frac{k}{\lambda v} Q_i^2, \] and the least squares estimator of contrast \(\sum c_i \tau_i\) is \[ \sum_{i=1}^v c_i \hat{\tau}_i=\frac{k}{\lambda v} \sum_{i=1}^v c_i Q_i. \]

It can be shown that the corresponding variance can be calculated as \[ \operatorname{Var}\left(\sum_{i=1}^v c_i \hat{\tau}_i\right)=\sum_{i=1}^v c_i^2\left(\frac{k}{\lambda v}\right) \sigma^2. \]

The simultaneous confidence intervals for a set of contrasts \(\sum c_i\tau_i\) are given by \[ \frac{k}{\lambda v} \sum_{i=1}^v c_i Q_i \pm w \sqrt{\sum_{i=1}^v c_i^2\left(\frac{k}{\lambda v}\right) m s E} \] where the critical coefficients for the four methods are, respectively, \[ \begin{gathered} w_B=t_{b k-b-v+1, \alpha / 2 m} ; w_S=\sqrt{(v-1) F_{v-1, b k-b-v+1, \alpha}} ; \\ w_T=q_{v, b k-b-v+1, \alpha} / \sqrt{2} ; w_{D 2}=|t|_{v-1, b k-b-v+1, \alpha}^{(0.5)} \end{gathered} \]

Sample Size

Given the number of treatments \(v\) and the block size \(k\), how many blocks \(b\) are required to achieve confidence intervals of a specified length or a hypothesis test of specified power? Since for most purposes the balanced incomplete block design is the best incomplete block design when it is available, we start by calculating \(b\) and the treatment replication \(r=b k / v\) for this design. Then if a balanced incomplete block design cannot be found with \(b\) and \(r\) close to the calculated values, a group divisible, cyclic, or other incomplete block design can be considered. Since balanced incomplete block designs are the most efficient, other incomplete block designs would generally require \(b\) and \(r\) to be a little larger.

Suppose we use Tukey’s method for all pairwise comparisons. Then \[ msd=(q_{v, df,\alpha}/\sqrt 2)\times \sqrt{2 (mse) \frac{k}{\lambda v}} \] where \(\lambda=r(k-1)/(v-1)\), \(df=vr-b-v+1\) and \(b=vr/k\).

Consider the case when \(v=5, k=3\) and \(mse\) is bounded by 2. How many replicates \(r\) per treatment are sufficient for the 95% simultaneous confidence intervals of all pairwise comparisons to have a width less than 3?

data size;
input r @@;
alpha=0.05;
mse=2;
v=5;
k=3;
b=v*r/k;
df=v*r-b-v+1;
lambda=r*(k-1)/(v-1);
q=probmc('range',.,1-alpha,df,v);
msd=(q/sqrt(2))*sqrt(2*mse*k/(lambda*v));
lines;
15 16 17 18
;
proc print;
var r msd;
run;