Chapter 3: Tests for Comparing Several Normal Quantiles and Pairwise Confidence Intervals

Justin Dunnam

1. Introduction

Comparing several groups or populations is a fundamental problem in statistics and is often addressed by testing equality of means. However, the mean or median does not fully characterize a distribution, and important differences may occur in other parts of the distribution. In many applications – particularly in medical and reliability studies—the behavior of a large proportion of the population, reflected in specific percentiles, may be more relevant than the mean. For example, a smaller upper quartile for treatment time under one therapy indicates that most patients experience faster recovery, even if mean times are similar. As noted by [cox1985testing] and [li2012comparison], distributions may share similar means while differing substantially in their tails, motivating inference procedures based on quantiles. The problem of estimating/testing the difference between percentiles of two independent normal distributions has received some attention in the literature. However, the paper by [li2012comparison] seems to be the first one considered the problem of testing equality of quantiles of several normal populations. [malekzadeh2023simultaneous] have proposed simultaneous CIs for quantile differences of several normal populations. They presumed that the pivotal quantity found in [chakraborti2007confidence] is different or better than the classical NCT CI, and proposed methods of computing critical values to find the Chakraborti-Li CI. We see that the pairwise fiducial CIs and parametric bootstrap CIs based on the classical NCT pivotal quantity for a quantile and those proposed in [malekzadeh2023simultaneous] are essentially the same.

The rest of this chapter is organized as follows. In the following section, we describe the generalized variable test (GVT) by [li2012comparison]. We have enhanced the GVT by deriving theoretical expressions of some quantities. This closed-form expressions are easy to compute and thereby avoid additional simulation used in [li2012comparison]. Then we describe the modified LRT proposed in [abdollahnezhad2018testing] and a new modified MLRT. We also outline a parametric bootstrap (PB) approach for testing the equality of the normal quantiles.

In Section 3, we evaluate and compare the tests in terms of type I error rates and powers. Available pairwise CIs and simplified version of them are described in Section 4. These pairwise CIs are evaluated and compared using Monte Carlo simulation. In Section 5, we illustrate the tests and simultaneous CIs using two examples with real data.

2. Tests for Equality of Quantiles

Let Xi1,,Xini be a sample from a normal distribution with parameters μi and σi2, say, N(μi,σi2), i=1,,k. The pth quantile of the ith normal distribution is given by ξi=μi+zpσi, where zp is the standard normal quantile. The problem of interest is to test the equality of the quantiles. Specifically, we like to test

(1) H0:ξ1==ξkvs.Ha:ξiξjfor some ij.

Let (X¯i,Si2) denote the (mean, variance) based on a sample of size ni, and let (x¯i,si2) be an observed value of (X¯i,Si2) and mi=ni1, i=1,,k.

2.1. Fiducial Quantities for Quantiles

Let (X¯,S2) denote the (mean, variance) based on a sample of size n from a N(μ,σ2) distribution. Let (x¯,s2) be an observed value of (X¯,S2), and let ξ=μ+zpσ. Using the stochastic representations that X¯=μ+Zσ/n and S2=σ2U2, where ZN(0,1) independently of U2=χm2/m, we see that

T=ξX¯S/n=dzpσ+ZσnσU=Z+zpnU=dtm(zpn),

where tm(δ) denotes the noncentral t random variable with df = m and the noncentrality parameter δ. To get the 2nd equation, we used the results that μX¯σnZ.

Applying the [dawid1982functional] approach, we find a fiducial quantity (FQ) for ξ by solving the “equation” (2.1) for ξ and then replacing (X¯,S) by the observed value (x¯,s), and is given by

(2) Qξ=x¯+tm(zpn)sn.

The FQ for ξ given in [li2012comparison] is given by

(3) Qξ=x¯ZUsn+zpsU=x¯+(Z+zpnU)sn,

where ZN(0,1) independently of U2χm2/m. To get the 2nd equation in the above, we used the fact that Z and Z are identically distributed. Note that the term within the brackets has a tm(zpn) distribution. [li2012comparison] have used the expression (3) for Qξ and unable to find the mean and the variance of Qξ which are required to develop a test for equality of quantiles. The expression for Qξ in (2) is simple and its moments can be found using the moments of a tm(δ) distribution.

2.2. Generalized Variable Test

We shall now describe the generalized variable test of [li2012comparison] using the FQ (2). Let

(4) 𝝃=(ξ1,,ξk),𝑸ξ=(Qξ1,,Qξk)and𝐇=(𝐈k1,𝟏)(k1)×k,

where

(5) Qξi=x¯i+tmi(zpni)sini,i=1,,k,

and 𝐈l denotes the identity matrix of order l and 𝟏 is a (k1)×1 vector of ones. In terms of these notations, the null hypothesis in (1) can be written as

(6) H0:𝐇𝝃=𝟎vs.𝐇𝝃𝟎.

For a given (x¯1,,x¯k) and (s12,,sk2), let 𝝁ξ=E(𝑸ξ)and𝚺ξ=Cov(𝑸ξ). The mean vector and covariance matrix can be found using the moments of noncentral t random variables. Recall that

(7) E(tm(δ))=m/2Γ((m1)/2)Γ(m/2)δandVar(tm(δ))=mm2(1+δ2)[E(tm(δ))]2.

Let ai=mi/2Γ((mi1)/2)Γ(mi/2),i=1,,k. Then

(8) μξi=E(𝑸ξi)=x¯i+E(tmi(zpni))sini=x¯i+zpaisi,i=1,,k,

and

(9) σξi2=Var(𝑸ξi)=Var(tmi(zpni))si2ni=[mimi2(1+zp2ni)ai2zp2ni]si2ni,i=1,,k,

In terms of these quantities, the generalized test variable T is given by

(10) T=(𝑸ξ𝝁ξ)𝐇(𝐇𝚺ξ𝐇)1𝐇(𝑸ξ𝝁ξ).

Note that, for a given (𝐱¯,𝐬), 𝝁ξ=(μξ1,,μξk) and 𝚺ξ=diag(σξ12,,σξk2) can be computed numerically. Let T0=𝝁ξ𝐇(𝐇𝚺ξ𝐇)1𝐇𝝁ξ. The generalized p-value for testing H0 in (6) is given by

p-value=P(TT0|H0).

The above generalized p-value can be estimated using the following Algorithm 3.1.

Algorithm 3.1

For a given set of k independent samples from normal populations,

  1. (1)

    compute the means x¯i’s and variances si2’s.

  2. (2)

    Generate noncentral t random variables tm1(zpn1),,tmk(zpnk), and compute 𝑸ξ=(Qξ1,,Qξk) using (5).

  3. (3)

    Compute 𝝁ξ using (8) and 𝚺ξ using (9).

  4. (4)

    Compute T=(𝑸ξ𝝁ξ)𝐇(𝐇𝚺ξ𝐇)1𝐇(𝑸ξ𝝁ξ).

  5. (5)

    Repeat steps 2, 3, and 4 for large number of times, say, 10,000.

  6. (6)

    Compute T0=𝝁ξ𝐇(𝐇𝚺ξ𝐇)1𝐇𝝁ξ.

  7. (7)

    Find the proportion of times TT0. This proportion is an estimate of the generalized p-value.

The null hypothesis (1) is rejected at the level α, if the generalized p-value is less than α.

2.3. Likelihood Ratio Test

Define

(11) X¯i=1nij=1niXijandσ^i2=1nij=1ni(XijX¯i)2,i=1,,k.

The log-likelihood function is given by

(12) lnL(μ1,,μk;σ12,,σk2)=12i=1kniln(σi2)12i=1kniσ^i2+ni(X¯iμi)2σi2.

The MLEs that maximize the above lnL are given by μ^i=X¯i and σ^i2, i=1,,k.

The log-likelihood function under H0:ξ1==ξk is given by

(13) lnL(ξzpσ1,,ξzpσk;σ12,,σk2)=12i=1kniln(σi2)12i=1kniσ^i2+ni(X¯iξ+zpσi)2σi2,

where ξ is the common unknown quantile under H0 in (1). The values of (ξ,σ12,,σk2) that maximize (13) are the constrained MLEs, and let us denote the constrained MLEs by (ξ^c,σ^1c2,,σ^kc2). Details on calculation of the constrained MLEs and an algorithm are given in the appendix. The LRT statistic is expressed as

(14) Λ = 2[lnL(μ^1,,μ^k;σ^12,,σ^k2)lnL(ξ^czpσ^1c,,ξ^czpσ^kc;σ^1c2,,σ^kc2)]
= i=1kniσ^ic2[σ^i2+(X¯i+zpσ^icξ^c)2]+i=1kniln(σ^ic2σ^i2)i=1kni.

For a given level of significance α, the LRT rejects the null hypothesis when Λ>χk1;1α2, where χm;q2 denotes the 100q percentile of the chi-square distribution with df = m.

AJ Modified LRT

In general, the LRT is not accurate for small samples. To improve the LRT, [abdollahnezhad2018testing] have proposed a modification using the general theory of [skovgaard2001likelihood]. To describe this modification, let

𝜷=(μ1/σ12,,μk/σk2,1/σ12,,1/σk2),

and

𝝉=(n1μ1,,nkμk,0.5n1(μ12+σ12),,0.5nk(μk2+σk2))

Let

^𝜷=(x¯1/σ^12,,x¯k/σ^k2,1/σ^12,,1/σ^k2)

be the MLE of 𝜷 and

^𝝉=(n1x¯1,,nkx¯k,0.5n1(x¯12+σ^12),,0.5nk(x¯k2+σ^k2))

be the MLE of 𝝉. The constrained MLE of 𝜷 is given by ^𝜷c, this can be found by replacing the parameters with the constrained MLEs σ^ic and μ^ic=ξ^czpσ^ic, i=1,,k. Similarly, we can find the constrained MLE ^𝝉c. Furthermore, define

𝚺=Var(^𝝉)=(diag(niσi2)diag(niμiσi2)diag(niμiσi2)diag(niσi2(μi2+0.5σi2))).

Let ^𝚺 and ^𝚺c denote the MLE and the constrained MLE of 𝚺, respectively. Let

γ={(^𝝉^𝝉c)^𝚺c1(^𝝉^𝝉c)}k/2Λk/21(^𝜷^𝜷c)(^𝝉^𝝉c)(|^𝚺c||^𝚺|)1/2.

Then Λ^=Λ(11Λln(γ))2χk12,approximately. The AJ-MLRT rejects the null hypothesis if Λ^>χk1;1α2.

DK Modified LRT

We can also find another improved version of the likelihood approach, referred to as the DK modified LRT (DK-MLRT), which can be obtained by approximating the distribution of Λ/μΛ by the moment matching method, where μΛ=E(Λ). Let σΛ2=var(Λ). This approximation is based on the results by [diciccio2001simple] on improving the usual LRT and Welch’s approximate degrees of freedom solution for the Behrens-Fisher problem. We approximate the distribution of Λ/μΛ by the distribution of χν2/ν, where the degrees of freedom ν is found so that var(Λ/μΛ)=σΛ2/μΛ2=var(χν2/ν). This moment matching method yields ν=2μΛ2/σΛ2. In general, it is difficult to find μΛ and σΛ2 theoretically, but they can be replaced by Monte Carlo estimate (μ^Λ,σ^Λ2) as shown in Algorithm 3.2. For an observed value Λ0 of Λ/μ^Λ, the p-value of the DK-MLRT is given by

(15) P(1ν^χν^2>Λμ^Λ),

which can be estimated as follows.

Algorithm 3.2

For a given set of k samples, calculate (X¯i,σ^i2), i=1,,k.

  1. (1)

    Calculate the constrained MLEs ξ^c and (σ^1c2,,σ^kc2), and the LRT statistic Λ in (14).

  2. (2)

    Generate a sample of size ni from the N(ξ^czpσ^ic,σ^ic2), i=1,,k.

  3. (3)

    Calculate the LRT statistic for the samples generated in the previous step.

  4. (4)

    Repeat steps 2 and 3 for a large number of times, say, 10,000.

  5. (5)

    Find the mean μ^Λ and the variance σ^Λ2 of these 10,000 simulated LRT statistics, and compute ν^=2μ^Λ2/σ^Λ2 and let Λ0=Λ/μ^Λ.

  6. (6)

    The p-value of the MLRT is estimated by P(1ν^χν^2>Λμ^Λ).

2.4. The Parametric Bootstrap Test

Let ξ^i=X¯i+zpSi, i=1,,k. It is easy to see that an estimate of the variance of ξ^i is given by

V^(ξ^i)=Si2(1ni+zp2(1cmi2))withcmi=2miΓ((mi+1)/2)Γ(mi/2),i=1,,k.

Following the lines of [krishnamoorthy2007parametric], who have developed a PB test for equality of normal means, we can develop a test statistic for testing H0 in (1) as

(16) Tξ(X¯1,,X¯k;S12,,Sk2) = i=1kξ^i2V^(ξ^i)(i=1k1V^(ξ^i)ξ^i)2i=1k1V^(ξ^i)
= i=1k1V^(ξ^i)(ξ^i2ξ^¯2)

where ξ^¯=i=1kWiξ^i, Wi=[v(ξ^i)]1j=1k[v(ξ^i)]1 and

The parametric bootstrap involves sampling from the estimated models where the model parameters are replaced with the sample estimates. The PB test statistic is the same as the statistic Tξ in (16) with (X¯i,Si) replaced by (X¯i,Si), i=1,,k, which are the statistics based on bootstrap samples generated from N(μ^ic,σ^ic2), i=1,,k, distributions. Here, μ^ic=ξ^czpσ^ic, i=1,,k, where ξ^c and σ^ic2 are the constrained MLEs derived in Section 2.3. To express the PB statistic, let

X¯iN(μ^ic,σ^ic2)independently ofSi2σ^ic2χmi2mi,i=1,,k.

Also, let ξ^i=X¯i+zpSi, i=1,,k. Then the PB statistic can be expressed as

(17) Tξ(X¯1,,X¯k;S12,,Sk2)=i=1k1V^(ξ^i)(ξ^i2ξ^¯2),

where ξ^¯=i=1kWiξ^i, Wi=[V^(ξ^i)]1j=1k[v(ξ^i)]1 and V^(ξ^i)=Si2(1n+zp2(1cmi2)) with cmi=2miΓ((mi+1)/2)Γ(mi/2), i=1,,k.

The PB test for the equality of the quantiles rejects the null hypothesis (1) if

P[Tξ(X¯1,,X¯k;S12,,Sk2)Tξ(x¯1,,x¯k;s12,,sk2)]<α,

where (x¯i,si2) is an observed value of (X¯i,Si2), i=1,,k.

The p-value of the above PB test can be computed using the following Algorithm 3.3.

Algorithm 3.3

For a given set of (x¯1,,x¯k,s12,,sk2) and the sample sizes,

  1. (1)

    compute the test statistic Tξ(x¯1,,x¯k;s12,,sk2) using (16).

  2. (2)

    Compute the constrained MLEs, ξ^c and σic2, i=1,,k. Set μ^ic=ξ^czpσ^ic, i=1,,k.

  3. (3)

    Generate X¯iN(μ^ic,σ^ic2) and Si2σ^ic2χmi2mi, i=1,,k.

  4. (4)

    Compute the PB statistic Tξ(X¯1,,X¯k;S12,,Sk2) in (17).

  5. (5)

    Repeat steps 3 and 4 for a large number of times, say, 10,000.

  6. (6)

    The proportion of Tξ’s that are greater than Tξ is an estimate of the PB p-value for testing H0 in (1)

3. Type I Error Rates and Power Studies

To assess the statistical properties of the tests, we estimated the type I error rates and powers of the tests using Monte Carlo simulation. To estimate the error rates of the GVT, we first generated 10,000 sets of k samples from independent normal populations with assumed means and variances. For each set of samples, we used another 10,000 simulation runs to estimate the p-values using Algorithms 3.1 and 3.3. The PB test was evaluated similarly. To estimate the error rates of the DK MLRT, we used the estimates of μ^Λ and σ^Λ2 based on simulation with 1000 runs. The estimates of the type I error rates of the AJ MLRT are based on 10,000 simulation runs. The error rates and powers of (1) generalized variable test (GVT), (2) Abdollahnezhada and Jafari’s LRT (AJ MLRT), (3) Our new MLRT (DK MLRT) and (4) the parametric bootstrap test (PB test).

In Table 1, we reported the estimated sizes of all the tests for p=0.10,0.25,0.75,0.90, and k=3,5,10. We first observe from this table that the GVT test is conservative having type I error rates less than the nominal level 0.05 for almost all cases. In particular, the GVT is too conservative for k=10. The AJ MLRT and DK MLRT perform similar in controlling type I error rates around 0.05, except that the former test is little conservative for k=10. In general, we see that the DK MLRT controls the error rates around the nominal level for all sample size and parameter configurations considered in Table 1.

The powers of tests were also estimated for p=0.25,0.50,0.75, k=3,5,10 and for some values of sample sizes ranging from 5 to 20. The powers are presented in Table 2. Since the GVT is in general conservative, the powers of the GVT are smaller than those of the other tests in most cases. For example, see the powers of the GVT for the case of k=3. There is no clear-cut winner between the AJ-MLRT (2) and the DK-MLRT (3) for k5. In particular, the AJ-MLRT seems to have larger powers than the DK-MLRT when p=0.25 but smaller powers than DK-MLRT when p=0.75. These tests perform similar for the case p=0.5. However, for k=10, the DK-MLRT appears to have better power property than the AJ-MLRT for p=0.25,0.5 and 0.75. The DK-MLRT is preferable to the AJ-MLRT when the number of groups being compared is ten or more. The PB test (4) appears to be better than the AJ-MLRT and the DK-MLRT for k10 and p0.5. However, the PB test is much less powerful than the DK-MLRT when p=0.75. For the cases where p0.5, the PB test can be recommended for applications.

We now review the performances of the tests when p=0.5, that is, for testing equality of the normal means. The PB test simplifies to the one for testing the equality of the means given in [krishnamoorthy2007parametric]. The Monte Carlo simulation studies in their paper indicated that the PB test controls the type I error rates always very close to the nominal level. The PB test is better than the popular Welch test ([welch1951comparison]) for k10. All the tests control the type I error rates within the nominal level 0.05 for k10. The PB test is more powerful than the AJ MLRT for almost all the cases. The PB test is also somewhat more powerful than DK MLRT. To compare several normal means, the PB test is referrable to all other tests.

4. Simultaneous CIs for Pairwise Differences

Once the null hypothesis in (1) is rejected, then it is desired to find the population quantiles that are significantly different. This can be found by examining simultaneous confidence intervals for all possible pairs of differences ξiξj, i<j, where ξi=μi+zpσi, i=1,,k.

4.1. One-Sample Confidence Intervals

Before proceeding to develop simultaneous CIs, we need to choose an appropriate CI for a normal quantile. The classical CI, referred to as the noncentral t interval, is based on the pivotal quantity n(X¯ξ)/S which follows a noncentral t distribution (e.g., see Owen, 1968 and Section 5.3.1.1 of [lawless2011statistical]). Since the CI based on n(X¯ξ)/S and the one based on n(ξX¯)/S are the same, we consider the latter version of the pivotal quantity. Letting m=n1 and using the stochastic representations that X¯=dμ+ZσnandS2=dσ2U2, where ZN(0,1) independently of U2χm2/m distribution, we see that

(18) ξX¯S/n=dzpnZU=dtm(zpn),

where tm(δ) denotes the noncentral t random variable with degrees of freedom (df) m and the noncentrality parameter δ. To get the 2nd equation in (18), we used the result that Z and Z are identically distributed. On the basis of the above distributional result, the 12α CI for ηp is given by

(19) (X¯+tm;α(zpn)Sn,X¯+tm;1α(zpn)Sn),

where tm;α(δ) denotes the 100α percentile of tm(δ).

[chakraborti2007confidence] have proposed a pivotal quantity T based on the minimum variance unbiased estimator of ξ given by ξ^u=X¯+zpcnS,withcn=m/2Γ(m/2)/Γ(n/2). In Chapter LABEL:ch2, we have shown that the CI based on T is the same as the one in (19). To show this, we first note that the variance estimate of the estimator ξ^u is given by V^(ξ^u)=S2n(1+nzp2(cn21)), and the pivotal quantity T can be expressed as

(20) T=ξ(X¯+zpcnS)Sn1+nzp2(cn21)=n(ξX¯)/Szpncn1+nzp2(cn21).

Since T is a one-to-one function of the usual pivotal quantity n(ξX¯)/S, the CI based on T should be the same as the classical CI in (19). For more details, see Chapter LABEL:ch2.

[malekzadeh2023simultaneous] have proposed several methods of constructing pairwise CIs for ξiξj, i<j. Most of these CIs are based on the pivotal quantities of the type in (20). In view of the above discussion, we see that these simultaneous CIs based on the noncentral pivotal quantity (18) and those based on T in (20) are essentially the same.

4.2. FG Method for Simultaneous Pairwise CIs

This procedure is based on the fiducial approach given in [malekzadeh2023simultaneous], and these authors call these resulting CIs as fiducial generalized (FG) CIs. To describe this method, let ai=mi/2Γ(mi/2)/Γ(ni/2) and bi=1/ni+zp2(ai21), i=1,,k. Let ξ^iu=X¯i+zpaiSi so that E(ξ^iu)=ξi=μi+zpσi and var(ξ^iu)=bi2σ2. Let ξ^iju=ξ^iuξ^ju, i<j. Using the relation that tm(δ)tm(δ), it is not difficult to check that the FQ Ri for ξi given in their paper can be expressed as Fi=x¯i+tmi(zpni)sini. Let ξ^iju=ξ^iuξ^ju and Fij=FiFj, i<j. The difference ξ^ijuFij can be simplified as

(21) ξ^ijuFij=zpsi(ai1nitmi(zpni))zpsj(aj1njtmj(zpnj)),i<j.

Let TF=max1i<jk|ξ^ijuFijvij|, where vij=bisi2+bjsj2. Let t1αF denote the 100(1α) percentile of the conditional distribution of TF given (x¯i,si)’s. Then a 1α simultaneous CIs for ξij’s are given by

(22) ξ^iuξ^ju±t1αFbisi2+bjsj2,i<j.

We refer to the above CIs as FGU CIs, because they are based on unbiased estimates of ξi’s.

In view of our discussion in Section 4.1, we can develop pairwise CIs on the basis of the noncentral t pivotal quantity in (18) instead of the one in (20) based on ξ^u. Such pairwise simultaneous CIs are obtained from (21) and (22) by substituting ai=1 and bi=1/ni, i=1,,k, and the simultaneous CIs can be expressed as

(23) ξ^iξ^j±t1αFsi2ni+sj2nj,i<j,

where t1αF is the 100(1α) percentile of TF=max1i<jk|ξ^ijFijsi2/ni+sj2/nj|, and ξ^ijFij is the expression (21) with ai=1, i=1,,k. The above CIs, referred at as the FG CIs, are somewhat simpler than the ones in (22), because computation of them does not involve repeated calculation of the gamma function. Our simulation studies in Section 4.5 indicate that these pairwise CIs in (22) and those in (23) are very similar. Also, see Examples in Section 5. However, it seems to be prove theoretically.

Table 1. Sizes of the tests at α=0.05
p=0.1 p=0.25 p=0.75 p=0.90
(μ1,μ2,μ3)=(1,1,1)
(n1,,nk) (1) (2) (3) (4) (1) (2) (3) (4) (1) (2) (3) (4) (1) (2) (3) (4)
ξ=4 ξ=2 ξ=2 ξ=2
(5, 10, 11) .025 .055 .050 .049 .026 .050 .049 .053 .032 .051 .049 .051 .026 .056 .052 .048
(8, 9, 13) .030 .053 .049 .051 .028 .051 .052 .050 .033 .048 .050 .049 .024 .052 .050 .052
(9,20,5) .037 .056 .049 .051 .032 .053 .051 .051 .030 .048 .049 .054 .032 .051 .047 .050
(10,10,10) .023 .053 .049 .049 .033 .045 .049 .047 .038 .048 .048 .047 .022 .046 .048 .051
(10,50,100) .045 .049 .049 .052 .052 .049 .047 .047 .049 .051 .049 .049 .042 .053 .050 .048
(μ1,μ2,μ3)=(0,1,1)
ξ=3 ξ=4 ξ=3 ξ=2
(5,10,11) .024 .052 .050 .051 .028 .051 .051 .050 .037 .051 .049 .048 .029 .052 .047 .048
(8,9,13) .023 .051 .054 .051 .032 .044 .050 .048 .039 .045 .051 .053 .032 .054 .049 .053
(9,20,5) .030 .054 .050 .053 .034 .053 .046 .052 .043 .056 .052 .051 .037 .053 .049 .047
(10,10,10) .025 .053 .048 .049 .028 .049 .051 .046 .046 .049 .049 .048 .025 .049 .051 .050
(10,50,100) .043 .049 .051 .046 .042 .051 .048 .051 .055 .054 .050 .050 .044 .054 .047 .047
(μ1,μ2,μ3)=(0,1,2)
ξ=1 ξ=3 ξ=4 ξ=3
(5,10,11) .021 .053 .050 .047 .025 .045 .048 .048 .027 .053 .047 .047 .032 .053 .051 .049
(8,9,13) .032 .053 .045 .047 .034 .046 .049 .048 .033 .050 .052 .050 .029 .054 .053 .045
(9,20,5) .029 .059 .050 .051 .033 .055 .047 .054 .039 .045 .054 .048 .027 .050 .053 .049
(10,10,10) .033 .052 .048 .050 .029 .045 .051 .049 .032 .044 .047 .047 .024 .052 .047 .052
(10,50,100) .044 .049 .049 .051 .042 .051 .051 .052 .041 .050 .048 .051 .049 .050 .048 .053
(μ1,,μ5)=(1,1,1,1,1)
ξ=3 ξ=2 ξ=4 ξ=5
(10, 30, 50, 80, 100) .047 .052 .049 .049 .025 .047 .048 .048 .041 .047 .051 .048 .049 .054 .050 .053
(10, 10, 10, 10, 10) .030 .050 .047 .049 .034 .052 .049 .051 .025 .047 .047 .049 .032 .047 .050 .049
(5, 10, 15, 20, 30) .035 .056 .049 .048 .033 .053 .047 .047 .042 .056 .051 .052 .038 .050 .049 .050
(μ1,,μ5)=(1,0,1,0,1)
(10, 30, 50, 80, 100) .047 .051 .051 .056 .028 .047 .052 .051 .048 .047 .048 .053 .046 .048 .049 .047
(10, 10, 10, 10, 10) .025 .048 .049 .051 .030 .049 .051 .051 .031 .046 .049 .050 .031 .048 .053 .051
(5, 10, 15, 20, 30) .033 .052 .050 .053 .042 .053 .051 .052 .037 .053 .051 .051 .031 .051 .048 .050
(μ1,,μ5)=(2,1,3,2,3)
(10, 30, 50, 80, 100) .039 .049 .051 .051 .028 .049 .052 .045 .045 .048 .047 .053 .051 .049 .049 .047
(10, 10, 10, 10, 10) .024 .054 .051 .052 .030 .046 .053 .051 .050 .046 .046 .049 .032 .046 .051 .054
(5, 10, 15, 20, 30) .031 .049 .050 .053 .042 .053 .051 .047 .045 .053 .049 .047 .043 .057 .051 .049

1 – GVT; 2 – AJ MLRT; 3 – DK MLRT; 4 – PB test

Table 1 continued.
p=0.1 p=0.25 p=0.75 p=0.90 (μ1,,μ10)=(1,,1) (n1,,nk) (1) (2) (3) (4) (1) (2) (3) (4) (1) (2) (3) (4) (1) (2) (3) (4) ξ=3 ξ=2 ξ=7 ξ=5 𝐧𝟏 .008 .034 .052 .039 .015 .036 .051 .040 .018 .034 .051 .043 .009 .035 .041 .052 𝐧𝟐 .030 .043 .049 .049 .032 .044 .050 .051 .032 .048 .047 .049 .030 .046 .048 .052 𝐧𝟑 .016 .038 .050 .050 .025 .043 .036 .050 .031 .040 .051 .049 .021 .035 .048 .049 𝐧𝟒 .033 .044 .047 .049 .038 .043 .049 .051 .041 .043 .051 .051 .031 .040 .051 .049 (μ1,,μ10)=(1,0,1,0,1,0,1,0,1,0) 𝐧𝟏 .010 .033 .046 .037 .016 .039 .047 .044 .017 .038 .051 .041 .007 .035 .043 .052 𝐧𝟐 .030 .043 .048 .045 .041 .050 .052 .046 .037 .044 .047 .046 .032 .043 .047 .052 𝐧𝟑 .016 .037 .051 .053 .026 .042 .038 .051 .023 .042 .051 .053 .022 .034 .048 .048 𝐧𝟒 .031 .038 .047 .050 .045 .046 .049 .048 .038 .042 .051 .051 .032 .041 .051 .051 (μ1,,μ10)=(2,1,4,2,3,1,1,3,1,1) 𝐧𝟏 .008 .039 .047 .041 .014 .037 .052 .042 .017 .040 .051 .039 .022 .033 .043 .049 𝐧𝟐 .032 .048 .050 .048 .047 .046 .049 .048 .044 .051 .047 .047 .041 .042 .050 .048 𝐧𝟑 .016 .035 .051 .050 .024 .040 .051 .051 .022 .041 .051 .051 .032 .038 .052 .048 𝐧𝟒 .031 .042 .051 .048 .037 .044 .051 .050 .031 .044 .051 .051 .033 .040 .051 .050

NOTE: 𝐧𝟏=(5,,5); 𝐧𝟐=(15,,15); 𝐧𝟑=(4,4,4,5,5,5,10,10,10,10); 𝐧𝟒=(4,4,4,12,12,12,15,15,15,15)
1 – GVT; 2 – AJ MLRT; 3 – DK MLRT; 4 – PB test

Table 2. Powers of the tests for equality of quantiles at α=0.05
𝝈=(1,1,1)
p=0.25 p=0.5 p=0.75
(n1,,nk) (μ1,,μk) (1) (2) (3) (4) (1) (2) (3) (4) (1) (2) (3) (4)
(5, 10, 11) (1,1,1) .031 .048 .051 .051 .037 .048 .049 .053 .029 .048 .048 .052
(1.5,1,1) .075 .089 .092 .131 .085 .099 .108 .105 .040 .095 .110 .074
(2.5,1,1) .359 .383 .357 .501 .516 .563 .557 .608 .328 .559 .662 .543
(1,1,2.5) .680 .757 .702 .744 .855 .865 .870 .859 .716 .754 .791 .752
(1,3,1) .899 .923 .893 .929 .977 .986 .986 .981 .932 .953 .973 .949
(1,1,3) .915 .944 .917 .949 .982 .989 .990 .987 .944 .950 .964 .950
(10,10,10) (1,1,1) .032 .046 .052 .048 .042 .046 .048 .052 .032 .049 .048 .047
(1.5,1,1) .101 .135 .142 .142 .134 .152 .174 .174 .103 .134 .156 .146
(2.5,1,1) .699 .744 .710 .791 .856 .888 .890 .891 .775 .838 .880 .853
(1,1,2.5) .720 .735 .717 .795 .875 .883 .896 .897 .800 .843 .879 .854
(1,3,1) .930 .934 .911 .956 .989 .990 .989 .992 .969 .979 .986 .984
(1,1,3) .938 .934 .907 .956 .989 .989 .992 .993 .967 .978 .989 .989
(10,15,20) (1,1,1) .034 .049 .053 .052 .040 .049 .052 .050 .038 .051 .052 .052
(1.5,1,1) .150 .148 .149 .194 .160 .181 .203 .203 .124 .161 .179 .150
(2.5,1,1) .750 .781 .749 .842 .923 .930 .924 .941 .890 .935 .950 .928
(1,1,2.5) .963 .972 .962 .975 .991 .993 .994 .991 .938 .971 .976 .977
(1,3,1) .994 .994 .991 .999 .999 1 1 1 .949 .999 1 1
(1,1,3) .999 .999 .999 1 1 1 1 1 .999 .999 1 1
𝝈=(1,1,1,1,1)
(5,5,5,10,10) (1,1,1,1,1) .029 .052 .050 .048 .033 .047 .052 .051 .025 .049 .047 .047
(1.5,1,1.5,1,1) .069 .094 .111 .116 .078 .111 .132 .112 .037 .101 .118 .076
(2.5,1,1,1,1) .242 .260 .244 .357 .379 .413 .386 .483 .208 .460 .540 .360
(1,1,1,1,2.5) .489 .591 .504 .548 .714 .789 .762 .730 .590 .692 .808 .629
(1,3,1,1,1) .392 .412 .368 .560 .604 .639 .573 .708 .395 .752 .808 .627
(1,1,1,1,3) .799 .836 .747 .835 .940 .969 .954 .954 .883 .938 .975 .910
(5,10,11,13,15) (1,1,1,1,1) .033 .048 .051 .051 .034 .047 .050 .050 .032 .051 .053 .048
(1.5,1,1.5,1,1) .111 .142 .152 .164 .163 .177 .180 .167 .083 .159 .174 .128
(2.5,1,1,1,1) .277 .286 .264 .463 .434 .432 .442 .575 .274 .532 .605 .440
(1,1,1,1,2.5) .851 .853 .780 .841 .954 .960 .959 .958 .906 .933 .962 .912
(1,3,1,1,1) .895 .875 .797 .915 .983 .986 .974 .983 .975 .991 .996 .979
(1,1,1,1,3) .982 .983 .961 .983 .999 .999 .999 .999 .995 .998 .999 .995
(5,5,7,15,15) (1,1,1,1,1) .030 .051 .048 .049 .037 .048 .047 .054 .034 .052 .052 .054
(1.5,1,1.5,1,1) .094 .107 .122 .146 .109 .136 .160 .142 .056 .127 .141 .084
(2.5,1,1,1,1) .259 .278 .258 .399 .402 .437 .395 .513 .255 .499 .566 .390
(1,1,1,1,2.5) .772 .746 .760 .772 .919 .951 .936 .961 .828 .884 .938 .845
(1,3,1,1,1) .425 .428 .432 .592 .631 .667 .610 .737 .503 .798 .837 .643
(1,1,1,1,3) .967 .980 .942 .966 .999 .999 .998 .997 .985 .994 .998 .986

NOTE: 1 – GVT; 2 – AJ MLRT; 3 – DK MLRT; 4 – PB test

Table 2 continued.
𝝈=(1,,1); k=10 (μ4,,μ10)=(1,,1) p=0.25 p=0.5 p=0.75 (n1,,nk) (μ1,,μk) (1) (2) (3) (4) (1) (2) (3) (4) (1) (2) (3) (4) 𝐧𝟏 (μ1,μ2,μ3) (1,1,1) .014 .034 .049 .047 .023 .040 .050 .046 .020 .040 .050 .044 (1.5,2,1) .047 .085 .119 .122 .107 .139 .175 .148 .047 .117 .188 .109 (2.5,2,1) .145 .191 .249 .261 .280 .337 .371 .375 .112 .345 .506 .282 (2.5,2,3) .327 .420 .521 .536 .600 .709 .761 .726 .321 .685 .858 .602 (3.5,2,3) .519 .591 .681 .743 .823 .899 .914 .910 .566 .920 .982 .841 (5,1,4) .927 .902 .901 .982 .998 .994 .996 .999 .989 .999 1 .998 𝐧𝟐 (μ1,μ2,μ3) (1,1,1) .028 .042 .051 .048 .028 .047 .050 .050 .032 .041 .050 .051 (1.5,2,1) .073 .082 .114 .160 .102 .128 .147 .171 .046 .120 .173 .090 (2.5,2,1) .157 .147 .233 .318 .239 .278 .320 .400 .097 .322 .459 .225 (2.5,2,3) .323 .289 .461 .573 .542 .578 .689 .719 .236 .729 .942 .538 (3.5,2,3) .472 .420 .594 .729 .735 .776 .853 .897 .461 .928 .968 .760 (5,1,4) .772 .719 .782 .989 .963 .969 .974 .998 .906 .999 1 .993

NOTE: 𝐧𝟏=(5,,5); 𝐧𝟐=(4,4,4,7,7,7,8,9,10,11); NOTE: 1 – GVT; 2 – AJ MLRT; 3 – DK MLRT; 4 – PB test

4.3. Parametric Bootstrap Method

We shall now see the PB pairwise CIs developed in [malekzadeh2023simultaneous]. Let Ti=ξ^iuξibiSi, where ξ^iu’s, ai’s and bi’s are as defined in the preceding Section 4.2. Let ZiN(0,1) independently of Ui2=χmi2/mi, i=1,..,k. Then Ti is distributed as Ti=dZi/ni+zp(aiUi1)biUi. Define

(24) Tij=sibiUiTisjbjUjTjbisi2Ui2+bjsj2Uj2,i<j.

Let T=maxi<j|Tij| and let t1αB denote the 100(1α) percentile of T. Then

(25) ξ^iuξ^ju±t1αBbiSi2+bjSj2,i<j,

are 1α simultaneous CIs for ξiξj, i<j.

As in the preceding section, we can take ai=1 and bi=1/ni for i=1,,k. In this case ξ^iu becomes ξ^i=X¯i+zpSi and the Ti and Tij simplify to

ti=Zi+zpni(Ui1)Uiandtij=siniUitisjnjUjtjsi2niUi2+sj2njUj,i<j,

respectively. Let t1α denote the 100(1α) percentile of maxi<j|tij|. Then

(26) ξ^iξ^j±t1αsi2ni+sj2nj,i<j,

are 1α simultaneous CIs for ξiξj, i<j.

4.4. Bonferroni Simultaneous Fiducial CIs

On the basis of a normal approximation to the noncentral t distribution, in Chapter LABEL:ch2, we have developed a fiducial CI for the difference between two quantiles. This CI is not only simple, but also very satisfactory even for small samples. To describe their CI for ξiξj, let

(27) h(x;ni,pi)=cmizp+xcmi2ni+12mi(zp2x2n)cmi2x22miwithcmi=1+14mi.

For 0<α0.5, let

(28) Dij;αsizpcmisjzpcmjsi2[zpcmih(zα;ni,p)]2+sj2[zpcmjh(z1α;nj,p)]2,i<j,

where zα denotes the α quantile of the standard normal distribution. Furthermore, let

(29) Dij;1α=sizpcmisjzpcmj+si2[zpcmih(z1α;ni,p)]2+sj2[zpcmjh(zα;nj,p)]2,i<j.

The 100(12α)% approximate fiducial CI for ξiξj is given by

(30) (x¯ix¯j+Dij;α,x¯ix¯j+Dij;1α).

The simulation study in Chapter LABEL:ch2 indicated that the above CI is very satisfactory even for small sample sizes.

On the basis of the CI in (30), the 100(1α)% simultaneous Bonferroni CIs for all ξiξj, i<j, can be expressed as

(31) (x¯ix¯j+Dij;α,x¯ix¯j+Dij;1α),i<j,j=2,,k,

where α=αk(k1).

4.5. Coverage and Precision Studies

To judge the properties of the pairwise CIs and to compare them, we carried out simulation studies as follows. To estimate the fiducial generalized CIs FGU and FG, we first generated 10,000 samples from different normal distributions, then we used simulation consisting of 10,000 runs to find simultaneous CIs based on each set of samples. The percentage of the 10,000 simultaneous CIs that include ξiξj for all i<j, is a Monte Carlo estimate of the coverage probability. To understand and to compare the precisions of the different methods, we also estimates expected volume1/K, where K is the number of simultaneous CIs. The coverage probabilities and precisions of the PB simultaneous CIs are estimated similarly. In our simulation study, we take the parameter values (without loss of generality) as 𝝁=(1,,1) and 𝝈=(σ1,σ2,,σk) with σ1=1 and σj1, j=2,,k. Furthermore, as argued in Chapter LABEL:ch2, we can take p[0.5,1). This is because, if (Li,Ui) is a CI based on (X¯i,Si2,X¯j,Sj2) for ξi,pξj,p, then (Ui,Li) is the CI based on (X¯i,Si2,X¯j,Sj2) for ξi,1pξj,1p.

In Table 3, we present the coverage probabilities and volumes1/3 of FGU CIs based on unbiased estimators of ξi and FG CIs based on ξ^i=X¯i+zpSi. As expected, both estimated coverage probabilities and volumes1/3 in Table 3 clearly indicate that the FGU and FG CIs are very similar. Minor differences between them could be due to simulation errors. In Table 4, we compare the Bonferroni (BF), fiducial CIs based on ξ^ (FG) and the PB CIs. An examination of table values shows that the PB CIs are better than other CIs in terms of coverage probabilities and precisions. Their coverage probabilities are very close to the nominal level 0.90, and their averages of volumes1/3 are smaller than those of the other CIs. The fiducial (FG) CIs and the PB CIs are practically the same for larger sample sizes. Bonferroni CIs are conservative having coverage probabilities larger than the nominal level for all the cases considered. The expected volumes of the BF CIs are somewhat larger than those of the PB abd FG CIs, but not much larger. Considering the simplicity of the BF CIs, they can be used in applications when sample sizes are 30 or more.

Coverage probabilities and averages of the volumes1/6 of BF, FG and PB CIs for the case of k=4 are reported in Table 5. Note that, when k=4, there are six simultaneous CIs for the differences ξiξj, i<j. Performances and comparisons of these CIs are very similar as in the case of k=3. We once again see that the PB CIs are narrower than the other CIs for all cases.

Overall, we see that calculation of the PB CIs and the FG CIs involve simulation and they share similar computational difficulties. Nevertheless, the PB CIs are better than the FG CIs in terms coverage probability and precision. In fact, the PB CIs are preferable to other CIs for all cases. For large sample sizes or when the study involves a large number of groups, the BF CIs are straightforward to compute.

Table 3. Coverage probabilities and average of volume1/3 of 90% simultaneous fiducial CIs
k=3
𝐧=(10,10,10) 𝐧=(10,15,15)
𝝈 p=0.75 p=0.90 p=0.75 p=0.90
FGU CI FG CI FGU CI FG CI FGU CI FG CI FGU CI FG CI
(1,1,1) .931(2.31) .930(2.31) .946(2.95) .947(2.95) .926(1.99) .928(1.99) .934(2.46) .937(2.51)
(1,.9,.8) .931(2.11) .933(2.11) .947(2.64) .950(2.64) .921(1.79) .922(1.79) .937(2.29) .940(2.26)
(1,.9,.6) .931(1.96) .931(1.96) .939(2.46) .941(2.47) .926(1.71) .926(1.71) .926(2.13) .929(2.14)
(1,.9,.1) .899(1.73) .900(1.74) .898(2.17) .899(2.22) .897(1.51) .898(1.52) .899(1.92) .900(1.95)
(1,.5,.1) .904(1.34) .904(1.34) .900(1.68) .900(1.71) .902(1.18) .903(1.19) .905(1.49) .904(1.53)
(1,.1,.1) .918(0.83) .929(0.83) .924(1.05) .927(1.06) .900(0.73) .899(0.73) .918(0.94) .918(0.94)
𝐧=(20,15,25) 𝐧=(30,35,30)
(1,1,1) .919(1.58) .920(1.58) .920(1.94) .922(1.94) .909(1.18) .909(1.18) .919(1.48) .920(1.48)
(1,.9,.8) .918(1.41) .918(1.41) .925(1.78) .927(1.78) .910(1.07) .911(1.07) .914(1.32) .915(1.32)
(1,.9,.6) .911(1.35) .913(1.35) .927(1.69) .929(1.70) .913(1.00) .914(1.00) .917(1.22) .918(1.22)
(1,.9,.1) .904(1.23) .905(1.24) .899(1.53) .899(1.54) .904(0.87) .904(0.87) .900(1.07) .901(1.08)
(1,.5,.1) .900(0.94) .900(0.95) .895(1.15) .895(1.16) .903(0.68) .903(0.68) .905(0.83) .905(0.84)
(1,.1,.1) .908(0.55) .908(0.55) .912(0.68) .912(0.69) .906(0.42) .907(0.42) .913(0.53) .914(0.53)
Table 4. Coverage probabilities and average of volume1/3 of 90% simultaneous CIs
k=3
𝐧=(10,10,10)
𝝈 p=0.50 p=0.75 p=0.90
BF-CIs FG CIs PB CIs BF-CIs FG CIs PB CIs BF-CIs FG CIs PB CIs
(1,1,1) .926(2.08) .924(2.06) .897(1.93) .928(2.44) .930(2.32) .899(2.13) .929(2.44) .953(2.92) .906(2.57)
(1,.9,.8) .926(1.88) .915(1.83) .899(1.74) .928(2.20) .935(2.12) .903(1.92) .928(2.20) .958(2.70) .911(2.31)
(1,.9,.6) .924(1.75) .919(1.72) .901(1.62) .927(2.05) .925(1.93) .901(1.80) .929(2.05) .945(2.48) .897(2.18)
(1,.9,.1) .910(1.55) .901(1.50) .894(1.47) .918(1.80) .904(1.74) .895(1.71) .925(1.80) .905(2.27) .893(2.17)
(1,.5,.1) .911(1.20) .900(1.17) .885(1.13) .920(1.39) .908(1.35) .896(1.32) .926(1.39) .903(1.71) .891(1.67)
(1,.1,.1) .927(0.76) .914(0.73) .896(0.69) .930(0.87) .921(0.83) .897(0.78) .933(0.87) .927(1.06) .905(0.98)
𝐧=(10,15,15)
(1,1,1) .922(1.79) .918(1.76) .904(1.69) .923(2.07) .919(1.97) .905(1.87) .926(2.07) .944(2.51) .912(2.27)
(1,.9,.8) .922(1.63) .912(1.59) .906(1.54) .923(1.89) .924(1.81) .905(1.71) .926(1.89) .936(2.26) .905(2.08)
(1,.9,.6) .920(1.54) .911(1.50) .897(1.45) .924(1.77) .920(1.70) .903(1.62) .926(1.77) .932(2.14) .905(1.98)
(1,.9,.1) .911(1.39) .901(1.35) .903(1.34) .917(1.59) .896(1.54) .899(1.52) .925(1.59) .905(1.95) .905(1.95)
(1,.5,.1) .915(1.09) .901(1.04) .896(1.05) .921(1.25) .906(1.19) .897(1.19) .927(1.25) .905(1.53) .900(1.53)
(1,.1,.1) .926(0.70) .898(0.65) .904(0.65) .930(0.80) .919(0.75) .901(0.73) .934(0.80) .923(0.97) .904(0.91)
𝐧=(20,15,25)
(1,1,1) .920(1.43) .908(1.39) .896(1.36) .922(1.63) .915(1.56) .899(1.50) .923(1.63) .924(1.94) .898(1.82)
(1,.9,.8) .920(1.30) .917(1.26) .899(1.23) .921(1.47) .914(1.42) .903(1.37) .921(1.47) .925(1.78) .909(1.66)
(1,.9,.6) .920(1.23) .904(1.19) .904(1.17) .921(1.40) .908(1.34) .900(1.31) .923(1.40) .920(1.66) .904(1.60)
(1,.9,.1) .913(1.13) .905(1.10) .905(1.10) .919(1.28) .905(1.24) .902(1.23) .923(1.28) .900(1.55) .901(1.55)
(1,.5,.1) .915(0.86) .896(0.83) .903(0.83) .919(0.98) .904(0.93) .901(0.93) .923(0.98) .899(1.18) .904(1.18)
(1,.1,.1) .928(0.52) .900(0.48) .895(0.48) .930(0.58) .911(0.55) .902(0.54) .931(0.58) .905(0.68) .899(0.66)
𝐧=(30,35,30)
(1,1,1) .919(1.10) .907(1.06) .902(1.04) .918(1.23) .913(1.18) .899(1.16) .920(1.16) .918(1.47) .904(1.42)
(1,.9,.8) .919(0.99) .898(0.94) .899(0.93) .919(1.12) .910(1.07) .895(1.05) .918(1.05) .920(1.32) .904(1.27)
(1,.9,.6) .918(0.93) .895(0.88) .901(0.87) .920(1.05) .907(1.00) .900(0.98) .922(0.98) .912(1.23) .903(1.19)
(1,.9,.1) .918(0.84) .898(0.77) .898(0.77) .919(0.95) .891(0.86) .905(0.88) .923(0.88) .898(1.07) .906(1.08)
(1,.5,.1) .919(0.65) .905(0.61) .897(0.60) .922(0.73) .898(0.68) .905(0.68) .924(0.68) .909(0.84) .901(0.83)
(1,.1,.1) .928(0.41) .907(0.38) .895(0.37) .929(0.45) .910(0.43) .903(0.42) .931(0.42) .913(0.53) .902(0.52)
Table 5. Coverage probabilities and average of volume1/6 of 90% simultaneous CIs
k=4
𝐧=(10,10,10,10)
𝝈 p=0.50 p=0.75 p=0.90
BF-CIs FG CIs PB CIs BF-CIs FG CIs PB CIs BF-CIs FG CIs PB CIs
(1,1,1,1) .941(2.44) .917(2.22) .896(2.18) .943(2.92) .935(2.58) .910(2.42) .947(3.91) .952(3.27) .917(2.92)
(1,.9,.8,.9) .941(2.20) .912(2.03) .890(1.97) .944(2.63) .934(2.32) .909(2.19) .947(3.52) .949(2.95) .915(2.63)
(1,.9,.6,.4) .938(1.79) .910(1.70) .895(1.60) .942(2.13) .926(1.94) .896(1.81) .947(2.83) .943(2.52) .902(2.22)
(1,.9,.4,.1) .930(1.53) .903(1.48) .898(1.39) .938(1.80) .911(1.73) .905(1.62) .945(2.36) .910(2.25) .888(2.05)
(1,.5,.1,.1) .938(1.25) .906(0.93) .898(0.88) .945(1.46) .919(1.07) .906(1.02) .950(1.92) .919(1.40) .904(1.28)
(1,.1,.1,.1) .943(0.64) .914(0.59) .899(0.56) .947(0.75) .928(0.68) .906(0.64) .950(0.99) .944(0.87) .915(0.79)
𝐧=(10,15,10,15)
(1,1,1,1) .936(2.16) .905(2.01) .899(1.99) .940(2.56) .923(2.28) .905(2.20) .943(3.37) .944(2.89) .902(2.61)
(1,.9,.8,.9) .936(1.95) .904(1.79) .903(1.80) .939(2.30) .921(2.04) .909(1.98) .944(3.03) .939(2.60) .906(2.36)
(1,.9,.6,.4) .934(1.61) .915(1.53) .902(1.50) .937(1.89) .928(1.75) .906(1.67) .943(2.49) .927(2.21) .892(2.02)
(1,.9,.4,.1) .928(1.39) .905(1.33) .903(1.32) .935(1.62) .906(1.51) .903(1.50) .943(2.11) .912(1.97) .894(1.86)
(1,.5,.1,.1) .937(1.11) .905(0.83) .909(0.82) .941(1.28) .907(0.94) .907(0.92) .947(1.66) .917(1.22) .901(1.15)
(1,.1,.1,.1) .941(0.59) .904(0.53) .911(0.53) .945(0.69) .926(0.62) .907(0.59) .949(0.90) .932(0.79) .904(0.73)
𝐧=(20,15,20,25)
(1,1,1,1) .930(1.62) .902(1.54) .907(1.54) .933(1.86) .915(1.75) .907(2.60) .934(2.38) .922(2.16) .895(2.00)
(1,.9,.8,.9) .930(1.47) .901(1.38) .905(1.38) .931(1.68) .916(1.56) .901(2.34) .935(2.14) .926(1.93) .898(1.81)
(1,.9,.6,.4) .930(1.23) .907(1.20) .908(1.16) .932(1.41) .916(1.37) .892(2.03) .935(1.79) .918(1.69) .903(1.55)
(1,.9,.4,.1) .929(1.08) .903(1.05) .902(1.01) .933(1.22) .904(1.17) .901(1.89) .938(1.55) .905(1.51) .898(1.40)
(1,.5,.1,.1) .939(0.87) .907(0.65) .906(0.64) .941(0.99) .903(0.73) .902(1.16) .944(1.25) .911(0.92) .898(0.87)
(1,.1,.1,.1) .940(0.43) .915(0.40) .904(0.39) .942(0.49) .915(0.44) .903(0.73) .944(0.61) .918(0.55) .898(0.52)
𝐧=(30,35,25,30)
(1,1,1,1) .928(1.29) .895(1.22) .892(1.21) .930(1.46) .891(1.33) .898( 1.33) .929(1.83) .907(1.64) .897(1.62)
(1,.9,.8,.9) .927(1.15) .897(1.07) .892(1.08) .928(1.31) .891(1.19) .901( 1.19) .928(1.64) .907(1.48) .904(1.46)
(1,.9,.6,.4) .928(0.94) .905(0.90) .898(0.87) .929(1.06) .909(1.01) .902( 0.98) .932(1.32) .918(1.24) .905(1.20)
(1,.9,.4,.1) .928(0.80) .902(0.78) .892(0.74) .933(0.90) .904(0.87) .900( 0.84) .936(1.12) .904(1.07) .904(1.04)
(1,.5,.1,.1) .937(0.64) .901(0.48) .896(0.47) .939(0.72) .912(0.54) .895( 0.52) .942(0.90) .912(0.67) .906(0.65)
(1,.1,.1,.1) .938(0.34) .908(0.31) .896(0.31) .939(0.38) .909(0.35) .900( 0.34) .941(0.48) .907(0.43) .895(0.42)

5. Examples

Example 3.5.1. This example and the associated data were taken from [li2012comparison]. The hypothesis that Vitamin D protects against colon cancer emerged from a study by [garland1980sunlight]. The result of the study is appeared to support the hypothesis that there is a relationship between Vitamin D and Colorectal Cancer (CRC). However, the effects of Vitamin D supplementation on incidence and mortality of CRC remain inconclusive. To investigate further, a Vitamin D study was conducted in Roswell Park Cancer Center where CRC patients were given a 6-month treatment with vitamin D supplements. The purpose of the study was to find if the vitamin D supplement treatment could sufficiently increase the serum 1, 25-D3 and 24, 25-D3 levels at the end of the study period. Subjects were divided into three groups according to the baseline serum 25-D3 level of each subject, namely, (i) vitamin D3 deficient if serum 25-D3 level less than 20 ng/ml, (ii) insufficient if it was 20 and <32, and (iii) sufficient if 32. Tests for normality of the data by [li2012comparison] indicated that the data fit normal distributions. The summary statistics of serum 1, 25-D3 and 24, 25-D3 vitamin D3 metabolites Li et al. (2012) are reproduced here in Table 6.

Table 6. Descriptive statistics for the data from a vitamin D study conducted in Roswell Park Cancer Institute
   Group    Variable    Size (ni)    Mean (x¯i)    SD (si)
   Vitamin D3 sufficient    1, 25-D3    16    62.39    17.99
   24, 25-D3    17    4.65    1.98
   Vitamin D3 insufficient    1, 25-D3    22    72.60    23.52
   24, 25-D3    22    3.62    1.17
   Vitamin D3 deficient    1, 25-D3    9    70.13    19.67
   24, 25-D3    9    2.66    1.35

A hypothesis of interest here is that if all of these three vitamin D groups would reach the same serum 1, 25-D3 and 24, 25-D3 vitamin levels. [li2012comparison] have noted that a thorough understanding about how the distributions differ across three groups is to compare the quantiles, especially the common quantiles such as 1st quartile, median, and third quartile.

Table 7. The p-values for testing the equality of percentiles for the vitamin D study
serum level 1,25-D3 serum level 24,25-D3
p GVT PB AJ-MLRT DK-MLRT GVT PB AJ-MLRT DK-MLRT
.05 .973 .942 .851 .898 .297 .300 .276 .260
.10 .914 .886 .881 .844 .251 .235 .211 .174
.25 .653(.643) .642 .689 .615 .123(.123) .103 .092 .067
.50 .317(.306) .316 .320 .316 .032(.034) .027 .028 .025
.75 .197(.193) .198 .196 .212 .019(.024) .015 .008 .005
.90 .193 .194 .206 .208 .028 .019 .005 .003
.95 .211 .205 .227 .218 .031 .022 .005 .003

Note: The values in parentheses are the p-values of the GVT given in Table 7 of Li et al. (2012)

We estimated the p-values of the generalized variable test (GVT), the PB test and the MLRTs and reported them in Table 7. In serum level 1, 25-D3, no significant differences among the groups in terms of quartiles and medians. All the tests produced p-values larger than commonly used practical nominal levels. However, all the tests indicate that significant differences exist in medians (or means) and 3rd quartiles among groups for 24,25-D3. The DK-MLRT produced p-values that are smaller than the corresponding AJ-MLRT for testing all percentiles considered in the table. We also see that the GVT produced a larger p-value of 0.123 testing the equality of the first quartiles, because this test is conservative for most cases. All the tests, as indicated by our earlier simulation studies, produced similar p-values for testing the equality of group means (p=0.5) in both serum levels. In particular, all tests indicate that there is no significant difference among group means in 1,25-D3 and they provide evidence to conclude that the group means are significantly different in 24,25-D3 level.

We shall now compute various 95% simultaneous CIs for the differences ξ1,.7ξ2,.7, ξ1,.7ξ3,.7 and ξ2,.7ξ3,.7. We chose 70th percentiles so that we can compare our results with those given in Table 9 of [malekzadeh2023simultaneous]. These simultaneous CIs are given in Table 8. We first observe that three simultaneous CIs by all methods include zero, and so percentiles from these three groups are not significantly different. Furthermore, we notice that the FG and FGU simultaneous CIs are very similar with same volume1/3, and the PB and PBU CIs are in good agreement with practically the same volume1/3.

Table 8. 95% simultaneous pairwise CIs
Serum Vitamin Level 1,25-D3
Difference FGU FG PBU PB BF
ξ1,.7ξ2,.7 (-32.38, 6.19) (-32.44, 6.23) (-30.89, 4.69 ) (-30.88, 4.66 ) (-31.99, 5.86 )
ξ1,.7ξ3,.7 (-31.64, 14.06) (-31.44, 14.20) (-29.86, 12.29) (-29.83, 12.31) (-36.28, 13.02)
ξ2,.7ξ3,.7 (-19.39, 28.01) (-19.20, 28.18) (-17.55, 26.17) (-17.48, 26.16) (-23.67, 26.70)
critical value 2.67 2.87 2.47 2.64 —–
volume1/3 43.72 43.72 40.32 40.18 45.47
Serum Vitamin Level 24,25-D3
ξ1,.7ξ2,.7 (-0.09, 3.02) (-0.11, 3.02) (0.02, 2.91) (0.01, 2.90) (0.07, 3.12)
ξ1,.7ξ3,.7 (0.42, 4.21) (0.42, 4.22) (0.55, 4.08) (0.57, 4.08 ) (0.23, 4.22)
ξ2,.7ξ3,.7 (-0.63, 2.33) (-0.62, 2.35) (-0.53, 2.23) (-0.51, 2.24) (-0.98, 2.20)
critical value 2.68 2.88 2.49 2.67 —–
volume1/3 3.27 3.28 3.04 3.03 3.38

Example 3.5.2. Operating room anesthesia involves clinical and managerial decision making that relies on communication over periods of less than 5 minutes. [ledolter2011analysis] have noted that the latency data including times for anesthesia providers to respond to messages well described by lognormal models. These authors have used the generalized variable approach to compare several lognormal means based on data consisting of 472 messages from four groups.

Group 1: no prior message and the message was not anchored,
Group 2: no prior message and the message was anchored,
Group 3: with prior message and the message was not anchored, and
Group 4: with prior message and the message was anchored.

The summary statistics reported in Table 8 of [li2012comparison] are reproduced here in Table 9. In Table 10, we reported the p-values of the tests for the equality of 100p percentiles for several values of p ranging from 0.05 to 0.95. All the tests indicate that differences exist among percentiles for p0.30. As the GVT is conservative, it produced a little larger p-value than those of the other tests for testing the equality of medians.

Table 9. Descriptive statistics for the log time to acknowledge in [ledolter2011analysis]
Group Size Mean Q1 Median Q3 SD Min Max
1. No prior message, not anchored 245 0.117 -0.288 0.122 0.525 0.580 -1.715 1.564
2. No prior message, anchored 125 0.111 -0.248 0.068 0.470 0.607 -1.833 1.575
3. Prior message, not anchored 65 -0.019 -0.446 0.039 0.419 0.627 -1.427 1.515
4. Prior message, anchored 37 -0.112 -0.511 0.058 0.270 0.788 -2.040 1.358
Table 10. The estimated p-values for testing the equality of pth quantiles of the log time to acknowledge
  p   GVT   PB   AJ-MLRT   DK-MLRT   Welch
  .05   .030   .031   .008   .008
  .10   .028   .030   .010   .010
  .25   .042(.045)   .044   .027   .026   —
  .50   .182(.186)   .177   .180   .177   .177
  .75   .683(.680)   .659   .653   .648   —
  .90   .876   .875   .834   .872
  .95   .849   .860   .846   .873

Note: The values in parentheses are the p-values of the GVT given in Table 7 of Li et al. (2012)

We also constructed 95% pairwise CIs for all six differences of the 10th percentiles and reported them in Table 11. Since all the tests for equality indicated that the 10th percentiles are significantly different (see Table 10), we see that pairwise CIs due to all the methods indicate that the 10th percentiles of the groups 1 and 4 are significantly different at the level 0.05. The BF CIs also indicate that the groups 2 and 4 are somewhat different while other CIs indicate that they are not significantly different.

Table 11. 95% simultaneous pairwise CIs for the difference ξi,.10ξj,.10
Difference FGU FG PBU PB BF
ξ1,.10ξ2,.10 (-0.19, 0.27) (-0.19, 0.27) (-0.19, 0.27) (-0.19, 0.27) (-0.19, 0.29)
ξ1,.10ξ3,.10 (-0.11, 0.50 (-0.11, 0.50) (-0.11, 0.50) (-0.11, 0.50) (-0.09, 0.55)
ξ1,.10ξ4,.10 ( 0.02, 0.98) ( 0.01, 0.98) ( 0.02, 0.97) ( 0.02, 0.98) ( 0.07, 1.11)
ξ2,.10ξ3,.10 (-0.18, 0.49) (-0.18, 0.49) (-0.18, 0.49) (-0.18, 0.49) (-0.17, 0.53)
ξ2,.10ξ4,.10 (-0.04, 0.95) (-0.04, 0.95) (-0.04, 0.95) (-0.04, 0.96) ( 0.00, 1.07)
ξ3,.10ξ4,.10 (-0.23, 0.84) (-0.23, 0.83) (-0.24, 0.84) (-0.23, 0.84) (-0.22, 0.94)
critical value 2.60 3.53 2.61 3.54
volume1/6 0.67 0.67 0.76 0.76 0.80