Journal of Statistical Theory and Applications

Volume 19, Issue 2, June 2020, Pages 212 - 222

Measure of Departure from Marginal Homogeneity for the Analysis of Collapsed Square Contingency Tables with Ordered Categories

Authors
Kouji Yamamoto1, *, Itsumi Iwama2, Sadao Tomizawa2
1Department of Biostatistics, Yokohama City University School of Medicine 3-9, Fukuura, Kanazawa-ku, Yokohama, Kanagawa, 236-0004, Japan
2Department of Information Sciences, Faculty of Science and Technology, Tokyo University of Science 2641, Yamazaki, Noda, Chiba, 278-8510, Japan
*Corresponding author. Email: kouji_y@yokohama-cu.ac.jp
Corresponding Author
Kouji Yamamoto
Received 2 February 2019, Accepted 20 October 2019, Available Online 26 May 2020.
DOI
10.2991/jsta.d.200507.001How to use a DOI?
Keywords
Collapsed table; Marginal homogeneity; Ordered category
Abstract

For square contingency tables with ordered categories, there would be some situations that one would like to analyze them by using collapsed 3×3 tables combining some adjacent categories in the original table. This paper considers the marginal homogeneity for collapsed tables and proposes a measure which represents the degree of departure from the marginal homogeneity. The proposed measure lies between 0 and 1, and it takes zero when the marginal homogeneity holds. Examples are given.

Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

1. INTRODUCTION

Consider an r×r square contingency table with the same row and column ordinal classifications. Let pij denote the probability that an observation will fall in the ith row and jth column of the table (i=1,,r;j=1,,r), and let X and Y denote the row and column variables, respectively. The marginal homogeneity (MH) model is defined by

Pr(X=i)=Pr(Y=i)(i=1,,r),
namely
pi=pi(i=1,,r),
where pi=t=1rpit and pi=t=1rpti ([1], [2, p. 282]). This model is also expressed as
Pr(X=i|XY)=Pr(Y=i|XY)(i=1,,r),
namely
pic=pic(i=1,,r),
where
pic=(pipii)δ,pic=(pipii)δandδ=ijpij.

This indicates that the conditional row marginal distribution is identical to the conditional column marginal distribution under the condition that an observation will fall in one of the off-diagonal cells of the table.

We consider the (r1)(r2)2 (being r12) ways of collapsing the r×r original table with ordered categories into a 3×3 table by choosing cut points after the sth and tth rows and after the sth and tth columns for 1s<tr1. We define each collapsed 3×3 table as the Tst table (1s<tr1). This process means that r categories in the r×r original table get into 3 categories (for example, A, B and C) by dividing at the cut points s and t and then a Tst table (1s<tr1) has the new 3 categories. In a collapsed Tst table (1s<tr1), let Gij(s,t) indicate the corresponding probability for row value i (i=1,2,3) and column value j (j=1,2,3); that is,

G11(s,t)=i=1sj=1spij,G12(s,t)=i=1sj=s+1tpij,G13(s,t)=i=1sj=t+1rpij,G21(s,t)=i=s+1tj=1spij,G22(s,t)=i=s+1tj=s+1tpij,G23(s,t)=i=s+1tj=t+1rpij,G31(s,t)=i=t+1rj=1spij,G32(s,t)=i=t+1rj=s+1tpij,G33(s,t)=i=t+1rj=t+1rpij.

Then, the MH model is expressed as

Gi(s,t)=Gi(s,t)(i=1,2,3),
where
Gi(s,t)=j=13Gij(s,t),Gi(s,t)=j=13Gji(s,t),
for all s and t (1s<tr1); see [3].

When the MH model does not hold, we are interested in measuring the degree of departure from MH. For square contingency tables with nominal categories, Tomizawa and Makii [4] proposed a measure which expresses the degree of departure from MH. In addition, for square contingency tables with ordered categories, Tomizawa, Miyamoto and Ashihara [5] proposed the measure Γ(λ), which represents the degree of departure from MH. See Appendix A for the measure Γ(λ).

Moreover, when the MH does not hold, we are interested in measuring the degree of departure from MH for every collapsed table Tst (1s<tr1). The purpose of this paper is to propose a new measure which expresses the degree of departure from MH by adopting collapsed 3 ×3 tables. Section 2 considers such a measure and Section 3 gives an approximate variance and a confidence interval for the measure. Section 4 shows examples using the proposed measure. Section 5 gives the concluding remarks.

2. MEASURE OF DEPARTURE FROM MH FOR COLLAPSED 3 × 3 TABLES

Let for 1s<tr1,

δst=i=13j=1ij3Gij(s,t),
and
Gic(s,t)=1δstGi(s,t)Gii(s,t),Gic(s,t)=1δstGi(s,t)Gii(s,t),
πst(i)c=12Gic(s,t)+Gic(s,t)(i=1,2,3).

Assume that Gic(s,t)+Gic(s,t) are all positive. The MH model is also represented as

Gic(s,t)=Gic(s,t)=πst(i)c(i=1,2,3),
for 1s<tr1.

Consider a measure defined by

Ω(λ)=1r121s<tr1Ωst(λ)(λ>1),
where
Ωst(λ)=λ(λ+1)2(2λ1)Ist(λ){Gic(s,t),Gic(s,t)};{πst(i)c,πst(i)c},
Ist(λ)(;)=1λ(λ+1)i=13Gic(s,t)Gic(s,t)πst(i)cλ1+Gic(s,t)Gic(s,t)πst(i)cλ1,
and the value at λ=0 is taken to be continuous limit as λ0. Namely
Ω(0)=limλ0Ω(λ)=1r121s<tr1Ωst(0),
where
Ωst(0)=12log2Ist(0){Gic(s,t),Gic(s,t)};{πst(i)c,πst(i)c},
Ist(0)(;)=i=13Gic(s,t)logGic(s,t)πst(i)c+Gic(s,t)logGic(s,t)πst(i)c.

The submeasure Ωst(λ) (1s<tr1) describes the degree of departure from the MH model for the collapsed Tst table. Note that Ist(λ)Gic(s,t),Gic(s,t);πst(i)c,πst(i)c is the power-divergence between Gic(s,t),Gic(s,t) and πst(i)c,πst(i)c, and especially, Ist(0)() is the Kullback–Leibler information. Also, note that a real value λ (>1) is chosen by users. See [6] for the power-divergence.

Let for 1s<tr1,

G1(i)c(s,t)=Gic(s,t)Gic(s,t)+Gic(s,t),G2(i)c(s,t)=Gic(s,t)Gic(s,t)+Gic(s,t)(i=1,2,3).

Note that G1(i)c(s,t)+G2(i)c(s,t)=1. The MH model may be expressed as

G1(i)c(s,t)=G2(i)c(s,t)=12(i=1,2,3),
for 1s<tr1.

Then, Ωst(λ) (1s<tr1) may also be defined by

Ωst(λ)=λ(λ+1)2λ1i=13πst(i)cIst(i)(λ)Gk(i)c(s,t);12(λ>1),
where
Ist(i)(λ)(;)=1λ(λ+1)G1(i)c(s,t)G1(i)c(s,t)12λ1+G2(i)c(s,t)G2(i)c(s,t)12λ1,
and the value at λ=0 is taken to be continuous limit as λ0. Namely,
Ωst(0)=1log2i=13πst(i)cIst(i)(0)Gk(i)c(s,t);12,
Ist(i)(0)(;)=G1(i)c(s,t)logG1(i)c(s,t)12+G2(i)c(s,t)logG2(i)c(s,t)12.

Therefore, Ωst(λ) represents the weighted sum of the power-divergence Ist(i)(λ)Gk(i)c(s,t);12. Moreover, Ωst(λ) can be expressed as

Ωst(λ)=i=13πst(i)c1λ2λ2λ1Hst(i)(λ)Gk(i)c(s,t)(λ>1),
where
Hst(i)(λ)Gk(i)c(s,t)=1λ1G1(i)c(s,t)λ+1G2(i)c(s,t)λ+1,
and the value at λ=0 is taken to be continuous limit as λ0. Namely
Ωst(0)=i=13πst(i)c11log2Hst(i)(0)Gk(i)c(s,t),
where
Hst(i)(0)()=G1(i)c(s,t)logG1(i)c(s,t)G2(i)c(s,t)logG2(i)c(s,t).

Note that Hst(i)(λ)Gk(i)(s,t) is Patil and Taillie’s [7] diversity index of degree λ for G1(i)c(s,t),G2(i)c(s,t), which includes the Shannon entropy (when λ = 0). Therefore, Ωst(λ) represents the weighted sum of the diversity index Hst(i)(λ)({Gk(i)c(s,t)}). Note that for each λ>1,

  1. 0Hst(i)(λ)()(2λ1)(λ2λ),

  2. Hst(i)(λ)()=0 if and only if G1(i)c(s,t)=1 (then G2(i)c(s,t)=0) or G2(i)c(s,t)=1 (then G1(i)c(s,t)=0),

  3. Hst(i)(λ)() = (2λ1)(λ2λ) if and only if G1(i)c(s,t)=G2(i)c(s,t) =12, that is Gic(s,t) = Gic(s,t).

Thus, we conclude that the measure Ω(λ) lies between 0 and 1, and the submeasure Ωst(λ) also lies between 0 and 1. For each λ>1,

  • d.

    there is a structure of MH in the r×r table if and only if Ω(λ) = 0,

  • e.

    the degree of departure from MH in the r×r table is the largest, in the sense that G1(i)c(s,t)=1 (then G2(i)c(s,t)=0) or G2(i)c(s,t)=1 (then G1(i)c(s,t)=0) for i=1,2,3 and 1s<tr1 if and only if Ω(λ)=1,

and for fixed s and t (1s<tr1),
  1. f.

    there is a structure of MH in a collapsed 3×3 table Tst if and only if Ωst(λ)=0,

  2. g.

    the degree of departure from MH in a collapsed 3×3 table Tst is the largest, in the sense that G1(i)c(s,t)=1 (then G2(i)c(s,t)=0) or G2(i)c(s,t)=1 (then G1(i)c(s,t)=0) for i=1,2,3 if and only if Ωst(λ)=1.

3. APPROXIMATE CONFIDENCE INTERVAL FOR MEASURE

Let nij denote the observed frequency in the ith row and jth column of the r×r table (i=1,,r;j=1,,r). Assuming that a multinomial distribution applies to the table, we shall consider an approximate standard error and a large sample confidence interval for the measure Ω(λ), using the delta method. The sample version of Ω(λ), that is, Ω̂(λ), is given by Ω(λ) with {pij} replaced by {p̂ij}, where p̂ij=nijn and n=nij. Using the delta method, n(Ω̂(λ)Ω(λ)) has asymptotically (as n) a normal distribution with mean 0 and variance σ2[Ω(λ)]. See Appendix B for the details of σ2[Ω(λ)] and Appendix C for the details of the submeasure Ωst(λ).

Let σ̂2[Ω(λ)] denote σ2[Ω(λ)] with {pij} replaced by {p̂ij}. Then σ̂[Ω(λ)]n is an estimated approximate standard error for Ω̂(λ), and Ω̂(λ)±zp2σ̂[Ω(λ)]n is an appriximate 100(1p) percent confidence interval for Ω(λ), where zp2 is the percentage point from the standard normal distribution corresponding to a two-tail probability equal to p.

4. EXAMPLES

Consider the data in Table 1. These data describe the cross-classification of father’s and son’s occupational status categories in Japan and British.

Son's Status
Father’s Status (1) (2) (3) (4) (5) Total
(a) Japan (1) 29 43 25 31 4 132
(2) 23 159 89 38 14 323
(3) 11 69 184 34 10 308
(4) 42 147 148 184 17 538
(5) 42 176 377 114 298 1007

Total 147 594 823 401 343 2308

(b) British (1) 50 45 8 18 8 129
(2) 28 174 84 154 55 495
(3) 11 78 110 223 96 518
(4) 14 150 185 714 447 1510
(5) 3 42 72 320 411 848

Total 106 489 459 1429 1017 3500
Table 1

Cross-classification of father’s and his son’s social class in (a) Japan in 1975 [8, p. 151] and (b) British [2, p. 100].

Since the confidence intervals for Ω(λ) applied to the data in Tables 1a and 1b do not include zero for all λ (see Table 2), these would indicate that there is not the structure of MH in both tables.

Values of λ Ω̂(λ) Standard Error Confidence Interval
(a) For Table 1a
−0.5 0.2332 0.0135 (0.2067, 0.2596)
0.0 0.3463 0.0176 (0.3119, 0.3807)
1.0 0.4226 0.0190 (0.3854, 0.4598)
1.5 0.4277 0.0190 (0.3904, 0.4649)
2.0 0.4226 0.0190 (0.3854, 0.4598)

(b) For Table 1b
−0.5 0.0045 0.0016 (0.0013, 0.0076)
0.0 0.0075 0.0027 (0.0022, 0.0128)
1.0 0.0104 0.0037 (0.0031, 0.0176)
1.5 0.0106 0.0038 (0.0032, 0.0181)
2.0 0.0104 0.0037 (0.0031, 0.0176)
Table 2

Estimates of measure Ω(λ), approximate standard errors for Ω̂(λ) and approximate 95% confidence intervals for Ω(λ), applied to Tables 1a and 1b.

When the degrees of departure from MH in Tables 1a and 1b are compared using the confidence intervals for Ω(λ), it is greater in Table 1a than in Table 1b. Then, we can see that the degree of departure from MH is greater for Table 1a than for Table 1b. Therefore, the difference between the father’s classification distribution and his son’s distribution is greater in Japan than in British.

We shall further analyze the data in Tables 1a and 1b using the submeasure Ωst(λ) (1s<tr1). We see from Table 3 that for Table 1a, (i) the degree of departure from MH in the collapsed table T12 is the smallest and (ii) those in the other collapsed tables are greater than that in the T12 table. Thus, it is seen that (i) when we combine the categories (3) to (5) in Table 1a, the degree of departure from MH for the collapsed table is slightest, and (ii) when we combine the categories in the other patterns in Table 1a, those for the collapsed tables are greater than that for case (i).

Estimated Submeasures Values of λ For Table 1a Confidence Interval For Table 1b Confidence Interval
Ω̂12(λ) −0.5 0.0731 (0.0491, 0.0971) 0.0016 (−0.0014, 0.0047)
0.0 0.1200 (0.0818, 0.1581) 0.0028 (−0.0024, 0.0079)
1.0 0.1608 (0.1113, 0.2103) 0.0038 (−0.0033, 0.0109)
1.5 0.1643 (0.1139, 0.2146) 0.0039 (−0.0034, 0.0112)
2.0 0.1608 (0.1113, 0.2103) 0.0038 (−0.0033, 0.0109)

Ω̂13(λ) −0.5 0.2763 (0.2395, 0.3131) 0.0030 (−0.0008, 0.0067)
0.0 0.4166 (0.3679, 0.4654) 0.0050 (−0.0013, 0.0114)
1.0 0.5130 (0.4608, 0.5652) 0.0070 (−0.0018, 0.0157)
1.5 0.5193 (0.4672, 0.5714) 0.0071 (−0.0019, 0.0161)
2.0 0.5130 (0.4608, 0.5652) 0.0070 (−0.0018, 0.0157)

Ω̂14(λ) −0.5 0.3327 (0.2893, 0.3762) 0.0092 (0.0025, 0.0159)
0.0 0.4809 (0.4284, 0.5335) 0.0154 (0.0042, 0.0267)
1.0 0.5720 (0.5194, 0.6246) 0.0213 (0.0058, 0.0368)
1.5 0.5775 (0.5251, 0.6298) 0.0218 (0.0060, 0.0377)
2.0 0.5720 (0.5194, 0.6246) 0.0213 (0.0058, 0.0368)

Ω̂23(λ) −0.5 0.2211 (0.1915, 0.2507) 0.0024 (−0.0010, 0.0058)
0.0 0.3394 (0.2985, 0.3803) 0.0041 (−0.0016, 0.0098)
1.0 0.4256 (0.3796, 0.4717) 0.0056 (−0.0023, 0.0135)
1.5 0.4317 (0.3855, 0.4779) 0.0058 (−0.0023, 0.0139)
2.0 0.4256 (0.3796, 0.4717) 0.0056 (−0.0023, 0.0135)

Ω̂24(λ) −0.5 0.2008 (0.1743, 0.2273) 0.0058 (0.0014, 0.0103)
0.0 0.2925 (0.2596, 0.3255) 0.0098 (0.0023, 0.0173)
1.0 0.3528 (0.3177, 0.3879) 0.0135 (0.0033, 0.0238)
1.5 0.3569 (0.3217, 0.3920) 0.0139 (0.0033, 0.0244)
2.0 0.3528 (0.3177, 0.3879) 0.0135 (0.0033, 0.0238)

Ω̂34(λ) −0.5 0.2949 (0.2598, 0.3300) 0.0047 (0.0013, 0.0082)
0.0 0.4282 (0.3848, 0.4716) 0.0080 (0.0022, 0.0137)
1.0 0.5114 (0.4666, 0.5562) 0.0110 (0.0031, 0.0189)
1.5 0.5165 (0.4718, 0.5612) 0.0113 (0.0031, 0.0194)
2.0 0.5114 (0.4666, 0.5562) 0.0110 (0.0031, 0.0189)
Table 3

Estimates of submeasure Ωst(λ) applied to Tables 1a and 1b.

However, there is no possibility to decide in which collapsed table the degree of departure from MH is largest, because the values in the confidence intervals for them overlap each other except for Ω12(λ). So, we can get a conclusion, for Table 1a, that even if we get the original 5 categories into the 3 categories in any pattern to make us interpret the original table more easily, the marginal distributions for each collapsed table are different between father and his son’s social classes. Moreover, if we consider the original categories (1) as the high class, (2) as the middle class and (3) to (5) as the low class, social mobility between father and his son’s social classes is less than that for the other classification patterns. We also see from Table 3 that for Table 1b, the confidence intervals for the submeasures Ω12(λ), Ω13(λ) and Ω23(λ) include zero, on the other hand, those for the other submeasures do not include zero. These results mean that when we combine the categories (3) to (5) in Table 1b, the marginal distribution for father’s social class is equal to that for his son’s one. Note that we can also obtain similar interpretations when we combine the categories (2) to (3) and (4) to (5), and (1) to (2) and (4) to (5) in Table 1b. Thus, if we consider the original categories (1) as the high class, (2) as the middle class and (3) to (5) as the low class, we could not see that social mobility between father’s social class and his son’s one happened. Also note that we are able to get a similar interpretations when we consider combining the categories (2) to (3) and (4) to (5), and (1) to (2) and (4) to (5) in Table 1b. As a result, for Table 1b, if we regard only the category (5) as the low class, when we consider the same specified category between father and his son, the probability that fathers whose son’s social class is lower than him is different from the probability that sons whose father’s social class is higher than him.

We can also see from Table 4 that the degree of departure from MH is greater for Table 1a than for Table 1b by using the measure Γ(λ) proposed by Tomizawa et al. [5]. However, when we would like to analyze these data in more detail, for example, by using collapsed tables as described above, it is impossible to do it by using the measure Γ(λ). So, in such a situation, the proposed measure Ω(λ) would be useful.

Values of λ For Table 1a For Table 1b
−0.5 0.2730 0.0061
0.0 0.3990 0.0103
1.0 0.4799 0.0143
1.5 0.4850 0.0146
2.0 0.4799 0.0143
Table 4

Estimates of measure Γ(λ) applied to Tables 1a and 1b.

Furthermore, we can know more about the degree of departure from MH in Tables 1a and 1b. From Ω̂(λ), for example, with λ=1, (i) for Table 1a, the degree of departure from MH is estimated to be 42.26 percent of the maximum degree of departure from MH, and (ii) for Table 1b, the degree of departure from MH is estimated to be 1.04 percent of the maximum degree of departure from MH.

5. CONCLUDING REMARKS

The measure Ω(λ) always ranges from 0 to 1 independent of the dimension r and sample size n. Thus, it may be useful for comparing the degrees of departure from MH in several tables.

When the MH model does not hold, we are interested in (i) seeing what degree the departure from MH is for the original r×r table, (ii) seeing what degree the departure from MH is for Tst tables (1s<tr1) and (iii) seeing in which Tst table (1s<tr1) the degree of departure from MH is the largest. We recommend to use the proposed measure Ω(λ) for (i) and the proposed submeasure Ωst(λ) for (ii) and (iii). The submeasure may also be used in the case that you would like to just analyze one collapsed table. Note that the measure Ω(λ) is not invarianrt under the arbitrary permutations of row and column categories except the reverse order, so this measure should be applied only for ordinal data.

Consider the artificial data in Table 5. Let G2 indicate the likelihood ratio statistic for goodness-of-fit of MH model. Table 6a gives the values of G2 applied to these data. We shall compare the values of G2 for Tables 5a and 5b. We see that the value of G2 for Table 5a is greater than that for Table 5b. In contrast, for any fixed λ(>1), the value of Ω̂(λ) is greater for Table 5b than for Table 5a (see Table 6b). In terms of G^ic(s,t)/G^ic(s,t), (i=1,2,3, 1s<tr1) (see Table 5), it seems natural to conclude that the degree of departure from MH is less for Table 5a than for Table 5b. Therefore we recommend using Ω̂(λ) for comparing the degrees of departure from MH among several tables. To many readers, it might seem that G2n is also a reasonable measure for representing the degree of departure from MH. However, G2n is not such a measure for us. For instance, consider the artificial data in Tables 5b and 5c. The values of G2n are 0.0261 for Table 5b and 0.1284 for Table 5c. Therefore, the value of G2n is less for Table 5b than for Table 5c. On the other hand, for any fixed λ(>1), the value of Ω̂(λ) for Table 5b is equal to that for Table 5c (see Table 6b). Moreover, G^ic/G^ic, i=1,2,3 for Table 5b is identical to that for Table 5c (see Table 5). Therefore, it seems natural to get a conclusion that there are no differences between Tables 5b and 5c for the degree of departure from MH. As a result, Ω̂(λ) may also be more desirable to measure the degree of departure from MH than G2n.

(a) n = 2814
(1) (2) (3) (4) Total
(1) 251 266 37 42 596
(2) 140 329 271 98 838
(3) 72 76 224 189 561
(4) 32 20 310 457 819
Total 495 691 842 786 2814

Note:

G^1c(1,2)G^1c(1,2)=1.41,G^2c(1,2)G^2c(1,2)=1.41,G^3c(1,2)G^3c(1,2)=0.45G^1c(1,3)G^1c(1,3)=1.41,G^2c(1,3)G^2c(1,3)=0.79,G^3c(1,3)G^3c(1,3)=1.10G^1c(2,3)G^1c(2,3)=2.24,G^2c(2,3)G^2c(2,3)=0.55,G^3c(2,3)G^3c(2,3)=1.10

(b) n = 2906
(1) 687 14 20 10 731
(2) 95 278 9 31 413
(3) 45 35 898 11 989
(4) 24 13 30 706 773
Total 851 340 957 758 2906

Note:

G^1c(1,2)G^1c(1,2)=0.27,G^2c(1,2)G^2c(1,2)=2.18,G^3c(1,2)G^3c(1,2)=1.67G^1c(1,3)G^1c(1,3)=0.27,G^2c(1,3)G^2c(1,3)=2.36,G^3c(1,3)G^3c(1,3)=1.29G^1c(2,3)G^1c(2,3)=0.60,G^2c(2,3)G^2c(2,3)=1.54,G^3c(2,3)G^3c(2,3)=1.29

(c) n = 591
(1) 68 14 20 10 112
(2) 95 27 9 31 162
(3) 45 35 89 11 180
(4) 24 13 30 70 137
Total 232 89 148 122 591

Note:

G^1c(1,2)G^1c(1,2)=0.27,G^2c(1,2)G^2c(1,2)=2.18,G^3c(1,2)G^3c(1,2)=1.67G^1c(1,3)G^1c(1,3)=0.27,G^2c(1,3)G^2c(1,3)=2.36,G^3c(1,3)G^3c(1,3)=1.29G^1c(2,3)G^1c(2,3)=0.60,G^2c(2,3)G^2c(2,3)=1.54,G^3c(2,3)G^3c(2,3)=1.29

Table 5

Artificial data (n is sample size).

For Table 5a For Table 5b For Table 5c
(a) G2
114.778 75.94 75.94
(b) Ω̂(λ)
−0.5 0.0232 0.0653 0.0653
0.0 0.0386 0.1060 0.1060
λ 1.0 0.0525 0.1405 0.1405
1.5 0.0537 0.1434 0.1434
2.0 0.0525 0.1405 0.1405
Table 6

Values of G2 and Ω(λ) applied to Tables 5a, 5b and 5c.

Finally, the readers may be interested in considering a measure based on unconditional marginal probabilities (say, Γ(λ)), instead of the proposed measure Ω(λ) based on conditional marginal probabilities. The measure Γ(λ) takes the minimum value 0, when there is the structure of MH in the original table. On the other hand, we cannot define the maximum value, because it is impossible to define the structure of the furthest departure from MH. Thus, when the readers wants to consider the degree of departure from MH in addition to define the maximum departure from MH, we recommend using the measure Ω(λ). If not, each measure, Γ(λ) and Ω(λ), is enough to use.

CONFLICT OF INTEREST

Authors have no conflict of interest to declare.

APPENDIX A

The measure to represent the degree of departure from MH proposed by Tomizawa et al. [5], is given as follows: assuming that {G1(i)+G2(i)0},

Γ(λ)=λ(λ+1)2λ1I(λ)G1(i),G2(i);Qi,Qi(λ>1),
where
I(λ)(;)=1λ(λ+1)i=1r1G1(i)G1(i)Qiλ1+G2(i)G2(i)Qiλ1,
with
G1(i)=s=1it=i+1rpst,G2(i)=s=i+1rt=1ipst,Δ=i=1r1(G1(i)+G2(i)),
and
G1(i)=G1(i)Δ,G2(i)=G2(i)Δ,Qi=12G1(i)+G2(i)(i=1,,r1),
and the value at λ=0 is taken to be continuous limit as λ0.

APPENDIX B

Using the delta method, n(Ω̂(λ)Ω(λ)) has aymptotically variance σ2[Ω(λ)] as follows:

σ2Ω(λ)=k=1rl=1rpklΔkl(λ)2(λ>1),
where
Δkl(λ)=1r12s=1r2t=s+1r11δstAkl(λ)(s,t)Ωst(λ)Bkl(s,t),Akl(λ)(s,t)=i=13Ckl(i)1λ2λ2λ1Hst(i)(λ)Gk(i)c(s,t)+δstπst(i)cDkl(i)(λ)(λ0),i=13Ckl(i)11log2Hst(i)(0)Gk(i)c(s,t)+δstπst(i)cDkl(i)(0)(λ=0),Bkl=1i=13Ekl(i)(s,t),Ckl(i)(s,t)=12Fkl(i)(s,t)+Jkl(i)(s,t)2Ekl(i)(s,t),
Dkl(i)(λ)(s,t)=2λ(λ+1)2(2λ1)1δstπst(i)cG1(i)c(s,t)λFkl(i)(s,t)Ekl(i)(s,t)2Ckl(i)(s,t)G1(i)c(s,t)+G2(i)c(s,t)λJkl(i)(s,t)Ekl(i)(s,t)2Ckl(i)(s,t)G2(i)c(s,t)(λ0),12log21δstπst(i)clogG1(i)c(s,t)+1Fkl(i)(s,t)Ekl(i)(s,t)2Ckl(i)(s,t)G1(i)c(s,t)+logG2(i)c(s,t)+1Jkl(i)(s,t)Ekl(i)(s,t)2Ckl(i)(s,t)G2(i)c(s,t)(λ=0),Ekl(i)(s,t)=I(1ks,1ls)(i=1),I(s+1kt,s+1lt)(i=2),I(t+1kr,t+1lr)(i=3),Fkl(i)(s,t)=I(1ks)(i=1),I(s+1kt)(i=2),I(t+1kr)(i=3),Jkl(i)(s,t)=I(1ls)(i=1),I(s+1lt)(i=2),I(t+1lr)(i=3),
where I() is the indicator function.

APPENDIX C

Using the delta method, nΩ̂st(λ)Ωst(λ) (1s<tr1) has aymptotically variance σ2Ωst(λ) as follows:

σ2Ωst(λ)=1δst2i=13j=1ij3Gij(s,t)ηij(λ)(s,t)2δstΩst(λ)2(λ>1),
where
ηij(λ)(s,t)=12(2λ1)2λG1(i)c(s,t)λ+λG2(i)c(s,t)λG1(i)c(s,t)λG2(i)c(s,t)λ+G2(j)c(s,t)λλG1(j)c(s,t)λG1(j)c(s,t)λG2(j)c(s,t)λ2(λ0),112log2logG1(i)c(s,t)logG2(j)c(s,t)(λ=0).

REFERENCES

2.Y.M.M. Bishop, S.E. Fienberg, and P.W. Holland, Discrete Multivariate Analysis: Theory and Practice, The MIT Press, Cambridge, MA, USA, 1975.
4.S. Tomizawa and T. Makii, J. Stat. Res., Vol. 35, 2001, pp. 1-24.
8.K. Hashimoto, Gendai Nihon no Kaikyuu Kouzou (Class Structure in Modern Japan: Theory, Method and Quantitative Analysis), Toshindo Press, Tokyo, Japan, 1999. (in Japanese)
Journal
Journal of Statistical Theory and Applications
Volume-Issue
19 - 2
Pages
212 - 222
Publication Date
2020/05/26
ISSN (Online)
2214-1766
ISSN (Print)
1538-7887
DOI
10.2991/jsta.d.200507.001How to use a DOI?
Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Kouji Yamamoto
AU  - Itsumi Iwama
AU  - Sadao Tomizawa
PY  - 2020
DA  - 2020/05/26
TI  - Measure of Departure from Marginal Homogeneity for the Analysis of Collapsed Square Contingency Tables with Ordered Categories
JO  - Journal of Statistical Theory and Applications
SP  - 212
EP  - 222
VL  - 19
IS  - 2
SN  - 2214-1766
UR  - https://doi.org/10.2991/jsta.d.200507.001
DO  - 10.2991/jsta.d.200507.001
ID  - Yamamoto2020
ER  -