Statistical formula for the cell chi-square test

Professional > Table scripting > Statistical tests > Cell chi-square test > Statistical formula for the cell chi-square test

For each cell, a two-by-two contingency table is established, with the actual cell from the cross table as the upper left cell, and the other cells derived by all three possible negations of the cross table cell:

	Total	Column	NOT column
Total	f	c	f-c
Row	r	x	r-x
NOT row	f-r	c-x	f-r-c+x

The two-by-two contingency table has one degree of freedom; the other three cells are determined when one of the four cells is known (x).

The following table shows the notation used in this topic.

Notation	Description
	The observed values.
	The expected values given the null hypothesis.

The first index points to the two-by-two table rows and the second index to the columns:

Null hypothesis

The deviation between the observed values O i,j and the expected values E i,j are not significant and are at random. That is the variables Row/NOT row and Column/NOT column are independent.

χ2 statistic

The null hypothesis is rejected if the p value for the χ2 statistic is smaller than the specified significance level. The p value is calculated from the chi-square distribution with 1 degree of freedom.

Yates' correction for small samples

In the underlying theoretical assumptions of the test there is an assumption of continuity that becomes dubious with very small samples and where some of the expected values are very small. In order to avoid a situation where rejection is done when assumptions of continuity are dubious the following correction is applied:

χ2 statistic with Yates' correction when at least one of the expected cell values

(E1,1, E1,2, E2,1, E2,2) < 5 :

The consequence of the correction may be that an uncorrected rejection is cancelled.