AP Statistics Curriculum 2007 Contingency Indep

From Socr

(Difference between revisions)

Revision as of 02:07, 4 March 2008

General Advance-Placement (AP) Statistics Curriculum - Contingency Tables: Independence and Homogeneity

Contingency Tables: Independence and Homogeneity

The chi-square test may also be used to assess independence and association between variables.

Motivational example

Suppose 200 randomly selected cancer patients were asked if their primary diagnosis was Brain cancer and if they owned a cell phone before their diagnosis. The results are presented in the table below.

Suppose we want to analyze the association, if any, between brain cancer and cell phone use. The 2x2 table below lists two possible outcomes for each variable (each variable is dichotomous). We have the following population parameters:

P(CP|BC) = true probability of owning a cell phone (CP) given that the patient had brain cancer (BC). This chance may be estimated by P(CP|BC) = 0.72.

P(CP|NBC) = true probability of owning a cell phone given that the patient had another cancer, which is estimated by P(CP|NBC) = 0.46

		Brain cancer
		Yes	No	Total
Cell Phone Use	Yes	18	80	98
	No	7	95	102
	Total	25	175	200

Does it seem like there is an association between brain cancer and cell phone use? Of the brain cancer patients 18/25 = 0.72, owned a cell phone before their diagnosis. P(CP|BC) = 0.72, estimated probability of owning a cell phone given that the patient has brain cancer.

Of the other cancer patients, 80/175 = 0.46, owned a cell phone before their diagnosis. P(CP|NBC) = 0.46, estimated probability of owning a cell phone given that the patient has another cancer.

Calculations

The hypotheses:

H o

: there is no association between variable 1 and variable 2 (independence)

P(BC|CP)=P(BC), that is brain-cancer (BC) is independent of cell-phone (CP) usage.

H a

: there is an association between variable 1 and variable 2 (dependence)

$P(BC|CP)={P(BC \cap CP) \over P(CP) } \not= P(BC).$

Test statistics:

The test statistic:

$\chi_o^2 = \sum_{all-categories}{(O-E)^2 \over E} \sim \chi_{(df)}^2$ , where df = (# rows – 1)(# columns – 1).

Expected cell counts can be calculated by

$E = { (row\_total)(column\_total)\over grand-total}$

P-values and critical values for the Chi-Square distribution may be easily computed using SOCR Distributions.

Results:

SOCR Chi-square Calculations:

Examples

Applications

References

TBD

SOCR Home page: http://www.socr.ucla.edu

Translate this page:

(default)	Deutsch	Español	Français	Italiano	Português	日本語	България	الامارات العربية المتحدة	Suomi	इस भाषा में	Norge
한국어	中文	繁体中文	Русский	Nederlands	Ελληνικά	Hrvatska	Česká republika	Danmark	Polska	România	Sverige

@@ Line 40: / Line 40: @@
 * The hypotheses:
 : <math>H_o</math>: there is no association between variable 1 and variable 2  (independence)
+:: ''P(BC|CP)=P(BC)'', that is brain-cancer (BC) is independent of cell-phone (CP) usage.
 : <math>H_a</math>: there is an association between variable 1 and variable 2 (dependence)
+:: <math>P(BC|CP)={P(BC \cap CP) \over P(CP) } \not= P(BC).</math>
 * Test statistics:

AP Statistics Curriculum 2007 Contingency Indep

From Socr

Revision as of 02:07, 4 March 2008

Contents

General Advance-Placement (AP) Statistics Curriculum - Contingency Tables: Independence and Homogeneity

Contingency Tables: Independence and Homogeneity

Motivational example

Calculations

Examples

Applications

References

Views

Personal tools

Navigation

Search

Toolbox