AP Statistics Curriculum 2007 NonParam 2MedianPair

From Socr

(Difference between revisions)

Revision as of 06:31, 24 February 2008

General Advance-Placement (AP) Statistics Curriculum - Differences of Medians of Two Paired Samples

Distribution-free (or non-parametric) statistical methods provide alternative to the (standard) parametric tests that we saw earlier, and are applicable when the distribution of the data is unknown.

Motivational Clinical Example

Whitley and Ball reported on the relative risk of mortality from 16 studies of septic patients. The outcome measure of interest was whether the patients developed complications of acute renal failure. The relative risk calculated in each study compared the risk of dying between patients with and without renal failure. A relative risk of 1.0 means no effect, and relative risk $\not= 1$ suggests beneficial or detrimental effect of developing acute renal failure in sepsis. The main goal of the study was to assess the cumulative evidence in these 16 studies to determine whether developing acute renal failure as a complication of sepsis impacts patient mortality. The data of this study is included below.

Study	Relative Risk	Sign (Relative Risk - 1)
1	0.75	-
2	2.03	+
3	2.29	+
4	2.11	+
5	0.80	-
6	1.50	+
7	0.79	-
8	1.01	+
9	1.23	+
10	1.48	+
11	2.45	+
12	1.02	+
13	1.03	+
14	1.30	+
15	1.54	+
16	1.27	+

We see the clear analogy of this study design to the paired or one-sample studies we saw before. However, if we were to plot these data (Relative Risk) we can see that their distribution is hardly symmetric, unimodal and bell-shaped (i.e., not Normal). Therefore, we can not use the paired T-test to test a Null-hypothesis that the mean Relative risk is 1 using this parametric test.

The Sign-Test

The sign test is a non-parametric alternative the one-sample and paired T-test. The sign test has no requirements for the data to be Normally distributed. It assigns a positive (+) or negative (-) sign to each observation according to whether it is greater or less than some hypothesized value. Then it measures the difference between the $\pm$ signs and how distinct is this difference from what we would expect to observe by chance alone. For example, if there were no effect of developing acute renal failure on the outcome from sepsis, about half of the 16 studies above would be expected to have a relative risk less than 1.0 (a "-" sign) and the remaining 8 would be expected to have a relative risk greater than 1.0 (a "+" sign). In the actual data, 3 studies had "-" signs and the remaining 13 studies had "+" signs. Intuitively, this difference of 10 appears large to be simply due to random variation. If so, the effect of developing acute renal failure would be significant on the outcome from sepsis.

Calculations

Suppose we fix a significance level of $α = 0.05$ . And consider the following two hypotheses:

H o :ν + = 8

(equivalent to

ν - = 8

): The effect of developing acute renal failure is not significant on the outcome from sepsis.

$H_1: \nu_+ \not=8$ : The effect of developing acute renal failure is significant on the outcome from sepsis.

Define the following test-statistics

B s = max N +, N -

, where

N +

and

N -

are the number of positive and negative signs, respectively.

Then the distribution of $B_s \sim Binomial(n=16, p=8/16=0.5)$ .

For our data, $B s = max N +, N - = max13,3 = 13$ and the probability that such binomial variable exceeds 13 is $P (B i n (16,0.5,13)) = 0.010635$ . Therefore, we can reject the null hypothesis $H o$ and regard as significant the effect of developing acute renal failure on the outcome from sepsis.

The Sign test using SOCR Analyses

It is much quicker to use SOCR Analyses to compute the statistical significance of the sign test. This SOCR Sign test activity may also be helpful in understanding how to use the sign test method in SOCR.

Example

A set of 12 identical twins are given psychological tests to determine whether the first born of the set tends to be more aggressive than the second born. Each twin is scored according to aggressiveness; a higher score indicates greater aggressiveness. Because of the natural pairing in a set of twins these data can be considered paired.

Twin-Index	1^st Born	2^nd Born	Sign
1	86	88	-
2	71	77	-
3	77	76	+
4	68	64	+
5	91	96	-
6	72	72	0 (Drop)
7	77	65	+
8	91	90	+
9	70	65	+
10	71	80	-
11	88	81	+
12	87	72	+

We first plot the data using the SOCR Line Chart. Visually there does not seem to be a strong effect of the order of birth on baby's aggression.

Next we can use the SOCR Sign Test Analysis to quantitatively evaluate the evidence to reject the null hypothesis that there is no birth-order effect on baby's aggressiveness.

Clearly the p-value reported is 0.274, and our data can not reject the null hypothesis.

References

Whitley, E. and Ball, J. (2002) Statistics review 6: Nonparametric methods. Critical Care, 6(6): 509–513.

SOCR Home page: http://www.socr.ucla.edu

Translate this page:

(default)	Deutsch	Español	Français	Italiano	Português	日本語	България	الامارات العربية المتحدة	Suomi	इस भाषा में	Norge
한국어	中文	繁体中文	Русский	Nederlands	Ελληνικά	Hrvatska	Česká republika	Danmark	Polska	România	Sverige

@@ Line 4: / Line 4: @@
 ==Motivational Clinical Example==
-[[AP_Statistics_Curriculum_2007_NonParam_2MedianPair#References | Whitley and Ball reported]] on the relative risk of mortality from 16 studies of septic patients. The outcome measure of interest was whether the patients developed complications of acute renal failure. The relative risk calculated in each study compared the risk of dying between patients ''with'' and ''without'' renal failure. A relative risk of 1.0 means no effect, and relative risk <math>\not= 1</math> suggests beneficial or detrimental effect of developing acute renal failure in sepsis. The main goal of the study was to assess the ''cummulative evidence'' in these 16 studies to determine whether developing acute renal failure as a complication of sepsis impacts patient mortality. The data of this study is included below.
+[[AP_Statistics_Curriculum_2007_NonParam_2MedianPair#References | Whitley and Ball reported]] on the relative risk of mortality from 16 studies of septic patients. The outcome measure of interest was whether the patients developed complications of acute renal failure. The relative risk calculated in each study compared the risk of dying between patients ''with'' and ''without'' renal failure. A relative risk of 1.0 means no effect, and relative risk <math>\not= 1</math> suggests beneficial or detrimental effect of developing acute renal failure in sepsis. The main goal of the study was to assess the ''cumulative evidence'' in these 16 studies to determine whether developing acute renal failure as a complication of sepsis impacts patient mortality. The data of this study is included below.
 <center>
-{| class="wikitable" style="text-align:center; width:75%" border="1"
+{| class="wikitable" style="text-align:center; width:35%" border="1"
 |-
 | '''Study''' || '''Relative Risk''' || '''Sign''' (Relative Risk - 1)
@@ Line 49: / Line 49: @@
 <center>[[Image:SOCR_EBook_Dinov_NonParam_SignTest_022308_Fig1.jpg|600px]]</center>
+==The Sign-Test==
+The '''sign test''' is a non-parametric alternative the [[AP_Statistics_Curriculum_2007_Infer_2Means_Dep | one-sample and paired T-test]]. The sign test has no requirements for the data to be Normally distributed. It assigns a positive (+) or negative (-) sign to each observation according to whether it is greater or less than some hypothesized value. Then it measures the difference between the <math>\pm</math> signs and how distinct is this difference from what we would expect to observe by chance alone. For example, if there were no effect of developing acute renal failure on the outcome from sepsis, about half of the 16 studies above would be expected to have a relative risk less than 1.0 (a "-" sign) and the remaining 8 would be expected to have a relative risk greater than 1.0 (a "+" sign). In the actual data, 3 studies had "-" signs and the remaining 13 studies had "+" signs. Intuitively, this difference of 10 appears large to be simply due to random variation. If so, the effect of developing acute renal failure would be significant on the outcome from sepsis.
-==Approach==
-TBD
-==Model Validation==
+===Calculations===
-TBD
+Suppose we fix a significance level of <math>\alpha= 0.05</math>. And consider the following two hypotheses:
-==Computational Resources: Internet-based SOCR Tools==
+: <math>H_o: \nu_+=8</math> (equivalent to <math>\nu_-=8</math>):  The effect of developing acute renal failure is not significant on the outcome from sepsis.
-TBD
+: <math>H_1: \nu_+ \not=8</math>:  The effect of developing acute renal failure is significant on the outcome from sepsis.
-==Examples==
+Define the following test-statistics
-TBD
+:<math>B_s = \max{N_+, N_-}</math>, where <math>N_+</math> and <math>N_-</math> are the number of positive and negative signs, respectively.
-==Hands-on Activities==
+Then the distribution of <math>B_s \sim Binomial(n=16, p=8/16=0.5)</math>.
-TBD
-<hr>
+For our data, <math>B_s = \max{N_+,N_-}=\max{13,3}=13</math> and the probability that such [[AP_Statistics_Curriculum_2007_Distrib_Binomial |binomial variable]] exceeds 13 is <math>P(Bin(16,0.5,13))=0.010635</math>. Therefore, we can reject the null hypothesis <math>H_o</math> and regard as significant the effect of developing acute renal failure on the outcome from sepsis.
+<center>[[Image:SOCR_EBook_Dinov_NonParam_SignTest_022308_Fig2.jpg|600px]]</center>
+==The Sign test using SOCR Analyses==
+It is much quicker to use [http://socr.ucla.edu/htmls/SOCR_Analyses.html SOCR Analyses] to compute the statistical significance of the sign test. This [[SOCR_EduMaterials_AnalysisActivities_TwoPairedSign | SOCR Sign test activity]] may also be helpful in understanding how to use the sign test method in SOCR.
+===Example===
+A set of 12 identical twins are given psychological tests to determine whether the ''first born'' of the set tends to be more aggressive than the ''second born''.  Each twin is scored according to aggressiveness; a higher score indicates greater aggressiveness. Because of the natural pairing in a set of twins these data can be considered paired.
+<center>
+{| class="wikitable" style="text-align:center; width:40%" border="1"
+|-
+| Twin-Index || 1<sup>st</sup> Born || 2<sup>nd</sup> Born || Sign
+|-
+| 1 || 86 || 88 || -
+|-
+| 2 || 71 || 77 || -
+|-
+| 3 || 77 || 76 || +
+|-
+| 4 || 68 || 64 || +
+|-
+| 5 || 91 || 96 || -
+|-
+| 6 || 72 || 72 || 0 (Drop)
+|-
+| 7 || 77 || 65 || +
+|-
+| 8 || 91 || 90 || +
+|-
+| 9 || 70 || 65 || +
+|-
+| 10 || 71 || 80 || -
+|-
+| 11 || 88 || 81 || +
+|-
+| 12 || 87 || 72 || +
+|}
+</center>
+We first plot the data using [[SOCR_EduMaterials_Activities_LineChart | the SOCR Line Chart]]. Visually there does not seem to be a strong effect of the order of birth on baby's aggression.
+<center>[[Image:SOCR_EBook_Dinov_NonParam_SignTest_022308_Fig3.jpg|600px]]</center>
+Next we can use the [[SOCR_EduMaterials_AnalysisActivities_TwoPairedSign | SOCR Sign Test Analysis]] to quantitatively evaluate the evidence to reject the null hypothesis that there is no birth-order effect on baby's aggressiveness.
+<center>[[Image:SOCR_EBook_Dinov_NonParam_SignTest_022308_Fig4.jpg|600px]]</center>
+Clearly the p-value reported is 0.274, and our data can not reject the null hypothesis.
 ==References==

AP Statistics Curriculum 2007 NonParam 2MedianPair

From Socr

Revision as of 06:31, 24 February 2008

Contents

Motivational Clinical Example

The Sign-Test

Calculations

The Sign test using SOCR Analyses

Example

References

Views

Personal tools

Navigation

Search

Toolbox