AP Statistics Curriculum 2007 Limits Norm2Bin

(Difference between revisions)
 Revision as of 04:55, 26 October 2009 (view source)IvoDinov (Talk | contribs) (added a link to the Problems set)← Older edit Current revision as of 02:40, 12 February 2011 (view source)IvoDinov (Talk | contribs) m (→Normal Approximation to Binomial Distribution) (One intermediate revision not shown) Line 2: Line 2: === Normal Approximation to Binomial Distribution=== === Normal Approximation to Binomial Distribution=== - Suppose [[AP_Statistics_Curriculum_2007_Distrib_Binomial | $Y\sim Binomial(n, p)$]] and $Y=Y_1+ Y_2+ Y_3+\cdots+ Y_n$, where [[AP_Statistics_Curriculum_2007_Distrib_Binomial | $Y_k~Bernoulli(p)$]] , $E(Y_k)=p$  & $Var(Y_k)=p(1-p)$. Then  $E(Y)=np$,  $Var(Y)=np(1-p)$ and $SD(Y)= \sqrt{np(1-p)}$. If we use the [[AP_Statistics_Curriculum_2007_Normal_Prob | Normal Standardization formula]] for ''Y'' we get $Z={Y-np\over \sqrt{np(1-p)}}$. + Suppose [[AP_Statistics_Curriculum_2007_Distrib_Binomial | $Y\sim Binomial(n, p)$]] and $Y=Y_1+ Y_2+ Y_3+\cdots+ Y_n$, where [[AP_Statistics_Curriculum_2007_Distrib_Binomial | $Y_k \sim Bernoulli(p)$]] , $E(Y_k)=p$  & $Var(Y_k)=p(1-p)$. Then  $E(Y)=np$,  $Var(Y)=np(1-p)$ and $SD(Y)= \sqrt{np(1-p)}$. If we use the [[AP_Statistics_Curriculum_2007_Normal_Prob | Normal Standardization formula]] for ''Y'' we get $Z={Y-np\over \sqrt{np(1-p)}}$. By [[AP_Statistics_Curriculum_2007_Limits_CLT | CLT]], $Z \sim N(0, 1)$ and $Y \sim N [\mu=np, \sigma^2={np(1-p)}]$. By [[AP_Statistics_Curriculum_2007_Limits_CLT | CLT]], $Z \sim N(0, 1)$ and $Y \sim N [\mu=np, \sigma^2={np(1-p)}]$. Line 38: Line 38: But we can also ''approximate'' this probability using the normal distribution.  We will need the mean and the standard deviation of this [[About_pages_for_SOCR_Distributions | normal distribution]].  These are But we can also ''approximate'' this probability using the normal distribution.  We will need the mean and the standard deviation of this [[About_pages_for_SOCR_Distributions | normal distribution]].  These are $\mu=np=80\frac{4}{52}=6.154$ and $\mu=np=80\frac{4}{52}=6.154$ and - $\sigma=\sqrt{80 \frac{4}{52}\frac{48}{52}}=2.383.$  Of course this can be obtained directly from the SOCR Binomial applet.  Now, all you need to do is to select the SOCR normal distribution applet and enter for the mean 6.154, and for the standard deviation 2.383.  To obtain the desire probability in the right cut-off box enter 7.5 (using the continuity correction for better approximation).  The approximate probability is $P(X \ge 8) \approx 0.2861$ (see figure below). + $\sigma=\sqrt{80 \frac{4}{52}\frac{48}{52}}=2.383.$  Of course this can be obtained directly from the SOCR Binomial applet.  Now, all you need to do is to select the SOCR normal distribution applet and enter for the mean 6.154, and for the standard deviation 2.383.  To obtain the desire probability in the right cut-off box, enter 7.5 (using the continuity correction for better approximation).  The approximate probability is $P(X \ge 8) \approx 0.2861$ (see figure below).
[[Image: SOCR_Activities_ExploreDistributions_Christou_figure10.jpg|600px]]
[[Image: SOCR_Activities_ExploreDistributions_Christou_figure10.jpg|600px]]

+ ===[[EBook_Problems_Limits_Norm2Bin|Problems]]=== ===[[EBook_Problems_Limits_Norm2Bin|Problems]]===

General Advance-Placement (AP) Statistics Curriculum - Normal Distribution as Approximation to Binomial Distribution

Normal Approximation to Binomial Distribution

Suppose $Y\sim Binomial(n, p)$ and $Y=Y_1+ Y_2+ Y_3+\cdots+ Y_n$, where Yk˜Bernoulli(p) , E(Yk) = p & Var(Yk) = p(1 − p). Then E(Y) = np, Var(Y) = np(1 − p) and $SD(Y)= \sqrt{np(1-p)}$. If we use the Normal Standardization formula for Y we get $Z={Y-np\over \sqrt{np(1-p)}}$.

By CLT, $Z \sim N(0, 1)$ and $Y \sim N [\mu=np, \sigma^2={np(1-p)}]$.

• Note: Normal approximation to Binomial is reasonable when p and (1-p) are NOT too small relative to n:
$np\geq 10$
n(1 − p) > 10

Example

The Roulette Game has 38 slots: 18 red, 18 black and 2 neutral. Suppose we play 100 games betting on red each time. Would observing 58 wins in the 100 games be considered atypical (suspicious)?

To answer this question, we need to compute the probability $P(Y\geq58)$, where $Y\sim Binomial(100, 0.47)$, as P(Win)=P(Red outcome) = 18/38=0.47.

Since $np=47 \geq 10$ and n(1 − p) = 53 > 10, Normal approximation is justified. Let $Z={(Y-np)\over \sqrt{np(1-p)}} ={(58-100*0.47)\over \sqrt{100*0.47*0.53}}=2.2$. Thus we can approximate $P(Y\geq 58) \approx P(Z\geq 2.2) = 0.0139$. The last equation can be directly computed using the SOCR Normal Distribution Calculator. You can also use the SOCR Binomial Distribution Calculator to compute the exact probability $P(Y\geq 58) = 0.0177$.

So why is the Normal approximation to Binomial distribution necessary in practice?

Binomial approximation by Normal distribution is useful when no access to the online SOCR resources is available (but we can use printed version of the Normal Table or when N is large and binomial probabilities are difficult to compute precisely!

Activities

Graph and comment on the shape of binomial with n = 20,p = 0.1 and n = 20,p = 0.9. Now, keep n = 20 but change p = 0.45. What do you observe now? How about when n = 80,p = 0.1. See the four figures below.

What is your conclusion on the shape of the Binomial distribution in relation to its parameters n,p? Clearly when n is large and p small or large the result is a bell-shaped distribution. When n is small (10-20) we still get approximately a bell-shaped distribution as long as $p \approx 0.5$. Because of this feature of the Binomial distribution we can approximate Binomial distributions using the normal distribution when the above requirements hold. Here is one example: Eighty cards are drawn with replacement from the standard 52-card deck. Find the exact probability that at least 8 aces are obtained. This can be computed using the formula $P(X \ge 8)=\sum_{x=8}^{80}(\frac{4}{52})^x (\frac{48}{52})^{80-x}=0.2725$.

Much easier we can use SOCR to compute this probability (see figure below).

But we can also approximate this probability using the normal distribution. We will need the mean and the standard deviation of this normal distribution. These are $\mu=np=80\frac{4}{52}=6.154$ and $\sigma=\sqrt{80 \frac{4}{52}\frac{48}{52}}=2.383.$ Of course this can be obtained directly from the SOCR Binomial applet. Now, all you need to do is to select the SOCR normal distribution applet and enter for the mean 6.154, and for the standard deviation 2.383. To obtain the desire probability in the right cut-off box, enter 7.5 (using the continuity correction for better approximation). The approximate probability is $P(X \ge 8) \approx 0.2861$ (see figure below).