SOCR Courses 2011 2012 Stat13 1 Lab1

From Socr

(Difference between revisions)
Jump to: navigation, search
(3. Histogram from Simulated Data)
Line 25: Line 25:
====3. Histogram from Simulated Data====
====3. Histogram from Simulated Data====
-
* Let's �first get some data: Go to the [http://socr.ucla.edu/htmls/SOCR_Modeler.html Modeler tab] at the top of the page (It's best if you open a new page for this).
+
* Let's first get some data: Go to the [http://socr.ucla.edu/htmls/SOCR_Modeler.html Modeler tab] at the top of the page (It's best if you open a new page for this).
* Click on the Data Generation button in the center of the screen. From the drop down bar choose the Normal Distribution and change the number of samples to 20 and the standard deviation to 100. Make sure the Raw Data box on the left is checked.
* Click on the Data Generation button in the center of the screen. From the drop down bar choose the Normal Distribution and change the number of samples to 20 and the standard deviation to 100. Make sure the Raw Data box on the left is checked.
* Hit Sample then click on the Data tab to see the data you generated.
* Hit Sample then click on the Data tab to see the data you generated.
-
*Copy these data by highlighting them with the mouse and then using the Copy button at the
+
*Copy these data by highlighting them with the mouse and then using the Copy button at the
top.
top.
-
* Go back to the your histogram plots window and click on the DATA section. Wipe out the
+
* Go back to the your histogram plots window and click on the DATA section. Wipe out the
-
old data by pressing on the CLEAR button and then click on the �rst cell under the column
+
old data by pressing on the CLEAR button and then click on the first cell under the column
C1. Click on the PASTE button on the left to paste your new data into the window. Double
C1. Click on the PASTE button on the left to paste your new data into the window. Double
click on 'C1' and relabel your column of data as 'Data' then hit the RETURN key on your
click on 'C1' and relabel your column of data as 'Data' then hit the RETURN key on your
keyboard.
keyboard.
-
* Click UPDATE CHART. Go back to your histogram and take SNAPSHOT of your new
+
* Click UPDATE CHART. Go back to your histogram and take SNAPSHOT of your new
histogram.
histogram.
-
* Change the bin size of your histogram and take SNAPSHOTs of it with both a smaller and
+
* Change the bin size of your histogram and take SNAPSHOTs of it with both a smaller and
larger bin size.
larger bin size.
-
* Redo the previous steps again but this time generate 150 samples.
+
* Redo the previous steps again but this time generate 150 samples.
-
* Remember to take a SNAPSHOT of your new histogram with the original bin size, with a
+
* Remember to take a SNAPSHOT of your new histogram with the original bin size, with a
smaller bin size and with a larger bin size.
smaller bin size and with a larger bin size.

Revision as of 23:05, 3 April 2013

Contents

Stats 13.1 - Laboratory Activity 1

Histogram Activity

This is an exploratory data analysis SOCR activity that illustrates the generation and interpretation of the histogram of quantitative data. In a nutshell, a histogram of a dataset is a graphical visualization of tabulated frequencies or counts of data within equispaced partition of the range of the data. A histogram shows what proportion of measurements fall into each of the categories defined by the partition of the data range space.

Go to the SOCR Charts (use the Charts tab at the top of the page). Once the page comes up, go to the left side of the page and drag the gray bar to the right.

1. Histogram from Categories and Frequencies

  • In the area on the left, click the arrow next to Bar Charts then XYPlots. Then click on HistogramChartDemo3.
  • Click on the SNAPSHOT button and save a copy of this original histogram.
  • Click on the DATA tab to view the default data. Notice that the chart requires the user to enter the counts/frequencies of observations within each of the range categories (in this default data case, year).
  • Using the SHOW ALL tab you can see all three (graph, data and mapping) in the same view.
  • Try revising some of the numbers in the second (frequency) column and click the UPDATE CHART button to see the effect of these changes on the histogram. Change the frequency for at least 6 years.
  • Click on the GRAPH button to return to view just the histogram. Take a SNAPSHOT and

print off your new histogram.

2. Simple Histogram from Raw Data

  • Click on HistogramChartDemo1
  • Scroll down to find the Bin Size adjustment bar
  • Change the bin size

3. Histogram from Simulated Data

  • Let's first get some data: Go to the Modeler tab at the top of the page (It's best if you open a new page for this).
  • Click on the Data Generation button in the center of the screen. From the drop down bar choose the Normal Distribution and change the number of samples to 20 and the standard deviation to 100. Make sure the Raw Data box on the left is checked.
  • Hit Sample then click on the Data tab to see the data you generated.
  • Copy these data by highlighting them with the mouse and then using the Copy button at the

top.

  • Go back to the your histogram plots window and click on the DATA section. Wipe out the

old data by pressing on the CLEAR button and then click on the first cell under the column C1. Click on the PASTE button on the left to paste your new data into the window. Double click on 'C1' and relabel your column of data as 'Data' then hit the RETURN key on your keyboard.

  • Click UPDATE CHART. Go back to your histogram and take SNAPSHOT of your new

histogram.

  • Change the bin size of your histogram and take SNAPSHOTs of it with both a smaller and

larger bin size.

  • Redo the previous steps again but this time generate 150 samples.
  • Remember to take a SNAPSHOT of your new histogram with the original bin size, with a

smaller bin size and with a larger bin size.

4. Questions

Answer each question fully using appropriate terminology and references to snapshots if appropriate.

1) What is the effect of the width/size of the histogram bin on the shape of the resulting histogram? Would the shape of the histogram change significantly if we alter the bin-size?

2) How do the sample size and bin size interact?

3) Would you expect the shape of the sample histogram to look like the shape of the population distribution the data sample came from?




Translate this page:

(default)

Deutsch

Español

Français

Italiano

Português

日本語

България

الامارات العربية المتحدة

Suomi

इस भाषा में

Norge

한국어

中文

繁体中文

Русский

Nederlands

Ελληνικά

Hrvatska

Česká republika

Danmark

Polska

România

Sverige

Personal tools