SOCR Data

From Socr

(Difference between revisions)
Jump to: navigation, search
(added the Monthly US Economics data including monetary-base data, interest, CPI, S&P, Unemployment, Inflation, etc. (1959-2009))
(29 intermediate revisions not shown)
Line 13: Line 13:
The following collections include a number of real observed datasets from different disciplines, acquired using different techniques and applicable in different situations.
The following collections include a number of real observed datasets from different disciplines, acquired using different techniques and applicable in different situations.
-
* [http://gcmd.nasa.gov/ Climate Change Data]
+
=== [http://gcmd.nasa.gov/ Climate Change Data]===
-
** [[SOCR_Data_Dinov_042108_Antarctic_IceThicknessMawson | Antarctic Ice Thickness at Mawson, Davis and Casey (01/Apr/1954 to 15/Jan/2002). Number of data points is 1636.]]
+
* [[SOCR_Data_Dinov_042108_Antarctic_IceThicknessMawson | Antarctic Ice Thickness at Mawson, Davis and Casey (01/Apr/1954 to 15/Jan/2002). Number of data points is 1636.]]
-
** [[SOCR_Data_Dinov_071108_OilGasData | Energy Resources, Production and Consumption Dataset]]
+
* [[SOCR_Data_Dinov_071108_OilGasData | Energy Resources, Production and Consumption Dataset]]
-
** [[SOCR_Data_Dinov_121608_OzoneData | California Ozone Data (1980-2006)]]
+
* [[SOCR_Data_121608_OzoneData | California Ozone Data (1980-2006)]]
-
** [http://www.stat.ucla.edu/~nchristo/statistics_c173_c273/ca_ozone.txt CA Ozone - 08/08/2005]
+
* [[SOCR_Data_121608_CA_US_OzoneData | California and US Ozone Data Snapshot]]
-
* Population Data
+
 
-
** [[SOCR_Data_Dinov_020108_HeightsWeights | 25,000 Records of Human Heights (in) and Weights (lbs)]]
+
=== Population Data===
-
** [[SOCR_Data_Dinov_011709_PRB_Data | Population Data by Country 2000-2006]]
+
* Human Height and Weight data
-
** [[SOCR_Data_LA_Neighborhoods_Data | Los Angeles City Neighborhoods Data (from US Census)]]
+
** [[SOCR_Data_Dinov_020108_HeightsWeights | 25,000 Records of (Adolescent) Human Heights (in) and Weights (lbs)]]
-
* Economy Data
+
** [[SOCR_Data_MLB_HeightsWeights | Major League Baseball Player Height and Weight Data]]
-
** Consumer Price Index (CPI)
+
* [[SOCR_Data_Dinov_011709_PRB_Data | Population Data by Country 2000-2006]]
-
*** [[SOCR_Data_Dinov_021808_ConsumerPriceIndex | Consumer Price Index (1981-2006) - Fuel and Food Data]]
+
* [[SOCR_Data_LA_Neighborhoods_Data | Los Angeles City Neighborhoods Data (from US Census)]]
-
*** [[SOCR_Data_Dinov_021808_ConsumerPriceIndex3Way | Consumer Price Index (1981-2007) - One-, Two- or Three-Way ANOVA Data by items, months and years]]
+
* [[SOCR_Data_2008_World_CountriesRankings | Ranking of the top 100 Countries in the World based on political, economic, health, and quality-of-life factors]]
-
*** [[SOCR_Data_Dinov_010309_HousingPriceIndex | Housing Price Index (2000-2006) (motion charts)]]
+
 
-
*** [[SOCR_Data_Dinov_091609_SnP_HomePriceIndex | S&P Home Price Index (1991-2009) (motion charts)]]
+
=== Economic, Business and Stock Market Data===
-
** [http://www.eoddata.com Stock Market Data]
+
==== Consumer Price Index (CPI)====
-
*** [[SOCR_Data_Dinov_070108_JAVA | Sun Microsystems (Java) Stock price (2007-2008)]]
+
* [[SOCR_Data_Dinov_021808_ConsumerPriceIndex | Consumer Price Index (1981-2006) - Fuel and Food Data]]
-
*** [[SOCR_Data_Dinov_070108_SP500_0608 | S&P 500 (2007-2008)]]
+
* [[SOCR_Data_Dinov_021808_ConsumerPriceIndex3Way | Consumer Price Index (1981-2007) - One-, Two- or Three-Way ANOVA Data by items, months and years]]
-
*** [[SOCR_Data_Dinov_101709_USEconomy | US Economy by Sectors (1997-2007) and 2007-2009 Recession Data]]
+
* [[SOCR_Data_Dinov_010309_HousingPriceIndex | Housing Price Index (2000-2006) (motion charts)]]
-
** Monetary-Base Data
+
* [[SOCR_Data_Dinov_091609_SnP_HomePriceIndex | S&P Home Price Index (1991-2009) (motion charts)]]
-
*** [[SOCR_Data_MonetaryBase1959_2009 | US Federal Reserve monetary-base data (1959-2009)]]
+
==== [http://www.eoddata.com Stock Market Data]====
-
*** [[SOCR_Data_MonetaryBaseStocksInterest1959_2009 | Monthly US Economics data including monetary-base data, interest, CPI, S&P, Unemployment, Inflation, etc. (1959-2009)]]
+
* [[SOCR_Data_Dinov_070108_JAVA | Sun Microsystems (Java) Stock price (2007-2008)]]
 +
* [[SOCR_Data_Dinov_070108_SP500_0608 | S&P 500 (2007-2008)]]
 +
* [[SOCR_Data_Dinov_101709_USEconomy | US Economy by Sectors (1997-2007) and 2007-2009 Recession Data]]
 +
* [[SOCR_Data_Fortune500_1955_2008 | Ranking, Profits and Income of Fortune500 Companies (1955-2008) Dataset]]
 +
==== Monetary-Base Data====
 +
* [[SOCR_Data_MonetaryBase1959_2009 | US Federal Reserve monetary-base data (1959-2009)]]
 +
* [[SOCR_Data_MonetaryBaseStocksInterest1959_2009 | Monthly US Economics data including monetary-base data, interest, CPI, HPI, S&P, Unemployment, Inflation, etc. (1959-2009)]]
 +
* [[SOCR_Data_WorldInflation2002_2012 | Monthly Monetary Inflation for Several Countries (2002-2012)]]
 +
 
 +
==== Budgets and Deficits Data====
 +
* [[SOCR_Data_US_BudgetsDeficits_1849_2016 | US Federal Budget and Deficit data (1849-2016)]]
 +
====Sector Data, Population Perception Trends data====
 +
* [[SOCR_Data_GoogleTrends_2005_2011|Google Web-Search Trends and Stock Market Data (2005-2011)]]
 +
====World Peace====
 +
* [[SOCR_Data_GlobalPeaceIndex_2001_2011|Global Peace Index Data (2001-2011)]]
 +
* [[SOCR_Data_WealthOfNations_1800_2009|Wealth of Nations Data (1800-2009)]]
 +
 
 +
=== Neuroimaging Data===
 +
* [[SOCR_Data_July2009_ID_NI | Neuroimaging study of 27 Alzheimer's disease (AD) subjects, 35 normal controls (NC), and 42 mild cognitive impairment subjects (MCI)]]
 +
* [http://www.stat.ucla.edu/%7Edinov/courses_students.dir/04/Spring/Stat233.dir/HWs.dir/AD_NeuroPsychImagingData1.html Alzheimer's Disease neuroimaging Data]
 +
* [[SOCR_Data_June2008_ID_NI | Neuroimaging study of super-resolution image enhancing]]
 +
* [[SOCR_Data_April2009_ID_NI | Neuroimaging study of Prefrontal Cortex Volume across Species and Tissue Types]]
 +
* [[SOCR_Data_Oct2009_ID_NI | Normal and Schizophrenia Children Neuroimaging study]]
 +
* [[SOCR_Data_April2011_NI_IBS_Pain | A large Neuroimaging study of pain including visceral pain, irritable bowel syndrome, ulcerative colitis, and Crohn's disease]]
 +
 
 +
=== Biomedical Data===
 +
* [[SOCR_Data_PD_BiomedBigMetadata|Human Health: Predictive Big Data Analytics, Modeling and Visualization of Clinical, Genetic and Imaging Data for Parkinson’s Disease]]
 +
* [[SOCR_Data_AMI_NY_1993_HeartAttacks| 1993 New York State Heart Attack Patients: Acute Myocardial Infarction (AMI), N=12,844]]
 +
* [[SOCR_Data_AD_BiomedBigMetadata|Human Health: Modeling and Analysis of Clinical, Genetic and Imaging Data of Alzheimer’s Disease]]
 +
* [[SOCR_Data_Dinov_032708_AllometricPlanRels | Allometric  relationship between population density, body mass and metabolic activity in Plants]]
 +
* [[SOCR_Data_052511_IrisSepalPetalClasses | Fisher's multivariate dataset on iris sepal and petal length]]
 +
* [[SOCR_Data_BMI_Regression | Body Density & Body Mass Index (BMI) Data]]
 +
* [[SOCR_Data_KneePainData_041409 | Knee Pain Centroid Locations Data]]
 +
* [[SOCR_Data_NIPS_InfantVitK_ShotData | Neonate Infant Pain Score (NIPS) Data (Vitamin K shots)]]
 +
 
 +
===Healthcare and Health Science Data===
 +
* [https://umich.instructure.com/courses/38100/files/ A number of case-studies including Big and Heterogeneous clinical, nursing, and healthcare datasets].
 +
 
 +
=== [[SOCR_US_CensusData | US Census Data]]===
 +
* [[SOCR_Data_LA_Neighborhoods_Data | Los Angeles County Neighborhoods Data (from US Census)]]
 +
* [[SOCR_Data_2011_US_JobsRanking | 2011 US Jobs Ranking (200 Best to Worst Jobs in the USA for 2011)]]
 +
 
 +
=== [http://www.presidency.ucsb.edu/ US Elections Data]===
 +
* [[SOCR_Data_Dinov_11_08_08_PresidentialElections | US Electoral College vs. Popular Vote Presidential Elections Mandate Data (1828-2008)]]
 +
* [[SOCR_Data_US_Elections_Counties2004 | US Elections and Counties data for 2004]]
 +
 
 +
===Other Data===
 +
* [[SOCR_Data_Brain2BodyWeight | Brain to Body Weight Dataset]]
* [[SOCR_Data_Dinov_021708_Earthquakes | California Earthquakes Data]] (1969-2007)
* [[SOCR_Data_Dinov_021708_Earthquakes | California Earthquakes Data]] (1969-2007)
 +
* [[SOCR_Data_CaliforniaLottery2011 | California Lottery]] (1992-2011)
 +
* [[SOCR_Data_Dinov_072108_H_Index_Pubs | Faculty Publications]]
* [[SOCR_Data_Dinov_030708_APExamScores | 2007 Advanced Placement (AP) Exam Scores by Discipline]]
* [[SOCR_Data_Dinov_030708_APExamScores | 2007 Advanced Placement (AP) Exam Scores by Discipline]]
* [http://math.whatcom.ctc.edu/content/Links.phtml?cat=18 Online Math Center: A large archive of data from different scientific observations]
* [http://math.whatcom.ctc.edu/content/Links.phtml?cat=18 Online Math Center: A large archive of data from different scientific observations]
* [[NISER_081107_ID_Data | Largemouth Bass Mercury Contamination Dataset]]
* [[NISER_081107_ID_Data | Largemouth Bass Mercury Contamination Dataset]]
-
* [[SOCR_Data_Dinov_032708_AllometricPlanRels | Allometric  relationship between population density, body mass and metabolic activity in Plants]]
+
* [[SOCR_LetterFrequencyData | Latin Letters Frequency Data]]
-
* Neuroimaging Data
+
-
** [[SOCR_Data_July2009_ID_NI | Neuroimaging study of 27 Alzheimer's disease (AD) subjects, 35 normal controls (NC), and 42 mild cognitive impairment subjects (MCI)]]
+
-
** [http://www.stat.ucla.edu/%7Edinov/courses_students.dir/04/Spring/Stat233.dir/HWs.dir/AD_NeuroPsychImagingData1.html Alzheimer's Disease neuroimaging Data]
+
-
** [[SOCR_Data_June2008_ID_NI | Neuroimaging study of super-resolution image enhancing]]
+
-
** [[SOCR_Data_April2009_ID_NI | Neuroimaging study of Prefrontal Cortex Volume across Species and Tissue Types]]
+
-
** [[SOCR_Data_Oct2009_ID_NI | Normal and Schizophrenia Children Neuroimaging study]]
+
* [http://wiki.stat.ucla.edu/niser/index.php/NISER_Data NISER Datasets]
* [http://wiki.stat.ucla.edu/niser/index.php/NISER_Data NISER Datasets]
* [[SOCR_061708_NC_Data_Aquifer | Texas Wolfcamp aquifer data]]
* [[SOCR_061708_NC_Data_Aquifer | Texas Wolfcamp aquifer data]]
* [[SOCR_012708_ID_Data_HotDogs | Hot Dog Calorie and Sodium Dataset]]
* [[SOCR_012708_ID_Data_HotDogs | Hot Dog Calorie and Sodium Dataset]]
-
* Biomedical Data
+
* [http://search.datacite.org/ui DataCite DOI Archive for diverse types of datasets]
-
** [[SOCR_Data_BMI_Regression | Body Density & Body Mass Index (BMI) Data]]
+
 
-
** [[SOCR_Data_KneePainData_041409 | Knee Pain Centroid Locations Data]]
+
===SOCR Course Data and Case-Studies===
-
* [[SOCR_Data_Dinov_072108_H_Index_Pubs | Faculty Publications]]
+
* [https://umich.instructure.com/courses/38100 UMich HS 853, Fall 2015]
-
* [[SOCR_US_CensusData | US Census Data]]
+
** [https://umich.instructure.com/courses/38100/files General Resources]
-
** [[SOCR_Data_LA_Neighborhoods_Data | Los Angeles County Neighborhoods Data (from US Census)]]
+
** [https://umich.instructure.com/courses/38100/files/folder/data Small Datasets]
-
* [http://www.presidency.ucsb.edu/ US Elections Data]
+
** [https://umich.instructure.com/courses/38100/files/folder/Case_Studies Biomedical and Health Science Case-Studies]
-
** [[SOCR_Data_Dinov_11_08_08_PresidentialElections | US Electoral College vs. Popular Vote Presidential Elections Mandate Data (1828-2008)]]
+
* [https://umich.instructure.com/courses/90136 UMich HS 853, Fall 2016]
-
* [[SOCR_LetterFrequencyData | Latin Letters Frequency Data]]
+
** [https://umich.instructure.com/courses/90136/files General Resources including data, case-studies, lecture notes and code]
 +
* [https://umich.instructure.com/courses/143011/ UMich Data Science and Predictive Analytics (HS 650)]
 +
** [https://umich.instructure.com/courses/143011/files General Resources including data, case-studies, lecture notes and code]
 +
 
 +
===External Data Archives===
 +
* [http://www.nature.com/sdata/archive Nature Scientific Data]
 +
* [https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/Medicare-Provider-Charge-Data/Physician-and-Other-Supplier.html Centers for Medicare & Medicaid Services (CMS)]
 +
* [http://www.census.gov/developers/ US Census] and [https://usa.ipums.org/usa/ US Census Graphical API Interface]
 +
* [http://www.ncbi.nlm.nih.gov/gap database of Genotypes and Phenotypes (dbGaP)]
 +
 
 +
==Machine Interfaces to Downloading [[SOCR Data]]==
 +
In addition to human interactions with the [[SOCR Data]], we provide several machine interfaces to consume and process these data.
 +
 
 +
* [[SOCR Data]] can be copy pasted directly from the Wiki HTML pages into any of the [http://socr.umich.edu/html/ana/ SOCR Java applets].
 +
 
 +
* SOCR Data can also be loaded into an R computational environment automatically using the protocol below illustrated with the case of a [[SOCR_Data_PD_BiomedBigMetadata|Parkinson's Disease dataset]]:
 +
 
 +
library(rvest)
 +
# Loading required package: xml2
 +
 +
wiki_url <- read_html("http://wiki.socr.umich.edu/index.php/SOCR_Data_PD_BiomedBigMetadata")
 +
html_nodes(wiki_url, "#content")
 +
 +
pd_data <- html_table(html_nodes(wiki_url,"table")<math>[[1]]</math>)
 +
head(pd_data); summary(pd_data)
<hr>
<hr>

Revision as of 17:48, 23 December 2017

Contents

SOCR Educational Materials - SOCR Data

The links below contain a number of datasets that may be used for demonstration purposes in probability and statistics education. There are two types of data - simulated (computer-generated using random sampling) and observed (research, observationally or experimentally acquired).

SOCR Data

Simulated data

The SOCR resources provide a number of mechanisms to simulate data using computer random-number generators. Here are some of the most commonly used SOCR generators of simulated data:

Observed data

The following collections include a number of real observed datasets from different disciplines, acquired using different techniques and applicable in different situations.

Climate Change Data

Population Data

Economic, Business and Stock Market Data

Consumer Price Index (CPI)

Stock Market Data

Monetary-Base Data

Budgets and Deficits Data

Sector Data, Population Perception Trends data

World Peace

Neuroimaging Data

Biomedical Data

Healthcare and Health Science Data

US Census Data

US Elections Data

Other Data

SOCR Course Data and Case-Studies

External Data Archives

Machine Interfaces to Downloading SOCR Data

In addition to human interactions with the SOCR Data, we provide several machine interfaces to consume and process these data.

  • SOCR Data can be copy pasted directly from the Wiki HTML pages into any of the SOCR Java applets.
  • SOCR Data can also be loaded into an R computational environment automatically using the protocol below illustrated with the case of a Parkinson's Disease dataset:
library(rvest)
# Loading required package: xml2

wiki_url <- read_html("http://wiki.socr.umich.edu/index.php/SOCR_Data_PD_BiomedBigMetadata")
html_nodes(wiki_url, "#content")

pd_data <- html_table(html_nodes(wiki_url,"table")[[1]])
head(pd_data); summary(pd_data)



Translate this page:

(default)

Deutsch

Español

Français

Italiano

Português

日本語

България

الامارات العربية المتحدة

Suomi

इस भाषा में

Norge

한국어

中文

繁体中文

Русский

Nederlands

Ελληνικά

Hrvatska

Česká republika

Danmark

Polska

România

Sverige

Personal tools