Title: SI 544
1SI 544
- Introductory Statistics and Data Analysis
2Welcome and Outline
- This is 644
- Im Paul Resnick
- GSI Jahna Otterbacher
- Today
- Course Intro
- Course Logistics
- Chapter 1 topic intro
3Course Objectives
- Characterize population data
- Draw conclusions and inferences
- Critique others claims
- Look for correlations while controlling for
confounds
4Statistical Tasks for Informationists
- Decide what to preserve
- Evaluate user interface alternatives
- Redesign services based on usage history
- Assess demand for products or services
- Estimate cost of service provision
5More Tasks for Informationists
- Evaluate program outcomes
- Evaluate product and policy effectiveness
- Interpret industry trend data
- Assess policy compliance
- Present government data to lay audiences
- Conduct academic research
6Course Coverage
- Descriptive statistics
- Inferential Statistics
- Sampling distributions confidence intervals,
hypothesis tests and p-values - Estimating population mean
- Comparing population means
- Analysis of variance
- Univariate and multivariate OLS Regression
- Analysis of categorical data
- Data Collection
- Experimental design
7Books
- McClave and Sincich, 10th edition
- Student solution manual (odd numbered problems)
8Course Format
- Before class
- Read textbook
- Do assigned exercises, ungraded
- In class
- Socratic review of basics
- Present tricky points, supplemental material
- Work through examples
9Grading
- Class prep and participation 8
- 7 problem sets 42
- Best 6 scores count
- Midterm (Oct. 27, in class) 15
- Final (Dec. 21, 1030AM-1230PM) 35
10Math
- No calculus required
- Some references in passing for the benefit of
those who know it - Algebra required
- Some links to online resources in syllabus
- More mathematically rigorous approaches to this
material are available on campus - Statistics department
- Economics department
11Software
- Well be using Stata
- Intercooled version available in DIAD Lab
- You can buy it at academic discount, if you want
- You can use whatever you like
- Most but not all of what we cover can be done
using Excels data analysis toolpak - But we dont support other packages
- Youre on your own
12Study Groups
13Study Groups
14Questions on Logistics?
15Learning Objectives Chapter 1
- 1. Define Statistics
- 2. Describe the Uses of Statistics
- 3. Distinguish Descriptive Inferential
Statistics - Define Population, Sample, Parameter,
Statistic - Identify data types
- Identify data sources
16What is Statistics?
17What is Statistics?
- The practice (science?) of data analysis
- Summarizing data and drawing inferences about the
larger population from which it was drawn
18Statistical Methods
Statistical
Methods
Descriptive
Inferential
Statistics
Statistics
19Descriptive Statistics
- 1. Involves
- Collecting Data
- Presenting Data
- Characterizing Data
- 2. Purpose
- Describe Data
50
25
0
Q1
Q2
Q3
Q4
?X 30.5 S2 113
20Inferential Statistics
- 1. Involves
- Estimation
- Hypothesis Testing
- 2. Purpose
- Make Decisions Based on Population Characteristics
Population?
21Key Terms
- 1. Population (Universe)
- All Items of Interest
- 2. Sample
- Portion of Population
- 3. Parameter
- Summary Measure about Population
- 4. Statistic
- Summary Measure about Sample
22Key Terms
- 1. Population (Universe)
- All Items of Interest
- 2. Sample
- Portion of Population
- 3. Parameter
- Summary Measure about Population
- 4. Statistic
- Summary Measure about Sample
- P in Population Parameter
- S in Sample Statistic
23Data Types
- Quantitative
- Discrete
- Continuous
- Qualitative
- Nominal (categorical)
- Ordinal (rank ordered categories)
24Exercise 1.13
- Data types
- Bacteria count
- Occupations of shoppers
- Marital status
- Time (in months) since last auto maintenance
25Exercise Data About Us
- Quantitative
- Discrete
- Continuous
- Qualitative
- Nominal (categorical)
- Ordinal (rank ordered categories)
- Fill out the questionnaire with your data
well use it in later classes
26Data Sources
- Published source
- Designed experiment
- Survey
- Observational study
- Exercise data sources of these types youve
encountered
27Sampling
- Representative sample
- Same characteristics as the population
- Random sample
- Every subset of the population has an equal
chance of being selected
28Exercise 1.25
29End of Chapter
Any blank slides that follow are blank
intentionally.