Political Science 5 Lecture 8, 21904 - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

Political Science 5 Lecture 8, 21904

Description:

How to Draw a Random sample. What if the entire nation is your ... How to Draw a Random Sample. Select the primary sampling units (counties, cities, etc) ... – PowerPoint PPT presentation

Number of Views:59
Avg rating:3.0/5.0
Slides: 30
Provided by: thadko4
Category:
Tags: draw | how | lecture | political | science | to

less

Transcript and Presenter's Notes

Title: Political Science 5 Lecture 8, 21904


1
Political Science 5Lecture 8, 2/19/04
  • - Midterm 1 on Tuesday, 3/2
  • - Review next Thursday
  • Homework 2 due a week from today
  • Homework 1 will be returned on Tuesday

2
What is a sample?
  • A researcher usually wants to generalize about a
    whole class of individuals. This is called the
    population.
  • However, studying the whole population is usually
    impractical. Only part of it can be examined,
    and this is called the sample.
  • Researchers make generalizations from the part to
    the whole in technical terms, they make
    inferences from the sample to the population.

3
Sample vs. the Population
  • An observational study simply observes cases,
    without attempting to impose a treatment and
    without requiring any quasi- or natural
    experimental design.
  • Researchers can ask their cases questions in
    order to measure some variable.
  • Most of the time, researchers look closely at a
    small sample of the overall population.

4
Sample vs. the Population
  • A population is the entire group of cases about
    which you want information.
  • A sample is a subset of the population which is
    used to gain information about the whole
    population.

Population
Sample
5
Parameters
  • Usually, there are some numerical facts about the
    population which the investigators want to know.
    Such numerical facts are called parameters. In
    forecasting a presidential election in the United
    States, two relevant parameters are
  • The average age of all eligible voters
  • The percentage of all eligible voters who are
    currently registered to vote.
  • Ordinarily, parameters like these cannot be
    determined exactly, but can only be estimated
    from a sample. Then a major issue is accuracy
    how close are the estimates going to be?

6
Statistics
  • Parameters are estimated by statistics, or
    numbers which can be computed from a sample.
  • For instance, with a sample of 10,000 Americans,
    a researcher could calculate the following two
    statistics in order to estimate the parameters
    mentioned above
  • The average age of eligible voters in the sample
  • The percentage of the eligible voters in the
    sample who are currently registered to vote
  • Statistics are what researchers know parameters
    are what they want to know.

7
Sample vs. the Population
  • A parameter is a number describing a population.
    It is a usually a mystery.
  • A statistic is a number describing a sample.
    Statistics vary from sample to sample.
  • If our sample is representative of the
    population, sample statistics will closely
    approximate population parameters.

8
Estimating Parameters from the Sample
  • Estimating parameters from the sample is
    justified when the sample represents the
    population.
  • This is impossible to check by just looking at
    the sample. Why? To see whether the sample is
    like the population, the researcher would have to
    know the facts about the population that they are
    trying to estimate--a vicious circle.
  • So, you have to look at how the sample was
    chosen. Some methods tend to do badly others
    are more likely to give representative samples.
  • Thus, the method of choosing a sample matters a
    lot, and the best methods involve the planned
    introduction of chance.

9
How to Draw a Good Sample
  • Nonrandom methods of drawing a sample
  • Haphazard (or convenience) sample
  • Voluntary response sample
  • Quota sample
  • Example Internet Polls
  • Surveys of a Sub-Population

10
Nonrandom methods of drawing a sample (Note
These are Bad!)
  • A voluntary response sample includes the members
    of the population who voice their desire to be
    included in the sample.
  • 1936 Literary Digest Poll mailed 10 million
    ballots to magazine readers to volunteer
    participate in their Presidential election
    survey. 2 million surveys came back, predicting
    that FDR would lose 43-57. In reality, FDR won,
    61-39.
  • Poll was plagued by both selection bias and
    non-response bias.
  • When a sampling procedure is biased, taking a
    large sample does not help. This just repeats
    the basic mistake on a larger scale.

11
Nonrandom methods of drawing a sample (Note
These are Bad!)
  • A haphazard sample studies the segment of the
    population that is easiest for the researcher to
    reach.
  • Polls only of people who have telephones. (Less
    of a problem than it used to be).
  • Television call-in survey (self-selection).
  • Studies by college students of their dormmates.
  • We cannot trust the results of haphazard surveys.
    Why?

12
Nonrandom methods of drawing a sample (Note
These are Bad!)
  • A quota sample tries to obtain a group
    representative of the population by setting
    quotas for selecting various categories of people
    based on their proportions in the population.
  • First, divide population into categories on the
    basis of variables from census data..
  • Then, set sample selection quotas for each
    category based on its proportion in the
    population.
  • Better than haphazard surveys, but still has
    shortcomings.
  • Too much interviewer discretion.
  • Problems in getting accurate data on the
    proportion of different groups in the
    population.
  • Dewey-Truman example.

13
Example Internet Polls
  • Some internet polls ask the opinions of those who
    have logged on to
  • www.foxnews.com
  • www.uclabruins.com
  • www.peetscoffee.com
  • www.pabst.com
  • www.sfgate.com
  • www.rogaine.com
  • More professional internet polls advertise with
    banners on a variety of web sites to recruit
    people into their sample.

14
Example Internet Polls
  • Knowledge Networks is an internet-administered
    survey that recruited its sample by using random
    digit dialing.
  • To give those without an internet connection the
    chance to participate, they offered a free Web
    TV console to participants.
  • Those in the 50,000 person sample are given the
    chance to participate in polls about subjects
    like hard liquor or politics.

15
Probability Sampling
  • A probability sample is a sample of a population
    in which each person has a known chance of being
    selected.
  • Removes bias from the sample selection process
  • Probability samples are more representative than
    than haphazard or quota samples.
  • Can use probability theory to estimate the
    accuracy of the sample (warning--next time
    math!).
  • Talking about surveys, but applies to sampling
    other types of cases as well.

16
How many people do you need in your sample?
  • Depends on
  • How much accuracy do you need in the survey
    results?
  • How much confidence do you want that your results
    are actually within the specified range of
    accuracy?
  • How much variability is there in the variable?

17
What Determines the Margin of Error (accuracy)
of a Poll?
  • Margin of error tells us how close the sample
    statistic is to the population parameters.
  • If we have drawn a truly random sample
  • Sample Population Proportion
  • Proportion Random Error

18
What Determines the Margin of Error (accuracy)
of a Poll?
  • The margin of error is calculated by

19
What Determines the Margin of Error (accuracy)
of a Poll?
  • In a poll of 505 likely voters, the Field Poll
    found 55 support for the recall.

20
What Determines the Margin of Error (accuracy)
of a Poll?
  • The degree of accuracy, or margin of error, is
    usually stated in plus or minus percentage
    terms.
  • The margin of error for this poll was plus or
    minus 4.4 percentage points.
  • That means that the true percentage (the
    population parameter, or the percentage that
    would be obtained if the entire population, and
    not just the sample, was surveyed) favoring the
    recall could be anywhere between 50.6 and 59.4

21
Confidence
  • The confidence level is the probability that the
    results are outside the specified level of
    accuracy.
  • Confidence level is stated in probability terms
  • A confidence level of .01 means that there is 1
    chance out of 100 that the survey results are
    outside the specified range of accuracy.
    Researchers usually use the .05 level.
  • In the previous example, at the .05 confidence
    level, we can say that if we took many samples
    using the Field Polls methods, 95 of the
    samples would yield a statistic within plus or
    minus 4.4 percentage points of the true
    population parameter

22
Sample sizes needed for different levels of
accuracy at .05 confidence level
23
Bias and Variability
  • Sampling bias is consistent deviation of the
    sample statistic from the parameter
  • Sampling variability describes how far apart
    statistics are over many samples.

24
Random sample
  • A random sample is one in which each person in
    the population has an equal chance of being
    selected throughout the selection process.
  • Removes bias
  • Need to have an entire list of the defined
    population

25
Surveys of a Sub-Population
  • Many researchers do not want to generalize to the
    population of all Americans.
  • They begin by defining the population that they
    want to study, such as likely voters,
    Asian-Americans voters, or Lesbian, Gay,
    Bisexual, and Transgendered California adults.

26
Surveys of a Sub-Population
  • Option 1. Take a random sample of the entire
    population, ask the respondent if he or she fits
    into the category, and then continue the
    interview if you find a match
  • Option 2. Begin with a list that approximates
    the entire subpopulation (registered voters with
    Asian surnames) and then take a random sample.

27
How to Draw a Random sample
  • Draw names out of a hat, a really big hat
  • Label every case in the population with a number,
    then draw some random numbers
  • In a telephone poll, random digit dialing uses a
    random number generator to get even those with
    unlisted numbers

28
How to Draw a Random sample
  • What if the entire nation is your target
    population? (no list available)
  • Use multistage cluster sampling
  • What is this?

29
How to Draw a Random Sample
  • Select the primary sampling units (counties,
    cities, etc)
  • Take a sample of smaller units from the list of
    primary sample units (city blocks)
  • Make a list of of smaller units and take a sample
    of them
  • Select a sample of persons
  • At this point, one person would be randomly
    selected from each household. Interviewer has no
    choice about the person selected.
Write a Comment
User Comments (0)
About PowerShow.com