Top Banner
Chapter 1: The Nature Chapter 1: The Nature of Probability & of Probability & Statistics Statistics
35
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Chapter 1

Chapter 1: The Nature of Chapter 1: The Nature of Probability & StatisticsProbability & Statistics

Page 2: Chapter 1

DefinitionsDefinitions

• Statistics is the science of conducting studies to collect, organize, summarize, analyze, and draw conclusions from data.

• A variable is a characteristic or attribute that can assume different values.

• Data are the values (measurements or observations) that the variables can assume.

Page 3: Chapter 1

Two Main Branches of Two Main Branches of StatisticsStatistics

• Descriptive Statistics consists of the collection, organization, summarization, and presentation of data.

Page 4: Chapter 1

Two Main Branches of Two Main Branches of StatisticsStatistics

• A population consists of all subjects (human or otherwise) that are being studied.

• A sample is a group of subjects selected from a population.

Page 5: Chapter 1

Examples: Population vs. Examples: Population vs. SampleSample

• Determine if the described data set is a population or a sample. If it is a sample, describe the population.

• A survey of every 4th customer leaving a grocery store to determine how many times per week they shop at a grocery store.

• The time it takes each mail carrier in zip code 91210 to complete a Saturday route.

• The ages at the time of their first marriage of 25 residents in an assisted living facility that houses 100 residents.

• The number of years of employment of all registered nurses in a maternity ward at a local hospital.

Page 6: Chapter 1

ExampleExample

• A grocery store wants to estimate the weight of cabbages that they receive from on of their produce suppliers. To accomplish this, they select and weigh 36 cabbages from a shipment of 150 cabbages.

• Identify the population of interest.

• Identify the sample.

Page 7: Chapter 1

Two Main Branches of Two Main Branches of StatisticsStatistics

• Inferential statistics consists of generalizing from samples to populations, performing estimations and hypothesis tests, determining relationships among variables and making predictions.

Page 8: Chapter 1

Example:Example:

• At the beginning of 1995, nine states had recorded more than 10,000 reported cases of the AIDS disease. The states and the numbers of cases are shown below.

NY 83,197 IL 14,255

CA 78,084 PN 12,754

FL 43,978 GA 12,228

TX 30,712 MD 10,534

NJ 25,089

Page 9: Chapter 1

Variables and Types of Variables and Types of DataData

• Variables can be classified as qualitative or quantitative.

• Qualitative variables are variables that can be placed into distinct categories according to some characteristic or attribute.

• Quantitative variables are numerical and can be ordered or ranked.– Discrete variables assume values that can be

counted. – Continuous variables can assume an infinite

number of values between any two specific values. They are obtained by measuring. The often include fractions and decimals.

Page 10: Chapter 1

Examples:Examples:

• The amount of water consumed daily by a teenager.

• The number of freshmen enrolled at a university at Fall 2006.

• The type of beverage selected on a lunch pre-order.

• Birth month of students in a kindergarten class.

• Zip codes in Richland and Lexington Counties.

Page 11: Chapter 1

Levels of MeasurementLevels of Measurement

• The nominal level of measurement classifies data into mutually exclusive (nonoverlapping), exhausting categories in which no order or ranking can be imposed on the data.

Page 12: Chapter 1

Level of MeasurementLevel of Measurement

• The ordinal level of measurement classifies data into categories that can be ranked; however, precise differences between the ranks do not exist.

Page 13: Chapter 1

Level of MeasurementLevel of Measurement

• The interval level of measurement ranks data, and precise differences between units of measure do exist. However, there is no meaningful zero.

Page 14: Chapter 1

Level of MeasurementLevel of Measurement

• The ratio level of measurement possesses all the characteristics of interval measurement, and there exists a true zero. In addition, true ratios exist when the same variable is measured on two different members of the population.

Page 15: Chapter 1

Data Collection: Data Collection: Three Methods of SurveysThree Methods of Surveys

• Telephone Surveys– Advantage

• Less costly than personal interviews• People are more candid since they are not

face-to-face

– Disadvantage• Some people don’t have telephones• Some people will not be home when the

calls are made; leading to bias

Page 16: Chapter 1

Data Collection: Data Collection: Three Methods of SurveysThree Methods of Surveys

• Mailed Questionnaire Surveys – Advantage

• Covers a large geographical area than telephones (?) and personal interviews since they are less expensive to conduct

• Respondents can remain anonymous

– Disadvantage• Low response rates• Inappropriate answers to questions• Some people may not be able to read or understand

the questions

Page 17: Chapter 1

Data Collection: Data Collection: Three Methods of SurveysThree Methods of Surveys

• Personal Interview Surveys– Advantage

• Obtain in-depth responses to questions from the respondents

– Disadvantage• The interviewer needs to be trained• The interviewer may lead the respondent to

the desired answer

Page 18: Chapter 1

Other Methods of Data Other Methods of Data CollectionCollection

• Surveying records• Direct Observation

Page 19: Chapter 1

Why Use a Sample?Why Use a Sample?

• Saves time and money• The population is often too large to

observe/interview

Page 20: Chapter 1

Five Basic Methods of Five Basic Methods of SamplingSampling

• A random samplerandom sample is selected by using chance methods or random numbers. We can use computers and calculators to generate random numbers.

• A systematic samplesystematic sample is obtained by numbering each subject of the population and then selecting every kth subject.

Page 21: Chapter 1

Five Basic Methods of Five Basic Methods of SamplingSampling

• A stratified samplestratified sample is obtained by dividing the population into groups (called strata) according to some characteristic, then sampling from each group.

• A cluster samplecluster sample is obtained by using intact groups called clusters.

• A convenience sampleconvenience sample is obtained by using subjects that are readily available.

Page 22: Chapter 1

Example: Sampling Example: Sampling MethodsMethods

• For the following scenarios, determine which sampling technique was used. Also list any biases that may be present.– Average weight of newborn baby boys: Twelve

hospitals are selected at random, and the weight of each baby boy born in January is recorded.

– Percentage of 18 – 25 year-olds who used drugs during the past 30 days: At a shopping mall, people who appear to be in the proper age group are stopped and asked for their age and whether they have used drugs in the past 30 days.

Page 23: Chapter 1

Example: Sampling Example: Sampling MethodsMethods

– Average length (in days) of a sexual harassment trial: The records of a law firm are analyzed, and the lengths of all of their sexual harassment trials are recorded.

– Effectiveness of a pain reliever against migraine headaches: Patients who have a history of migraines are divided into three groups, using random numbers. The three groups are given a placebo, a half-dose and a full-dose of the medication. The patients are then asked to rate the effectiveness of the medication on a scale of 1 to 10.

Page 24: Chapter 1

Types of Statistical Types of Statistical StudiesStudies

• In an observational study, the researcher merely observes what is happening or what has happened in the past and tries to draw conclusions based on these observations.– Advantages

• Occurs in a natural setting• Can be done in situations where it would be

unethical to conduct an experiment (Tuskegee experiment)

• Can be done using variables that cannot be manipulated (gender, dominant hand, race)

Page 25: Chapter 1

Types of Statistical Types of Statistical StudiesStudies

• Observational Study– Disadvantages

• Can’t establish cause and effect since the variables are not controlled.

• Can be expensive and time consuming• Researcher may have to rely on

measurements collected or reported by others

– People forget and fudge the truth

Page 26: Chapter 1

Types of Statistical Types of Statistical StudiesStudies

• In an experimental studyexperimental study the researcher manipulates one of the variables and tries to determine how the manipulation influences other variables.– The independent variableindependent variable (explanatory

variable) in the experimental study is the one that is being manipulated.

– The dependent variabledependent variable is called the outcome variable. It is the variable that is studied to see if it has changed significantly due to the manipulation of the independent variable.

Page 27: Chapter 1

Types of Statistical Types of Statistical StudiesStudies

• Experimental Study– The treatment group receives a special

“treatment” while the control group does not.– Advantages

• Researcher can decide how to select subjects and how to assign them to groups

• The researcher controls the independent variable

– Disadvantages• Occurs in an unnatural setting (disposable baby

bottles)• Hawthorne effect – subject behavior changes

because they know that they are being observed.

Page 28: Chapter 1

Types of Statistical Types of Statistical StudiesStudies

• A case-control case-control study is an observational study that resembles an experiment because the sample naturally divides into two (or more) groups. The participants who engage in the behavior under study form the cases, like a treatment group in an experiment. The participants who do not engage in the behavior are the controls, like a control group in the experiment

Page 29: Chapter 1

Example: Types of Example: Types of Statistical StudiesStatistical Studies

• For the following questions, what type of statistical study (observational study or experiment) is most likely to lead to an answer? Why? If an an observational study, state whether it should be a case control study. If an experiment, state whether it should be single- or double-blind.

• What is the mean income of stock brokers?   

Page 30: Chapter 1

Example: Types of Example: Types of Statistical StudiesStatistical Studies

• Do seatbelts save lives?   

• Can lifting weights improve runners’ times in a 10-kilometer (10K) race?

• Does skin contact with a particular glue cause a rash?   

• Can a new herbal remedy reduce the severity of colds?  

Page 31: Chapter 1

Uses and Misuses of Uses and Misuses of StatisticsStatistics

• Some people use statistics like a drunken man uses a lamp post – for support rather than illumination.

Page 32: Chapter 1

• Seventy-two percent of Americans squeeze the toothpaste tube from the top. This and other not-so-serious findings are presented in The First Really Important Survey of American Habits. Those results are based on 7000 respondents from the 25,000 questionnaires that were mailed. What is wrong with this survey?

Page 33: Chapter 1

• The New England Chronicles reports that women who eat lobster on a regular basis during their pregnancy tend to have healthier babies.

• A report sponsored by the Florida Citrus Commission concluded that cholesterol levels could be lowered by eating citrus products. Why might the conclusion be suspect?

Page 34: Chapter 1

• Glamour magazine published this survey result: "Seventy-nine percent of those who responded to our August survey say that they believe America has become too lawsuit-happy." The survey question was published in the magazine and readers could respond by mail, fax, or e-mail (Tellus@Galamour. com). How valid is the 79% result?

Page 35: Chapter 1

• In a study on college campus crimes committed by students high on alcohol or drugs, a mail survey of 1875 students was conducted. A USA Today article noted, "Eight percent of the students responding anonymously say they've committed a campus crime. And 62% of that group say they did so under the influence of alcohol or drugs." Assuming that the number of students responding anonymously is 1875, how many actually committed a campus crime while under the influence of alcohol or drugs?