Top Banner
SAMPLING DESIGN AND PROCEDURES
56

SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Jan 04, 2016

Download

Documents

Emery Wheeler
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

SAMPLING

DESIGN AND PROCEDURES

Page 2: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Sampling Terminology

SampleA subset, or some part, of a larger population.

Population (universe)Any complete group of entities that share some common set of characteristics.

Population ElementAn individual member of a population.

CensusAn investigation of all the individual elements that make up a population.

Page 3: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Sample Survey

A survey which is carried out using a sampling method, i.e., in which a portion only, and not the whole population, is surveyed.

One of the units into which an aggregate is divided for the purposes of sampling, each unit being regarded as individual and indivisible when the selection is made. The definition of unit may be made on some natural basis, for example, households, persons etc.

Page 4: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

PARAMETER & STATISTIC

PARAMETER(S): A characteristic of a population

STATISTIC(S):A characteristic of a sample (estimation of a parameter from a statistic is the prime objective of sampling analysis).

Page 5: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A list, map or other specification of the units which constitute the available information relating to the population designated for a particular sampling scheme. There is corresponding to each state of sampling in a multi-stage sampling scheme. The frame may or may not contain information about the size or other supplementary information of the units, but it should have enough details so that a unit, if included in the sample, may be located and taken up for inquiry.

Page 6: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

that part of the difference between a population value and an estimate thereof, derived from a random sample, which is due to the fact that only a sample of values is observed; as distinct from errors due to imperfect selection, bias in response or estimation, errors of observation and recording, etc. the totality of sampling errors in all possible samples of the same size generates the sampling distribution of the statistic which is being used to estimate the parent value.

Page 7: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Why Sample?

Budget and time constraints.Limited access to total population.

Accurate and Reliable Results Destruction of Test Units

Sampling reduces the costs of research in finite populations.

Page 8: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Sample Vs. Census

Conditions Favoring the Use of

Type of Study

Sample Census

1. Budget

Small

Large

2. Time available

Short Long

3. Population size

Large Small

4. Variance in the characteristic

Small Large

5. Cost of sampling errors

Low High

6. Cost of nonsampling errors

High Low

7. Nature of measurement

Destructive Nondestructive

8. Attention to individual cases Yes No

Page 9: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Sampling Techniques

NonprobabilitySampling Techniques

ProbabilitySampling Techniques

ConvenienceSampling

JudgmentalSampling

QuotaSampling

SnowballSampling

SystematicSampling

StratifiedSampling

ClusterSampling

Other SamplingTechniques

Simple RandomSampling

Page 10: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Convenience sampling attempts to obtain a sample of convenient elements. Often, respondents are selected because they happen to be in the right place at the right time.

• use of students, and members of social organizations

• mall intercept interviews without qualifying the respondents

• department stores using charge account lists

• “people on the street” interviews

Page 11: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A B C D E

1 6 11 16 21

2 7 12 17 22

3 8 13 18 23

4 9 14 19 24

5 10 15 20 25

Group D happens to assemble at a

convenient time and place. So all the elements in this

Group are selected. The resulting sample consists of elements

16, 17, 18, 19 and 20. Note, no elements are

selected from group A, B, C and E.

Page 12: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Judgmental sampling is a form of convenience sampling in which the population elements are selected based on the judgment of the researcher.

test markets

purchase engineers selected in industrial marketing research

bellwether precincts selected in voting behavior research

expert witnesses used in court

Page 13: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A B C D E

1 6 11 16 21

2 7 12 17 22

3 8 13 18 23

4 9 14 19 24

5 10 15 20 25

The researcher considers groups B, C and E to be typical and

convenient. Within each of these groups one or

two elements are selected based on

typicality and convenience. The resulting sample

consists of elements 8, 10, 11, 13, and 24. Note, no elements are selected

from groups A and D.

Page 14: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Quota sampling may be viewed as two-stage restricted judgmental sampling.

The first stage consists of developing control categories, or quotas, of population elements.

In the second stage, sample elements are selected based on convenience or judgment.

Population Samplecomposition composition

ControlCharacteristic Percentage Percentage NumberGender Male 48 48 480 Female 52 52 520

____ ____ ____100 100 1000

Page 15: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A B C D E

1 6 11 16 21

2 7 12 17 22

3 8 13 18 23

4 9 14 19 24

5 10 15 20 25

A quota of one element from each group, A to E, is

imposed. Within each group, one element is

selected based on judgment or

convenience. The resulting sample

consists of elements 3, 6, 13, 20 and 22.

Note, one element is selected from each column or group.

Page 16: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

o Each element in the population has a known and equal probability of selection.

o Each possible sample of a given size (n) has a known and equal probability of being the sample actually selected.

o This implies that every element is selected independently of every other element.

Page 17: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A B C D E

1 6 11 16 21

2 7 12 17 22

3 8 13 18 23

4 9 14 19 24

5 10 15 20 25

Select five random numbers from 1 to 25. The resulting sample

consists of population elements 3, 7, 9, 16,

and 24. Note, there is no element from Group

C.

Page 18: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

The sample is chosen by selecting a random starting point and then picking every ith element in succession from the sampling frame.

The sampling interval, i, is determined by dividing the population size N by the sample size n and rounding to the nearest integer.

When the ordering of the elements is related to the characteristic of interest, systematic sampling increases the representativeness of the sample.

Page 19: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

If the ordering of the elements produces a cyclical pattern, systematic sampling may decrease the representativeness of the sample.

For example, there are 100,000 elements in the population and a sample of 1,000 is desired. In this case the sampling interval, i, is 100. A random number between 1 and 100 is selected. If, for example, this number is 23, the sample consists of elements 23, 123, 223, 323, 423, 523, and so on.

Page 20: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A B C D E

1 6 11 16 21

2 7 12 17 22

3 8 13 18 23

4 9 14 19 24

5 10 15 20 25

Select a random number between 1 to 5, say 2.The resulting sample

consists of population 2, (2+5=) 7, (2+5x2=) 12,

(2+5x3=)17, and (2+5x4=) 22. Note, all the elements are

selected from a single row.

Page 21: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A two-step process in which the population is partitioned into subpopulations, or strata.

The strata should be mutually exclusive and collectively exhaustive in that every population element should be assigned to one and only one stratum and no population elements should be omitted.

Next, elements are selected from each stratum by a random procedure, usually SRS.

A major objective of stratified sampling is to increase precision without increasing cost.

Page 22: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

The elements within a stratum should be as homogeneous as possible, but the elements in different strata should be as heterogeneous as possible.

The stratification variables should also be closely related to the characteristic of interest.

Finally, the variables should decrease the cost of the stratification process by being easy to measure and apply.

Page 23: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

A B C D E

1 6 11 16 21

2 7 12 17 22

3 8 13 18 23

4 9 14 19 24

5 10 15 20 25

Randomly select a number from 1 to 5

for each stratum, A to E. The resulting

sample consists of population elements4, 7, 13, 19 and 21. Note, one element

is selected from each column.

Page 24: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.
Page 25: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

HYPOTHESIS …???

is formally stated expectation about how a behavior

operates.

… is a proposition that a researcher wants to verify.

A hypothesis is an assumption about the population parameter.

Page 26: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

• Formulate a Null Hypothesis (H0).

• Formulate an Alternative Hypothesis (H1)

• Select a suitable Test Statistic

• Specify a Level of Significance ()

• Define a suitable Decision Criterion based on and

Test Statistic

• Make necessary Assumptions if required

• Experiment and Calculation of Test Statistic

• Conclusion or Decision

Page 27: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

As the sample size gets large enough…the sampling distribution becomes almost normal regardless of shape of population

Central Limit Theorem

Page 28: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

The Null Hypothesis, H0

•Always contains the ‘ = ‘ sign

• It is a statement about the hypothesized value of population parameter.• States the Assumption (numerical) to be tested for possible rejection under the assumption that the null hypothesis is TRUE.

The average sale of showroom is at least 3.0 lakh (H0: μ≥ 3.0)

Page 29: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Is the opposite of the null hypothesis e.g. The average sale of a showroom is less than 3.0 (H1: μ < 3.0)

Never contains the ‘=‘ signThe Alternative Hypothesis may or may not be

acceptedIs generally the hypothesis that is believed to be true

by the researcher

The Alternative Hypothesis, H1

Page 30: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Level of Significance, a

Typical values are 0.01, 0.05, 0.10

• Defines Unlikely Values of Sample Statistic if Null Hypothesis Is True.

• If we assume that hypothesis is correct , then the significance level will indicate the percentage of sample statistics is outside certain limits.

0

Page 31: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Level of Significance, and the Rejection

RegionH0: 3

H1: < 30

0

0

H0: 3

H1: > 3

H0: 3

H1: 3

/2

Critical Value(s)

Rejection Regions

Page 32: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

One-Tailed Hypothesis Test

The term one-tailed signifies that all values that would cause to reject H0, are in just one tail of the sampling distribution

Two-Tailed Hypothesis Test

Two-tailed test is one in which values of the test statistic leading to rejectioin of the null hypothesis fall in both tails of the sampling distribution curve

Page 33: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Summary of Errors Involved in Summary of Errors Involved in Hypothesis TestingHypothesis Testing

Real State of Affairs

Inference Based on Sample Data

H0 is Accepted H0 is Rejected

H0 is True Correct decision Confidence level = 1-

Type I error Significance level=*

H0 is False

Type II error

P (Type II error) =

Correct decision

Power = 1-

*Term represents the maximum probability of

committing a Type I error

Page 34: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Reduce probability of one error and the other one goes up.

& Have an Inverse

Relationship

Page 35: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

How to choose between Type I and Type II errors

Reworking cost is low----Type I error

Reworking cost is high---Type II

Page 36: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

TOSH of means when the population Standard deviation is known

Zcalc = (X - 0)/(/ n)

H0: = < > 0 vs. HA: ≠ > < 0

0

Page 37: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

ExampleBajaj Company claims that the length of life of its electric bulb is 1000 hours with standard deviation of 30 hours. A random sample of 25 checked an average life of 960 hours. At 5 % level of significance can we conclude that the sample has come from a population with mean life of 1000 hours? Table value = 1.96

Page 38: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

t –test, Standard deviation is unknown and small sample

H0: = < > 0 vs. HA: > < 0

Testing a Hypothesis About a Mean; We Do Not Know Which Must be Estimated by S..

Calculate tcalc = (X - 0)/(s/ n )

Page 39: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Example

The weight of a canned food product is specified as 500 grm. For a sample of 8 cans the weight were observed as 480, 475, 510, 500, 505, 495, 504 and 515 grm. Test at 5% level of significance, whether on an average the weight is as per specification.Table value = 2.365

Page 40: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Two independent samples were collected. For the first sample of 42 items, the mean was 32.3 and the variance 9. The second sample of 57 items had a mean of 34 and a variance of 16. Using 0.05level of significance, test whether there is sufficient evidence to show the second population has a larger mean.

Page 41: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

H0:= < > vs. HA: ≠ > <

n1 = ______, n2=______ = _______

Testing a Hypothesis About two Mean; Process Performance Measure is Approximately Normally

Distributed; We “Know”

Therefore this is a “Z-test” - Use the Normal Distribution. Calculate test statistic (x1 - x 2) - (1 - 2 ) Zcalc = -------------------------------

12/n1 + 2

2/n2

DR: (≠ in HA) Reject H0 in favor of HA if Zcalc < -Z/2 or if Zcalc > +Z/2. Otherwise, FTR H0.

DR: (> in HA) Reject H0 in favor of HA iff Zcalc > +Z . Otherwise, FTR H0.

DR: (< in HA) Reject H0 in favor of HA iff Zcalc < -Z. Otherwise, FTR H0.

Page 42: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Z-test to test two population mean()When population standard deviation is unknown & n is large

H0:= < > vs. HA: ≠ > <

n1 = ______, n2=______ = _______

Testing a Hypothesis About two Mean; Process Performance Measure is Approximately Normally

Distributed; We “Know” SS Therefore this is a “Z-test” - Use the Normal Distribution.

Calculate test statistic (x1 - x 2) - (1 - 2 ) Zcalc = -------------------------------

S2/n1 + S2/n2

Page 43: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

H0: = < > 2 vs. HA: > < 2

n = _______ = _______• Testing a Hypothesis About a Mean;• Process Performance Measure is Approximately Normally

Distributed or We Have a “small” Samples;• We Do Not Know Which Must be Estimated by S.• Therefore this is a “t-test” - Use Student’s T Distribution.

Calculate (x1 - x2) - (1 - 2 ) t = -------------------------

s* ( 1/n1 + 1/n2 )

with d.f. = n1 + n2 - 2. In this expression, s* is the pooled standard deviation, given by s2 = [ (n1 – 1)s1

2 + (n2 – 1)s22 ] / (n1+n2-2)

= --------------------------------- n1 + n2 - 2

t-test ,To test two population mean

Page 44: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Paired Samples

The difference in these cases is examined by a paired samples t test. To compute t for paired samples, the paired difference variable, denoted by D, is formed and its mean and variance calculated. Then the t statistic is computed. The degrees of freedom are n - 1, where n is the number of pairs. The relevantformulas are: H0: D = 0

H1: D 0

tn-1 = D - DsDn

Page 45: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

The difference in these cases is examined by a paired samples t test. To compute t for paired samples, the paired difference variable, denoted by D, is formed and its mean and variance calculated. Then the t statistic is computed. The degrees of freedom are n - 1, where n is the number of pairs. The relevantformulas are:

H0: D = 0

H1: D 0

tn-1 = D - DsDn

Page 46: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Cross-Tabulations: Chi-square Test

Technique used for determining whether there is a statistically significant relationship between two categorical (nominal or ordinal) variables

Page 47: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Telecommunications Company

Marketing manager of a telecommunications company is reviewing the results of a study of potential users of a new cell phone

Random sample of 200 respondentsA cross-tabulation of data on whether target consumers would buy the phone (Yes or No) and whether the cell phone had Bluetooth wireless technology (Yes or No)

QuestionCan the marketing manager infer that an association exists between Bluetooth technology and buying the cell phone?

Page 48: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Two-Way Tabulation of Bluetooth Technology and Whether Customers Would Buy Cell Phone

Page 49: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Cross Tabulations -Hypotheses

H0: There is no association between wireless technology and buying the cell phone (the two variables are independent of each other).

Ha: There is some association between the Bluetooth feature and buying the cell phone (the two variables are not independent of each other).

Page 50: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Conducting the Test

Test involves comparing the actual, or observed, cell frequencies in the cross-tabulation with a corresponding set of expected cell frequencies (Eij)

Expected Values ninj

Eij = ----- n

Where ni and nj are the marginal frequencies, that is, the total number of sample units in category i of the row variable and category j of the column variable, respectively

Page 51: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Computing Expected Values

The expected frequency for the first-row, first-column cell is given by

100 100 E11 = ------------ = 50

200

Page 52: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Observed and Expected Cell Frequencies

Page 53: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Chi-square Test Statistic

r c (Oij - Eij)2

2 = -----------------

i=1 j=1 Eij

= 72.00

Where r and c are the number of rows and columns, respectively, in the contingency table. The number of degrees of freedom associated with this chi‑square statistic are given by the product (r - 1)(c - 1).

Page 54: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Chi-square Test Statistic in a Contingency Test

For d.f. = 1, Assuming =.05, from Appendix 2, the critical chi‑square value (2

c) = 3.84.

Decision rule is: “Reject H0 if 2 3.84.”

Computed 2 = 72.00Since the computed Chi-square value is greater than the critical value of 3.84, reject H0.

The apparent relationship between “Bluetooth technology"and "would buy the cellular phone" revealed by the sample data is unlikely to have occurred because of chance

Page 55: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

EXAMPLE In a management institute, the A+, A and B grades allocated to students

in there final examination, were as follows. Using 5% level of significance, determine whether the grading scale is independent of the specialization.Table value = 9.488

SpecializationGrade Finance Marketing Operations  A+ 20 15 05 A 25 20 15

B 10 08 07

Page 56: SAMPLING DESIGN AND PROCEDURES. Sampling Terminology Sample A subset, or some part, of a larger population. Population (universe) Any complete group of.

Univariate Hypothesis:Papa John’s restaurants are more likely to be located in a stand-alone location or in a shopping center.

Bivariate Hypothesis: Stand-alone locations are more likely to be profitable than are shopping center locations.