Top Banner

of 39

1 Basic Statistics

Mar 07, 2016

Download

Documents

Ngoc Tran

1 Basic Statistics
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Basic statistics: p value and condence interval

    Nguyen Quang Vinh

    February, 2012

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Statistics

    Objectives Statistics

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Statistics

    Objectives Statistics

    Statistics

    Statistics:

    science of data

    study of uncertainty

    Biostatistics: data from: Medicine, Biological sciences

    (business, education, psychology, agriculture, economics...)

    Modern society:

    Reading

    Writing

    Statistical thinking: to make the strongest possible conclusions

    from limited amounts of data.

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Statistics

    Objectives Statistics

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Statistics

    Objectives Statistics

    Objectives Statistics

    Objectives:

    (1) Organize & summarize data

    (2) Reach inferences: sample population

    Statistics:Descriptive statistics(1)Inferential statistics: drawing of inferences(2)Estimation (point estimate & interval estimate condenceinterval)

    Hypothesis testing reaching a decision (p value)Parametric statistics

    Non-parametric statistics

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    Why estimation?

    Two reasons:

    Innite populations: incapable of complete examination

    Finite populations: cost, time

    In addition, estimation can help not to defer a conclusion, until

    the entire population has been observed

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    Estimation of

    mean(s):

    a single population mean

    the dierence between two population means: unpaired, paired

    proportion(s):

    a single population proportion

    the dierence of two population proportions

    variance(s):

    a single population variance

    the ratio of two population variances

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    An estimation of these parameters

    An estimation of these parameters:

    Point estimate

    Interval estimate

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    A point estimate

    Estimator Parameter

    In many cases, a parameter may be estimated by more than one

    estimator.

    Example:

    Sample mean estimate population meanSample median estimate population mean

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    The criteria of good estimator (opt.)

    (1) E (x) = without systematic errorE (x) is called systematic error(2) Mean square error

    E (x)2 must be small in comparison to

    E(x)2

    must be small

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    An interval estimate

    In general, an interval estimate is obtained by the formula:

    estimator (reliability coecient) x (standard error)

    What is dierent is the source of the reliability coecient:

    In particular, when sampling is from a normal distribution with

    known variance, an interval estimate for may be expressedas: x z/2x

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    How to interpret the interval given by this expression

    In repeated sampling 100(1)% of all intervals of the formwill in the long run include the population mean, The quantity (1), is called the condence coecient &The interval x z/2x , is called the condence interval for The most frequently used values are: .90, .95, .99, which have

    associated reliability factors, respectively, of 1.645, 1.96, 2.58

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    The practical interpretation

    We are 100(1)% condent that the single computedinterval x z/2x contains the population mean, Example: ...

    E = margin error = maximum error = practical / clinical

    acceptable error:

    E = z/2x = z/2 n

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Why hypothesis testing

    Hypothesis (H.): a statement concerns about some one or

    more populations

    Testing hypothesis: to aid researcher in reaching a decision

    concerning a population by examining a sample from that

    population

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Hypothesis testing for

    mean(s)

    a single population mean

    the dierence between two population means: unpaired, paired

    proportion(s)

    a single population proportion

    unpaired: a small sample, a suciently large sample

    paired

    the dierence of two population proportions

    variance(s)

    a single population variance

    the dierence of two population variances

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Two types of hypotheses

    (1) Research Hypotheses:

    The conjecture or supposition

    It may be the results of years of observation

    Research H. leads directly to Statistical H.

    (2) Statistical Hypotheses: Hypotheses are stated in such a way

    that they may be evaluated by appropriate statistical techniques.

    H

    O

    H

    A

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Statistical Hypotheses

    H

    O

    The H

    O

    is the hypothesis that is tested

    The H

    O

    should contain either =,,(The statement concerns about some one or more population's

    parameters in term of equality or inequality)

    H

    A

    What we hope or expect to be able to conclude as a result of

    the test usually should be placed in the H

    A

    The H

    O

    & HA

    are complementary

    One-sided vs. Two-sided Hypothesis Tests (opt.)

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Notes

    Neither hypothesis testing nor statistical inference, in general,

    leads to proof a hypothesis

    It merely indicates whether the hypothesis is supported or not

    supported by the available data

    When we fail to reject the H

    O

    , we do not say that it is true,

    but that it may be true

    When we speak of accepting a H

    O

    , we have this limitation

    in mind & do not wish to convey the idea that accepting

    implies proof

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    p value

    Test statistic =p value

    General formula:

    Teststatistic = relevantstatistichypothesizedparameterS .E .oftherelevantstatistic

    Example: z = x0n

    Test statistic p valueDecision maker, since the decision to reject or not to reject the

    H

    O

    depends on the magnitude of the test statistic

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Decision rule for a rejection or not the H

    O

    = type I error = level of signicance (say, .01, .05, .10) = type II error (say, .05, .10, .20)When we reject a H

    O

    p < , risk of committing a type Ierror, rejecting a true H

    O

    When we fail to reject a H

    O

    : risk of committing a type II

    error, accepting a false H

    O

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Type I & Type II error

    Conditions under which type I & type II errors may be committed (the four possibilities)

    Actual Situation(Truth in the population)

    Ho false Ho true

    The results in the study sample Conclusion:

    RejectHo

    Correct decision

    Type I error

    Fail toreject Ho

    Type II error

    Correct decision

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Testing Hypothesis Rejected or not rejected HO

    In the testing process the H

    O

    either is rejected or is not

    rejected

    If H

    O

    is not rejected, we will say that the data on which the

    test is based do not provide sucient evidence to cause

    rejection

    If the testing process leads to rejection, we will say that the

    data at hand are not compatible with the H

    O

    , but are

    supportive of some other hypothesis & may be designated by

    H

    A

    (H

    A

    a contradiction statement of H

    O

    )

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    The Five-Step practical procedure for Hypothesis Testing

    (opt.)

    Step 1: Set up H

    O

    , H

    A

    1. Data: The nature of the data (classication)

    2. Assumptions: The normality of the population distribution,

    equality of variances, independence of samples. . .

    3. Hypotheses: H

    O

    , H

    A

    Step 2: Dene the test statistic

    4. Test statistic

    5. Distribution of the Test Statistic

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    The Five-Step practical procedure for Hypothesis Testing,

    cont. (opt.)

    Step 3: Dene a rejection region: having determined a value

    for 6. Decision rule

    Step 4:

    7. Calculate the value of the test statistic, and compare it with

    the acceptance & rejection regions that have already been

    specied.

    8. State our decision: to reject H

    O

    or to fail to reject H

    O

    Step 5:

    9. Give a conclusion: this statement should be free of

    statistical jargon & should merely summarize the results of the

    analysis.

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Outline

    1

    Introduction

    Statistics

    Objectives Statistics2

    Estimation - Condence Interval

    Estimation

    A point estimate

    An interval estimate

    Interpretation a condence interval

    3

    Hypothesis testing - p value

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    The Power of a Statistical Test (opt.)

    The probability of a type II error, b, has remained a phantom:

    we know it is there,

    but we don't know what it is

    One thing we can say is that: a wide C.I. for m means that the

    corresponding 2-tailed test of Ho versus HA has a large chance

    of failing to reject a false Ho; that is b is large.

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Determining b (opt.)

    b = P(fail to reject H

    O

    when H

    O

    is false)

    1 - b = P(rejecting HO

    when H

    O

    is false)

    1 - b represents the probability of making a correct decision in

    the event that H

    O

    is false

    Since we like b to be small, that is we prefer 1 - b to be large

    The value of 1 - b is referred to as the power of test

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Hypothesis testing

    Hypotheses

    p value

    The hypothesis testing procedure

    Power of Test

    Power of test (opt.)

    Power of test = P(Z>z1) + P(Zz1)P(Z

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Determination of sample size (opt.)

    Estimating a condence interval

    Testing a hypothesis

    Nguyen Quang Vinh Basic statistics: p value and condence interval

  • Introduction

    Estimation - Condence Interval

    Hypothesis testing - p value

    Determination of sample size

    Summary

    Summary

    1. Statistics:

    Descriptive statisticsorganize & summarize dataInferential statistics drawing of inferencesEstimation

    Hypothesis testing

    Modeling

    2. Estimation - condence interval, estimator

    3. Hypothesis testing - p value

    Nguyen Quang Vinh Basic statistics: p value and condence interval

    IntroductionStatisticsObjectives Statistics

    Estimation - Confidence IntervalEstimationA point estimateAn interval estimateInterpretation a confidence interval

    Hypothesis testing - p valueHypothesis testingHypothesesp valueThe hypothesis testing procedurePower of Test