Generally to understand some characteristic of the general population we take a random sample and study the corresponding property of the sample. We then determine whether any conclusions we reach about the sample are representative of the population.

This is done by choosing an **estimator** function for the characteristic (of the population) we want to study and then applying this function to the sample to obtain an **estimate**. By using the appropriate statistical test we then determine whether this estimate is based solely on chance.

The hypothesis that the estimate is based solely on chance is called the **null hypothesis**. Thus, the null hypothesis is true if the observed data (in the sample) do not differ from what would be expected on the basis of chance alone. The complement of the null hypothesis is called the **alternative hypothesis**.

The null hypothesis is typically abbreviated as H_{0} and the alternative hypothesis as H_{1}. Since the two are complementary (i.e. H_{0} is true if and only if H_{1} is false), it is sufficient to define the null hypothesis.

Since our sample usually only contains a subset of the data in the population, we cannot be absolutely certain as to whether the null hypothesis is true or not. We can merely gather information (via statistical tests) to determine whether it is likely or not. We therefore speak about **rejecting** or **not rejecting** (aka **retaining**) the null hypothesis on the basis of some test, but not of **accepting** the null hypothesis or the alternative hypothesis. Often in an experiment we are actually testing the validity of the alternative hypothesis by testing whether to reject the null hypothesis.

When performing such tests, there is some chance that we will reach the wrong conclusion. There are two types of **errors**:

**Type I**– H_{0}is rejected even though it is true (**false positive**)**Type II**– H_{0}is not rejected even though it is false (**false negative**)

The acceptable level of a Type I error is designated by **alpha** (*α*), while the acceptable level of a Type II error is designated **beta** (*β*).

We use the following terminology:

**Significance level** is the acceptable level of type I error, denoted *α*. Typically, a significance level of *α* = .05 is used (although sometimes other levels such as *α* = .01 may be employed). This means that we are willing to tolerate up to 5% of type I errors, i.e. we are willing to accept the fact that in 1 out of every 20 samples we reject the null hypothesis even though it is true.

**P-value** (the **probability value**) is the value *p* of the statistic used to test the null hypothesis. If *p* < *α* then we reject the null hypothesis.

**Critical region** is the part of the sample space that corresponds to the rejection of the null hypothesis, i.e. the set of possible values of the test statistic which are better explained by the alternative hypothesis. The significance level is the probability that the test statistic will fall within the critical region when the null hypothesis is assumed.

Usually the critical region is depicted as a region under a curve for continuous distributions (or a portion of a bar chart for discrete distributions).

The typical approach for testing a null hypothesis is to select a statistic based on a sample of fixed size, calculate the value of the statistic for the sample and then reject the null hypothesis if and only if the statistic falls in the critical region.

**One-tailed hypothesis testing** specifies a direction of the statistical test. For example to test whether cloud seeding increases the average annual rainfall in an area which usually has an average annual rainfall of 20 cm, we define the null and alternative hypotheses as follows, where represents the average rainfall after cloud seeding.

H_{0}:* µ* ≤ 20 (i.e. average rainfall does not increase after cloud seeding)

H_{1}: *µ* > 20 (i.e. average rainfall increases after cloud seeding

Here the experimenters are quite sure that the cloud seeding will not significantly reduce rainfall, and so a one-tailed test is used where the critical region is as in the shaded area in Figure 1. The null hypothesis is rejected only if the test statistic falls in the critical region, i.e. the test statistic has a value larger than the critical value.

The critical value here is the **right **(or** upper**)** tail**. It is quite possible to have one sided tests where the critical value is the **left **(or** lower**)** tail**. For example, suppose the cloud seeding is expected to decrease rainfall. Then the null hypothesis could be as follows:

H_{0}: *µ* ≥ 20 (i.e. average rainfall does not decrease after cloud seeding)

H_{1}: *µ* < 20 (i.e. average rain decreases after cloud seeding)

**Two-tailed hypothesis testing** doesn’t specify a direction of the test. For the cloud seeding example, it is more common to use a two-tailed test. Here the null and alternative hypotheses are as follows.

H_{0}: *µ* = 20

H_{1}: *µ* ≠ 20

The reasons for using a two-tailed test is that even though the experimenters expect cloud seeding to increase rainfall, it is possible that the reverse occurs and, in fact, a significant decrease in rainfall results. To take care of this possibility, a two tailed test is used with the critical region consisting of both the upper and lower tails.

In this case we reject the null hypothesis if the test statistic falls in either side of the critical region. To achieve a significance level of *α*, the critical region in each tail must have size *α*/2.

**Statistical power** is 1 –* β*. Thus power is the probability that you find an effect when one exists, i.e. the probability of correctly rejecting a false null hypothesis. While a significance level for type I error of *α* = .05 is typically used, generally the target for *β* is .20 or .10, and so .80 or .90 is used as the target value for power.

The **general procedure** for null hypothesis testing is as follows:

- State the null and alternative hypotheses
- Specify
*α*and the sample size - Select an appropriate statistical test
- Collect data (note that the previous steps should be done prior to collecting data)
- Compute the test statistic based on the sample data
- Determine the p-value associated with the statistic
- Decide whether to reject the null hypothesis by comparing the p-value to
*α*(i.e. reject the null hypothesis if*p < α*) - Report your results, including effect sizes (as described in Effect Size)

**Observation**: Suppose we perform a statistical test of the null hypothesis with *α* = .05 and obtain a p-value of *p* = .04, thereby rejecting the null hypothesis. This does not mean that there is a 4% probability of the null hypothesis being true, i.e. *P*(H_{0}) =.04. What we have shown instead is that assuming the null hypothesis is true, the conditional probability that the sample data exhibits the obtained test statistic is 0.04; i.e. *P*(D|H_{0}) =.04 where *D* = the event that the sample data exhibits the observed test statistic.

I have a question and hope you could help me. I have a null hypothesis as ‘there is a negative effect of X on Y’ and an alternative hypothesis as ‘there is a positive effect of X on Y’. P-value is set at .05.

I use linear regression for the test and find that the unstandardized coefficient between X and Y is -.503 and Sig is .828. In this case, I should reject or accept the null hypothesis and why?

Since sig = .828 > .05 = alpha, you would retain the null hypothesis. Note that you don’t “accept” the null hypothesis, but merely state that you don’t have sufficient reasons to reject it. This null hypothesis is that the corresponding regression coefficient is zero.

Whether this null hypothesis is the same as your null hypothesis is not completely clear to me without further information.

How do you state a null hypothesis saying the effect of beta will be greater than 100.000?

What is beta? Clearly you aren’t referring to the type II error since that can’t exceed 100%.

Is IT possible to test alternative hypothesis as 5*beta >25000 ?

The point is to test if an increase of 5 in the beta variable is larger than 25000

Should we use alternative or null hypotheses with t-test analysis???

The test is for the null hypothesis, although since the alternative hypothesis is the complement of the null hypothesis it can also be viewed as a test for the alternative hypothesis.

Charles

Finally, I think to grasp the NHST behaviour in simple terms.

The Null Hypothesis, H0: p=p0 should be thought has an

approximate condition The same thing for the Alternative H1: p=p1

Note that the latter value p1 is called strictly in order to calculate the Type II error. In fact. it is absolutely absent at the test formula which is deduced supposing the Null true.

please i need examples of null hypothesis and alternative hypothesis as many as possible

Dear Sir, I have read somewhere that one can only test NULL hypothesis and one cannot test ALTERNATIVE hypothesis ……..Is it true? Why so ?

Dave,

The tests are all set up to test the null hypothesis, but keep in mind that the alternative hypothesis is true if and only if the null hypothesis is false.

Also when you calculate the the power of a test, in some sense you are testing the alternative hypothesis.

Charles

pleas explain to me this question my problem is this when I face to any question in hypotheses I can not Analyse how to make null and alternate hypotheses and about choosing the tailed, which tailed should I choose .pleas make me clear about this doubt .

Null/alternative hypothesis: The best thing to do is look at lots of examples and see how these hypotheses are defined. There are numerous examples given throughout the website. See, for example, the t test.

Tails: Generally you choose the two tailed test. Only when you are pretty sure (usually on theoretical grounds) that one of the tails won’t occur should you consider using a one-tailed test.

The terminology of type I /II errors seems a bit counter-intuitive. If a test result is wrong, then I would agree to call that a ‘false’ test result (i.e. wrong/error= false and accordingly, type I and type II are both called ‘false’). However, if the null hypothesis H0 is ‘rejected’ by the test, then I would call that a ‘negative’ test result (i.e. rejected = negative). Hence the type I error event that a true Ho is rejected, would be called a ‘false negative’

What is the alternative logic of calling a Type I error a ‘false positive ?

Hi Simon,

Here, the word positive is used since the null hypothesis is actually True. The word false is used since the conclusion is wrong, namely to reject the null hypothesis.

Charles

If we perform a one tailed test and our directional prediction was incorrect, do we fail to reject the null?

I often see:

H0: u1 = u2

Ha: u1 > u2

So let’s say the test shows we were wrong, in fact u1 < u2. But u1 does not equal u2, so it's difficult for me to want to fail to reject the null. I just did a test like this and I want to reject the null although I was directionally wrong.

Is it the case that even though the null was written as (=), in actuality, there is a tacit agreement that we really mean (= OR <)? Because I really do see this often–the null being written only as (=) and not (= or ). I am having trouble finding a consensus on whether or not to reject the null when you chose the wrong direction. Thanks.

Kyle,

You only perform a one-tailed test if you are sure that one side of the test is impossible. For the example you gave, you are assuming that u1 < u2 can't happen. If you are not sure of this, then you should perform a two tailed test. Charles

Why we use null hypothesis in statistics instead of alternative hypnosis???

Maryam,

I believe that it is for historical reasons, but in any case the null and alternative hypotheses are flip sides of the same coin.

Charles

