Two Factor ANOVA without Replication

To help introduce the basic concepts we start with the following example.

Example 1: A new fertilizer has been developed to increase the yield on crops, and the makers of the fertilizer want to better understand which of the three formulations (blends) of this fertilizer are most effective for wheat, corn, soybeans and rice (crops). They test each of the three blends on one sample of each of the four types of crops. The crop yields for the 12 combinations are as shown in Figure 1.

Figure 1 – Data for Example 1

We interrupt the analysis of this example to give some background, after which we will resume the analysis.

Definition 1: We define the structural model as follows.

A factor is an independent variable. A k factor ANOVA addresses k factors.

A level is some aspect of a factor; these are what we called groups or treatments in the one-factor analysis discussed in Basic Concepts for ANOVA.

In Example 1 there are two factors: blends and crops. The blend factor has 3 levels and the crop factor has 4 levels.

In general, suppose we have two factors A and B. Factor A has r levels and factor B has c levels. We organize the levels for factor A as rows and the levels for factor B as columns. We use the index i for the rows (i.e. factor A) and the index j for the columns (i.e. factor B). Thus we use an r × c table where the entries in the table are

We use terms such as x̄_i (or x̄_i.) as an abbreviation for the mean of {x_ij: 1 ≤ j ≤ c}. Similarly, we use terms such as x̄_j (or x̄_.j) as an abbreviation for the mean of {x_ij: 1 ≤ i ≤ r}.

We estimate the level means from the total mean for factor A by μ_i = μ + α_i where α_i denotes the effect of the ith level for factor A (i.e. the departure of the ith level mean μ_i for factor A from the total mean μ). We have a similar estimate for the sample of x̄_i = x̄ + a_i.

Note that

Similarly, we estimate the level means from the total mean for factor B by μ_j = μ + β_j where β_j denotes the effect of the jth level for factor B (i.e. the departure of the jth level mean μ_j for factor B from the total mean μ). We have a similar estimate for the sample of x̄_j = x̄ + b_j.

As for factor A,

The two-way ANOVA will either test for the main effects of factor A or factor B, namely

H₀: μ_1. = μ_2. =⋯= μ_r. (Factor A)

H₀: μ.₁ = μ.₂ =⋯= μ._c (Factor B)

If testing for factor A, the null hypothesis is equivalent to

H₀: α_i = 0 for all i

If testing for factor B, the null hypothesis is equivalent to

H₀: β_j = 0 for all j

Finally, we can represent each element in the sample as x_ij = μ + α_i + β_j + ε_ij where ε_ij denotes the error (or unexplained amount). As before we have the sample version x_ij = x̄ + a_i + b_j + e_ij where e_ij is the counterpart to ε_ij in the sample.

Observation: Since

It follows that

Similarly,

It is easy to show that

Definition 2: Using the terminology of Definition 1, define

Correction: The term x_ij in the formula for SS_E in the above table should not have a bar over it.

Property 1:

Proof: Clearly

If we square both sides of the equation, sum over i, j and then simplify (with various terms equal to zero as in the proof of Property 2 of Basic Concepts for ANOVA), we get the first result. For the second,

Property 2: If a sample is made as described in Definition 1, with the x_ij independently and normally distributed and with all $\sigma^2_{i.}$ (or $\sigma^2_{.j}$ ) equal, then

Proof: The proof is similar to that of Property 1 of Basic Concepts for ANOVA.

Theorem 1: Suppose a sample is made as described in Definitions 1 and 2, with the x_ij independently and normally distributed.

If all μ_i are equal and all $\sigma^2_{i}$ are equal then

If all μ_j are equal and all $\sigma^2_{j}$ are equal then

Proof: The result follows from Property 2 and Theorem 1 of F Distribution.

Property 3:

Observation: We use the following tests:

Recall that the assumptions for using these tests are:

All samples are drawn from normally distributed populations
All populations have a common variance
All samples were drawn independently from each other
Within each sample, the observations were sampled randomly and independently of each other

We now return to Example 1 and show how to conduct the required analysis using Excel’s Anova: Two-factor Without Replication data analysis tool.

Example 1 (continued): The output from the data analysis tool is shown in Figure 2.

Figure 2 – Two factor ANOVA w/o replication data analysis tool

There are two null hypotheses: one for the rows and the other for the columns. Let’s look first at the rows:

H₀: there is no significant difference in yield between the (population) means of the blends

Since the p-value for the rows = .0068 < .05 = α (or F = 12.83 > 5.14 = F-crit) we reject the null hypothesis, and so at the 95% level of confidence we conclude there is significant difference in the yields produced by the three blends.

The null hypothesis for the columns is

H₀: there is no significant difference in yield between the (population) means for the crop types

Since the p-value for the columns = .1446 > .05 = α (or F = 2.63 < 4.76 = F-crit) we can’t reject the null hypothesis, and so at 95% level of confidence we conclude there is no significant difference in the yields for the four crops studied.

Observation: Although the analysis in Figure 2 was produced automatically by Excel’s data analysis tool, the same result can be produced using Excel formulas, just as we were able to do in Basic Concepts of ANOVA for one-way ANOVA. The most interesting cells are the ones corresponding to the four sum squares. We show how to calculate the values for each of those cells in Figure 3.

Figure 3 – Key formulas for analysis from Figure 2

The formulas for calculating SS_Row and SS_Col in Definition 2 involve taking squared deviations of the group means. E.g. SS_Row can be calculated via the formula =DEVSQ(I6:I8)/H6. Alternatively, we can take squared deviations from the sums of each group, as is done in Figure 3.

Real Statistics Excel Capabilities: The Real Statistics Resource Pack contains a number of functions and the Two Factor ANOVA data analysis tool that support Two Factor ANOVA without Replication. You can get more information about these in Two Factor ANOVA with Replication.

171 thoughts on “Two Factor ANOVA without Replication”

Faye

July 26, 2023 at 12:42 pm

Hi Charles,

I don’t know how I can do a two-way ANOVA with the data that I have, but my supervisor tells me it’s possible. I have three different types of flowers (sunflower, tulip and lilies). Each group is getting infested with 40 different insects. My dependant variable is the number of a certain insect found in the flower for Day X. Basically, for every day (Day 1, 2, 3, 4, etc.), we look at only one type of flower over time and we count how many insects there are.

For example, for Day 4, the sunflower type will have 4 ladybugs, 0 caterpillars, 10 beetles, and so on for 37 more insects. Then, for Day 4, the tulip type will have the same kind of insects but different count, and so is the lily. Then we do this again for the other days.

I am mostly interested in the variation of insects between the 3 flower types. Is it possible to do a two-way ANOVA with replication ? I would consider my 40 types of insects as my 40 replicates, but the only issue is, that theoritically, it’s not really “replicates” since it’s not the same insect. Or should I do without replication since I only have one replicate for each Day/insect type (e.g. 4 ladybugs for Day 4 in Tulip).
Reply
Chingho WANG

November 16, 2022 at 4:19 am

Dear Charles,
Thank you for such a helpful website which I learned a lot from. Regarding interaction of two factors without repeated measurements, can I use Tukey’s one df test for nonadditivity? I used R to perform that test and got p = 0.28, which means no interaction between blend and crop.
Ching-Ho WANG, National Taiwan University
Reply
- Charles
  
  November 19, 2022 at 11:38 pm
  
  Dear Chingho Wang,
  I am pleased that you have found the Real Statistics website to be helpful.
  Yes, you can use Tukey’s Additivity Test in this case. p = .28 would indicate no interaction.
  The Real Statistics website doesn’t explain how to perform this test. I plan to add this to the website in the next couple of days.
  Thank you for your question, which has prompted me to add the test to the website.
  Charles
  Reply
  - Rafael
    
    March 12, 2023 at 1:41 am
    
    sir i am Rafael may i kindly ask sir
    1) what is ANOVA two way WITH replication, i need a long definition
    where and when should be used
    
    2) what is ANOVA two way WITH replication long definition too
    
    3) what is the difference between the two and where and when should they be used
    Reply
    - Charles
      
      March 13, 2023 at 8:36 pm
      
      Hi Rafael,
      1) Two-way ANOVA means that there are two factors. Each factor can have multiple levels. With replication means that for every combination of levels from the two factors, there are more than one sample element. See the following webpage for more details:
      https://real-statistics.com/two-factor-anova-with-replication/
      2) 1) Two-way ANOVA means that there are two factors. Each factor can have multiple levels. Without replication means that for every combination of levels from the two factors, there is exactly one sample element.
      3) The with replications version is the more commonly used version. The without replications version is used when there are no replications. It can also be used for Repeated Measures ANOVA and Randomized Complete Block Design.
      Charles
      Reply
      - Rafael
        
        March 13, 2023 at 10:36 pm
        
        sir what kind of element in without replication is?
      - Charles
        
        March 16, 2023 at 7:51 am
        
        Sorry, but I don’t understand your question.
        Charles
      - Rafael
        
        March 18, 2023 at 11:15 am
        
        sir thank you for answering my mistake on another replication, it is without replication,
        
        anyway can i know what is the formula for (with) replication and can i have the definition per sign same with (without) replication formula?
- Charles
  
  November 20, 2022 at 11:23 am
  
  I have now added an explanation of Tukey’s Additivity Test to the website. See
  Tukey’s Additivity Test
  I plan to add support for this test to the Real Statistics software in the next release.
  Charles
  Reply
  - Rafael
    
    March 21, 2023 at 8:28 am
    
    i mean what is the formula of two way anova with replication and without replication sir?
    Reply
Qirom

November 6, 2022 at 4:03 pm

Very good article, however, I would like to ask you about how we can measure normality and homogeneity tests for ANOVA two factors without replication. Should we measure them one-by-one factor? Thank you
Reply
- Charles
  
  November 7, 2022 at 11:37 am
  
  Yes, that is correct.
  Charles
  Reply
Severin

June 3, 2022 at 1:57 pm

Hello Charles.

Why didnt you include the interaction between levels of different factors into the systematic variability? Isnt the interaction part of the general variability of x,ij?

Thank you,
Severin
Reply
- Charles
  
  June 4, 2022 at 11:28 am
  
  Hello Severin,
  Since we are considering the case where there is no replication, the interaction between levels of the two factors consists of one element. In this case, there is no variability (e.g. VAR.S(R1) is undefined when R1 consists of one cell).
  There is interaction variability when there is replication. This case is discussed at
  Two Factor ANOVA with Replication.
  Charles
  Reply
Mark Colpus

April 20, 2022 at 2:04 pm

Why is this not chi-square?
Reply
- Charles
  
  April 21, 2022 at 6:53 pm
  
  Hello Mark,
  The data for the chi-square test of independence consists of counts (i.e. non-negative integers such as 30). The data for Two Factor ANOVA without Replications uses decimal values (such as 30.65 or -3.4). The null-hypothesis is also different.
  Charles
  Reply
  - Mark Colpus
    
    April 21, 2022 at 10:33 pm
    
    Dear Charles,
    
    Thank you for the quick and clear clarification. And thank you for such a helpful website?
    
    Best wishes,
    Mark
    Reply
Anonymous

November 3, 2021 at 5:34 pm

Hi. So I am evaluating, if the checks done by my colleagues are are significantly different for a specific equipment. I have 5 lab mates and each did 3 trials. So would it be one way anova?
Reply
- Charles
  
  November 4, 2021 at 2:30 pm
  
  It sure sounds like a fit for ANOVA, but, as they say, “the devil is in the details”.
  Charles
  Reply
Riana

April 20, 2021 at 9:52 pm

Hello Charles,
I have two independent variables (time and temperature) then one dependent variable (starch content). There are 3 levels of time and 2 levels of temperature. I have only done 2 trials for the six combinations. What type of ANOVA should I use? And Can I use Tukey’s Method afterwards?

Thank you!
Reply
- Charles
  
  April 21, 2021 at 8:37 am
  
  Hello Riana,
  It depends on what hypotheses you want to test. You might use two-factor ANOVA, although one factor ANOVA or a t-test may be applicable.
  Charles
  Reply
Fransiska

December 9, 2020 at 4:19 am

Hi Charles, I did my two way ANOVA without replication and I am wondering whether I can do post hoc analysis on it? If yes, which post hoc analysis should I use?
Reply
- Charles
  
  December 9, 2020 at 3:45 pm
  
  Hello Fransiska,
  See https://www.real-statistics.com/two-way-anova/follow-up-analyses-for-two-factor-anova-w-o-replications/
  Which post hoc test to use depends on what hypothesis or hypotheses you want to test.
  Charles
  Reply
Fiza

December 9, 2020 at 12:03 am

Hi Charles.

Thank you for the example. I have a questions. My data is about STEM students between girls and boys in 4 fields which are science, engineering, technology and mathematics. So, do I have to perform the test year by year?
Reply
- Charles
  
  December 9, 2020 at 3:35 pm
  
  Fiza,
  I need additional information to be able to address your question. What hypothesis (or hypotheses) are you trying to test?
  Charles What sort of data have you gathered? (or are you preparing to gather?)
  Reply
werner bartmann

November 23, 2020 at 2:30 pm

Dear Charles,
I found negative values in my ss for interaction in the two way Anova. Is this possible? Is something wrong with my data?

Best regards

Werner
Reply
- Charles
  
  November 23, 2020 at 2:58 pm
  
  Werner,
  SS should never be negative. I can think of two reasons for getting a negative value: (1) it is a negative value near zero, in which case you should interpret it as zero and (2) you have an unbalanced model and need to use the Regression option. Perhaps it could also happen if there are no interactions.
  If you email me an Excel file with your data and test results, I will try to figure out what is going wrong.
  Charles
  Reply
Afnan

September 14, 2020 at 2:23 am

Sir i want to test the effect of three priming techniques each with two levels on four maize hybrids which anova should i use kindly tell me.
Reply
- Charles
  
  September 14, 2020 at 8:27 am
  
  You need to supply additional information. E.g. (1) what hypothesis or hypotheses are you trying to test? (2) Are the priming techniques applied to the same maize hybrids or different hybrids? (3) What are the levels? E.g. they are different dosages of the priming techniques.
  Charles
  Reply
Neetu

August 28, 2020 at 6:05 pm

I have nine treatments of different types of manure and have to test all these treatments on one crop . Which test I will use for my experiment?
Reply
- Charles
  
  August 29, 2020 at 9:35 am
  
  Neetu,
  What hypothesis or hypotheses are you trying to test? Do you have any constraints on your data?
  Charles
  Reply
Margaret S Ferguson

July 25, 2020 at 3:17 am

Hi Charles,
Is there a way to perform a post hoc on a two factor ANOVA without replicates? if so what post hoc is best? I had been trying to use a post hoc t test because i didnt think that i needed anything much more complicated than that, however, at times i found that the ANOVA would come back saying that there was significance when the t test said that there was not. Just looking for some help as this is my first time doing two factor ANOVAs.
Reply
- Charles
  
  July 26, 2020 at 11:18 am
  
  Hello Margaret,
  See the following webpage:
  
  Charles
  Reply
Daniel

November 12, 2019 at 4:40 am

Dear Charles,
I have 3 batches of the same product, 9 samples were withdrawn from each batch during the manufacturing process. I am looking to assess to inter & Intra batch variability. Can you please advise if Anova Two factors w/o replications will be suitable for this case?
Regards Daniel
Reply
- Charles
  
  November 12, 2019 at 2:06 pm
  
  Hello Daniel,
  I am not familiar with the term “inter & Intra batch variability”. The following is an article that you might find helpful.
  https://www.tandfonline.com/doi/pdf/10.1080/17453670710014004
  Charles
  Reply
Sander

October 15, 2019 at 2:05 pm

Dear Charles,
First thank you for this magnificent side.
In the example you concluded a significant difference between the yields produced by the three blends, but if I were to find out if they all differed or only two of them did. What kind of post-test should I use?

Best Regards Sander
Reply
- Charles
  
  October 15, 2019 at 11:50 pm
  
  See https://real-statistics.com/two-way-anova/follow-up-analyses-for-two-factor-anova-w-o-replications/
  Charles
  Reply

171 thoughts on “Two Factor ANOVA without Replication”

Leave a Reply to Charles Cancel reply