No menu items!

Chauvenets Criterion

-

Chauvenets Criterion is a statistical test that can be used to find the outliers in the given data. The Chauvenets test assumes that the data is normally distributed. The Chauvenets criterion provides an upper bound such that, if the data point lies beyond the bound it is classified as an outlier.

Procedure to conduct the Chauvenets criterion test:

Step 1: Identify the data point that you want to test for being an outlier

Step 2: Calculate the mean and standard deviation of the data.

Step 3: Calculate the value of the Chauvenets test statistic (essentially the Z score) using the formula,

Test statistic formula for Chavenets criterion

Step 4: Find the critical Chauvenet value at a 5% level of significance for given sample size from the table below,

Critical value table for chauvenets criterion depending on sample size

Step 5: If the test statistic exceeds the upper bound obtained from the table, then we conclude that that data point is an outlier.

Example: Identify the outlier values in the following set of data:

5, 14, 16, 19, 20, 22

Solution: Here n=6. Let us check whether the data point 5 is an outlier or not.

Step 1: We calculate the mean and the standard deviation of the data and obtain,

Mean= 16

Standard deviation= 5.57

Step 2: The value of the Chauvenets test statistic for the data point 5 is,

Test statistic = |5-16|/ 5.57 = 11/5.57 = 1.97

Step 3: From the above table the critical value for n=6 is,

Critical value = 1.732

Step 4:  Since the test statistic exceeds the table value, we conclude that the data point ‘5’ is a spurious outlier. We can similarly test the other data points individually to determine whether they are outliers or not,

Logic behind the Chauvenets criterion:

The logic behind the Chauvenets test is that if the population is normally distributed then the data has a higher chance of lying within certain standard deviations of the mean. The data has a much lesser probability of lying below that bound. So if it does lie beyond the bound it is reasonable to conclude that that data point is a spurious outlier.

Other methods to identify outliers:

Some other methods that can be used to identify outliers are:

  1. The 1.5 IQR rule
  2. Dixons Q test.
  3. Grubbs test.
  4. Visually identifying outliers by drawing scatterplots.

Hey 👋

I have always been passionate about statistics and mathematics education.

I created this website to explain mathematical and statistical concepts in the simplest possible manner.

If you've found value from reading my content, feel free to support me in even the smallest way you can.



Share this article

Recent posts

Popular categories

Recent comments