Inferential Statistics - Degrees Of Freedom

Degrees Of Freedom:

Degrees of freedom refers to the maximum number of logically independent values in a data sample which have the freedom to vary within.

Example:

If there is a sample of 3 values {5, x, 15} and the mean of all the values is 10.

Now it is easy to say that the value of x would be 10 as the mean of these 3 values is 10.

But if 2 values from this sample are not known, say {5, x, y} with same mean 10, then we are now cannot be sure about the exact values of x & y.

It could be any values from (10, 15), (15, 10), (5, 20), (20, 5) or even (1, 24).

So we cannot determine the exact value of these data x & y.

These 2 values has a freedom to vary.

But the third value do not have the freedom to change as it has to be some value so that the mean will not change. So this value depends upon all the other values.

So the degrees of freedom of this sample data of size 3 is 2.

Not only with size 3 sample, a sample with any size we can determine only one value if it is unknown as it depends on all the other values in the sample.

So the degrees of freedom is always the sample size minus 1.

Formula:

V= n – 1

V = Degrees of freedom

n = Sample size

In the above example, there is only one constraint placed in the set that the “mean is 10”.

Therefore the constraint placed on the freedom is one and degrees of freedom is two.

If we mention number of constraints as k, then

V = n – k

As the restrictions increase, the freedom is reduced.

Handed- ness Sex	Right handed	Left handed	Total
Male	43	9	52
Female	44	4	48
Total	87	13	100

In the above matrix of 2 X 2, the degrees of freedom of Gender and Handedness, each having 1 constraints in it (Total) and size is 2, then the degrees of freedom is as given below:

V (nu) = (c - 1) (r - 1)

= (2 – 1) (2 – 1)

= 1

P.S. Contingency Table: In statistics, a contingency table (also known as a cross tabulation or crosstab) is a type of table in a matrix format that displays the (multivariate) frequency distribution of the variables.

Conclusion:

Degrees of freedom of a data set is n – k, whereas n – Size of data set & k – Number of constraints placed.

As the constraints increase, the freedom is reduced.

If there are more than one variable are combined into a matrix, then the entire degrees of freedom is the product of degrees if freedom of each variable.

Datalogy Today

Search This Blog

Inferential Statistics - Degrees Of Freedom

Comments

Post a Comment

Popular posts from this blog

Inferential Statistics - Hypothesis Testing Part #1

Inferential Statistics - An Introduction