Index of qualitative variation

Expressed as a ratio from 0 to 1.00.   Observed in a distribution of scores to the maximum variation that could exist in a particular distribution.

 0.00 indicates no variation.  1.00 represents maximum variation.

 The most common usage for this is for nominal variables and it can be used with any variable when scores have been grouped into a frequency distribution.

For example, given the following tables representing a city's marital status in the hundreds, we may want to know which of the two cities is more diverse (or heterogeneous).

 



Marital Status


Frequency


Married


10


Living Together


2


single


6


Separated


1


Widowed


1


Divorced


0


 


 


 


 


Marital Status


Frequency


Married


5


Living Together


8


single


4


Separated


2


Widowed


0


Divorced


1

 

At a glance, it is difficult to discern which one of the two is more diverse.  So we need to compute the index of qualitative variation (IQV). To compute the (IQV) we apply the following formula. 

k(N2 - sum(f2)) / N2(k - 1)

where:

k = number of categories

N = number of cases (total the frequency counts)

sum of f2 = the sum of the squared frequencies (that is, square the frequencies first, then add them up)

 

The result should be something like this:

 



Marital Status


Frequency


Frequency Squared


 


 


Married


10


100


 


 


Living Together


2


4


 


 


single


6


36


 


 


Separated


1


1


 


 


Widowed


1


1


 


IQV


Divorced


0


0


 


0.774


 


 


 


 


 


 


 


 


 


 


Marital Status


Frequency


Frequency Squared


 


 


Married


5


25


 


 


Living Together


8


64


 


 


single


4


16


 


 


Separated


2


4


 


 


Widowed


0


0


 


IQV


Divorced


1


1


 


0.87

 

In the attached excel sheet, you will be able to audit the formula for IQV if you want excel to do your calculations.

 

AttachmentSize
index of qualitative variation.xls14.5 KB