**Hoover index** is one of the simplest inequality indices to measure the deviation from an ideal equal distribution. It can be interpreted as the maximum vertical deviation of the Lorenz curve from the 45 degree line.

**Theil index** is an inequality measure related to the Shannon entropy. It is often used to measure economic inequality.

Like the Shannon entropy, Theil index can be decomposed in two independent components, for example to descbribe inequality “within” and “in between” subgroups. Low Theil or Hoover index means low inequality, high values stand for a high deviation from an equal distribution.

With

*E*_{i} – Effect in group *i*, *i* = 1 to *N*

E_{t} – Total sum of effects in all N groups

*A*_{i} – Number of items in class *i*

A_{t} – Total number of items in all *N* groups

**Theil Index**:

Eq. 1a *T*_{T} = ln (*A*_{t}/*E*_{t}) – ∑[ *E*_{i}/*E*_{t} ln (*A*_{i}/*E*_{i})]

Eq. 1b * T*_{L} = ln (*A*_{t}/*E*_{t}) – ∑[ *A*_{i}/*A*_{t} ln (*E*_{i}/*A*_{i})]

Taking relative (proportional) variables

*p*_{i} = *E*_{i}/*E*_{t}

*w*_{i} = *A*_{i}/*A*_{t} we get

Eq. 2a * T*_{T} = ∑[ *p*_{i} ln (*p*_{i}/*w*_{i})]

Eq. 2b *T*_{L} = ∑[ *w*_{i} ln (*w*_{i}/*p*_{i})]

The **symmetric Theil index **T_{s} = ½ ( *T*_{T} + *T*_{L}) can be expressed as:

Eq. 3 * T*_{s} = ½ ∑[ (*p*_{i} –*w*_{i}) ln (*p*_{i}/*w*_{i})]

Comparing the symmetric Theil index with the

### Hoover index

Eq. 4 * Hv* = ½ ∑ |*p*_{i} – *w*_{i}|

we see that for the symmetric Theil index the difference (*pi* – *w*_{i}) is weighted with the logarithm of *p*_{i}/*w*_{i}.

The normalized Theil index ranges from 0 to 1:

Eq. 5 T_{norm} = 1 – e^{–T}

### How does the Theil index relate to Shannon entropy?

For *w*_{i} = 1/*N *(same number of items in all groups) we get with Shannon entropy

*H* = – ∑ *p*_{i} ln *p*_{i} and true diversity *D* = exp (*H*):

Eq. 6a * T*_{T} = ln (*N*) – *H*

Eq. 6b * T*_{Tnorm} = 1 – *D*/*N*

and with

MLD = (1/*N*) ∑ ln (1/*p*_{i})

(MLD = mean logarithmic deviation)

Eq. 7 * T*_{L} = MLD – ln (*N*)

For the symmetric Theil index:

Eq. 8 *T*_{s} = ½ (MLD – *H*)

The symmetric Theil index is simply half of the difference between mean log deviation and Shannon entropy.

### Decomposition

The Theil index can be decomposed to find “within group” (w) and “between group” (b) components:

Eq. 9 *T* = *T*_{w} + *T*_{b}

For *j* subgroups (*j* = 1 to *K*) with individual Theil index *T*_{j}

Eq. 10a *T*_{T} = ∑ *s*_{j} T_{Tj} + ∑ *s*_{j} ln (*s*_{j}/*w*_{j})

Eq. 10b *T*_{L} = ∑ *w*_{j} T_{Lj} + ∑ *w*_{j} ln (*w*_{j}/*s*_{j})

*s*_{j} is the share of *E* in group *j* (*E*_{j}/*E*_{tot}); *w*_{j} the relative number of items in subgroup* j* (*N*_{j}/*N*_{tot}). The first term in (10) gives the “within group” component, the second the “between group” component.