Professional Documents
Culture Documents
1. Introduction
This paper sets out the indirect standardisation methodology and statistical methods
used in the construction of the clinical indicators. The paper is aimed at information
staff, but a knowledge of statistics is required to understand the method of dealing
with over-dispersion.
Standardisation methodology
The standardisation methodology is illustrated using the worked example below. The
example considers data for emergency readmissions following treatment for hip
fracture for trust X, but is equally applicable to the other acute and mental health
clinical indicators.
Expected hip fracture readmissions are calculated for each sex and age group using
the formula:
xs ( j )
Expected number of readmissions = n ( j ) *
ns ( j )
where
n(j) = number of spells relevant to the indicator in age group j in trust X
xs(j) = number of readmissions in age group j in the England data
ns(j) = number of spells relevant to the indicator in age group j in the England data
So, referring to the data in table 1, for age 80-84, the expected number of male
readmissions
=
15 * (155/1510) = 1.54
40 * (588/6837) = 3.44
j
0
1-4
5-9
10-14
15-19
20-24
25-29
30-34
35-39
40-44
45-49
50-54
55-59
60-64
65-69
70-74
75-79
80-84
85+
Total
England
Numerator
Denominator
Numerator
Denominator
Denominator
spells where
patient is
readmitted <28
days, x(j)
Male
Female
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
1
0
0
0
1
0
3
2
4
1
3
3
9
7
21
Spells with
emergency
admission for hip
fracture, n(j)
Denominator spells
where patient is
readmitted <28 days,
xs(j)
Male
0
1
0
1
0
1
0
0
0
1
0
1
0
1
4
6
7
15
9
47
Male
0
3
1
2
2
1
1
6
7
7
11
28
28
34
48
95
164
155
239
832
Male
5
27
23
74
51
35
60
117
112
140
155
270
332
399
580
969
1453
1510
1964
8276
Female
0
0
0
0
0
0
0
0
0
2
0
4
3
2
6
14
29
40
73
173
Female
1
0
0
1
0
0
0
1
1
2
7
13
22
51
89
194
409
588
1124
2503
Female
10
20
17
31
14
13
12
19
38
49
103
269
403
682
1261
2597
5036
6837
12394
29805
x (j )
e (j )
where
x(j) = number of observed readmissions in age group j for PCT / Trust X
e(j) = number of expected readmissions in age group j for PCT / Trust X
So in the example,
SR =
(7 + 21) / (4.76 + 14.47) = 1.46
Expected readmissions
e(j)
Female
Male
0.00
0.00
0.11
0.00
0.00
0.00
0.03
0.00
0.00
0.00
0.03
0.00
0.00
0.00
0.00
0.00
0.00
0.00
0.05
0.08
0.00
0.00
0.10
0.19
0.00
0.16
0.09
0.15
0.33
0.42
0.59
1.05
0.79
2.36
1.54
3.44
1.10
6.62
4.76
14.47
The final indicator value is produced by multiplying through by the England crude rate
per 100,000, i.e.
Indicator value = SR * England crude rate
So in the example,
England crude rate = ((832 + 2503) / (8276 + 29805)) * 100000 = 8757.65
Hence, the indicator value per 100,000 = 1.46 * 8757.65 = 12786.17
This crude rate multiplication is used in both the Scottish and Welsh clinical
indicators and is done in order to produce a more useable and interpretable final
value.
Where data for more than one calendar year are pooled for the calculation of the
indicator, the data are standardised using the England averages for the pooled data.
Where data for a single year are used for the calculation of the indicator, data for the
most recent three calendar years are analysed, with the data for the middle of the
three years being used to provide England averages for standardisation. Data for the
most recent calendar year is used for calculating the published indicator values.
Hypothesis testing
Tests of statistical significance are used to compare the observed rates of a trust with
a standard such as the national average rate. For example, P<0.001 indicates that
there is less than a one in 1000 chance of getting such a high trust result by chance
alone, were the rate of that trust truly at the national standard level. Five bands can
be used to grade trust values i.e. A1, A5 - trust values above the national average
with P<0.001 or P<0.025 respectively; B1, B5 - trust values below the national
average with P<0.001 or P<0.025 respectively; and W trust values not considered
to be statistically significantly different from the national average. P<0.001 and
P<0.025 will generally, but not always exactly, correspond to whether the respective
99.8% and 95% confidence intervals overlap with those of the average.
Over-dispersion
Indicators that are based on large numbers of cases have a precision that can result
in statistically significant differences that are not of practical significance. For the
clinical indicators it is reasonable to accept as inevitable a degree of between trust
variability in performance and seek to identify trusts that deviate from this distribution
of performance, rather than deviating from a single standard. This underlying
distribution is estimated using techniques that avoid undue influence of outlying
trusts, in which a proportion of the top and bottom values are Winsorised (i.e. shrunk
in). The significance of observed deviations then takes into account both the
precision with which the indicator is measured within each trust (i.e. the sample size),
and the estimated between-trust variability. This statistical technique is used for the
clinical indicators to reduce the possibility of inappropriately classifying trusts as
abnormal.
Interpretation
Both the 95% and 99% confidence intervals are calculated for each trust, and can be
used by trusts to compare their performance with the national figure. The upper and
lower confidence limits are provided to give a plausible range for the true underlying
rate. If trust and national confidence intervals do not overlap, this generally indicates
that differences in rates are statistically significant. This approach has been used for
star ratings purposes in the past and is straight-forward for use by trusts. However,
the star ratings model itself has evolved to use formal statistical tests using
probability (P) values rather than overlapping confidence intervals. Trust performance
is indicated by one of five bands A1, A5, W, B5, B1 (defined above) assigned using
these statistical tests.
zi = (yi - o)
si0
(1)
where si0 = standard error of yi given the trust is on target: hence si0 = Var(yi
o,i). zi
is referred to as the unadjusted Z score in the published data. It is important to note
that s0i may not necessarily be the same as the reported si, and hence some care is
required in calculating the z-scores. For example, if yi is an observed proportion
between 0 and 1, then
si = (yi(1- yi)/ni)
where ni is the effective sample size. si0 can then be estimated to be
Winsorising Z-scores
Winsorising consists of shrinking in the extreme Z-scores to some selected
percentile, using the following method.
Rank cases according to their naive Z-scores.
Identify Zq and Z1-q, the 100q% most extreme top and bottom naive Z-scores,
where q might, for example, be 0.1.
Set the lowest 100q% of Z-scores to Zq, and the highest 100q% of Z-scores to Z1q. These are the Winsorised statistics.
This retains the same number of Z-scores but discounts the influence of outliers.
Estimation of over-dispersion
Following the standard approach of generalised linear modelling (McCullagh and
Nelder, 1989) an over-dispersion factor is introduced that will inflate the null
variance, so that
Var(Y , , ) Var (Y , )
0
Suppose we have a sample of I units that we shall assume (for the present) all to be
on-standard. may be estimated as follows:
1
2
z i
I i
(2)
This assumes that Exp(Yi) = i, and that for on-standard trusts i is distributed with
mean 0 and standard deviation . can be estimated using a standard method of
moments (DerSimonian and Laird, 1986).
I * ( I 1)
(3)
w w w
2
I *
I *
is
set to 0 and complete homogeneity is assumed. Otherwise the adjusted z-scores are
given by
ziD = yi - 0
(si2 + 2)
% readmission
10
99.8 % limits
95 % limits
8
6
4
2
0
0
20000
40000
60000
80000
100000
120000
100000
120000
% readmission
10
8
6
4
2
Random effects SD 0.086 based on 10 % winsorised
0
0
20000
40000
60000
80000
Volume of cases
References
Breslow NE & Day NE, Statistical Methods in Cancer Research: Volume II- The
Design and Analysis of Cohort Studies, IARC, Lyon, 1987: 91-95
Breslow NE & Day NE, Statistical Methods in Cancer Research: Volume II- The
Design and Analysis of Cohort Studies, IARC, Lyon, 1987: 107-108
Breslow NE & Day NE, Statistical Methods in Cancer Research: Volume II- The
Design and Analysis of Cohort Studies, IARC, Lyon, 1987: 64
DerSimonian R and Laird N Meta-analysis in clinical trials. Controlled Clinical Trials
1986: 7, 177-188.
McCullagh P and Nelder J. Generalised Linear Models, 2nd edition. Chapman and
Hall, London 1989.
Spiegelhalter DJ. Funnel plots for institutional comparisons. Statistics in Medicine
2004: (to appear)