You are on page 1of 15

INTERNATIONAL JOURNAL OF CLIMATOLOGY

Int. J. Climatol. 27: 1–15 (2007)


Published online 7 July 2006 in Wiley InterScience
(www.interscience.wiley.com) DOI: 10.1002/joc.1375

Pattern hunting in climate: a new method for finding trends


in gridded climate data
A. Hannachi*
Department of Meteorology, University of Reading, Reading RG6 6BB, UK

Abstract:
Trends are very important in climate research and are ubiquitous in the climate system. Trends are usually estimated using
simple linear regression. Given the complexity of the system, trends are expected to have various features such as global
and local characters. It is therefore important to develop methods that permit a systematic decomposition of climate data
into different trend patterns and remaining no-trend patterns. Empirical orthogonal functions and closely related methods,
widely used in atmospheric science, are unable in general to capture trends because they are not devised for that purpose.
The present paper presents a novel method capable of systematically capturing trend patterns from gridded data. The method
is based on an eigenanalysis of the covariance/correlation matrix obtained using correlations between time positions of the
sorted data, and trends are associated with the leading nondegenerate eigenvalues. Application to simple low-dimensional
time series models and reanalyses data are presented and discussed. Copyright  2006 Royal Meteorological Society
KEY WORDS empirical orthogonal functions; inverse-rank correlation; trend empirical orthogonal functions; trend patterns;
North Atlantic Oscillation; Siberian high
Received 21 December 2005; Revised 3 May 2005; Accepted 16 May 2005

1. INTRODUCTION in recent global surface temperature, which has become


known as global warming. Given the complexity of the
In the last several decades, atmospheric scientists have
system, it is expected that trend characteristics will vary
contributed major efforts in climate data analysis to
from one location to another and this may lead to trend
extract patterns from simulated and observed large grid-
structures that cannot be explained by a single pattern.
ded data sets. Various methods have been developed to
This can happen, e.g., when the data contain two spatially
find patterns of variability from the high-dimensional
weather/climate system. The main objectives of these orthogonal patterns, each reflecting a specific trend.
methods are twofold: (1) reduce the dimensionality of Various trend detection methods exist in the literature.
the system by retaining a much smaller set of dom- These methods are mainly devised for univariate time
inant patterns (e.g. Hannachi and O’Neill, 2001), and series. There is, however, no general method for gridded
(2) obtain a few leading and most prominent patterns data, although EOFs and a few related methods have been
of variability that are physically relevant (e.g. Wallace used for that purpose.
and Thompson, 2002). Among these methods, empirical EOFs are not precisely designed for trend detection.
orthogonal functions (EOFs) have been the most widely EOF analysis finds linear combination of the different
used in atmospheric science since the late 1940s (e.g. variables of a gridded spatio-temporal field that suc-
Obukhov (1947, 1960); Fukuoka (1951); Lorenz (1956); cessively maximises variance subject to orthogonality
see also Craddock (1973) for a discussion of eigenanal- constraints. The obtained linear combinations are the
ysis in meteorology). (uncorrelated) principal components (PCs) and the spatial
Climate is a highly complex system involving many (orthogonal) EOF patterns are the loadings of the PCs.
interacting physical and dynamical processes leading to There is a wealth of literature on EOFs: see for example,
a wide spectrum of variations including trends. A trend is Kutzbach (1967); Preisendorfer (1988); Wilks (1995);
generally taken to mean a (non-random) smooth function von Storch and Zwiers (1999); and Jolliffe (2002). The
with a systematic, generally monotonic, variation in time geometric constraints of EOFs/PCs are very useful, but
series. The phenomenon of trend is ubiquitous in climate they may render difficult the possibility of physical inter-
data. The simplest such phenomenon is a linear trend pretation (Horel, 1981; Richman, 1981, 1986; Hannachi
observed in many atmospheric time series, such as that et al., 2006).
Given the importance of trends and because EOFs
and closely related methods are unable, in general, to
* Correspondence to: A. Hannachi, Department of Meteorology, The
University of Reading, Earley Gate, Reading RG6 6BB, UK. capture trends, it is of great benefit to develop methods
E-mail: A.Hannachi@reading.ac.uk that are capable of capturing trends from spatio-temporal

Copyright  2006 Royal Meteorological Society


2 A. HANNACHI

gridded climate data by decomposing the data into a trend detection is to look for linear combinations that max-
part and a remaining no-trend part. The present paper imise autocorrelation (Box and Tiao, 1977; Solow, 1994).
attempts precisely to achieve this. The paper is organised Solow (1994) finds the smallest eigenvalue, and associ-
as follows. Section 2 presents a brief background on trend ated eigenvector, of the matrix S−1 S∇ , where S and S∇
detection methods in univariate and gridded data, and sets are respectively the covariance matrices of the data and
the motivation of the paper. The method of trend EOFs their first difference.
(TEOFs) is then presented in Section 3. Application Now, the original context of EOFs (e.g. Fukuoka,
to simple low-dimensional time series is presented in 1951) was to achieve a decomposition of a continuous
Section 4, and application to sea level pressure (SLP) and space-time field using an optimal set of basis functions
geopotential height reanalyses from the National Center of space and expansion functions of time. EOFs, how-
for Environmental Prediction and the National Center ever, are unable in general to find trends (see Section 4.)
of Atmospheric Research (NCEP/NCAR) is presented in The EOF method finds a new set of variables that cap-
Section 5. Summary and conclusions are presented in the ture most of the observed variance from the data through
last section. linear combinations of the original variables. EOFs are in
general easy to compute using singular value decompo-
sition (SVD) (Golub and van Loan, 1996). Other useful
2. BACKGROUND AND MOTIVATION features of EOFs include spatial and temporal orthogo-
nality of the EOF patterns and their PCs respectively,
Trend detection in weather and climate is an important and various optimality criteria (Jolliffe, 2002; Magnus
issue in many respects not least for understanding climate and Neudecker, 1995). However, besides not being able
processes and testing for anthropogenic climate change to capture trends, EOFs have also a number of other dif-
signature. Various methods have been developed to ficulties, which are listed below for the sake of a better
deal with trend detection and estimation. Most of these understanding of the method presented in the next sec-
methods deal with univariate time series, the multivariate tion.
case being more difficult. Trend hypotheses can be in
general of two types, namely, step trend, formulated for • Physical interpretability: This constitutes the major
example as a change point problem (Lazante, 1996), or difficulty encountered with EOFs resulting from their
monotonic trend. Trend detection and estimation also rely geometric properties (Horel, 1981; Richman, 1986;
on the method used: parametric or nonparametric. Dommenget and Latif, 2002; Jolliffe, 2003, see also
In climate research, a simple strategy is to study lin- Hannachi et al., 2006).
ear trends using a parametric regression model based • Truncation order: This difficulty, due to the ‘smooth’
on the correlation coefficient between data and time nature of the spectrum, emerges particularly when
(Karl et al., 1993; von Storch and Zwiers, 1999). In EOFs are used for dimension reduction. The distribu-
this case, the significance of the trend is tested using tion of ‘power’ as a function of (spatial) scale makes
a T-statistic. The best known nonparametric approaches truncation ambiguous. Ideally, one would like to have
are based on the Mann–Kendall test (Mann, 1945; a discontinuous (or broken) spectrum with a number
Kendall, 1975; Lettenmaier, 1988) and Spearman’s rho of eigenvalues clearly distinct from and above a noise
test (Yue et al., 2002). Nonparametric tests for step floor.
trends based on a change point algorithm use the • Linearity and second/higher order statistics: Covari-
Mann–Whitney–Wilcoxon rank sum test (Bradley, 1968) ances or correlations used in EOFs represent linear
and the associated Hodges–Lehman estimator of trend measures of association, and hence there is a tendency
magnitude (Hodges and Lehmann, 1963, see also Hirsch to overlook nonlinearity, a main characteristic feature
and Gilroy, 1985). More recent nonparametric methods of climate. In the same vein, because EOFs are based
for trend analysis include wavelets (Antoniadis et al., on second-order statistics, higher-order moments are
1994; Gilbert, 1999; Craigmile et al., 2004) and other systematically excluded from the analysis. For exam-
smoothing methods, such as locally weighted scatterplot ple, the main centres of maximum variance of a given
smoothing LOWESS (Champely and Doledec, 1997). field will heavily reflect on the leading EOFs of that
These methods have been developed to deal with the field.
univariate case. Their extension to gridded data has var- • Ellipticity assumption: The restriction to second-order
ious difficulties not least the fact that climate variables moments readily translates into an implicit normal-
are highly correlated. In addition, the previous methods ity assumption, where the EOFs can be interpreted
cannot tell us whether the data contain one or more trend as the directions of the principal axes of the distribu-
patterns, i.e. they do not provide explicitly trend patterns. tion (Chatfield and Collins, 1989; Jolliffe, 2002). This
Hence detecting and estimating trends from gridded cli- interpretation extends to a large class of probability dis-
mate data can be challenging. tributions, namely, elliptically contoured distributions
To deal with spatial correlation, several authors used (Fang and Zhang, 1990).
EOFs/PCs to find trends. EOFs are also used for other • Correlation versus covariance: This difficulty is related
purposes, such as smoothing, weather forecasting, and to the dilemma of choice between covariance or corre-
dimensionality reduction. Another approach for trend lation when doing EOFs (Wilks, 1995; Jolliffe, 2002).

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
A NEW METHOD FOR FINDING TRENDS IN GRIDDED CLIMATE DATA 3

• Noncorrelation versus independence: This issue arises The obtained correlation is then given by:
often in practice in many cases where noncorrelation is
taken for independence. This is in fact true only when ρr (X, Y ) = cor(FX (X), FY (Y )) = ρ(FX (X), FY (Y )) (3)
the random variables involved are jointly Gaussian.
In fact for two random variables X and Y to be A sample estimate of Equation (3) is precisely the (Spear-
independent, the correlation ρg(X),h(Y ) between g(X) man’s) rank correlation (Appendix A). Rank correlation,
and h(Y ) should satisfy, for almost all functions, g() which measures closeness to curved line, is known to
and h(), the equality: be resistant (Lazante, 1996) and robust (DeGroot and
Schervish, 2002). It does not, however, capture trends
corr(g(X), h(Y )) = ρg(X),h(Y ) = 0 (1) since it is an approximation to ordinary correlation, which
measures closeness to a straight line.
(For precisely measurables functions, see e.g. Papoulis The second multivariate approach based on a gener-
and Pillai (2002).) alised eigenvalue problem (Solow, 1994) has also its own
Independence is indeed much stronger and requires limitations. Maximising autocorrelation (see, e.g. Del-
factorisation of the joint probability density function Sole, 2001) does not in general reflect trend features, and
(pdf) f (x, y) of X and Y (see, e.g. DeGroot and yields results that are in general comparable to those from
Schervish, 2002) as: EOFs/PCs. Box and Tiao (1977) argue that differencing is
not in general recommended because it can lead to vari-
ous complications. A serious problem also emerges when
f (x, y) = fX (x)fY (y) (2) S or S∇ is singular or near-singular. Note that this is in
general the case when dealing with observed large-scale
where fX () and fY () are respectively the marginal pdfs climate data. Furthermore, as for EOFs, the spectrum of
X and Y , yielding therefore Equation (1). S−1 S∇ is in general smooth, and the smallest eigenval-
• Monotonic transformation: The measure of association ues will be degenerate making separation of eigenvalues
based on covariance or correlation is characterised by a difficult.
noninvariance to nonlinear monotonic transformation. Trend patterns are the most natural patterns that are
It is important to have a measure of association that is relevant and, unlike EOFs, can be in general physi-
invariant under monotonic transformation. One would cally interpreted. Local or global trend with complex
expect monotonic transformations not to alter robust spatial patterns cannot be obtained using the previous
associations between variables. approaches. The motivation of this paper is to propose a
method that is easy to use to identify trends. The expecta-
Some of the previous difficulties can be addressed tion is that, unlike EOFs, a multivariate trend–PC method
using particular nonlinear transformations so that all will yield only a small number of nondegenerate eigen-
higher-order moments can be taken into account, e.g. in values above a floor of eigenvalues not associated with
Equation (1). The simplest such transformation should: trends. The patterns associated with the nondegenerate
(1) help remove the effect of the marginal distributions part of the spectrum may be more easily interpreted than
as in independent component analysis (Hyvärinen, 1999; those associated with the degenerate part, and hence the
Hyvärinen et al., 2001) and nonlinear regime studies potential of interpretability of trend patterns. The method,
(Hsu and Zwiers, 2001; Stephenson et al., 2004); and which is of exploratory nature, provides explicitly the
(2) behave like a transfer function (e.g. Hannachi et al., possible number of trend patterns present in the gridded
2003, 2004), i.e. they are precisely cumulative distribu- data. Investigation of sampling properties of the method
tion functions (cdfs). (Functions such as those used in is left for future work.
neural networks known as sigmoids or squashing func-
tions, are well known to enjoy these nice properties (see
e.g. Bishop, 1995). A well known principle used in neural 3. TREND EOFS AND TREND PCS
networks and applied often in blind separation and blind
deconvolution (Bell and Sejnowski, 1995) is based on the 3.1. Method description
fact that the inputs should be weighted so that the dif- Let x1 , x2 , . . . , xp , where xk = (x1k , . . . , xnk ), k =
ferent response levels are used with the same frequency 1, . . . , p, denote the p variables forming our spatio-
(Laughlin, 1981). This principle is rooted in information temporal field. These variables form the columns of a
theory (Shannon and Weaver, 1962) and guarantees, via n × p data matrix X = (xij ). The data matrix X is sup-
entropy maximisation, an optimum information transfer. posed to represent anomalies or departure from the cli-
Following this principle, transmitted information is max- matology. To resolve the issue of trend, the concept of
imised when the sloping part of the sigmoid is optimally monotonicity has to be included. The solution is obtained
lined up with the most probable parts of the input distri- by looking for a new transformation that gives a measure
bution.) A natural choice for these functions corresponds of monotonicity. To achieve this, the original data are
to the cdfs FX () and FY () of X and Y respectively. (This first sorted (e.g. in decreasing order.) Then the position in
choice satisfies in fact Laughlin (1981) principle of opti- time of each datum from the sorted series is taken. These
mality.) (time) positions (from the sorted data) constitute our new

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
4 A. HANNACHI

data. So the newly transformed data q1 , q2 , . . . , qp are (a) Ranks


composed of p time series, each of which is some per- p1
mutation of {1, 2, . . . , n}, i.e.
p3 p4
qk = (q1k , q2k , . . . , qnk ) = (Qk (1), Qk (2), . . . , Qk (n)) (4) p2 pn

t
1 2 3 4 n
for some permutation Qk () of {1, 2, . . . , n}.
As an illustration, let us consider a simple time series
(b) Time-sorting
with only five elements: x = (1.5, 0, −1, 1.2, 3). The new
transformed time series is obtained by sorting first x to
yield (−1, 0, 1.2, 1.5, 3). Then q consists of the (time)
position of these sorted elements as given in the original
time series x to yield q = (3, 2, 4, 1, 5), which is a
permutation Q1 () of {1, 2, 3, 4, 5}. q1 q2 q3 q4 qn
Now by looking for maximum correlation between
time positions of the sorted data one is attempting to find Figure 1. Illustration of a time series using ranks (a) and positions of
times when the different time series are increasing (or sorted data (b). The horizontal axis in (a) refers to the time axis with
p1 , . . . , pn representing the ranks of data, and q1 , . . . , qn represent the
decreasing) together. The leading modes based on this positions of the ordered data.
new correlation (or covariance) provide the answer to
trends as is explained next. Figure 1(a) illustrates a time
series x1 , . . . , xn , (n = 7), where the ranks p1 , . . . , pn of for k, l = 1, 2, . . . , p. TEOFs and trend PCs (TPCs) are
the data are shown. The sequence p1 , . . . , pn reflects the then provided respectively by the ‘EOFs’ and the ‘PCs’
variability in the time series since it is highly correlated using the newly obtained covariance (or correlation)
with the original series. Now, the correlation of the posi- matrix
tions q1 , q2 , . . . , qn with the sequence 1, 2, . . . , n reflects 1 T T
the total monotonicity of the sequence, and hence pro- T = (ρT (xk , xl )) = Q H HQ, (7)
n
vides a robust measure of the strength of the trend. Mono-
tonicity is indeed imprinted in the sequence q1 , . . . , qn  
where H = In − n 1 1 1T is the centring operator, I is
since xq1 ≤ xq2 · · · , ≤ xqn (Figure 1(b)). For illustration n n n

(see Appendix B for details), consider two time series the n × n identity matrix, and 1n = (1, 1, . . . , 1)T is a
xt = at + εt and yt = bt + ηt , t = 1, 2, . . . , n, composed column vector of length n containing only ones.
each of a linear trend and a white noise. Let also qx = In the application presented in the next sections, we use
y y y SVD of the new data matrix HQ to obtain the new trend
(q1x , q2x , . . . , qnx )T and qy = (q1 , q2 , . . . , qn )T denote the
positions of the associated time series. If a = 0 or EOFs and TPCs. To capture trends the method requires
b = 0, then one of the sequences q1x , q2x , . . . , qnx or (see above) that at least two variables be trendy (see also
y y y Appendix B). If only one single variable is trendy, the
q1 , q2 , . . . , qn , respectively, will be ‘random’ (i.e. pat-
ternless), and consequently the sequences qx and qy will method will not capture the trend. This, however, does
be uncorrelated. Now, as a and b increase, the two not happen in reality because of the cross-correlation
sequences become more and more non-random and cor- structure observed in weather/climate. The trends are then
related. Hence, in the presence of a (detectable) trend associated with the leading nondegenerate eigenvalues
the sequences will always not only have a pattern but of Equation (7). The first trend PC will maximise the
this pattern will also follow the trend. This will be the total monotonicity, the second one will also maximise
case even for a relatively weak trend because the cor- the same monotonicity and be uncorrelated with the first
relation based on time-sorting, like rank correlation, is trend PC, etc. Hence the method also captures local
robust and resistant. The variance of a linear combina- trends, when they exist.
tion of two position sequences gives a measure of the
total (and common) monotonicity. Maximising this mea- 3.2. A few remarks
sure provides therefore the largest monotonicity, hence The permutations Qk (), k = 1, . . . , p that define the
the largest trend (see Appendix B). newly obtained data matrix Q are in fact related to the
In summary our new data matrix of positions of sorted ranks of the data. As before, let us designate by pk =
data is now given by: (p1k , . . . , pnk ) the ranks of the corresponding k’th time
series xk = (x1k , . . . , xnk ), k = 1, . . . , p. These ranks are
Q = (qT1 , qT2 , . . . , qpT ) = (qij ) (5) also given by:

and we require the following correlations (or covari- pk = (p1k , p2k , . . . , pnk ) = (Pk (1), Pk (2), . . . , Pk (n)) (8)
ances):
where Pk () is a permutation of {1, 2, . . . , n}. In Appendix
ρT (xk , xl ) = cov (qk , ql ) (6) A it is shown that Qk () is precisely the reciprocal of the

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
A NEW METHOD FOR FINDING TRENDS IN GRIDDED CLIMATE DATA 5

rank permutation Pk (), i.e. an autocorrelated noise, given by

Qk = Pk−1 = Pk(n−2) 
(9) xt = αt + 1.6εt1
yt = αt + 2.4εt2 (10)
where Pk(m) = Pk oPk o . . . , oPk is the m’th iteration of zt = αt + 1.5εt3
the permutation Pk (). As an illustration, consider again
the previous five-element time series. The vector of ranks where α = 0.004, εt1 , εt2 , and εt3 are three first-order
of this time series is p = (4, 2, 1, 3, 5), and is a permu-
autoregressive, AR(1), models with respective lag-1
tation P1 () of {1, 2, 3, 4, 5}. It can be easily verified that
autocorrelations 0.5, 0.6, and 0.3. Figure 2(a), (b), (c)
P1 () is the inverse of Q1 (). The useful properties of
shows an example of this three-variable time series
rank correlation are also inherited by the new inverse-
model. Covariance-based EOFs/PCs of these time series
ranks method, e.g. resistance, robustness, invariance to
(Figure 2(a), (b), (c)) have been computed. Figure 2(d),
monotonic transformation, in addition to the fact that the
(e), (f) shows the three obtained PCs in decreasing order
covariance and correlation matrices are equivalent. The
of their explained variances. It can be seen that the trend
spectrum of the inverse-ranks correlation matrix is, how-
is observed in all three PCs, and slightly stronger in PC2
ever, different from that of the rank correlation because
(Figure 2(e)). This is because the PCs maximise variance
only very few patterns will reflect the trend (see e.g.
plus the fact that the trend is small, and hence the failure
Appendix B and the application sections).
of covariance-based EOFs to capture trends.
Next the EOFs and PCs of the newly obtained correla-
4. APPLICATION TO SIMPLE LOW-DIMENSIONAL tion/covariance matrix (Equations (6)–(7)) are computed
SYSTEMS using the SVD of the centred data matrix of inverted
ranks (Equation (5)). Figure 2(g), (h), (i) shows the ‘PCs’
In this section, we illustrate the above formulation with associated with the newly obtained data matrix (Equa-
two sets of simple examples, and compare the results tion (5)), i.e. time positions in the sorted data. It can
with conventional EOFs using covariance and correlation be seen now that the leading PC (Figure 2(g)) captures
matrices. The application to reanalyses is presented in the entirely the trend. Furthermore, no trend is left in the
next section. remaining PCs (Figure 2(h), (i)). Note that the eigenvec-
tor loadings in this case are three-component vectors and,
4.1. Example 1: Covariance-based EOFs unlike TPCs, they are not very informative to show. Their
In this example we consider a set of three-variable usefulness will be made clear with the reanalyses data in
time series containing a linear trend contaminated with Section 5.

(a) Time series xt (b) Time series yt (c) Time series zt

5 5 5
x1

x2

x3

0 0 0

-5 -5 -5

0 200 400 0 200 400 0 200 400

(d) Ordinary PC1 (e) Ordinary PC2 (f) Ordinary PC3


4 4 4
2 2 2
PC1

PC2

PC3

0 0 0
-2 -2 -2
-4 -4 -4
0 200 400 0 200 400 0 200 400

(g) Trend PC1 (h) Trend PC2 (i) Trend PC3


4 4 4
2 2 2
TPC1

TPC2

TPC3

0 0 0
-2 -2 -2
-4 -4 -4
0 200 400 0 200 400 0 200 400

Figure 2. Example of time series of xt , yt , and zt obtained using Equation (10), (1st row), the PCs of these time series using covariance-based
EOFs (2nd row), and the trend PCs obtained using the new correlation/covariance-based EOFs (3d row).

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
6 A. HANNACHI

The significance of these results are examined using of four, but the conclusion is unaltered. Figure 4(a), (b),
a Monte Carlo test as follows. A set of 100 time (c) shows an example for the first three time series wt ,
series given by Equation (10) is generated. For each xt , and yt . The trend as well as the periodic signal can
time series covariance-based EOFs and newly obtained be noted from these time series.
EOFs, using Equation (6), are computed. Correlations As in Example 1, the second row of Figure 4 shows
between the corresponding PCs and the actual trend PC1, PC2, and PC4 obtained from the correlation-
(see Equation (10)) are then calculated. Figure 3 shows based EOF analysis of the time series generated using
the histograms of the correlation coefficients for the Equation (11). Again the trend is seen to be cap-
conventional PCs (Figure 3(a), (b), (c)) and the new PCs tured by most PCs (Figure 4(d), (e), (f)). In addition,
(Figure 3(d), (e), (f)). Note how the trend is absorbed the periodic signal is captured by PC1 (Figure 4(d)).
by all conventional PCs (Figure 3(a), (b), (c)) across all Note that PC4 is shown instead of PC3 because it
simulations, whereas it is only captured by the first newly has a larger trend, and the purpose is to show that
obtained PC (Figure 3(d)), which we call trend PC. The the trend gets spread over most PCs, including the
approach of Solow (1994) has also been applied, and last one (PC4), i.e. with the least variance! Simi-
the result (not shown) is similar to that obtained using larly, the third row of Figure 4 shows the leading
EOFs. three PCs obtained using the new data matrix (Equa-
tion (6)). The trend is clearly captured by the first
(trend) PC (Figure 4(g)). The fourth trend PC (TPC4,
4.2. Example 2: Correlation-based EOFs
not shown) looks also like TPC2 and TPC3, i.e. without
Here, we consider a 4-variable time series containing trend.
a sine/cosine wave and a quadratic trend, given by: As in Example 1 above, the statistical properties of
 the correlations between the quadratic trend and the
 w = 1.8at + 2βb(t) + 1.6εt1 various PC time series are investigated using a Monte
 t
xt = 1.8at + 1.8βb(t) + 2.4εt2 Carlo approach. The first row of Figure 5 shows the his-
(11)
 yt = 0.5at + 1.7βb(t) + 1.5ε4t
 3
tograms of these correlations for the (correlation-based)
zt = 0.5at + 1.5βb(t) + 1.7εt PCs like those shown in Figure 4(d), (e), (f). Again
the trend has been absorbed by all PCs (Figure 5(a),
where at is a quadratic trend proportional to t 2 , at ∝ t 2 , (b), (c)). Furthermore, the trend is stronger with PC2
β = 2, and b(t) = cos 4t 5 + sin 5 . The noises εt , k =
t k
(Figure 5(b)) than with the remaining PCs. The spiky
1, . . . , 4, are again AR(1) with respective lag-1 autocorre- feature of the histogram for PC1 (Figure 5(a)) is simply
lations 0.5, 0.6, 0.3, and 0.35. To conform with Figure 2, due to the existence of the periodic component, which
and for clarity purposes we only show three panels out projects strongly onto the leading correlation-based PC.

(a) Histogram (b) Histogram (c) Histogram


15 15 15
Frequency

Frequency

10 10 10
Frequency

5 5 5

0 0 0
0 0.5 1 0 0.5 1 0 0.5 1
Correlation Correlation Correlation

(d) Histogram (e) Histogram (f) Histogram


15 15 15
Frequency

Frequency

10 10 10
Frequency

5 5 5

0 0 0
0 0.5 1 0 0.5 1 0 0.5 1
Correlation Correlation Correlation

Figure 3. Histograms of the correlation coefficients between the linear trend (Equation (10)) and the PCs obtained from covariance-based EOFs
(1st row), and between the same linear trend and the trend PCs (2nd row).

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
A NEW METHOD FOR FINDING TRENDS IN GRIDDED CLIMATE DATA 7

(a) Time series wt (b) Time series xt (c) Time series yt


4 4 4
2 2 2

x1

x2

x3
0 0 0
-2 -2 -2
-4 -4 -4
0 200 400 0 200 400 0 200 400

(d) PC1 (e) PC2 (f) PC4


4 4 4
2 2 2
PC1

PC2

PC4
0 0 0
-2 -2 -2
-4 -4 -4
0 200 400 0 200 400 0 200 400

(g) Trend PC1 (h) Trend PC2 (i) Trend PC3


4 4 4
2 2 2
TPC1

TPC2

TPC3
0 0 0
-2 -2 -2
-4 -4 -4
0 200 400 0 200 400 0 200 400

Figure 4. As in Figure 2, but using the four-variable model Equation (11), and showing wt , xt , and yt (a), (b), (c), and correlation-based PCs
1, 2, and 4 (d), (e, (f), and the three leading trend PCs (g,h,i).

(a) Histogram (b) Histogram (c) Histogram


25 25 25

20 20 20
Frequency
Frequency

Frequency

15 15 15

10 10 10

5 5 5

0 0 0
0 0.5 1 0 0.5 1 0 0.5 1
Correlation Correlation Correlation

(d) Histogram (e) Histogram (f) Histogram


25 25 25

20 20 20
Frequency

Frequency

Frequency

15 15 15

10 10 10

5 5 5

0 0 0
0 0.5 1 0 0.5 1 0 0.5 1
Correlation Correlation Correlation

Figure 5. Histograms of the correlation coefficients between the quadratic trend (Equation (11)) and the correlation-based PCs 1, 2, and 4 (a),
(b), (c), and between the same trend and the trend PCs 1, 2, and 3 (d), (e), (f).

Unlike the previous histograms, the histograms of the approach is unable to capture the trend. The two
correlations using the newly obtained PCs (Figure 5(d), time series associated with the smallest eigenvalues
(e), (f)) clearly indicate that the leading PC (Figure 5(d)) of S−1 S∇ (not shown) are similar to PC1 and PC2
entirely captures the trend. Note that the Solow (1994) (Figure 4(d), (e)).

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
8 A. HANNACHI

5. APPLICATION TO REANALYSES DATA Eigenvalue spectrum (SLP)


101
5.1. Data description
Two NCEP/NCAR data sets are used here. The first

Eigenvalue (%)
set consists of winter monthly mean SLP over the
Northern Hemisphere north of 20° N and spans the period
1948–2000. The winter season is defined by the months
Dec-Jan-Feb (DJF). The second set consists of winter
monthly geopotential heights also over the same latitudes 100
and for the same period as SLP. Anomalies are computed
as departures from the mean annual cycle and are 0 10 20 30
considered in this analysis. Rank

5.2. Application to winter SLP Figure 6. The leading 30 eigenvalues, expressed in %, of the new
covariance/correlation matrix Equation (7) of winter monthly SLP. The
The leading (conventional) EOF (not shown) yields vertical bars show hand-waving uncertainty estimates based on the
the familiar Annular Oscillation (e.g. Thompson and North et al. (1982) rule of thumb.
Wallace, 1998, 2000; Wallace, 2000; Rogers and van
Loon, 1982), a tripolar structure with its centres localised (a) Trend EOF1 (SLP)
respectively over the North Atlantic, the polar region, 6
and the North Pacific. The second EOF yields the Pacific 5
pattern (not shown), a monopolar structure sitting over 4
the North Pacific. The correlation-based EOFs have also
3
been computed. The spectrum of the correlation matrix
indicates a smooth decay with degenerate eigenvalues 2
clustered into groups. EOF1 (not shown) has a polar 1
high and a low over the subtropics with particular high 0
values over North Africa and the Mediterranean. EOF2
-1
(not shown) has a high over the Asian subtropics and two
low centres over the North Pacific and West Atlantic near -2
Florida. Rank correlation-based EOFs (not shown) are -3
virtually identical to (ordinary) correlation-based EOFs. -4
In both the cases, no trend was obtained with a particular
(b) Trend EOF2 (SLP)
PC. Moreover, when the data were detrended prior to 5
the analysis by removing a linear trend, the obtained 4
EOFs were very similar to those obtained from the 3
nondetrended data. This leaves the impression that the 2
data do not contain any trend. In fact what happened is 1
that the trend has been dissipated into most PCs, rather 0
like the above examples. -1
The SLP data matrix has been transformed using Equa- -2
tion (4) to yield the new data matrix Q (Equation (5)) -3
of inverted ranks. To take into account the converging -4
-5
longitudes poleward, this data matrix is weighted using
-6
the square root of the cosine of the latitude at each
-7
corresponding grid point. This weighting is important
-8
to remove the singularity at the pole. The trend EOFs
and TPCs of the centred and area-weighted data matrix Figure 7. Trend EOFs of the new SLP covariance/correlation matrix
of inverted ranks are then computed using SVD. The Equation (7) associated with the leading two eigenvalues (Figure 6).
results are strikingly different from conventional EOFs.
Figure 6 shows the corresponding spectrum associated Figure 7 shows the trend EOFs of the newly obtained
with the leading 30 Trend EOFs (TEOFs). The error bars data matrix (Equation (5)) associated with the leading
shown in Figure 6 are based on the North et al. (1982) two eigenvalues. Interestingly, the third TEOF corre-
rule of thumb. Clearly the spectrum has now an outstand- sponding to the third eigenvalue, already in the no-trend
ing feature. It is not continuous (or smooth) and has two floor (Figure 6), is shown in Figure 8, and clearly reveals
well-separated eigenvalues and a no-trend floor, which no structure, i.e. it looks like noise, but does not in
looks like a noise floor: after all this is what we are look- reality correspond to a noise as explained below. This
ing for. In conventional EOFs or in the Solow (1994) result is once more consistent with the obtained spec-
method, for example, the spectrum is generally ‘smooth’ trum (Figure 6). Again, this is not in general the case
or slowly decaying rather like a red-noise spectrum. with conventional methods (e.g. EOFs) where the patterns

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
A NEW METHOD FOR FINDING TRENDS IN GRIDDED CLIMATE DATA 9

associated with the lower part of the spectrum do not Trend EOF3 (SLP)
in general show pure noise. Those (low ranked) EOFs 7
usually have some structure. The obtained leading two 6
5
TEOFs (Figure 7(a), (b)) show large-scale structures. For
4
example Figure 7(a) shows a positive center over North
3
Africa, the Mediterranean, most of the Indian subconti- 2
nent, and parts of North Atlantic, and a negative centre 1
over most of Eurasia, north east of the north Pacific, and 0
Greenland/north east Canada. For the second TEOF, one -1
can also see some structure over the East Asian continent -2
and the Mediterranean. -3
Before interpreting these patterns, we first show the -4
time series associated with these leading TEOFs. Figure 9 -5
-6
shows the two leading TPCs obtained as the left singular
-7
vectors of the new area-weighted data matrix HQ (Equa-
tion (5)) corresponding to the leading two eigenvalues. Figure 8. As in Figure 7, but for the third eigenvalue.
Figure 9(a) shows a clear increasing trend as expected.
The second ‘TPC’ (Figure 9(b)) also shows an initial trend EOFs (Figure 7). The obtained time series are
decrease and then an increasing trend starting from the then regressed back onto the SLP anomalies to yield
middle of the record. Note in particular the steep increas- the required patterns that we label from now on trend
ing trend towards the end of the record, indicating that patterns or trend modes. Hence if v is a trend EOF, i.e.
the method captures local trends. It is worth noting here a right eigenvector of HQ, see Equation (5), then one
that the part that does not project onto the (two) leading first constructs the time series w = HXv, which is then
trend EOFs, referred to as no-trend, which is similar to scaled and regressed back onto the original field to yield
noise, is not pure noise but simply does not have trend, the trend mode. So the trend mode is composed of the
and it is possible to do regular PCs on this part. regression coefficients between the time series w and the
The leading two TEOFs (Figure 7(a), (b)) are not to time series anomalies at each grid point. Alternatively,
be physically interpreted straightaway because they are the trend PC z, i.e. the left singular vector of HQ, can be
not in the same state space of the data because of the regressed back onto the original filed to yield the trend
nonlinear transformation (Equation (4)). To obtain the pattern. So the trend pattern is composed of the regression
corresponding patterns in the original (physical) state coefficients between the trend PC and the anomaly time
space, we first project the SLP anomaly field onto the series of the original field at each grid point.

(a) Trend PC1 (DJF SLP)


0.15
0.1
0.05
TPC1

0
-0.05
-0.1
-0.15
Jan50 Jan60 Jan70 Jan80 Jan90 Jan00
Time

(b) Trend PC2 (DJF SLP)


0.3

0.2
TPC2

0.1

-0.1
Jan50 Jan60 Jan70 Jan80 Jan90 Jan00
Time

Figure 9. The two leading SLP trend PCs, obtained as the first two left singular vectors of the new data matrix anomaly HQ, see Equation (5).

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
10 A. HANNACHI

Figure 10 shows the leading two trend modes associ- trend towards the last quarter of the century, a charac-
ated with the leading two TEOFs (Figure 7). The first teristic feature of the SiH. The SiH pattern is usually
trend mode (Figure 10(a)) is familiar and clearly shows identified in winter SLP as an extensive semi-permanent
the NAO pattern. The NAO is a well known large-scale anticyclone sitting over Asia, and can be seen also in
prominent mode of natural climate variability over the hemispheric winter-mean SLP. The real mechanisms of
North Atlantic, Europe, and the Mediterranean. It bears the SiH is still not very well understood, but it is likely
a meridional see-saw feature in the atmospheric mass to be associated with the coldest and densest air masses
between subtropical high and polar low. It is fair to of the hemisphere (Lydolf, 1977), mostly dominant in the
say, however, that there is no universal way to define boreal winter. Being mainly thermally induced, the SiH
its spatial structure or its time index. Approaches for its has a very shallow cold-core structure, confined mainly
definition include point correlation, EOFs, and rotated to the planetary boundary layer. The SiH has witnessed
EOFs. A well-known NAO time index is obtained as the a dramatic decline particularly in the last quarter of the
difference of normalised SLP anomalies between Lisbon century (see Panagiotopoulos et al. (2005) for a detailed
(Portugal) and Stykkisholmur/Reykjavik (Iceland) (see, description of the pattern and its time evolution). The
e.g. Hurrell et al. (2003) for details). pattern (not shown) associated with TEOF3 (Figure 8)
The second trend mode (Figure 10(b)), not as famil- indicates a large-scale pattern (not noise): it simply does
iar as the NAO, shows a major low over the Asian not contain a trend. In other words, the trend patterns are
continent centred over Siberia and a smaller positive captured only by the first two nondegenerate TEOFs.
centre over West Alaska. This pattern represents the
Siberian High (SiH). Figure 11 shows the time series 5.3. Application to geopotential heights
corresponding to the trend modes of Figure 10 obtained
To find out differences between vertical levels we have
by projecting the SLP anomalies onto the trend modes
analysed the geopotential heights at various vertical pres-
(Figure 10). Figure 9 can be regarded as a smoothed
sure levels, 500, 925, and 1000 mb. At each level, the
version of Figure 11. Note, in particular, the increasing
data are processed as in the previous section. The spectra
and the EOFs of the centred and area-weighted inverse-
(a) Trend mode 1 (SLP) rank data matrix are calculated. Figure 12 shows the
4 spectrum of the obtained covariance matrix at 500 mb
3 (Figure 12(a)), 925 mb (Figure 12(b)), and 1000 mb
2
(Figure 12(c)). At 500 mb, there is clearly only one single
eigenvalue above a no-trend floor. Figure 13(a) shows the
1
trend mode, i.e. after projection back onto the data space,
0 associated with this leading eigenvalue. This trend mode
-1 is reminiscent of the cold ocean warm land (COWL) pat-
tern (Wallace et al., 1995; Broccoli et al., 1998) with
-2
low centres over the North Pacific and North Atlantic,
-3 and high centres over the Mediterranean/Western Europe,
-4 north west of North America, and central Asia. In total
agreement with the spectrum (Figure 12(a)), the trend
-5
EOF, i.e. the eigenvector of T in (7), associated with
-6 the second eigenvalue (already in the no-trend floor),
(b) Trend mode 2 (SLP) shown in Figure 13(b) is structureless, like noise, and
3 represents no trend. Therefore there is only one single
pattern, COWL, associated with the trend in the mid-
2
troposphere heights. The time series associated with this
pattern is shown in Figure 13(c), and is obtained by
1
projecting the data anomaly matrix HX onto the trend
0 pattern (Figure 13(a)). The eigenspectrum correspond-
ing to the 925 mb pressure level (Figure 12(b)) shows
-1 again one eigenvalue singled out of the no-trend floor
like Figure 12(a), and is associated with the NAO pat-
-2 tern as the trend-mode (not shown). Note, however, that
the spectrum is slightly different from that at 500 mb, in
-3 that the second eigenvalue is slightly raised off the trend-
noise floor but not entirely. The TEOF associated with
-4 this eigenvalue is no longer structureless (not shown),
Figure 10. The two leading trend modes obtained by projecting the and its trend mode shows the SiH pattern (not shown).
DJF sea level pressure anomalies onto the TEOFs (Figure 7) and then The spectrum at 1000 mb (Figure 12(c)) is now differ-
regressing back onto the same anomalies. Contour interval 1 mb. ent from the previous two. While the leading eigenvalue

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
A NEW METHOD FOR FINDING TRENDS IN GRIDDED CLIMATE DATA 11

Time series of trend mode 1 (SLP)


2

Index 1
0

-1

-2

Jan50 Jan60 Jan70 Jan80 Jan90 Jan00


Time

Time series of trend mode 2 (SLP)


2

1
Index 2

-1

-2
Jan50 Jan60 Jan70 Jan80 Jan90 Jan00
Time

Figure 11. The time series associated with the two leading SLP trend modes (Figure 10(a), (b)). Note that every consecutive Dec-Jan-Feb months
are plotted next to each other.

is still well above the no-trend floor as in Figure 12, matrix of large-scale climate data, and the degeneracy of
the second eigenvalue has now completely risen off this the generalised eigenvalue spectrum involved.
no-trend floor. Figure 14(a) shows the associated trend The new method presented provides a systematic
mode, i.e. after projection back onto the data space, and decomposition of the data into a small number of trend
represents again the NAO. The second trend mode is patterns and a remaining no-trend when at least two vari-
shown in Figure 14(b), which can be compared to the ables are trendy. The method is based on an eigenanalysis
second trend-mode associated with SLP (Figure 10(b)) of the covariance/correlation matrix of time positions
and shows again the Siberian High pattern. This is once from sorted data, i.e. inverse ranks, yielding trend EOFs
more consistent with the fact that the SiH is highly con- and associated TPCs. By sorting the data and looking
fined to the low-level atmosphere near the ground, i.e. the for patterns that maximise correlation/covariance between
planetary boundary layer. The SiH was not captured pre- inverse ranks one is looking for linear combinations max-
viously using various forms of conventional EOFs mainly imising total monotonicity, hence capturing trends with
because it is a pattern associated with a local trend, and their local slow variation feature. The leading nondegen-
partly because it is confined to the boundary layer. This erate eigenvalues are associated with trends, whereas no
pattern, associated with a local trend, is also different trend is associated with the remaining spectrum floor.
from the NAO pattern, which can be explained by global The method is adapted to the problem of trend analysis
linear trend. because the only spatially coherent signal coming out of a
rank-type reordering of space-time values is a signal con-
taining fluctuations at the time scale of the length of the
6. SUMMARY AND CONCLUSIONS data. The first TPC maximises the total monotonicity, the
A new method capable of capturing trend patterns second TPC also maximises the total monotonicity sub-
from gridded climate data was presented. EOFs and ject to being uncorrelated with the first TPC, etc. Hence
a related method based on maximising autocorrelation this makes the method capable of capturing also local
are often used to detect trends. These methods are trends.
in general unable to capture trend patterns because The method has first been applied to two simple low-
they are not devised for that purpose. Besides, these order time series models. The first one consists of a
approaches have a number of difficulties such as physical three-variable time series containing a linear trend plus an
interpretability, smooth spectrum yielding difficulties in autocorrelated AR(1) noise, and the second example is a
truncation, and limitation to second-order moments. The four-variable time series containing a quadratic trend plus
method based on maximising autocorrelation also suffers a periodic signal and a AR(1) noise. Covariance-based
serious difficulties such as singularity of the covariance EOFs of the first example and correlation-based EOFs of

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
12 A. HANNACHI

(a) Eigenvalue spectrum (Z500 mb) (a) Trend mode 1 (Z500)


101 40
30
20
Eigenvalue (%)

10
0
-10

100 -20
-30
0 10 20 30 -40
Rank
-50
(b) Eigenvalue spectrum (Z925 mb) -60
101
(b) Trend EOF 2 (Z500)
8
7
Eigenvalue (%)

6
5
4
3
2
100 1
0
-1
0 10 20 30 -2
Rank -3
-4
(c) Eigenvalue spectrum (Z1000 mb) -5
101
-6
-7
Time series of trend mode 1 (Z500)
Eigenvalue (%)

2
1
0
x

-1
-2
100
-3
Jan50 Jan60 Jan70 Jan80 Jan90 Jan00
0 10 20 30 Time
Rank
Figure 13. (a) The trend mode, i.e. after projection back onto the data
Figure 12. As in Figure 6, but for the winter monthly geopotential space, for DJF 500 mb heights associated with the leading eigenvalue
heights at 500 mb (a), 925 mb (b), and 1000 mb (c). of Figure 12(a); (b) The trend EOF, i.e. eigenvector of Equation (7),
associated with the second eigenvalue; and (c) the standardised time
series of the trend mode obtained by projecting the data anomalies onto
the second example failed to capture the trend. Maximum the trend mode shown in (a). Contour interval 1 m in (a).
autocorrelation method also produces results similar to
EOFs. The new method, however, correctly captures the in the no-trend floor, is structureless like noise, i.e. it
trend as its leading pattern. contains no trend.
The method is then applied to northern hemispheric The application to geopotential heights reveals some
NCEP/NCAR reanalyses using DJF SLP and geopotential differences between levels. At 500 mb, there is only one
heights at various levels. The eigenspectrum of the (area- single eigenvalue above the no-trend floor. The associated
weighted) inverse-rank correlation/covariance matrix of trend mode shows the cold ocean-warm land pattern.
SLP reveals different feature from usual spectra, with As we move to lower levels the spectrum starts to
two separated leading eigenvalues well above a no-trend show slight differences. At 925 mb, e.g., the leading
floor. The first trend EOF is associated with the familiar eigenvalue is still unchanged, i.e. well above the no-
NAO, whereas the second trend EOF is associated with trend floor; however, the second eigenvalue is slightly
the SiH. The TPCs clearly show the increasing trend raised off the no-trend floor. As one reaches 1000 mb, the
for the NAO since the beginning of the record, and an second eigenvalue has risen off the no-trend floor. The
increasing trend during the last quarter of the century for trend modes associated with these leading eigenvalues
the SiH. The third trend EOF, with eigenvalue already represent respectively the NAO and the SiH patterns.

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
A NEW METHOD FOR FINDING TRENDS IN GRIDDED CLIMATE DATA 13

(a) Trend mode 1 (Z1000) conceptually difficult to think of a field of data where
32 one single grid point has a trend and all other grid points
24 contain no trend. This is because nearby grid points
tend to be highly correlated. The other weakness of the
16
method is the back-transformation of the patterns to the
8 physical or data space. This is because of the nonlin-
0 ear transformation involved. However, this is a generic
-8
problem in all nonlinear methods, e.g. nonlinear PCs.
Here, we projected the data onto the trend EOFs to
-16 obtain the corresponding time series. The trend modes
-24 are then obtained by regressing these time series (scaled
-32 to have unit variance) back onto the data. The trend
modes could also be obtained by regressing the TPCs
-40
onto the anomaly field; but the results are found to be
-48 similar.
(b) Trend mode 2 (Z1000)
24
ACKNOWLEDGEMENTS
18
This work was funded by the NCAS Centre for Global
12 Atmospheric Modelling (CGAM), the University of
Reading. The author wishes to thank Prof. Ian Jolliffe
6 and two anonymous reviewers for their constructive com-
ments that helped improve the manuscript.
0

-6
APPENDIX A
-12
On rank correlation and the inverse of ranks
-18 Let us be given two time series x1 , x2 , . . . , xn , and
-24 y1 , y2 , . . . , yn , supposed to be realisations of two random
variables X and Y with respective marginal cdfs FX () and
Figure 14. As in Figure 10, but for the leading two trend modes, FY (). We also suppose that the first time series is sorted
i.e. after projection back onto the data space, corresponding to DJF into x[1] ≤ x[2] ≤ · · · , ≤ x[n] , and similarly for the second
geopotential heights at 1000 mb, Z1000 mb. Contour interval is 8 m
one. A familiar estimate of the cdf FX () is given by the
in (a) and 6 m in (b).
empirical distribution function (edf) defined by:

The method clearly captures the modes associated 0 if u < x[1]
with trends in SLP and geopotential heights. The out- F̂X (u) = nk if x[k] ≤ u < x[k+1] (A1)
standing feature of the method is its ability to sepa- 1 if u > x[n]
rate patterns associated with trends, albeit small, from and similarly for FY (). Now the sample x1 , x2 , . . . , xn is
patterns not associated with trends. This is achieved mapped, using the edf F̂X (), onto pn1 , pn2 , . . . , pnn , where
simply by examining the spectrum and the associated p1 , p2 , . . . , pn are the ranks of the first time series. This
trend EOFs. The NAO trend mode is well captured yields the Spearman’s rank correlation as the sample
from both fields. In the middle atmosphere, the trend estimate of Equation (3).
mode becomes the COWL pattern. The SiH is cap- Consider now the first time series. The elements of
tured in SLP and the very low-level height fields. This the sequence p1 , p2 , . . . , pn of ranks are given by pk =
is entirely consistent with the fact that the SiH is P (k), k = 1, . . . , n, where P () is some permutation
confined to the low levels and boundary layer of the of {1, 2, . . . , n}. For example, x1 is the p1 ’th smallest
atmosphere. element of the series. Associated with the sorted data
The method is simple to use, and yields a few pat- x[1] ≤ x[2] ≤ · · · , ≤ x[n] is the sequence q1 , q2 , . . . , qn of
terns whose structures can be interpreted in general. The the (original) time positions of the data, where qk =
method can also be used effectively to remove vari- Q(k), k = 1, . . . , n, and Q() is a another permutation of
ous trends from climate data and to yield detrended {1, 2, . . . , n}. Now for any 1 ≤ k ≤ n, because xk is the
data that can be investigated further. Like any other pk ’th smallest element yields xk = x[pk ] . Similarly, x[k] ,
approach, the method has advantages and disadvantages. is the k’th smallest element in the sorted data, and its time
However, the advantages seem to outweigh the disad- position in the original time series is qk , that is x[k] = xqk .
vantages. The method requires at least two variables These two equalities yield: xk = x[P (k)] = xQ(P (k)) , and
to be trendy in order to capture a trend. Clearly in hence the permutations P () and Q() are inverses of each
weather/climate this is in general the case because it is other.

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
14 A. HANNACHI

APPENDIX B approximately to a leading eigenvector of . In other


Trend and the covariance matrix of positions/inverse words, the maximum of cov (u1 qx + u2 qy , n), with u =
ranks (u1 , u2 )T a unit vector, is obtained with u = a.
Let us now compare this with the ordinary covariance
Here we provide proof for a simple case to find trend
matrix S of the original time series. It is clear from the
using the covariance matrix of the time positions of
eigenvalues
the sorted data. A simple bivariate case consisting of a
linear trend and a random noise is presented. The case of


2 

a 2
+ b 2 σ 2
− σ 2
σ − σ 2
σ 2 + ση2
λ± = σT2 1 ± 1 + ε η ε η
+ 2(a 2 − b2 )  + ε (B3)
2 σT2 (a 2 + b2 )2 σT2 2

more than two variables is similar in principle. Let xt = of S that when σε = ση both PCs will contain parts of
at + εt , and yt = bt + ηt , t = 1, . . . , n be two time series the trend variance. For the particular case where σε = ση ,
with independent and identically distributed (iid) noises the leading PC will contain both the trend and the noise
εt and ηt with variances σε2 and ση2 , and we let a and variances. Note that this happens even if one of the time
b to be nonnegative. Let qx = (q1x , . . . , qnx )T and qy = series does not contain a trend, e.g. a = 0. In this case the
y y
(q1 , . . . , qn )T be the time positions of the sorted time leading PC, with λ+ = b2 σT2 + ση2 , will contain both the
series. Also let xq x = a1 (xq1x , . . . , xqnx )T , and similarly for trend and the noise variance. Interestingly, however, there
εq x , yq y , and ηq y . So we have qx = a1 xq x − a1 εq x , and are values of b such that the eigenvalues are degenerate
qy = b1 yq y − b1 ηq y . The eigenvalues of the covariance and they do not contain the trend variance at all.
matrix  of (qx , qy ) are:
REFERENCES
λ± = σT2 ± cov(q , q )
x y
(B1) Antoniadis AG, Gregoire G, McKeague I. 1994. Wavelet methods for
curve estimation. Journal of the American Statistical Association 89:
1340–1353.
where σT2 = n 12−1 , (or 1 ± corr(qx , qy ) when the cor-
2
Bell AJ, Sejnowski IT. 1995. An information-maximization approach
relation matrix is used) with leading eigenvector a = to blind separation and blind deconvolution. Neural Computation 7:
√1 (1, 1)T . Now when the trend is detectable (e.g. σε and 1129–1159.
2 Bishop CM. 1995. Neural Networks for Pattern Recognition. Clarendon
ση are smaller than σT ), then both qx and qy will be Press: Oxford.
highly correlated with the sequence n = (1, 2, . . . , n)T Box GEP, Tiao GC. 1977. A canonical analysis of multiple time series.
and then cov(qx , qy ) will be close to σT2 (this also means Biometrika 64: 355–365.
Bradley JV. 1968. Distribution-Free Statistical Tests. Prentice-Hall:
that εq x and ηq y remain nearly iid). The result is that λ− Englewood Cliffs, NJ.
will be significantly smaller than λ+ , and most of the Broccoli AJ, Lau N-C, Nath MJ. 1998. The cold ocean-warm land
variance will be explained by λ+ (which will be close pattern: Model simulation and relevance to climate change detection.
Journal of Climate 11: 2743–2763.
to 2σT2 ), that is by the trend. Note also that the correla- Champely S, Doledec S. 1997. How to separate long-term trends from
tions of the leading trend PC with qx and qy will be even periodic variations in water quality monitoring. Water Resources 11:
2849–2857.
larger than corr(qx , qy ). If, on the other hand, the trend Chatfield C, Collins AJ. 1989. Introduction to Multivariate Analysis.
is not detectable (e.g. aσT or bσT is very small compared Chapman and Hall: London.
to σε and ση ), then qx and qy will be (nearly) uncorre- Craddock JM. 1973. Problems and prospects for eigenvector analysis
in meteorology. Statistician 22: 133–145.
lated, and the spectrum Equation (B1) will be degenerate. Craigmile PF, Guttorp P, Percival DB. 2004. Trend assessment in
This also happens when only one time series at most con- a long memory dependence model using the discrete wavelet
tains a trend, e.g. a > 0 and b = 0. When more than two transform. Environmetrics 15: 313–335.
DeGroot MH, Schervish MJ. 2002. Probability and Statistics. Addison
dimensions are considered, the method will capture trends Wesley: Boston, MA.
when at least two variables are trendy. This is because DelSole T. 2001. Optimally persistent patterns in time varying fields.
the restriction of  to these variables will contain at least Journal of the Atmospheric Sciences 58: 1341–1356.
Dommenget D, Latif M. 2002. A cautionary note on the interpretation
one nondegenerate eigenvalue, see Equation (B1). Note, of EOFs. Journal of Climate 15: 216–225.
in particular, how the noise variances are not explicitly Fang F-T, Zhang Y-T. 1990. Generalized Multivariate Analysis.
involved in Equation (B1). They get involved only in Springer-Verlag: Berlin.
Fukuoka A. 1951. A study of 10-day forecast. The Geophysical
the degeneracy of Equation (B1). Now suppose that the Magazine XXII: 177–218.
trend is detectable (i.e. λ+ is significantly larger than λ− ). Gilbert SD. 1999. Testing for the onset of trend using wavelets. Journal
The vector coefficients (α, β) of the trend is obtained by of Time Series Analysis 20: 513–526.
Golub GH, van Loan CF. 1996. Matrix Computation. John Hopkins
regressing qx and qy onto n as: University Press: Baltimore, MD.
Hannachi A, O’Neill A. 2001. Atmospheric multiple equilibria and
(α, β) = (cov(qx , n), cov(qy , n)) (B2) non-Gaussian behaviour in model simulations. Quarterly Journal of
the Royal Meteorological Society 127: 939–958.
Hannachi A, Stephenson DB, Sperber KR. 2003. Probability-based
and since qx is highly correlated with qy , and that methods for quantifying nonlinearity in the ENSO. Climate
cov(qx , n) ≈ cov(qy , n), the vector (α, β) corresponds Dynamics 20: 241–256.

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc
A NEW METHOD FOR FINDING TRENDS IN GRIDDED CLIMATE DATA 15

Hannachi A, Stephenson DB, Sperber KR. 2004. Erratum: Probability- North GR, Bell TL, Cahalan RF, Moeng FJ. 1982. Sampling errors in
based methods for quantifying nonlinearity in the ENSO. Climate the estimation of empirical orthogonal functions. Monthly Weather
Dynamics 22: 69–70. Review 110: 699–706.
Hannachi A, Jolliffe IT, Stephenson DB, Trendafilov N. 2006. In Obukhov AM. 1947. Statistically homogeneous fields on a sphere.
search of simple structures in climate: Simplifying EOFs. Uspekhi Matematicheskikh Nauk 2: 196–198.
International Journal of Climatology 26: 7–28. Obukhov AM. 1960. The statistically orthogonal expansion of
Hirsch RM, Gilroy EJ. 1985. Detectability of step trends in the rate empirical functions. Bulletin of the Academy of Sciences of the USSR.
of atmospheric deposition of sulfate. Water Resources Bulletin 21: Geophysics Series (English Transl.) 1: 288–291.
773–784. Panagiotopoulos F, Shahgedanova M, Hannachi A, Stephenson DB.
Hodges JL Jr, Lehmann EL. 1963. Estimates of location based on rank 2005. Observed trends and teleconnections of the Siberian high:
tests. Annals of Mathematical Statistics 34: 598–611. a recently declining center of action. Journal of Climate 18:
Horel JD. 1981. A rotated principal component analysis of the 1411–1422.
interannual variability of the Northern Hemisphere 500 mb height Papoulis A, Pillai SU. 2002. Probability, Random Variables and
field. Monthly Weather Review 109: 2080–2902. Stochastic Processes, 4th edn. McGraw Hill.
Hsu CJ, Zwiers F. 2001. Climate change in recurrent regimes and Preisendorfer RW. 1988. Principal Component Analysis in Meteorology
modes of Northern Hemisphere atmospheric variability. Journal of and Oceanography, Mobley CD (ed). Elsevier: Amsterdam.
Geophysical Research 106: 20145–20159. Richman MB. 1981. Obliquely rotated principal components: an
Hurrell JW, Kushnir Y, Ottersen G, Visbeck M. 2003. An overview of improved meteorological map typing technique. Journal of Applied
the North Atlantic Oscillation. In The North Atlantic Oscillation, Meteorology 20: 1145–1159.
Climate Significance and Environmental Impact, Geophysical Richman MB. 1986. Rotation of principal components. Journal of
Monograph 134 , Hurrell JW, Kushnir Y, Ottersen G, Visbeck M Climatology 6: 293–335.
(eds). American Geophysical Union: Washington, WA; 1–35. Rogers JC, van Loon H. 1982. Spatial variability of sea level pressure
Hyvärinen A. 1999. Survey on independent component analysis. and 500-mb height anomalies over the Southern Hemisphere.
Neural Computing Surveys 2: 94–128. Monthly Weather Review 110: 1375–1392.
Hyvärinen A, Karhunen J, Oja E. 2001. Independent Component Shannon CE, Weaver W. 1962. The Mathematical Theory of
Analysis. Wiley: New York. Communication. University of Illinois Press: Urbana, IL.
Jolliffe IT. 2002. Principal Component Analysis, 2nd edn. Springer: Solow AR. 1994. Detecting change in the composition of a
New York. multispecies community. Biometrics 50: 556–565.
Jolliffe IT. 2003. A cautionary note on artificial examples of EOFs. Stephenson DB, Hannachi A, O’Neill A. 2004. On the existence
Journal of Climate 16: 1084–1086. of multiple climate regimes. Quarterly Journal of the Royal
Karl TR, Coauthors. 1993. Asymmetric trends of daily maximum Meteorological Society 130: 583–605.
and minimum temperature. Bulletin of the American Meteorological Thompson DWJ, Wallace JM. 1998. The Arctic oscillation signature in
Society 74: 1007–1023. wintertime geopotential height and temperature fields. Geophysical
Kendall MG. 1975. Rank Correlation Methods, 4th edn. Charles Research Letters 25: 1297–1300.
Griffin: London. Thompson DWJ, Wallace JM. 2000. Annular modes in the extratropi-
Kutzbach JE. 1967. Empirical eigenvectors of sea-level pressure, cal circulation. Part I: month-to-month variability. Journal of Climate
surface temperature and precipitation complexes over North 13: 1000–1016.
America. Journal of Applied Meteorology 6: 791–802. von Storch H, Zwiers FW. 1999. Statistical Analysis in Climate
Laughlin S. 1981. A simple coding procedure enhances a neuron’s Research. Cambridge University Press: Cambridge.
information capacity. Zeitschrift Fur Naturforschung 36c: 910–912. Wallace JM. 2000. North Atlantic oscillation/annular mode: two
Lazante JR. 1996. Resistant, robust and non-parametric techniques paradigms–one phenomenon. Quarterly Journal of the Royal
for the analysis of climate data: theory and examples, including Meteorological Society 126: 791–805.
applications to historical radiosonde station data. International Wallace JM, Thompson DWJ. 2002. The Pacific Center of Action of
Journal of Climatology 16: 1197–1226. the Northern Hemisphere annular mode: real or artifact? Journal of
Lettenmaier DP. 1988. Multivariate nonparametric tests for trend in Climate 15: 1987–1991.
water quality. Water Resources Bulletin 24: 505–512. Wallace JM, Zhang Y, Renwick JA. 1995. Dynamic contribution to
Lorenz EN. 1956. Empirical orthogonal functions and statistical hemispheric mean temperature trends. Science 270: 780–783.
weather prediction. Technical report, Statistical Forecast Project Wilks DS. 1995. Statistical Methods in the Atmospheric Sciences.
Report 1. Department of Meteorology, MIT: 49. Academic Press: San Diego, CA.
Lydolf PE. 1977. Climates of the Soviet Union. Elsevier: Amsterdam. Yue S, Pilon P, Caradias G. 2002. Power of the Mann-Kendall and
Magnus JR, Neudecker H. 1995. Matrix Differential Calculus with Spearman’s rho tests for detecting monotonic trends in hydrological
Applications in Statistics and Econometrics. John Wiley and Sons: series. Journal of Hydrology 259: 254–271.
Chichester.
Mann HB. 1945. Nonparametric tests against trend. Econometrica 13:
245–259.

Copyright  2006 Royal Meteorological Society Int. J. Climatol. 27: 1–15 (2007)
DOI: 10.1002/joc

You might also like