SOMETCUBA Bulletin

Volume  6  Number 1

January 2000

Previous paperPapers

THE CHANGE-POINT INSTABILITY OF CLIMATOLOGICAL TIME-SERIES AS ALTERNATIVE TO RANDOMNESS. THE EXAMPLE OF ANNUAL TEMPERATURE AVERAGES 1908 - 1995 AT CASABLANCA (CUBA).

The statistical characterisation of time series

Testing randomness. The use of ranks

Transposed from sample to time-series, the definition of randomness assumes an identical and independent distribution of the elements of the series (Sneyers 1975). In particular, for a series of n elements xi, with i = 1, 2,.. , n, having the common continuous distribution function F(x), identity and independence are ensured if, for the joint distribution Fn of xi, we have

    Fn(x1, x2,.. , xn) = F(x1).F(x2).. .F(xn).                                                (1)

It follows that, if x(i) = i are the corresponding ranks of the elements xi ranged in increasing order, we have the mean values (Gumbel 1958)

    E[F(x(i))] = i/(n+1).                                                                             (2)

Replacing in the time series xi by x(i) = Xi, keeps then unchanged all the inequalities between any couple of elements xi and xj. Moreover, with (1), the n! permutations of the elements of the series have all the same probability with, for whatever i and j,

    Prob (Xi > Xj) = p = Prob (Xi < Xj) = q, with p = q = 1/2.                 (3)

Thus, for n trials, we have the binomial distribution of f(m)

    f(m) = Prob [nb (Xi > Xj) = m] = Cmnpmqn-m; with S 0,n f(m) = (p + q)n = 1. (4)

Moreover, for the set of (Xi, Xj), we have the relations

    var Xi = (n2 - 1)/12,

    E(Xi - Xj) = 0, E[cov (Xi, Xj)] = 0, var (Xi - Xj) = n(n+1)/6,             (5)

while, for the correlation coefficient r = r (Xi, Xj) of the series Xi with i + j = 1, 2,.. , n, with relations (5) we have (Kendall and Stuart, 1967)

    E(r) = 0 and var r = 1/(n - 1).                                                              (6)

Continuous distribution functions excluding ties as xi = xj for different i and j, when such ties appear in a time-series of averages of continuous variables, these averages should be recomputed with a sufficient number of significant digits eliminating these ties. Without this precaution, a weakening of the significance level for the tests statistics has to be expected.

Furthermore, the definition of randomness being independent of the type of the underlying distribution of the elements of the time series and randomness being needed for testing the fit of any particular type of distribution, applying parametric tests for testing randomness is not only begging the question, but involves the risk of biased conclusions.

The alternatives to randomness

Testing stability of F(Xi) against trend

The trend statistic tn (Mann 1945) is defined as

    tn = nb (Xi >Xj) for all couples i > j.                                              (7)

With ni = nb (Xj < Xi) for j < i, we have also

    tn = S i ni.                                                                                        (8)

With ni = 0, 1,.. , i, having for a random series, var ni = (i2 - 1)/12, we have also with (8)

    E(tn) = n(n - 1)/4 and var tn = S i var ni = n(n - 1)(2n + 5)/72,       (9)

large or small values of tn being significant for a grouping of either the largest or of the smallest values towards the end of the series and thus of an increasing or of a decreasing trend.

The statistic tn has an approximate normal distribution giving acceptable significance levels when n > 10. With the correction for continuity made in increasing |tn - E(tn)| by 1/2 (Sneyers, 1975), as to be expected, this accuracy is improved and found to be already valid for n > 4.

 

Testing independence against serial correlation

In this case, replacing j respectively by i+1 in the statistic (6), with (Xn+1 = X1), the statistic r becomes

    rs = cov (Xi, Xi+1) / var Xi,                                                             (10)

which, under the hypothesis of randomness, has also 0 mean and variance 1/(n - 1), positive values of rd being significant for persistence and negative ones, for alternance.

The change-point search in time-series analysis


Bulletin author: Alejandro Bezanilla
Copyright © 2000 Cuban Metorogical Society 
Last modified: March 08, 2000
Homepage Institute of Meteorology

About this Bulletin
Contact us

Cuban Meteorogical Society
Previous paperPapers