CLIMAK: a Stochastic Model
For Weather data generation
Francesco
Danuso and Giuliano Castelli
Dipartimento
di Produzione Vegetale e Tecnologie Agrarie, University of Udine
Via
delle Scienze 208, 33100 Udine(Italy)
Fax +39 432 558603
E-mail:
danuso@dpvta.uniud.it
Background. Many human activities and
ecological processes are affected by climatic conditions. Despite the
difficulties in long term meteorological forecasting, a statistical description
of climate is possible and used for planning purposes and strategic decisions.
With this aim, stochastic models for the generation of daily weather data have
been developed (Climak). The generated meteorological data can be used: a) to
perform Monte Carlo simulations with deterministic models (e.g., crop growth
and ecological models, models for climate risk assessment); b) to better
classify the climates; c) to assess environmental scenarios for the effects of
climatic changes by "what if" procedures; d) to spatially interpolate
the climate parameters, so obtaining data for location not covered by
meteorological stations. These data can be used as input for agro-ecological
models, mainly when a probabilistic evaluation of climatic uncertainty and risk
is of interest. Climak has been developed to directly take into account the
among-years variability.
Methods.Climak generates, as first, the occurrence of rain and the rainfall amount for rainy days. After rainfall generation, minimum and maximum air temperatures are generated, with different parameters for rainy and dry days. Solar radiation is obtained from the astronomical photoperiod and from the daily thermal excursion. Evapotranspiration (reference or potential) is generated from solar radiation, if available; if not, it is obtained from photoperiod and daily maximum temperature.
Results. The model was evaluated using
meteorological data coming from different locations of northern Italy and
comparing its capability to reproduce the climatic properties with that of
another weather generator (Wxgen). The behaviour of Climak was quite
satisfactorily even if some minor problems have been highlighted. The minimum
number of years of data to correctly estimate the climatic parameters was also
determined.
Conclusions. Climak was shown to be
satisfactorily accurate in generating meteorological data representing the
climate of a site, even when compared with a well known weather generator. At
present the model still has some limitations and it has not been tested in
climates other then the temperate ones. These drawbacks will be improved in the
next versions of Climak.
Keywords: climate models,
weather generators, software, meteorological data, planning
RIASSUNTO
Scopo. Molte attività umane che
coinvolgono decisioni strategiche e di progettazione dipendono dalle condizioni
climatiche. I modelli stocastici del clima consentono la generazione di dati
meteorologici per esegue simulazioni Monte Carlo con modelli di deterministici
(ad esempio, modelli colturali), classificare i climi; valutare gli scenari
ambientali dovuti ai cambiamenti climatici con procedure "what if" e
spazializzare i parametri climatici per aree lontane dalle stazioni
meteorologiche.
Viene presentato un modello stocastico
del clima (Climak), sviluppato per generare sequenze giornaliere di dati meteorologici
da impiegare come input per i modelli agro-ecologici, e, in particolare, per le
analisi probabilistiche e di rischio.
Metodo. Climak ha una struttura simile ad
altri “weather generators” ma tiene conto direttamente della variabilità fra
gli anni. Climak genera per primo l'evento piovoso (giorno secco o piovoso) e
la quantità di pioggia per i giorni piovosi. Successivamente vengono generate
la temperatura minima e massima dell’aria, con parametri diversi per i giorni
piovosi o secchi. La radiazione solare è ottenuta dal fotoperiodo astronomico e
dalla escursione termica giornaliera. L’evapotraspirazione (di riferimento o
potenziale) è generata dalla radiazione solare se disponibile; se no, viene
ottenuta dal fotoperiodo e dalla temperatura massima giornaliera.
Risultati. Il modello è stato
valutato sui dati meteorologici ottenuti in alcune località del nord Italia, in
confronto con il modello Wxgen, per la capacità di riprodurre le proprietà
climatiche. Il comportamento di Climak è risultato soddisfacente. E’ stata
eseguita anche una valutazione del numero minimo di annate meteorologiche
necessarie a stimare correttamente i parametri climatici.
Conclusioni. Climak è risultato
accurato nella generazione di dati meteorologici di una determinata località,
anche se confrontato con un generatore climatico ben noto ed usato da tempo. Al
momento il modello presenta alcune limitazioni: per esempio, non è stato
sottoposto a test in climi diversi da quelli temperati. Altri aspetti saranno
migliorati nelle prossime versioni del programma.
Parole chiave: clima, modelli
stocastici, generatore climatico, software, dati meteorologici.
The climate of a site is the resultant of
variations in some relevant climatic factors such as air temperature, precipitation,
solar radiation and evaporative demand of the atmosphere. Over time, the
weather factors show periodical trends (daily, annual, multi-annual) and
stochastic variability. The randomness level of climatic variables can be very
different; for example, the rain event can be considered as a purely stochastic
process while the daylight duration (photoperiod) can be considered as a
completely systematic factor. The weather factors also show correlation and
dependence on the previous values of the same and other variables.
Many human activities and ecological
processes are controlled by the climatic conditions, and so great efforts have
been devoted to weather forecasting. Intrinsic limits prevent the long term
forecasting but a statistical description of the climate is possible. This
knowledge allows climatic stochastic models for the generation of weather
data to be developed (Jones et
al., 1970; Richardson, 1981; Larsen and Pense, 1982; Wgen, Richardson and
Wright, 1984; Shu Geng et al., 1985;
Wxgen, Richardson and Nicks, 1990; USClimate, Hanson et al., 1994; MTCLIM, Thornton et
al., 1997; ClimGen, Stockle et al.,
1998; Skills and Richardson, 1998; Parlange
and Katz, 2000; GEM, Johnson et al.,
2001). The generated meteorological data can be used to perform Monte Carlo
simulations with deterministic models (e.g., models of crop growth and of
ecological processes, models for the climate risk assessment). The climatic
models also permit the classification of different climates, the generation of
meteorological data for sites not covered by meteorological stations and the
study of the effects of climatic
changes by "what if" methods (Wilks, 1992).
A model (Climak; Danuso and Della Mea,
1994) for the generation of daily
values for rainfall (Prec, mm/d), minimum
air temperature (Tn, °C), maximum air
temperature (Tx, °C), integral of
solar radiation (Rad, MJ/m²/d) and
evapotranspirative demand of the atmosphere (ETr, mm/d) is described. The weather generation procedure involves
the estimation of climatic parameters from the historical meteorological
recordings and the weather generation from deterministic models and residuals
sampled from probability distributions. The scheme of the whole generation
procedure is reported in figure 1.
Climak was applied to the regional scale
(Danuso et al., 1997) by creating a
grid of climatic parameters for the plains of Friuli-Venezia Giulia to be used
to perform Monte Carlo simulations with agro-ecological models.
Rain is generated first because it is
required for the generation of the other variables. Two aspects are considered:
the occurrence of the rain event (if a day is rainy or not) and, for the rainy
days, the amount of rainfall.
a) Rain
occurrence
Model. The rain event
is assumed to be a stochastic process, represented by a first order Markov
chain (Larsen and Pense, 1982;
Richardson, 1985). The state of each day (rainy or dry) is obtained from the
state of the previous day and from the dry to dry (Pdd) and rainy to dry (Prd)
transition probabilities. The complementary
dry to rainy (Pdr) and rainy
to rainy (Prr) probabilities are
computed as:
Pdr = 1 - Pdd and Prr = 1 - Prd
The transition probabilities are considered
on a monthly base and then the model requires 24 parameters for the rain event
generation (12 for Pdd and 12 for Prd).
Parameter estimation. The transition
probabilities are calculated on a monthly base and are estimated, for a
location, on all the available recordings in the data set as:
Pdd=Ndd/Nd and Prd=Nrd/Nr
where:
Ndd number of dry days in the month, preceded by a dry day;
Nd total number of dry days in the data set, for the month;
Nrd number of dry days in the month preceded by a rainy day;
Nr total number of rainy days in the data set, for the month;
b) Rainfall
amount
Model: The rainfall
amount distribution typically follows a negative exponential-like distribution,
with parameters varying during the course of the year. All these distributions
are limited to positive values and allow also the generation of rare intense precipitation.
The most used probability density function (pdf) for the rainfall amount
generation is the Gamma distribution (Jones et
al., 1970; Larsen and Pense, 1982) but exponential and mixed-exponential
distributions are also used.
In Climak the precipitation amount is
generated by sampling from a two parameters Gamma pdf:
if x>0 and Ag>0 and Bg>0;
otherwise Gamma(x)=0
where G(Ag) is the gamma function. Ag
and Bg are specific parameters for
each month. Moreover, because the rain gages cannot detect rainfall below a
sensitivity threshold (Sthr, about
0.2 mm), the rainfall data are corrected for this threshold.
The total number of parameters needed to
describe the rainfall amount is 25 (Sthr;
Ag and Bg for each month).
Parameter estimation. Before
estimating the parameters of the Gamma distribution, the sensitivity threshold
of the instrument (Sthr) is
subtracted from rainfall data, in order to obtain a distribution starting from
0. Then the Ag and Bg parameters are estimated, on a monthly
base, by the method of the moments (Ag=M²/V;
Bg=V/M, where M is the mean and V the variance of the daily rainfall amounts). Often, for the
estimation of the Gamma pdf parameters, the polynomial approximation of maximum
likelihood from Greenwood and Durand (1960) has been used (Larsen and Pense,
1982; Shu et al, 1985). However, the
method of the moments was here adopted because, for our data, it showed
slightly better in reproducing the original climate statistics.
Model: Air temperature
has a strong dependence on the rainfall status of the day and then it is
normally generated with different parameters for rainy and dry days. Moreover
the yearly trend of minimum and maximum temperatures show, in high latitudes, a
quite regular sinusoidal shape. The approach to calculate daily minimum and
maximum temperatures uses the following steps: 1) minimum and maximum air
temperature, both in rainy and dry days, are considered separately; 2)
depending on the rain status of the day, the mean value of minimum temperature
for that day, is calculated from the proper trend; 3) a random residual is
added to the value of 2); this residual
is sampled from a bivariate normal distribution, autocorrelated with the minimum
temperature residuals of the previous day (first-order autoregressive process);
4) depending on the rain status of the day, the mean value of maximum
temperature for that day is calculated, from the proper trend;
5) a random residual is added to the value
of 4); this residual is sampled from a
bivariate normal distribution of minimum and maximum temperature residuals,
taking into account the correlation between the residuals of minimum and
maximum temperature for the same day.
The minimum (Tn) and maximum (Tx) air
temperatures are obtained as:
Tn=Tnd+Rn and Tx=Txd+Rx if
the day is dry,
Tn=Tnr+Rn and Tx=Txr+Rx if
the day is rainy,
where Tnd
is the average daily minimum temperature for the dry days, obtained as a
function of the date, by interpolating:
Tnd=a+b·sin(Date·2p/365)-c·cos(Date·2p/365)+d·sin(Date·4p/365)-e·cos(Date·4p/365)
This expression
(Carlini, 2003; SIPEAA 2003) is equal to the second order Fourier series
of the type:
Tnd=A+B·sin[(Date-C)·2p/365] + D·sin[(Date-E)·2p/182.5]
by putting:
a=A A=a
b=B·cos(2pC/365) B=(b2+c2)1/2
c=-B·sin(2pC/365) D=(d2+e2)1/2
d=D·cos(4pE/365) C=365/(2p)·acos(b/B)
e=-D·sin(4pE/365) E=365/(4p)·acos(d/D)
where:
A annual average minimum temperature of
the dry days (°C);
B semi-amplitude of the first term (°C);
C phase shift for the first term (days),
assumed as a constant for a location;
D semi-amplitude of the second term (°C);
E phase shift for the second term
(days), assumed as a constant for a location;
Date date as day of the year (from 1 to 365).
A, B and D are obtained by sampling, at the beginning of each year, from the
normal distributions N(MA,SA), N(MB,SB), N(MD,SD),
so determining the specific trend for each year. Since statistics of measured data do not show correlations among
parameters, independent samplings are used.
Analogous calculations, with the specific
parameters, are made for the trends of minimum temperatures for rainy days (Tnr), the maximum temperatures for dry
days (Txd) and maximum temperature
for rainy days (Txr).
Rn is a random residual sampled from a
bivariate normal distribution N2(0,n) with
0
vector of means and n covariance matrix of residuals from the minimum
temperature
of the previous day and the same of the present day. The dispersion
matrix
corresponds to:

where:
SRn standard
deviation of the residuals from the trends of the minimum
temperature, both in dry and rainy days;
RRnn autocorrelation
coefficient for minimum temperature residuals, with
time lag of 1 day.
Rx is a random residual sampled from a
bivariate normal distribution N2(0,x),
where
the dispersion matrix x is:

where:
SRx standard
deviation of the residuals from the trends of the maximum
temperature,
both in dry and rainy days;
RRnx correlation
coefficient between minimum temperature residuals and
maximum
temperature residuals of the same day.
The model requires 80 parameters to describe
the air temperature. Of these, 32 parameters are used for the Fourier series (C, E,
MA, MB, MD, SA, SB
and SD of the four different
temperature trends Tnd, Tnr, Txd,
Txr), while 48 parameters are used to
characterise the monthly distributions of the residuals (SRn, SRx, RRnn, RRnx).
Parameter estimation. The yearly
trends of the four combinations of temperature are obtained by the following
procedure:
1) The values of a,
b, c, d and e are estimated using linear regression, C
and E are calculated and are assumed to be constant among years, for a
specific site.
2) Year by year, the values of A, B
and D are estimated using linear
regression and assuming C and E as constants.
3) The parameters for the normal
distributions of A, B and D are then calculated. The mean values for A, B and D are obtained as:
MA=Aj/n, MB=Bj/n and MD=Dj/n
where: j= index of the year: j=1,..., n
n= number of years in the data set
The standard deviations for A (SA),
B (SB) and D (SD) are obtained as:
,
and ![]()
4) By using the values of A, B
and D from linear regression on the
data set of each year, the trends of the temperatures are obtained and then the
residuals from minimum (Rn) and
maximum temperature (Rx) are
calculated. While four temperature trends are calculated (depending on the rain
status of the day) for the residual distributions no distinction is made
between rainy and dry days .
5) Subsequently the residuals are evaluated
on a monthly base. It is assumed that there is a normal distribution of
residuals and no difference between the residual distributions of dry and rainy
days. For each month, the procedure computes the standard deviation of Rn and Rx (SRn, SRx), the autocorrelation of Rn residuals with those of the previous
day (RRnn) and the correlation
between Rn and Rx (RRnx). This specific
correlation, instead of others (e.g.,
autocorrelation of Tx or
cross-correlation) was selected because it showed the highest values.
Given that the residuals are obtained from
the specific trend of each year, the among year variability of the Rn and Rx distributions has not been considered. The variability of the
trends is taken into account by calculating the variability among years of A, B
and D.
Model. The maximum
(potential) daily integral of solar radiation (Rmax) strictly depends on photoperiod (Ph), while the actual radiation at soil level (Rad) also depends on the cloudiness of the day. Cloud data are not,
in general, available; to evaluate the atmosphere transmittance (related to
cloudiness) the effect of cloudiness on daily thermal excursion (Es=Tx-Tn)
is considered. Normally, on cloudy days the maximum temperature is low because
sun radiation is strongly intercepted and the minimum temperature is high
because the long wave irradiation from the soil during the night is reduced by
the moisture of the atmosphere, so reducing Es.
Radiation is modelled in two steps: the first
calculates the annual trend of the maximum daily radiation (Rmax). This was found to be well
linearly related to the duration of the photoperiod (figure 2); this relation
is considered to be the same for all the years of each location. The second
step evaluates the distribution of the ratio between the daily radiation and Rmax. This ratio (Rr=Rad/Rmax), ranges from 0 to 1 and is
linearly related to the daily thermal excursion (Es). Moreover, the distribution of Rr values for the different Es
classes was found to be a Beta pdf (figure 3). The equation for this distribution
is:
if 0£x£1 and Ab>0 and Bb>0;
otherwise Beta(x)=0
The Beta pdf, being defined for real values
ranging from 0 to 1, fits well the different shapes of the Rr distributions at varying Es.
Ab and Bb are parameters, calculated respectively
with an exponential and an hyperbolic regression above the specific values for
each of the twenty Es classes adopted.
The model needs
six parameters for the generation of the solar radiation: two for the
calculation of Rmax from the photoperiod, two for the best exponential
fit on the twenty values of Ab, and two for the best hyperbolic fit on the
twenty values of Bb.
Parameter estimation A preliminary
step for the estimation of the radiation parameters is the calculation of the
daily astronomical photoperiod depending on latitude and the day of the year.
This is performed with the method described in Keisling (1982). The parameters
of the linear relation between Rmax
and Ph are obtained by selecting only
the maximum values of the solar radiation
in ten-day periods of the year: Rmax
= b1·Ph + b0
Subsequently the program calculates the ratio
between the daily radiation Rad and
the estimated function for Rmax (Rr). The radiation data are then divided
into twenty classes, depending on twenty classes of daily air temperature
excursion. For each class and from the ratio Rr the two parameters of the Beta distribution are evaluated, using
the moments:
Ab=M²·(1-M)/V - M
Bb=Ab·(1-M)/M
where M
and V are the mean and variance of Rr, for each excursion class.
Then an exponential fit is adapted on the
twenty Ab values and two best fit parameters are obtained. In a similar way a
hyperbolic fit is adapted on the twenty Bb values and two other best fit
parameters are obtained.
Model. The reference
evapotranspiration always shows a good linear relation with the radiation
(Doorembos and Pruitt, 1977); the dependence on maximum air temperature and
photoperiod is less good. Given that radiation data are often not available in
the historical meteorological series, two different approaches are adopted,
depending on availability of radiation data:
a) if solar radiation data are available, the
daily reference evapotranspiration is obtained as a linear function of the
daily radiation, plus a residual from a normal distribution (unique for all the
months) with standard deviation Setr: ETr=a1·Rad + a0 + N(0,Setr)
b) if radiation data are not available, ETr is modelled as a function of maximum
air temperature and photoperiod. The following relation was found to give a
good fit:
ETr= c0
+ c1·Tx·Ph² /1000+N(0,Setp)
where Setp
is the standard deviation of the residuals, related to the photoperiod by a
linear function: Setp=d1·Ph + d0. In
this case the standard deviation of the residuals is not considered as a
constant, but linearly depending on the photoperiod.
Three parameters are then needed for the
generation of the reference evapotranspiration if radiation data are available
and 4 in the other case.
Parameter estimation: the estimation
of the parameters a1 and a0 is simply made by linear
regression of ETr vs. Rad. Then, the standard deviation of the
residuals (Setr), obtained
subtracting the predicted data from the measured ones, is evaluated.
With no available radiation data, c1 and c0 come from a linear regression of ETr vs. Tx·Ph². The
residuals of this regression show a distribution with a variance increasing
with the photoperiod. Then the standard deviation of residuals is calculated
separately for each of 12 photoperiod classes. These values are then regressed
against photoperiod to give the d0
and d1 parameters.
It should be noted that the procedure for
evapotranspiration can be applied to simulate every variable representing the
evapotranspirative demand of the atmosphere (e.g., without limitation due to
water shortage). So, data of “reference evapotranspiration”, “potential
evapotranspiration”, “evapotranspiration from class A pan” and others can be
generated, depending on the data set used to estimate the parameters.
Climak generates first the occurrence of
rainy or dry day and the rainfall amount if the day is rainy (figure 1). After
rainfall generation, minimum and maximum air temperature are generated,
separately, for rainy and dry days. Solar radiation is obtained from the
astronomical photoperiod and from the daily thermal excursion. If radiation
data are available, the evapotranspiration is generated from solar radiation;
if not, it is obtained from photoperiod and maximum temperature.
The rain
event is generated by sampling, for each day to be generated, a random
value from the uniform distribution U(0,1).
If the current day is dry, the Pdd
probability is used; if the sampled value U(0,1)
is less than Pdd the following day is
set to "dry", otherwise it is set to "rainy". The same
procedure is adopted with the Prd
transition probability if the present day is rainy. The considered probability
is that specific for each month.
On the rainy days, the rainfall amount (Prec) is
obtained by sampling a value from a G(Ag,Bg)
pdf and adding the threshold value for the instrumental sensitivity:
Prec=G(Ag,Bg)+Sthr
where the parameters Ag and Bg are specific
for each month.
The sampling routine for the Gamma
distribution was implemented merging two algorithms: the first allows the
generation from a Gamma pdf when the parameter Ag is less than 1 (RGS algorithm; Ahrens et al., 1972; Tadikamalla, 1981; Bratley et al. 1984); the second is used when Ag is greater than or equal to 1 (G3A algorithm; Fishman, 1978).
In general, the rainfall amount distribution
has the form of a Gamma pdf with Ag<1
(negative exponential-like); the complete routine has been implemented because
it is used also for the generation of the solar radiation.
The generating method for the air temperature values is strictly
related to the procedure for the estimation of the parameters. After rainfall
generation, the minimum and maximum temperatures are generated separately,
depending on the status of the day (rainy or dry).
At the beginning of a new year the program
samples a value for the parameters A,
B and D from the normal probability distributions N(MA,SA), N(MB,SB)
and N(MD,SD), for each
temperature combination, in order to obtain the annual trends of temperatures
by the Fourier series. C and E are considered to be constant for all
the years while means and standard deviations of A, B and D are different in relation to the year
and for the minimum/maximum and rainy/dry combinations.
Using these parameters, the annual trend of
minimum and maximum air temperature on dry and rainy days is generated, for
each year; then, month by month, each temperature is obtained by adding the
residuals to the trends. The residuals are obtained as follows:
- The minimum temperature residual (Rn) is sampled from the autocorrelated
normal distribution with 0 mean, SRn
standard deviation and RRnn
autocorrelation:
![]()
where:
Rn minimum temperature residual of the
current day;
R1n residual of minimum temperature of the
previous day, already generated;
N(0,1) value sampled from a normal distribution with
0 mean and 1 standard deviation;
- The maximum temperature residual (Rx) is sampled from the bivariate normal
distribution of parameters 0 mean, SRx
standard deviation and RRnx
correlation coefficient, depending on the value Rn:
![]()
The generation of radiation data is based on air temperature excursion and so a
previous generation of air temperature is required. As in the estimation phase,
Climak calculates first the daily photoperiod (Ph). From Ph, the maximum
daily radiation Rmax is obtained as Rmax=b1·Ph+b0, for each day of the year.
After that, for each day, a value of Rr is generated by sampling from a Beta
probability distribution: Rr=B(Ab,Bb),
where Ab and Bb are parameters of the Beta pdf, chosen in relation to the
temperature excursion of the day. For the Beta generation, a well-known
property of this distribution is used:
B(Ab,Bb)=G(Ab,1)/(G(Ab,1)+G(Bb,1))
where B(Ab,Bb) is a value extracted from the
Beta pdf and G(Ab,1) and G(Bb,1) are values from the Gamma
distribution. The generation of a Beta pdf is made by using the algorithm for
the Gamma generation twice and setting Ab
or Bb to 1.
Finally, the daily solar radiation is
calculated as Rad = Rmax × Rr.
If radiation parameters are available, the
daily reference evapotranspiration is
obtained, after the generation of radiation, by generating a value from the
normal distribution N(0,Setr) as:
ETr= a0
+ a1·Rad +N(0,Setr)
When the radiation parameters are missing,
Climak calculates ETr from maximum
temperature and photoperiod. The standard deviation of the ETr residuals (Setp) is,
in this case, obtained as a function of the photoperiod as: Setp
= d1·Ph+d0
The residuals are then generated from a
normal distribution N(0,Setp) and the daily reference
evapotranspiration is obtained from maximum air temperature, photoperiod and
residuals as:
ETr= c0
+ c1·Tx·Ph²/1000 + N(0,Setp)
The current version of Climak (2.0, 4
November 2003) is a GUI (Graphical Using Interface) program that is able to
estimate parameters (text files with cmk
name extension) from historical data (text files with met extension) and to generate meteorological data (text files with
gen extension) from the parameter
file (figure 4).
The generation of meteorological data is
performed by using the climate parameters contained in the parameter files.
Only the variables for which the related parameters are in the parameter file
are generated: that is, if some values are missing, the corresponding climate
variables are not generated. As a minimum, Climak can generates only the
rainfall. The generation of temperature requires rainfall; radiation requires
temperature and evapotranspiration requires radiation or temperature (figure
1).
The meteorological data files of Climak are
text files with name extension met (recorded
data) and gen (generated data). All
the required columns have to be present, even if data are missing. Missing
values have to be indicated by a full stop (.). The meteorological data files
have seven columns (variables) which must follow the order: year code, day of
the year, precipitation, minimum air temperature, maximum air temperature,
solar radiation, reference evapotranspiration. If a variable is not available,
its column has to be filled with full stops. The column of year and day of the
year cannot contain missing values, while the other variables can.
The minimum set of variables required for the
estimation of a parameter set is formed by the year code, the day of the year
and the rainfall; in this case only the rainfall parameters can be estimated.
To estimate also the temperature parameters, rainfall and both minimum and
maximum temperatures have to be supplied. The estimation of the radiation
parameters needs the temperature variables. The estimation of the
evapotranspiration parameters requires the radiation variable or the
temperature variables; when the radiation variable is present, by default, the
parameters of ETr are obtained from
radiation.
The generation of meteorological data is
performed by using the climatic parameters contained in the parameter files.
Climatic parameters are estimated by the Climak program from recorded
meteorological data.
The parameter files are text files with cmk extension and 148 rows of numeric
data. Climak recognises as comment each line starting with an asterisk.
The file of estimated parameters has one
column of 148 elements. Each parameter has a definite position in the column.
Parameters that cannot be estimated are indicated by a full stop (missing
value).
A complete set of parameters is of 148 items.
Of these, 6 are used for the general site description, 49 for the rainfall
generation, 80 for the temperatures, 6 for the radiation and 3 or 4 for the
reference evapotranspiration (depending on the generation method).
Only the variables for which the related
parameters are in the parameter file are generated: that is, if some parameters
are missing, the corresponding climate variable is not generated.
Latitude (as degrees and decimals) is used
for the photoperiod calculations and then its value has an effect on the
generation of radiation and on evapotranspiration. Longitude and altitude serve
only for documentation or when a spatial interpolation has to be performed.
The structure of the parameter file is
reported in tables 1, 2, 3 and 4.
The goodness of the generated weather data
depends on the model but also on the quality of the estimate of parameters.
This depends, in turn, on the estimation method and on the number of year of
data available for the estimation.
An evaluation of the minimum sample size to
estimate the Climak parameters was carried out. If the meteorological data
comes with too short a time period, the estimated values may not correctly
represent the statistical properties of the climate of the site and so the
generated meteorological data should be used with caution.
The minimum number of years needed to give a
correct estimate of the parameters depends on the degree of variability of the
climate itself and on the specific parameters. For example, rainfall values are
known to present higher variability than temperature values.
Using the historical meteorological data
series of Udine (from 1960 to 1989) the estimation of parameters has been
performed, with the specific Climak routine, using a sample of years increasing
size from 5 to 30 years. In figure 5 the estimated values for Bg (Gamma parameter for the generation
of rainfall amount), for each month, of different series length are reported.
For the transition probability which controls the rain events, Pdd requires, in the average of months,
less year data to obtain a stable estimate than Prd (15 and 20 years, respectively). The Pdd and Prd values are
more easily estimated for the winter and spring months (about 10 years are
enough). More years are progressively required for the summer and fall months
(up to 20 years).
The estimation of the annual temperature
trend parameters (MA, MB, MD,
C, E) were good with just 10-15 years, for all the Tn-Tx
and dry-rainy combinations.
The aim of the weather generators is to
generate meteorological sequences having the same statistical properties as the
actual ones. Thus, there are two ways to evaluate the goodness of the generated
data: the first involves the statistical comparison of parameters of the
observed and generated data sets (centrality and dispersion parameters,
distributions). Another, pragmatic, way considers the generated data as
acceptable when a dynamic simulation model, requiring meteorological data as
input, gives the same results with both recorded and generated data.
Climak has already been evaluated and
compared with other weather generators also by other authors. Acutis et al. (1998) evaluated
Climak as compared with ClimGen (Stockle et
al., 1998) and with Wxgen-Vxparm (Berndt and Williams, 1991) using
long-period series from different localities: Akron (USA), Haarveg (NL), Modena
(IT), Tolouse (FR) and Udine (IT). The behaviour of the models was judged
satisfactorily.
In the present paper, a comparison between
recorded and generated data has been made. The behaviour of Climak seems quite
satisfactory, particularly for the generation of temperature and radiation.
Table 5 reports a comparison between annual statistics derived from historical
data recorded at Padua (North-East Italy) and those generated by Climak and by
another weather stochastic generator (Wxgen, Richardson and Nicks, 1990). The
climatic parameters for Wxgen have been estimated by the program Wxparm (Berndt
and Williams, 1991). Climak gives quite satisfactory results, particularly for
the temperature and radiation generation, even considering the monthly
statistics of generated values. Both the models give very similar annual
statistics that are not statistically different from those of the original data
(figures 6, 7, 8 and 9).
While the normal rainfalls are well
reproduced by both generators (figure 9), they are not able (Climak less than
Wxgen) to reproduce the extreme values of observed rainfalls. So, other
generation methods should be applied to generate those values (for example,
sampling from a Poisson pdf).
Climak was found
to be sufficiently accurate in generating meteorological data that represents
the climate of a site, even when compared with a well known weather generator.
The minimum
number of years needed to give a correct estimate of the parameters depends on
the degree of variability of the climate itself and on the specific parameters.
For example, rainfall required more years of data than temperature in order to
be correctly estimated.
In a previous evaluation (Acutis et al., 1999) it was found that Climak
did not reproduce the climate as well as Climgen, when using short series to
estimate parameters. This is due to the characteristics of Climak, as it was
especially designed to cope with the year-to-year variability. In effect, when
using long series to estimate the parameters, Climak is better at reproducing
the climate itself (Acutis et al.,
1998).
The goodness of a weather model basically
depends on its model structure, on methods and algorithms for parameter
estimation and on algorithms for data generation (sampling from pdf). The
algorithms implementation in the computer program plays a major role in model
quality, particularly for the sampling phase. So the weather generator was
evaluated for the real performance of the software. At present the model has
some limitations. It has not been tested in climates other then the temperate
ones. So care should be taken when using the weather generator in such
situations.
The model is “unit-free” for rainfall,
radiation and evapotranspiration but temperature must be in Celsius degrees.
This is because absolute values of thermal excursion are used in radiation
calculation. Instead, the units or rainfall, radiation and evapotranspiration
can be other than those used in this study. This aspect will be fixed in the
next versions. Spatial co-ordinates (latitude and longitude) are also required
as degrees and decimals because latitude is used for the photoperiod
calculation.
In general, the behaviour of the model is
considered satisfactory but some aspects could be improved and inserted into
the next versions of Climak: a) generation of other variables such as dewpoint
temperature, wind speed, etc.; b) the variables (especially, temperature and
rainfall) at a higher time resolution (hourly values, within-storm rainfall
intensity, etc.); c) account for spatial correlation in order to generate
realistic data on a regional or watershed bases; d) developing methods to
spatialise the climatic parameters in hill or mountain conditions; e) a link to
global circulation models (GCM) which can supply parameters for the weather
generator derived from climate scenario forecasting; f) the use in the model of
multi annual trends, related, for example, to the solar spot cycle or to the El
Niño Southern Oscillation (ENSO, Woolhiser et
al., 1993) to obtain the climatic parameters, thus providing also a sort of
climate forecasting; g) the creation of regional georeferenced database of
climate parameters of Climak, particularly for hill conditions.
Work supported by the Italian Ministry of the
University and of the Scientific and Technological Research (MURST - 60%). The
authors wishes to thank Vincenzo Della Mea for his help in developing and
programming the first version of Climak, and Laura Carlini for the temperature
function linearization. The Istituto di Agronomia e Coltivazioni erbacee of the
University of Padua and the Ufficio Idrografico del Magistrato alle Acque are
also gratefully acknowledged for supplying the meteorological data used for the
model evaluation.
Acutis M., Donatelli M.,
Stockle C., 1998. Comparing the performance of three weather generators.
Proceedings of the 5th ESA Congress, Nitra, Slovak Republic, 117-118.
Acutis
M., Donatelli M. e Stockle C., 1999. Performances of two weather generators as a
function of the number of available years of measured climatic data. Proc. Int.
Symposium "Modelling cropping systems", , M. Donatelli, C.O.Stockle,
J.M.Villar and F.Villalobos Eds., ESA, Lleida, 21-23 June, 1999, Catalonia,
Spain.
Ahrens
J.H and U. Dieter, 1972. Computer methods for sampling from gamma, beta,
poisson and binomial distributions. Computing, Vol. 12, 223-246.
Berndt H.D. and Williams,
1991. USDA/ARS Grassland Soil & Water research Lab. Temple,
TX, USA, 74 p.
Bratley
P., Bennet L. F., Schrage L.E., 1983. A guide to simulation. Springer-Verlag, New
York.
Carlini L., personal
communication, 2003.
Coe R. and Stern R.D., 1982.
Fitting models to daily rainfall data. J. Applied Meteorology. 21, 1024-1031.
Danuso
F., Della Mea V., 1994. Development and evaluation of a stochastic model for
the generation of weather data. III ESA Congress, Abano, Padova
(Italy), 18-22 September 1994.
Danuso F., Ceschia M., Bortolussi S., Della Mea V. and R. Giovanardi, 1997. A database of climatic parameters for stochastic weather generation in the region of Friuli-Venezia Giulia. Proc. 14th Int. Congress of Biometeorology, 1-8 September 1996. Ljubljana, Slovenia, Int Soc. Biometeorology, Biometeorology, Part 2 (vol. 3), 361-375.
Doorembos J. and Pruitt
W.O., 1977. Guidelines for predicting crop water requirement. FAO Irrigation
and Drainage Paper n. 24. Rome.
Fishman G.S., 1978.
Principles of discrete event simulation. J. Wiley & Sons, New York, pag.
420.
Greenwood, J.A. and Durand,
D., 1960. Aids for fitting the gamma distribution by maximum likelihood.
Technometrics, 2: 55-65.
Hanson C.L., Cumming K.A.,
Woolhiser D.A. and Richardson C.W., 1994. Microcomputer program for daily
weather simulation. US Dept. Agric., Agric.Res. Svc. Pub. No. ARS-114, 38 pp.
Johnson
G.L. et al., 2001. ARS-NRCS Weather simulation team, USDA-NRCS, 101 SW Main St., Suite
1600, Portland, OR 97204-324 Email: gjohnson@wcc.nrcs.usda.gov.
Jones, J.W., Colwick R.F.,
Threadgill, E.D., 1970. A simulated environmental model of temperature,
evaporation, rainfall and soil moisture. ASAE Paper n. 70-404.
Jones, P.G. and Thornton
P.K., 1993. A rainfall generator for agricultural applications in the tropics.
Agric. and Forest Meteorology, 63, 1-19.
Keisling T.C., 1982.
Calculation of the length of day. Agron. J., 77:500-505.
Larsen G.A. and Pense R.B.,
1982. Stochastic simulation of daily climatic data for agronomic models.
Agronomy Journal, 74, 510-514.
Parlange M.B., Katz R.W.,
2000. An extended version of the Richardson model for simulating daily weather
variables. Journal of Applied Meeorology, 39, 610-622.
Richardson C.W. and Nicks,
A.D., 1990. Weather generator descriptor. In "EPIC - Erosion /productivity
Impact Calculator. 1. Model documentation. USDA, Tec. Bull. 1768, Temple TX.
Richardson, C.W. and D.A.
Wright, 1984. WGEN: a model for generating daily weather variables. US Dept.
Agric., Agric.Res. Svc. Pub. No. ARS-8, 83 pp.
Richardson, C.W., 1981.
Stochastic simulation of daily precipitation, temperature and solar radiation.
Water Resour. Res., 17(1): 182-190.
Richardson, C.W.,1985.
Weather simulation for crop management models. Transactions of the ASAE. 1985,
28: 5, 1602-1606.
Shu Geng, Penning de Vries
F.W.T. and Supit I., 1985. Analysis and simulation of weather variables-part
II: temperature and solar radiation. Simulation Reports CABO-TT n. 5.
SIPEAA,
2003. CLIMA Component, http://www.sipeaa.it
.
Stata Corporation, 1993.
Stata Reference Manual: Release 3.1. 6th ed. College Station, TX, USA.
Stockle C.O. et al., 1998.
ClimGen Manual. BSE, WSU, Pullman, WA 99164-6120.USA. 28 pp.
Tadikamalla, P.R. and M.E. Johnson, 1981. A complete guide to gamma variate generation. Amer. J. of math. and man. sci., vol 1, 213-236.
Thornton P.E., Running S.W., White M.A., 1997. Generating surfaces of daily meteorological variables over large regions of complex terrain. J. of Hydrology 190, 214-251.
Wilks, D.S., 1992. Adapting
stochastic weather generation algorithms for climate change studies. Climatic
Change, 22:1, 67-84.
Woolhiser D.A, Keefer T.O.,
Redmond K.T., 1993. Southern oscillation effects on daily precipitation in the
southwestern U.S. Water Resources Research, 29, 1287-1295.
Table 1. Structure of the climatic parameter file of
Climak. The parameter values estimated on Padua 1981-1988 meteorological data
are reported. Each parameter is located in a specific row of the climatic
parameter file.
|
Rows |
symbol |
values |
parameter description |
unit |
|
General
parameters |
||||
|
1 |
Location |
Padova |
|
|
|
2 |
Latitude |
45 |
latitude |
degrees |
|
3 |
Longitude |
12 |
longitude |
degrees |
|
4 |
Altitude |
20 |
altitude above sea level |
m |
|
5 |
Slope |
-999.0 |
slope |
? |
|
6 |
Aspect |
-999.0 |
aspect |
? |
|
Rainfall
parameters |
||||
|
7 |
Sthr |
0.2 |
Rainfall sensitivity threshold |
mm/d |
|
8-19 |
Pdd |
(1) |
transition probability dry to dry day |
- |
|
20-31 |
Prd
|
(1) |
transition probability rainy to dry day |
- |
|
32-43 |
Ag |
(1) |
gamma distribution: 1st parameter |
- |
|
44-55 |
Bg |
(1) |
gamma distribution: 2nd parameter |
- |
|
Temperature
parameters |
||||
|
56-59 |
MA |
(2) |
mean of the A parameter of Fourier series |
°C |
|
60-63 |
MB |
(2) |
mean of the B parameter of Fourier series |
°C |
|
64-67 |
MD |
(2) |
mean of the D parameter of Fourier series |
°C |
|
68-71 |
C |
(2) |
mean of the C parameter of Fourier series |
date |
|
72-75 |
E |
(2) |
mean of the E parameter of Fourier series |
date |
|
76-79 |
SA |
(2) |
standard deviation of A |
°C |
|
80-83 |
SB |
(2) |
standard deviation of B |
°C |
|
84-87 |
SD |
(2) |
standard deviation of D |
°C |
|
88-99 |
SRn |
(1) |
standard deviation of Tn residuals (Rn) |
°C |
|
100-111 |
SRx |
(1) |
standard deviation of Tx residuals (Rx) |
°C |
|
112-123 |
RRnn
|
(1) |
autocorrelation coefficients of Tn residuals |
- |
|
124-135 |
RRnx
|
(1) |
correlation coefficients between Tn and Tx residuals |
- |
|
Radiation
parameters |
||||
|
136 |
b1
|
2.878 |
slope of Rmax = f(Ph) |
- |
|
137 |
b0
|
-19.733 |
intercept of Rmax = f(Ph) |
- |
|
138-139 |
Ab |
(3) |
beta distribution: 1st parameter |
- |
|
140-141 |
Bb |
(4) |
beta distribution: 2nd parameter |
- |
|
Evapotranspiration parameters |
||||
|
142 |
a1 |
0.314 |
slope for ETr=f(Rad) |
- |
|
143 |
a0 |
-0.394 |
constant for ETr=f(Rad) |
mm/d |
|
144 |
Setr |
1.011 |
standard deviation of ETr residuals |
- |
|
145 |
c1 |
0.001 |
slope for ETr=f(Tx·Ph²) |
- |
|
146 |
c0 |
-0.072 |
const. for ETr=f(Tx·Ph²) |
mm/d |
|
147 |
d1 |
0.094 |
slope Setp=f(Ph) |
- |
|
148 |
d0 |
-0.463 |
constant for Setp=f(Ph) |
mm/d |
(1) 12 parameters; one for each month. See table 2.
(2) 4 parameters; one for each combination: min. temperature-dry day, min. temperature-rainy
day, max. temperature-dry day, max. temperature-rainy day. See table 3.
(3) 2 parameters from an exponential best fit
(4) 2 parameters from a hyperbolic best fit
Table 2. Values for some climatic parameters estimated
using the 1981-1988 meteorological data recorded at Legnaro (Padua, Italy) (45°
Lat N).
|
month |
Pdd dry to dry transition probability |
Prd rainy to dry transition probability |
Ag Gamma pdf parameter |
Bg Gamma pdf parameter |
SRn Std.dev. of Tn residuals |
SRx Std.dev. of Tx residuals |
RRnn autocorre-lation of Tn residuals |
RRnx Correlation Tn-Tx residuals |
|
JAN |
0.898 |
0.481 |
0.600 |
8.097 |
2.964 |
2.881 |
0.639 |
0.537 |
|
FEB |
0.921 |
0.513 |
0.602 |
16.261 |
2.161 |
2.963 |
0.477 |
0.471 |
|
MAR |
0.840 |
0.521 |
0.549 |
13.938 |
2.679 |
2.657 |
0.604 |
0.332 |
|
APR |
0.785 |
0.484 |
0.976 |
6.133 |
2.901 |
3.103 |
0.742 |
0.441 |
|
MAY |
0.697 |
0.385 |
0.882 |
8.040 |
2.424 |
2.909 |
0.667 |
0.573 |
|
JUN |
0.790 |
0.645 |
0.787 |
11.592 |
2.209 |
3.092 |
0.602 |
0.547 |
|
JUL |
0.855 |
0.792 |
1.310 |
6.225 |
2.194 |
2.422 |
0.572 |
0.656 |
|
AUG |
0.846 |
0.702 |
1.152 |
8.644 |
2.154 |
2.469 |
0.594 |
0.610 |
|
SEP |
0.881 |
0.500 |
0.693 |
10.326 |
2.352 |
2.590 |
0.532 |
0.453 |
|
OCT |
0.864 |
0.712 |
0.993 |
10.953 |
3.161 |
2.512 |
0.687 |
0.376 |
|
NOV |
0.888 |
0.510 |
0.724 |
12.663 |
3.012 |
2.909 |
0.679 |
0.549 |
|
DEC |
0.893 |
0.448 |
1.163 |
7.720 |
2.829 |
2.671 |
0.610 |
0.643 |
Table 3. Values for the annual
temperature trend parameters estimated using the 1981-1988 meteorological data
recorded at Legnaro (Padua, Italy) (45° Lat N).
|
temperature type |
A |
B |
C |
D |
E |
|||
|
MA |
SA |
MB |
SB |
MD |
SD |
|||
|
Tn-dry |
6.83 |
0.795 |
10.08 |
0.595 |
111.8 |
-0.545 |
0.693 |
130.91 |
|
Tn-rainy |
8.59 |
0.559 |
8.14 |
0.907 |
114.6 |
-0.376 |
1.335 |
108.83 |
|
Tx-dry |
16.71 |
0.521 |
12.07 |
0.461 |
110.9 |
-1.817 |
0.744 |
128.49 |
|
Tx-rainy |
15.67 |
0.328 |
10.47 |
0.642 |
113.4 |
-0.244 |
0.649 |
107.58 |
Table 4. Values of the Beta pdf best fit exponential
(for Ab) and hyperbolic (for Bb) parameters. The parameters were estimated
using the 1981-1988 meteorological data recorded at Legnaro (Padua, Italy).
|
Exponential best fit parameters for Ab |
-0.121 |
0.170 |
|
Hyperbolic best fit parameters for Bb |
0.656 |
-1.372 |
Table 5. Comparison of annual values (totals, averages
and related standard deviations) of meteorological variables recorded, from 1981
to 1988, at the Istituto di Agronomia of Padua (North-East Italy) and those
generated (30 years) by Wxgen (Richardson and Nicks, 1990) and Climak The
significance of the t-test for the differences between the recorded and
generated means and total values are also given.
|
meteorological variable |
unit |
recorded data |
Wxgen generated |
Climak generated |
||
|
total/mean-sd |
P t-test |
total/mean-sd |
P t-test |
|||
|
Rainfall amount |
mm/year |
750±97 |
727±119 |
0.62 |
718±117 |
0.48 |
|
Rainy days |
n/year |
85±11.6 |
86.2±10.8 |
0.78 |
81.7±9.8 |
0.42 |
|
Minimum temperature |
°C |
7.67±0.58 |
7.57±0.32 |
0.51 |
7.42±0.63 |
0.32 |
|
Maximum temperature |
°C |
16.83±0.56 |
16.76±0.40 |
0.68 |
16.81±0.67 |
0.93 |
|
Solar radiation |
MJ/m²/d |
10.18±0.32 |
10.17±0.28 |
0.93 |
10.27±0.44 |
0.60 |

Figure 1. Structure and generation procedure of Climak.

Figure 2. Distribution over the year
of daily integrals of solar radiation (Rad)
recorded at Padua from 1981 to 1988 (represented as dots). The astronomical
calculated photoperiod (Ph) is also
reported (solid line); it was rescaled in order to show its adequacy in
representing the maximum solar radiation for each day of the year (Rmax).

Figure 3. Frequency
distribution of the ratio Rr=Rad/Rmax
for each of four classes of daily temperature excursion (0-6, 6-10, 10-14 and
14-18 °C). The distributions can be adequately represented by a Beta pdf. Data
were recorded at Padua from 1981 to 1988.

Figure 4. Procedure for the parameter
estimation and weather data generation.

Figure 5. Estimates of the rainfall
parameters Bg, for each month, using
different sample sizes, on meteorological data recorded at Udine from 1960 to
1989.

Figure 6. Comparison among and monthly
rainfall amount (mm) and monthly number of rainy days and the related standard
deviation (SD), as recorded at Udine
from 1960 to 1989 (circle), generated
by Climak (triangle) and by Wxgen (square) after estimating the specific
climatic parameter on Udine meteo data.

Figure 7. Comparison among minimum and
maximum temperature and the related standard deviations (SD), as recorded at Udine from 1960 to 1989 (circle), generated by
Climak (triangle) and by Wxgen (square) after estimating the specific climatic
parameter on Udine meteo data.
|
|
Figure 8. Comparison among daily solar
radiation and the related standard deviations (SD), as recorded at Padua from
1981 to 1988 (circle), generated by Climak (triangle) and by Wxgen (square)
after estimating the specific climatic parameter.
|
|
Figure 9. Rainfall amount distribution
as recorded at Padua from 1981 to 1988, generated by Climak and Wxgen for the
same site.