Predicting the spatio-temporal distribution of Culicoides imicola in Sardinia using a discrete-time population model

  • Thibaud Rigot1, 2Email author,

    Affiliated with

    • Annamaria Conte3,

      Affiliated with

      • Maria Goffredo3,

        Affiliated with

        • Els Ducheyne4,

          Affiliated with

          • Guy Hendrickx4 and

            Affiliated with

            • Marius Gilbert1, 5

              Affiliated with

              Parasites & Vectors20125:270

              DOI: 10.1186/1756-3305-5-270

              Received: 8 June 2011

              Accepted: 9 November 2012

              Published: 22 November 2012

              Abstract

              Background

              Culicoides imicola KIEFFER, 1913 (Diptera: Ceratopogonidae) is the principal vector of Bluetongue disease in the Mediterranean basin, Africa and Asia. Previous studies have identified a range of eco-climatic variables associated with the distribution of C. imicola, and these relationships have been used to predict the large-scale distribution of the vector. However, these studies are not temporally-explicit and can not be used to predict the seasonality in C. imicola abundances. Between 2001 and 2006, longitudinal entomological surveillance was carried out throughout Italy, and provided a comprehensive spatio-temporal dataset of C. imicola catches in Onderstepoort-type black-light traps, in particular in Sardinia where the species is considered endemic.

              Methods

              We built a dynamic model that allows describing the effect of eco-climatic indicators on the monthly abundances of C. imicola in Sardinia. Model precision and accuracy were evaluated according to the influence of process and observation errors.

              Results

              A first-order autoregressive cofactor, a digital elevation model and MODIS Land Surface Temperature (LST)/or temperatures acquired from weather stations explained ~77% of the variability encountered in the samplings carried out in 9 sites during 6 years. Incorporating Normalized Difference Vegetation Index (NDVI) or rainfall did not increase the model's predictive capacity. On average, dynamics simulations showed good accuracy (predicted vs. observed r corr = 0.9). Although the model did not always reproduce the absolute levels of monthly abundances peaks, it succeeded in reproducing the seasonality in population level and allowed identifying the periods of low abundances and with no apparent activity. On that basis, we mapped C. imicola monthly distribution over the entire Sardinian region.

              Conclusions

              This study demonstrated prospects for modelling data arising from Culicoides longitudinal entomological surveillance. The framework explicitly incorporates the influence of eco-climatic factors on population growth rates and accounts for observation and process errors. Upon validation, such a model could be used to predict monthly population abundances on the basis of environmental conditions, and hence can potentially reduce the amount of entomological surveillance.

              Keywords

              Spatial ecology Infectious disease Remote-sensing Dynamic model Longitudinal entomological surveillance network Mediterranean basin

              Background

              Culicoides imicola, one of the major Bluetongue disease (BT) and African horse sickness (AHS) vectors in the old world [13] has probably been the most investigated Culicoides species over the last decades. A good knowledge of its ecology helps in quantifying potential disease transmission, and assessing the risk of resurgence or expansion of BT/AHS. Therefore, a large number of studies have investigated the spatio-temporal distribution of C. imicola in the old world [410]. Furthermore, since the establishment of BT in southern Europe in 2000 [11], systematic entomological samplings were carried out in many countries in order to monitor the vector populations. An important number of descriptive studies followed [1219]. Long-term and wide-scale surveys also provided adequate datasets that have supported predicting the vector distribution at large scale as a function of a variety of environmental predictors ([2032], summed up in Table 1). However, the interpretation of the results across those studies is not straightforward (Table 1). First, because these studies used various modelling approaches, scale and set of environmental predictors. Second, because the correspondence between the timing of the catches and timing of the observation of eco-climatic variables was variable. As a consequence, even if the accuracy of the distribution models was found to be very good, the lack of consistency across study predictions in some regions, and the difficulty in interpreting the relationships that were found, preventing the use of these models for predicting the species distribution outside of the spatio-temporal range of the training data.
              Table 1

              Summary of the risk studies on Culicoides imicola distribution in the Mediterranean basin since 1998

              References

              Extent

              Resolution

              Dependent variable

              Statistical model

              Cofactors selected through the analysis

              External evaluation

              [21, 22]

              Morocco,

              Morocco/

              Iberia

              Not given

              Abundance

              Linear regression

              NDVI (min), windspeed

              No.

              [23]

              Iberia

              Size of sampled sites

              Pres./abs.

              Logistic regression

              Mean monthly T° (min & max), number of months in the year with mean T° exceeding 12.5 °C

              No.

              [24]

              Iberia,

              Morocco

              8 km x 8 km

              Abundance

              Discriminant analysis

              8-variables model with DEM and Fourier-transformed NDVI, MIR, VPD and LST (cf. Table2)

              No.

              [25]

              Italy,

              Calabria

              Size of sampled sites

              Pres./abs.

              Logistic regression

              T° (min & max)

              No.

              [26]

              Portugal

              1 km x 1 km

              Both

              Discriminant analysis

              DEM and Fourier-transformed LST, NDVI, MIR and TAIR including seasonal cycles

              No.

              [27]

              Italy

              10 km x 10 km

              Pres./abs.

              Multiple logistic regression

              Mean altitude and mean annual daily min T° and relative humidity

              No.

              [28]

              Sicily

              1 km x 1 km

              Pres./abs.

              Stepwise discriminant analysis

              10-variables model with Fourier-transformed LST, NDVI, MIR and TAIR (cf. p93)

              No.

              [29]

              Italy

              Cell size = 0.0387°

              Pres./abs.

              Additive model

              Elevation, slope, aridity index, landuse, animal density, soil type, T° and NDVI for the 4 seasons

              No.

              [30]

              Italy

              250 m x 250 m

              Both

              Discriminant analysis

              Min. T°, aridity index, altitude, slope, NDVI and forest cover

              No.

              [31]

              Spain

              1 km x 1 km (sometimes 8 km x8km)

              Pres./abs.

              Logistic regression

              Mean NDVI, sun index, interpolated precipitations and T° and their seasonality

              Yes.

              [32]

              Spain

              UTM 10 km x 10 km

              Abundance

              GLM neg. bin. & variation partitioning

              21-variables model with spatial location, topo-climatic variable, domestic and wild hosts, soil, NDVI and seasonality

              No.

              DEM, Digital Elevation Model; MIR, Middle Infrared Radiance; VPD, Vapour Pressure Deficit; TAIR, Air temperature. NB: A review for some studies that occurred until 2004 is also provided in Baylis et al. [20].

              In this study, the set of eco-climatic predictors used to predict C. imicola abundances was restricted to a few variables measured either by weather stations (temperature, rainfall) or by remote sensing (Land Surface Temperature (LST), Normalized Difference Vegetation Index (NDVI)). In contrast to previous studies, these indices were tested against C. imicola population using a spatially and temporally explicit model, i.e. each catch was statistically tested against the eco-climatic conditions that were measured in the matching trapping site, and previous month. The aim of the study was therefore (i) to describe the influence of these eco-climatic indicators on the monthly abundances of C. imicola measured in Sardinia from 2001 to 2006 and (ii) to predict the observed spatio-temporal dynamics on that basis. The applied perspective of this research consisted in introducing a simple, but yet robust method to analyse data collected through longitudinal networks of entomological surveillance and to map their abundances both in space and time.

              Methods

              Data

              Between 2001 and 2007, more than 200 permanent Onderstepoort-type black-light traps were operated weekly in Italy in accordance with standardized surveillance procedures of the National Reference Centre for Exotic Diseases [16]. Culicoides imicola is not present everywhere in Italy, and can be considered to be endemic in Sardinia region [13, 33]. Since the present study aimed to better understand factors affecting C. imicola seasonality, 9 sites from Sardinia were selected (Figure 1, black dots). In order to filter the high variability in C. imicola trap catches, data were aggregated for each site by month and maximum abundances were retained, as in previous studies [34]. In order to fill existing gaps, linear regulations implemented in the R library PASTECS [35] were applied, and 49 observations were filled that way. In complement, eleven sites distributed in Lazio (1 site) and Tuscany (10 sites) (Figure 1, open dots) were used to evaluate the extrapolation capacity of a selected set of models.
              http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Fig1_HTML.jpg
              Figure 1

              Study area. C. imicola population dynamics were studied on the basis of weekly samplings carried out in Sardinia (bottom left region) at the level of 9 permanent light-traps from 2001 to 2006 (black dots). During the same period, meteorological data were collected at the level of 31 weather stations (grey crosses). Data gathered outside the study area (mainly Tuscany, upper right region on the top) were used to evaluate the model built outside its range of training (open circles).

              The model aimed to compare two sets of predictors in their potential to predict C. imicola seasonality. The first set was made of data collected in weather stations (WS). These data give a good measure of the conditions actually observed on the ground over time, but their relatively limited number does not allow the quantification of spatial variations and heterogeneity. The second set was made of satellite remotely sensed variables. These measures can be affected by various processes taking place between the ground and the sensor, but they have the advantage of better capture of the spatial distribution of the variable of interest, and in doing so, have a good chance to provide an accurate estimate at the locations of the traps.

              Meteorological data were obtained from the Italian Air Force Meteorological Service. They were collected between 2000 and 2007 in 105 weather stations distributed across Italy. Eleven of them are found in Sardinia and 21 cover Tuscany and Lazio (Figure 1). Daily observations include temperature (min, max,), rainfall (cumulated value in the first half (12 hours) of the day, cumulated value in the second half (12 hours) of the day), relative humidity (RH), wind direction and intensity. After an aggregation by month, mean temperatures, mean and cumulated rainfalls were used in our analyses. These data were interpolated using ordinary kriging [36]. The empirical semivariogram was modelled using a circular model for temperature and a spherical model for rainfall with initial sill, range and nugget fixed to 8.5, 100 000 and 0.1 respectively. Goodness-of-fit of the semivariogram model was evaluated after leave-one-out cross validation [36]. Output resolution was set as 0.01 decimal degree in accordance with the resolution provided by RS products.

              Remotely-sensed eco-climatic variables included the Land Surface Temperature and Normalized Difference Vegetation Index. Day-time and night-time LST products (MODIS Terra MOD11A2) composited at an 8-day interval and the NDVI (MODIS Terra MOD13A2) composited at an 16-day interval were downloaded from the Land Processes Distributed Active Archive Centre. Additionally, the surface reflectance data (MODIS Terra MOD9A1) composited at a 16-day interval were also downloaded. These data were mosaicked and resampled from the original metric sinusoidal projection system to decimal degrees (WGS84) using a nearest-neighbour resampling and with an output resolution of 0.01 decimal degrees. The images were then subjected to a spline interpolation to remove the missing gap [37].

              These data were aggregated by month, keeping the minimum, maximum and mean values.

              Model

              The discrete-time population model was built in three steps.

              The first step consisted in building an autoregressive model where the abundance of populations was predicted using a linear combination of (i) the population in the previous time step (autoregressive term) and (ii) eco-climatic variables measured the month before (equation 1).
              N s,t = N μ o b s , σ obs http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Equ1_HTML.gif
              (1)
              μ~aN s,t - 1 +b Var 1 +c Var 2 +…+zVar n + I n t e r c e p t http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Equ2_HTML.gif
              (2)

              where Ns,t and Ns,t-1 are respectively the population sampled in site s at time t and t-1; Var1, Var2, …, Varn are the eco-climatic predictors; a, b, …, z and Intercept are the model parameters.

              In the second step, we assumed that the measured population abundances can be quantified from equation 2.
              N t =N t - 1 * exp r * Δ t http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Equ3_HTML.gif
              (3)
              After the log-transformation of both sides of equation 2, assuming a constant time interval (in our case a month) between 2 samples and making the relationship spatially explicit (time t, site s), equation 2 become:
              L n N s , t = L n N s , t 1 + r s , Δ t http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Equ4_HTML.gif
              (4)

              One can note that this relationship is very similar to equation 1. Combining the two approaches, we can assume that the linear relationship found between eco-climatic variables and the log-transformed abundances found at time t in site s, corresponds to growth rater found in equation 3. The right-hand part of equation 1, i.e. the intercept and additional cofactors differing from the autoregressive cofactor, will therefore be considered as proxies for predicting the monthly increase in population. This approach allows more straightforward biological interpretation of the inclusion of cofactors in the right-side hand of the linear model. The parameters were estimated using a Generalized Linear Model (GLM) and accounting for a normally distributed error term. This model was initially built to estimate model coefficients, i.e. the relationship that exists between population abundances and co-factors, their range of confidence and the dispersal in residuals.

              Finally, we aimed at evaluating how process and observation error could influence the precision and accuracy of the model parameters and predictions [38, 39]. Because no prior information on the magnitude of the different sources of observations error in our datasets was available, we evaluated the influence of several levels of observation error ( = 1/2, 1 or 2 times the total variance found in the whole dataset) on the estimates of model parameters, the dependence between population abundances at time t and in the previous month (coefficient of the autoregressive cofactor) and the dispersal in residuals ( = process error).

              Based on the parameters identified by the GLM statistical model, the space-time dynamics of C. imicola populations was simulated by seeding 13 individuals at t0 (here, March 2001). Population in month t1 was predicted by applying the GLM model coefficients to the initial number of individuals seeded and eco-climatic variables measured in the previous month t0. Population at time 1 was then used to predict population at time 2, and the predictions were iterated through the time series until December 2006. At each time step, both process and observation error values were added to the predicted abundance of population. These value were both sampled from a normal distribution with a zero-mean and a standard deviation equal to that found in the model residuals for the process error, and equal to that of the overall log-transformed catches for the observation error. The seed value of 13 individuals was chosen because it was the mean abundance encountered in the months of March from 2001 to 2006 in Sardinia.The predictions were repeated over 100 simulations, and averaged.

              Finally, using a similar approach with a seed of 2 individuals in March 2005, simulations were carried out over the entire spatial domain of Sardinia (extent: 8 to 10°E; 38.8 to 41.4°N) until December 2006. The approach allowed production of monthly maps of C. imicola relative abundance at a spatial resolution of 0.01 decimal degree. The maps presented in the results section were produced on the basis of 250 bootstraps. For each of the bootstraps, the coefficients of all terms (constant, autoregressive co-factor and covariates) were sampled from a normal distribution of coefficients with mean and standard deviation estimated by the GLM model. The GLM model was finally applied to Tuscany to test its predictions against the observed catches.

              All analyses were implemented under R [35], using the packages MASS [40], raster [41], rgdal, PASTECS and gstat [36].

              Results

              The GLM coefficients used in the spatio-temporal dynamic model are presented in Table 2. Most of the explained variability predicted by the GLM resulted from the inclusion of an autoregressive term (Table 2; multiple r-squared r2 = 0.65, p < 0.001), then from temperature (LST, r2 = 0.25, p < 0.001; or T from WS, r2 = 0.36, p < 0.001). When combined, the autoregressive cofactor and LST or WS temperature predicted respectively 75.6% or 75.9% of the total variability. Other co-variables such as altitude (r2 = 0.16, p < 0.001), rainfall (r2 = 0.01, p < 0.01) or NDVI (r2 = 0.25, p < 0.001; r2 = 0.70, when tested with the autoregressive cofactor) had a much lower predictive power. The correlation coefficients between observations and GLM predictions based on remote-sensing or weather stations data were very similar (r corr = 0.87 and 0.88, respectively). Predictions made in Tuscany resulted in overestimating the populations in most sites where it was applied (Additional file 1: Figure S1).
              Table 2

              Coefficient estimates (with 95% CI) from GLM carried out with RS or WS data respectively

              Coefficients

              Estimates

              Pr(>|t|)

              95%CI

              AIC

              Multiple R2

              RS

                    

               Intercept

              −1.73

              ***

              −2.62

              −0.85

              2165.7

              0.766

               Autoreg

              0.70

              ***

              0.65

              0.74

                

               LST

              2.25

              ***

              0.18

              0.32

                

               LST2

              −0.0038

              ***

              −0.0058

              −0.0018

                

               NDVI

              −0.13

              ns

              −1.22

              0.95

                

               Altitude

              −0.0011

              **

              −0.0018

              −0.00044

                

              WS

                    

               Intercept

              −1.66

              ***

              −2.42

              −0.90

              2160.2

              0.768

               Autoreg

              0.64

              ***

              0.59

              0.68

                

               T

              0.22

              ***

              0.12

              0.31

                

               T2

              −0.0016

              ns

              −0.0047

              0.0012

                

               Rainfall

              0.046

              p = 0.06

              −0.0023

              0.095

                

               Altitude

              −0.0015

              ***

              −0.00219

              −0.00084

                
              The main results of the simulation models are presented in Figure 2. Simulations had lower predictive power than the GLM’s (Table 3) and were not able to reproduce the level of all peaks of measured populations (Figures 2 and 3). Nevertheless, they succeeded in reproducing the seasonality of the populations (Figure 3). The predictions of the model also allowed reproducing the marked difference between C. imicola catches made at different sites and the extinction that could be observed in high elevation sites more specifically in Austis, for example (Additional file 2: Figure S2 and Additional file 3: Figure S3).
              http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Fig2_HTML.jpg
              Figure 2

              Evaluation of the dynamic models' accuracy and precision. Comparisons among observed C. imicola log-abundances and the predictions carried out with the population dynamic model (99 simulations in grey and the average is given by black dots), using RS data (A) or WS data (B).

              Table 3

              Goodness of fit found for the GLM and the dynamic model run with RS or WS data

               

              Training model

              Dynamic model (99 stochastic realizations)

              Whole extent (site level)

              Mean model (regional level)

              Data

              RS

              WS

              RS

              WS

              RS

              WS

              Rcorr

              0.88

              0.88

              0.71

              0.75

              0.9

              0.9

              RMSE

              1.361

              1.355

              1.97

              1.86

              0.84

              0.83

              Indicators of goodness of fits for the dynamic model are also presented on Figure 2.

              http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Fig3_HTML.jpg
              Figure 3

              Seasonal activity of C. imicola during 6 years (9 sites). Grey lines present the result of 99 stochastic simulations with RS data and accounting for process and observational errors. 95% confidence intervals are presented in black (RS data model) and in blue (WS data model). Red lines and dots represent respectively mean monthly abundances found in all sites grouped and monthly maximal abundances for each sites.

              Increasing the observational error increased the lack of precision on the estimate of model coefficients (Additionial file 4: Figure S4) and on process error (Figure 4). In particular, the average estimate of the autoregressive cofactor coefficient was most influenced by observation error, and tended towards zero when observation error increases (Additionial file 4: Figure S4). As expected, increasing observational error also affected process error estimates and the link that exists between process error and the autoregressive term (Figure 4).
              http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Fig4_HTML.jpg
              Figure 4

              The effect of unknown observational error on process error estimates, variance and the autoregressive cofactor.

              We present two distribution maps of C. imicola predicted abundances for the month of June 2006 (Figure 5). There are no apparent important differences in terms of relative abundances between the predictions made with RS data in comparison to those built with the WS data. Both highlighted that abundances are low at high altitudes, and also allowed differentiation of the different abundance classes within areas of constant elevation. Finally, a comparison of predicted distribution over time highlights the months when the highest activity occurred and the specific areas where this highest activity could happen.
              http://static-content.springer.com/image/art%3A10.1186%2F1756-3305-5-270/MediaObjects/13071_2011_Article_804_Fig5_HTML.jpg
              Figure 5

              Distribution maps of the relative mean abundances of C. imicola found for June 2006. The mean abundances presented on these maps were realized after 250 bootstraps of the dynamic model that account respectively for RS data (A) and WS data (B). These simulations were realized after seeding 2 individuals in each pixel in March in 2005. A mask for altitudes lower than 0 meter was applied.

              Discussion

              As expected (e.g. see [6], but also Table 1), temperature -and its quadratic value (for LST)- influenced C. imicola population abundance. Veronesi et al. showed experimentally that temperature could influence the duration and survival of sub-adult stages in C. imicola [42]. In our models, temperature from WS, when tested as a single predictor, explained 36% of the variability measured in the dataset investigated, which is ~10% higher than MODIS LST variance explanation. This range (from 27 to 36%) is lower than the variability in C. imicola catches explained on the basis of WS temperature and LST in previous studies, where values ranging between 34 to 40% were found [9]. Combining temperature with a first-order autoregressive cofactor increased significantly the explanatory power of the model.

              Whilst the strong effects of temperature and autoregressive cofactors were somewhat anticipated, we were surprised to find that including NDVI did not substantially improve the predictive power of the model. NDVI could indeed be believed to improve spatial predictions by highlighting areas where moisture conditions are more favourable to C. imicola larval stages [4346]. In addition, it was previously a variable improving C. imicola distribution models [20]. One possible explanation is the role played by artificial breeding sites in the direct vicinity of the traps. Indeed, all traps are placed in farms where a variety of artificial breeding sites can be found: mud surrounding local provision of livestock drinking water, local small streams of cleaning water, or even small-scale irrigated pastures. All these could provide C. imicola populations suitable habitats surrounding the trap, even in the absence of surrounding vegetation that could be detected at larger-scale by the remotely sensed NDVI signal. In addition, compared to other studies, NDVI may not be such a limiting factor in Sardinia where catches appear to be very abundant over the entire region. A similar approach developed at the fringe of the C. imicola distribution range may hence highlight a relatively stronger contribution of factors other than temperature in modelling C. imicola population dynamics. The fact that the model overestimated the populations in Tuscany suggests that the model would need further adjustments to account for a different range or value in the environmental conditions than those encountered in Sardinia alone.

              It is obvious that the model simulations had lower predictive power than the initial statistical approach built to find estimates of cofactors (lower correlation coefficient and higher RMSE, Table 3). Indeed the reduction in predictive power can be explained by the fact that the simulations only use a constant initial population at t0 (here 13 individuals in March 2001) to feed the predictions over the entire time series whereas the predictions of the statistical model are estimated with the observed abundances measured at each previous time step. In other words, predictions from the model simulations are not made based on the observed population at the previous time step, but are based on the modelled population at the previous time step, hence the reduction in predictability. However, the comparative advantage is that the simulation demonstrated moderate to good predictability over space and time simply based on the spatio-temporal distribution of the predictors, and do not require field samplings of C. imicola to make the predictions.

              Those simulations succeeded fairly well in reproducing the seasonality of the populations, the maintenance of C. imicola activity during winter, even at very low population levels, and the likely outcome of extinction at high elevation ( > 500 m). Even if the simulations were not able to fully quantify the level of the peaks of maximum abundances, they described very well the increase in population activity that occurs at the beginning of each season (Figure 3). The applied perspective of such a characteristic could be found in the development of a surveillance system that could predict seasonal vector abundances on the basis of the current temperature and could help to alert on periods of high risk of bluetongue disease transmission. The model predicted extinctions at high elevation (Additional file 2: Figure S2 and Additional file 3: Figure S3), and the maintenance of the activity of vectors in these areas would require renewed introductions. A further development of the model could account for external introduction of novel specimens. For example, it could be coupled to broad-scale wind density models such as presented in [47, 48], transport and trade networks [49], or to local-scale leptokurtic models, which describe the decrease in Culicoides spp. abundances as a function of the distance to the farm [50]. Other factors, not accounted for in this study, are likely to influence C. imicola populations. One such factor may be the local density of livestock (horses, cattle, sheep and goats), which provides both hosts for blood feeding females, and breeding sites through the manure. Sardinia hosts approximately 3.9 million sheep and goats, the highest density in all of Italy. Although this number does not show strong seasonal fluctuation, grazing patterns are strongly seasonal in the most elevated part of the island, with sheep flocks free-grazing in the pastures. In contrast, most sheep are grazing in pastures in the direct vicinity of farms in the low-elevation parts of Sardinia. These factors may also influence the spatio-temporal pattern of C. imicola populations, but quantification of these effects is difficult due to a lack of high-resolution data on hosts and grazing patterns. In addition, Onderstepoort-type black-light traps catches do not accurately reflect host-seeking behaviour by biting midges in comparison to host-baited traps catches [51], hence limiting the use of our dataset to test for the local effect of hosts distribution on C. imicola populations abundance.

              Average correlation coefficients between observations and predictions were very similar between the RS model and the WS model (Figure 2). The model with RS predictors has nevertheless the advantage that predictions can be made over all pixels without interpolation of observations such as is needed in the case of weather station data. Furthermore, interpolation tends to produce very continuous surface that do not fully reflect the local heterogeneities in temperatures (Figure 5A and B). This comparative advantage could not be quantified in our study, probably because the observation error in catches could be higher than the difference in predictions due to the differences in the type of temperature data. Another possible explanation is that local temperature at the level of the trap, influenced by local conditions (shelter and shading, local topography) could be as different from the RS data as it is from the WS data. This could only be evaluated thoroughly using local temperature measurements with data loggers. Overall, the distribution that we predicted at the seasonal peak is fairly similar to that observed by previous studies, (e.g. [13, 33]), with low-level population areas located at high altitudes.

              Our modelling approach included at least three sources of uncertainties. The first one appears when an autoregressive cofactor is included and results from the interplay between process and observation error [38]. We tried to take it into account by (1) quantifying the effect of observation error on coefficient estimates, (2) trying to quantify the influence of observation error on process error, and (3) including using a stochastic component in the modelling framework.

              As expected, introducing different levels of observation error influenced both GLM coefficient estimates [39], and the extent of process error (Figure 4). Since we have little prior information on the magnitude of observational error, we decided to run our simulations with a fairly high level of observational error, introduced in the form of an error term sampled from a normal distribution with 0-mean and standard deviation equal to half the variance found in the whole dataset. This introduces a lot of variability in the predictions, and one way to reduce uncertainties could only be gained by experimental quantification of observation error. Other methods to include explicitly observation and process error are available (e.g. state-space models such as in [52] or else, [38] and [39] for implementations using the Bayesian techniques; mixed models, such as highlighted in [53]; hybrid models developed in [54]). Our approach was somewhat simpler, but allows quantification of the impact of observation error to be made under various modelling frameworks, such as for examples BRT [55], GLMM [56] or autologistic models [57].

              Conclusion

              Spatially and temporally explicit models have considerable prospects for modelling data arising from longitudinal entomological surveillance because they allow the incorporation of seasonality explicitly in the model and facilitate interpretation of the results by identifying eco-climatic factors that influence population growth rate in space and time (see also [58]). Once validated, these models could be used to predict population levels on the basis of observed environmental conditions, hence potentially reduce the amount of entomological surveillance. Together with the recent advances in methods for the identification of biting midges [59, 60] and blood meal sources [61, 62], these models should help to strengthen ecological studies on biting midges, a field by far underexplored. A further improvement of these models would be gained by a better quantification and integration of observation error.

              Declarations

              Acknowledgements

              The authors would like to thank the farmers, veterinarians and laboratory technicians involved in the sampling and identification of Culicoides species throughout Italy between 2000 and 2007. We are also grateful to two anonymous referees for the important comments provided on the initial manuscript, which warrants further interesting analytical perspectives. This study was funded by the Belgian Science Policy, research programme for earth observation Stereo II, research project EPISTIS (contract N° SR/00/102).

              Authors’ Affiliations

              (1)
              Biological control and spatial ecology (LUBIES), Université Libre de Bruxelles
              (2)
              INRA, UMR 1202, Biodiversity Genes and Communities
              (3)
              Istituto Zooprofilattico Sperimentale dell’Abruzzo e del Molise ‘G. Caporale’
              (4)
              Avia-GIS
              (5)
              Fonds National de la Recherche Scientifique

              References

              1. Meiswinkel R, Nevill EM, Venter GJ: Vectors: Culicoides spp. Infectious Diseases of Livestock with special reference to Southern Africa. Volume 1. Edited by: Coetzer JAW. 1994, Cape Town: Oxford University Press, 69-89.
              2. Mellor PS, Boorman J: The transmission and geographical spread of African horse sickness and bluetongue viruses. Ann Trop Med Parasitol. 1995, 89: 1-15.PubMed
              3. Mellor PS, Boorman J, Baylis M: Culicoides biting midges: their role as arbovirus vectors. Annu Rev Entomol. 2000, 45: 307-340. 10.1146/annurev.ento.45.1.307.View ArticlePubMed
              4. Venter GJ, Meiswinkel R, Nevill EM, Edwardes M: Culicoides (diptera: ceratopogonidae) associated with livestock in the onderstepoort area, Gauteng, south Africa as determined by light-trap collections. Onderstepoort J Vet. 1996, 63: 315-325.
              5. Venter GJ, Nevill EM, Van der Linde TC: Seasonal abundance and parity of stock-associated Culicoides species (diptera: ceratopogonidae) in different climatic regions in southern Africa in relation to their viral vector potential. Onderstepoort J Vet. 1997, 64: 259-271.
              6. Ortega M-D: Seasonal and geographical distribution of Culicodies imicola kieffer in southwestern Spain. J Am Mosq Control Assoc. 1997, 13: 227-232.PubMed
              7. Rawlings P, Pro M-J, Pena I, Ortega M-D, Capela R: Spatial and seasonal distribution of Culicoides imicola in Iberia in relation to the transmission of African horse sickness virus. Med Vet Entomol. 1997, 11: 49-57. 10.1111/j.1365-2915.1997.tb00289.x.View ArticlePubMed
              8. Rawlings P, Capela R, Pro MJ, Ortega M-D, Pena I, Rubio C, Gasca A, Mellor PS: The relationship between climate and the distribution of Culicoides imicola in Iberia. Arch Virol Suppl. 1998, 14: 95-PubMed
              9. Baylis M, Meiswinkel R, Venter GJ: A preliminary attempt to use climate data and satellite imagery to model the abundance and distribution of Culicoides imicola (Diptera: Ceratopogonidae) in southern Africa. J S Afr Vet Assoc. 1999, 70: 80-89.PubMed
              10. Rawlings P, Meiswinkel R, Labuschange K, Welton N, Baylis M, Mellor PS: The distribution and species characteristics of the Culicoides biting midge fauna of South Africa. Ecol Entomol. 2003, 28: 559-566. 10.1046/j.1365-2311.2003.00545.x.View Article
              11. Mellor PS, Wittmann EJ: Bluetongue virus in the Mediterranean basin 1998–2001. Vet J. 2002, 164: 20-37. 10.1053/tvjl.2002.0713.View ArticlePubMed
              12. De Liberato C, Purse BV, Goffredo M, Scholl F, Scaramozzino P: Geographical and seasonal distribution of the bluetongue virus vector, Culicoides imicola, in central Italy. Med Vet Entomol. 2003, 17: 388-394. 10.1111/j.1365-2915.2003.00456.x.View ArticlePubMed
              13. Goffredo M, Conte AM, Cocciolito R, Meiswinkel R: The distribution and abundance of Culicoides imicola in Italy. Vet Ital. 2003, 39: 22-32.
              14. Miranda MA, Borras D, Rincon C, Alemany A: Presence in the Balearic Islands (Spain) of the midges Culicoides imicola and Culicoides obsoletus group. Med Vet Entomol. 2003, 17: 52-54. 10.1046/j.1365-2915.2003.00405.x.View ArticlePubMed
              15. Goffredo M, Conte A, Meiswinkel R: Distribution and abundance of Culicoides imicola, Obsoletus complex and Pulicaris complex (diptera: ceratopogonidae) in Italy. Vet Ital. 2004, 40: 270-273.PubMed
              16. Goffredo M, Meiswinkel R: Entomological surveillance of bluetongue in Italy: methods of capture, catch analysis and identification of Culicoides biting midges. Vet Ital. 2004, 40: 260-265.PubMed
              17. Torina A, Caracappa S, Mellor PS, Baylis M, Purse BV: Spatial distribution of bluetongue virus and its Culicoides vectors in Sicily. Med Vet Entomol. 2004, 18: 81-89. 10.1111/j.0269-283X.2004.00493.x.View ArticlePubMed
              18. Patakakis MJ: Culicoides imicola in Greece. Vet Ital. 2004, 40: 232-234.PubMed
              19. Patakakis MJ, Papazahariadou M, Wilson A, Mellor PS, Frydas S, Papadopoulos O: Distribution of Culicoides in Greece. J Vector Ecol. 2009, 34: 243-251.View ArticlePubMed
              20. Baylis M, O’Connell L, Purse BV: Modelling the distribution of bluetongue vectors. Vet Ital. 2004, 40: 176-181.PubMed
              21. Baylis M, Bouayoune H, Touti J, Hasnaoui EH: Use of climatic data and satellite imagery to model the abundance of Culicoides imicola, the vector of African horse sickness virus, in morocco. Med Vet Entomol. 1998, 12: 255-266. 10.1046/j.1365-2915.1998.00109.x.View ArticlePubMed
              22. Baylis M, Rawlings P: Modelling the distribution and abundance of Culicoides imicola in Morocco and Iberia using climatic data and satellite imagery. Arch Virol Suppl. 1998, 14: 137-153.PubMed
              23. Wittmann EJ, Mellor PS, Baylis M: Using climate data to map the potential distribution of Culicoides imicola (Diptera: Ceratopogonidae) in Europe. Rev sci tech Off int Epiz. 2001, 20: 731-736.
              24. Baylis M, Mellor PS, Wittmann EJ, Rogers DJ: Prediction of areas around the Mediterranean at risk of bluetongue by modelling the distribution of its vector using satellite imaging. Vet Rec. 2001, 149: 639-643. 10.1136/vr.149.21.639.View ArticlePubMed
              25. Calistri P, Goffredo M, Caporale V, Meiswinkel R: The distribution of Culicoides imicola in Italy: application and evaluation of current mediterranean models based on climate. J Vet Med B. 2003, 50: 132-138.View Article
              26. Tatem AJ, Baylis M, Mellor PS, Purse BV, Capela R, Pena I, Rogers DJ: Prediction of bluetongue vector distribution in Europe and north Africa using satellite imagery. Vet Microbiol. 2003, 97: 13-29. 10.1016/j.vetmic.2003.08.009.View ArticlePubMed
              27. Conte A, Giovannini A, Savini L, Goffredo M, Calistri P, Meiswinkel R: The effect of climate on the presence of Culicoides imicola in Italy. J Vet Med B. 2003, 50: 139-147. 10.1046/j.1439-0450.2003.00632.x.View Article
              28. Purse BV, Tatem AJ, Caracappa S, Rogers DJ, Mellor PS, Baylis M, Torina A: Modelling the distributions of Culicoides bluetongue virus vectors in Sicily in relation to satellite-derived climate variables. Med Vet Entomol. 2004, 18: 90-101. 10.1111/j.0269-283X.2004.00492.x.View ArticlePubMed
              29. Conte A, Ippoliti C, Calistri P, Pelini S, Savini L, Salini R, Goffredo M, Meiswinkel R: Towards the identification of potential infectious sites for bluetongue in Italy: a spatial analysis approach based on the distribution of Culicoides imicola. Vet Ital. 2004, 40: 311-315.PubMed
              30. Conte A, Goffredo M, Ippoliti C, Meiswinkel R: Influence of biotic and abiotic factors on the distribution and abundance of Culicoides imicola and the Obsoletus Complex in Italy. Vet Parasitol. 2007, 150: 333-344. 10.1016/j.vetpar.2007.09.021.View ArticlePubMed
              31. Calvete C, Estrada R, Miranda MA, BorráS D, Calvo JH, Lucientes J: Modelling the distributions and spatial coincidence of bluetongue vectors Culicoides imicola and the Culicoides obsoletus group throughout the Iberian peninsula. Med Vet Entomol. 2008, 22: 124-134. 10.1111/j.1365-2915.2008.00728.x.View ArticlePubMed
              32. Acevedo P, Ruiz-Fons F, Estrada R, Márquez AL, Miranda MA, Gortázar C, Lucientes J: A broad assessment of factors determining Culicoides imicola abundance: modelling the present and forecasting its future in climate change scenarios. PLoS One. 2010, 5: e14236-10.1371/journal.pone.0014236.PubMed CentralView ArticlePubMed
              33. Conte A, Gilbert M, Goffredo M: Eight years of entomological surveillance in Italy show no evidence of Culicoides imicola geographical range expansion. J Appl Ecol. 2009, 46: 1332-1339.
              34. Baylis M, Hasnaoui EH, Bouayoune H, Touti J: The spatial and seasonal distribution of African horse sickness and its potential Culicoides vectors in morocco. Med Vet Entomol. 1997, 11: 203-212. 10.1111/j.1365-2915.1997.tb00397.x.View ArticlePubMed
              35. R Development Core Team: Vienna, Austria. R: a language and environment for statistical computing. 2011
              36. Pebesma EJ: Multivariable geostatistics in S: the gstat package. Comput Geosci. 2004, 30: 683-691. 10.1016/j.cageo.2004.03.012.View Article
              37. Scharlemann JP, Benz D, Hay SI, Purse BV, Tatem AJ, Wint GR, Rogers DJ: Global data for ecology and epidemiology: a novel algorithm for temporal Fourier processing MODIS data. PLoS One. 2008, 3: e1408-10.1371/journal.pone.0001408.PubMed CentralView ArticlePubMed
              38. de Valpine P, Hastings A: Fitting population models incorporating process noise and observation error. Ecol Monogr. 2002, 72: 57-76. 10.1890/0012-9615(2002)072[0057:FPMIPN]2.0.CO;2.View Article
              39. Calder C, Lavine M, Müller P, Clark JS: Incorporating multiple sources of stochasticity into dynamic population models. Ecology. 2003, 84: 1395-1402. 10.1890/0012-9658(2003)084[1395:IMSOSI]2.0.CO;2.View Article
              40. Venables WN, Ripley BD: Modern Applied Statistics with S. 2002, New York: SpringerView Article
              41. Hijmans RJ, van Etten J: Package ‘raster’. 2011,http://​r-forge.​r-project.​org/​projects/​raster,
              42. Veronesi E, Venter GJ, Labuschagne K, Mellor PS, Carpenter S: Life-history parameters of Culicoides (Avaritia) imicola Kieffer in the laboratory at different rearing temperatures. Vet Parasitol. 2009, 163: 370-373. 10.1016/j.vetpar.2009.04.031.View ArticlePubMed
              43. Walker AR, Davies FG: A preliminary survey of the epidemiology of bluetongue in Kenya. J Hyg. 1971, 69: 47-60. 10.1017/S0022172400021239.PubMed CentralView ArticlePubMed
              44. Lubega R, Khamala CPM: Larval habitats of common Culicoides Latreille (Diptera, Ceratopogonidae) in Kenya. B Entomol Res. 1976, 66: 421-425. 10.1017/S0007485300006829.View Article
              45. Walker AR: Seasonal Fluctuations of Culicoides Species (Diptera: Ceratopogonidae) in Kenya. B Entomol Res. 1977, 67: 217-233. 10.1017/S0007485300011032.View Article
              46. Braverman Y, Galun R, Ziv M: Breeding sites of some Culicoides species (Diptera, Ceratopogonidae) in Israel. Mosq News. 1974, 34: 303-308.
              47. Hendrickx G, Gilbert M, Staubach C, Elbers A, Mintiens K, Gerbier G, Ducheyne E: A wind density model to quantify the airborne spread of Culicoides species during north-western Europe bluetongue epidemic. Prev Vet Med. 2008, 87: 162-181. 10.1016/j.prevetmed.2008.06.009.View ArticlePubMed
              48. Sanders CJ, Shortall CR, Gubbins S, Burgin L, Gloster J, Harrington R, Reynolds DR, Mellor PS, Carpenter S: Influence of season and meteorological parameters on flight activity of Culicoides biting midges. J Appl Ecol. 2011, 48: 1355-1364. 10.1111/j.1365-2664.2011.02051.x.View Article
              49. Napp S, García-Bocanegra L, Pagès N, Allepuz A, Alba A, Casal J: Assessment of the risk of a bluetongue outbreak in Europe caused by Culicoides midges introduced through intracontinental transport and trade networks. Med Vet Entomol. 2012, 10.1111/j.1365-2915.2012.01016.x.
              50. Rigot T, Vercauteren Drubbel M, Delécolle J-C, Gilbert M: Farms, pastures and woodlands: the fine-scale distribution of Palearctic Culicoides Spp. Biting midges along an agro-ecological gradient. Med Vet Entomol. 2012, 10.1111/j.1365-2915.2012.01032.x.
              51. Viennet E, Garros C, Lancelot R, Allène X, Gardès L, Rakotoarivony I, Crochet D, Delécolle J-C, Moulia C, Baldet T, Balenghien T: Assessment of vector/host contact: comparison of animal-baited traps and UV-light/suction trap for collecting Culicoides biting midges (diptera: ceratopogonidae). vectors of orbiviruses. Parasite Vector. 2011, 4: 119-10.1186/1756-3305-4-119.View Article
              52. Holmes EE, Ward EJ, Wills K: MARSS: Multivariate autoregressive state-space models for analyzing time-series data. R Journal. 2012, 4: 11-19.
              53. Fitzpatrick MC, Preisser EL, Porter A, Elkinton J, Ellison AM: Modeling range dynamics in heterogeneous landscapes: invasion of the hemlock woolly adelgid in eastern North America. Ecol Appl. 2011, 22: 472-486.View Article
              54. Gallien L, Münkemüller T, Albert CH, Boulangeat I, Thuiller W: Predicting potential distributions of invasive species: where to go from here?. Divers Distrib. 2010, 16: 331-342. 10.1111/j.1472-4642.2010.00652.x.View Article
              55. Elith J, Leathwick JR, Hastie T: A working guide to boosted regression trees. J Anim Ecol. 2008, 77: 802-813. 10.1111/j.1365-2656.2008.01390.x.View ArticlePubMed
              56. Zuur AF, Ieno EN, Walker NJ, Saveliev AA, Smith GM: Mixed effects models and extensions in ecology R. 2009, New York: Springer VerlagView Article
              57. Dormann CF, McPherson JM, Araújo MB, Bivand R, Bolliger J, Carl G, Davies RG: Methods to account for spatial autocorrelation in the analysis of species distributional data: a review. Ecography. 2007, 30: 609-628. 10.1111/j.2007.0906-7590.05171.x.View Article
              58. Searle KR, Blackwell A, Falconer D, Sullivan M, Butler A, Purse BV: Identifying environmental drivers of insect phenology across space and time: Culicoides in Scotland as a case study. B Entomol Res. 2012, 10.1017/S0007485312000466.
              59. Kaufmann C, Steinmann IC, Hegglin D, Schaffner F, Mathis A: Spatio-temporal occurrence of Culicoides biting midges in the climatic regions of Switzerland, along with large scale species identification by MALDI-TOF mass spectrometry. Parasite Vector. 2012, 5: 246-10.1186/1756-3305-5-246.View Article
              60. Lehmann K, Werner D, Hoffmann B, Kampen H: PCR identification of Culicoid biting midges (Diptera, Ceratopogonidae) of the Obsoletus complex including putative vectors of bluetongue and Schmallenberg viruses. Parasite Vector. 2012, 5: 213-10.1186/1756-3305-5-213.View Article
              61. Lassen SB, Nielsen SA, Kristensen M: Identity and diversity of Blood meal hosts of biting midges (Diptera: Ceratopogonidae: Culicoides Latreille) in Denmark. Parasite Vector. 2012, 5: 143-10.1186/1756-3305-5-143.View Article
              62. Martínez-de la Puente J, Martínez J, Ferraguti M, Morales-de la Nuez A, Castro N, Figuerola J: Genetic characterization and molecular identification of the bloodmeal sources of the potential bluetongue vector Culicoides obsoletus in the Canary islands, Spain. Parasite Vector. 2012, 5: 147-10.1186/1756-3305-5-147.View Article

              Copyright

              © Rigot et al.; licensee BioMed Central Ltd. 2012

              This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://​creativecommons.​org/​licenses/​by/​2.​0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

              Advertisement