Skip to main content

Modeling the distribution of the West Nile and Rift Valley Fever vector Culex pipiens in arid and semi-arid regions of the Middle East and North Africa



The Middle East North Africa (MENA) region is under continuous threat of the re-emergence of West Nile virus (WNV) and Rift Valley Fever virus (RVF), two pathogens transmitted by the vector species Culex pipiens. Predicting areas at high risk for disease transmission requires an accurate model of vector distribution, however, most Cx. pipiens distribution modeling has been confined to temperate, forested habitats. Modeling species distributions across a heterogeneous landscape structure requires a flexible modeling method to capture variation in mosquito response to predictors as well as occurrence data points taken from a sufficient range of habitat types.


We used presence-only data from Egypt and Lebanon to model the population distribution of Cx. pipiens across a portion of the MENA that also encompasses Jordan, Syria, and Israel. Models were created with a set of environmental predictors including bioclimatic data, human population density, hydrological data, and vegetation indices, and built using maximum entropy (Maxent) and boosted regression tree (BRT) methods. Models were created with and without the inclusion of human population density.


Predictions of Maxent and BRT models were strongly correlated in habitats with high probability of occurrence (Pearson’s r = 0.774, r = 0.734), and more moderately correlated when predicting into regions that exceeded the range of the training data (r = 0.666,r = 0.558). All models agreed in predicting high probability of occupancy around major urban areas, along the banks of the Nile, the valleys of Israel, Lebanon, and Jordan, and southwestern Saudi Arabia. The most powerful predictors of Cx. pipiens habitat were human population density (60.6% Maxent models, 34.9% BRT models) and the seasonality of the enhanced vegetation index (EVI) (44.7% Maxent, 16.3% BRT). Maxent models tended to be dominated by a single predictor. Areas of high probability corresponded with sites of independent surveys or previous disease outbreaks.


Cx. pipiens occurrence was positively associated with areas of high human population density and consistent vegetation cover, but was not significantly driven by temperature and rainfall, suggesting human-induced habitat change such as irrigation and urban infrastructure has a greater influence on vector distribution in this region than in temperate zones.


With rapidly expanding urban populations, recent refugee movements, internal displacement, and widespread civil strife, parts of the Middle East are becoming increasingly vulnerable to vector-borne diseases (VBDs). Urban growth, in particular, has expanded faster than the supply of available housing and supporting infrastructure, resulting in a large proportion of the population living in unsanitary conditions, which may increase human exposure to bites of infected vectors that proliferate in urban environments [14].

Prior to the recent period of civil conflict, the region has experienced the emergence and resurgences of two deadly diseases. Rift Valley Fever Virus caused an epidemic in Egypt in 1977, 1993 and 2003, and hit Saudi Arabia and Yemen in 2000 and 2001 [57]. In addition, West Nile virus, a nearly “forgotten” disease re-emerged in a severe country-wide epidemic in Israel in 2000, with a fatality rate of 8.4% [8]. Between 2010 and 2012, 200 West Nile cases were reported in Israel [9]. While the causes for the recent resurgence of neglected tropical diseases like West Nile Virus, Rift Valley Fever Virus, and Leishmaniasis are not fully understood, it is cause for renewed interest and increased attention to disease dynamics in the region [2, 10, 11].

Molecular and epidemiological studies have shown that Cx. pipiens is a vector of both Rift Valley Fever virus and West Nile Virus [6, 1216]. It has a preference for living near areas of high human population density, and breeding in the artificial containers abundant in close proximity to human settlements [17, 18] or polluted pools of water associated with human activities [1922]. The species also feeds opportunistically from a wide variety of blood hosts [23, 24], and is active nearly year round [21]. This vast ecological plasticity makes it a potentially significant vector. As such, understanding the distribution of Cx. pipiens throughout the region is a fundamental requirement to understanding transmission dynamics.

Habitat suitability models for Cx. pipiens in general have identified rainfall, temperature, and vegetation to be major drivers of population distribution [25, 26]. Density of hosts has also been found to be significantly correlated with habitat suitability in related species [27]. However, these studies were largely conducted in different climatic regions than those dominant in the MENA region, and therefore the identified predictors would not necessarily explain vector distributions in arid and semi-arid regions. Environmental predictors will be most informative at scales and across regions where they exhibit sufficient heterogeneity to accurately discern suitable from unsuitable habitat. The relative information values of predictors shift as underlying climatic conditions change, and so does the nature and identity of the most significant vector-habitat relationship in the model. In Egypt, climatic variables such as ambient temperature, relative humidity, and wind speed did not significantly distinguish sites with high and low filariasis transmission vectored by Cx. pipiens, which was positively correlated with temperature and negatively correlated with rainfall [28]. In Australia, the critical climate factors for predicting outbreaks of Ross River virus vary significantly between different environmental regions within a single state [29]. Simulation models of Cx. quinquefasciatus across the southern United States revealed significantly different sensitivities of mosquito populations to temperature and precipitation in arid and humid habitats [30].

The aim of the present work is to investigate the relationship between the distribution of Cx. pipiens and the environment in the Middle East and subsequently derive vector distribution surfaces. Accordingly, we used data collected from a wide range of habitats in the region and two different but robust presence-only species distribution models (SDMs), Maxent and boosted regression trees (BRT). The models examined the role of human populations, climatic, hydrological, and vegetation parameters in predicting habitat suitability.


Selecting modeling approaches

In order to strengthen confidence in predicting Cx. pipiens distributions, we incorporated two modeling approaches, Maximum entropy implemented in Maxent software, and boosted regression trees. Regression models are powerful tools for selecting relevant predictors and modeling complex interactions, while boosting avoids misclassification problems inherent in a single tree model. Thus our combined models would enable powerful selection of relevant predictors and accurate modeling of complex interactions. The previously discussed non-uniform relationships between other Culex species and critical climate factors [30] suggest such interactions may possibly contribute significantly to Cx. pipiens distributions as well. Thus, the use of model intercomparisons leads to greater confidence in the predictions of the distribution. Using Maxent as our alternative method allows for comparisons with the predictions of one of the most familiar and frequently applied presence only modeling techniques [3134].

Modeling species distribution using maximum entropy

Maximum entropy is a machine learning algorithm that produces predictions of habitat suitability by comparing the conditional density of predictors at presence sites with the marginal density of predictors across the study area. The raw output of Maxent is a probability of habitat suitability [35, 36]. To make model output more interpretable, Maxent converts the exponential values of its raw output estimate of habitat suitability into a logistic output that represents an estimate of probability of presence [37].

Modeling species distribution using boosted regression trees

Boosted regression trees predict the value of a target variable based on the value of several input variables. Boosting uses a machine learning algorithm to produce a final prediction model that is an ensemble of weak prediction models, in this case, an ensemble of regression trees. The model is built in a stage wise fashion; a regression tree is fit to the original data, then the residuals of that model become the new data values, to which a second tree is fit, and so forth. By fitting each subsequent tree in the model to the residuals of the previous tree, the data become re-weighted in each iteration. Points that were misclassified by the previous model will now have more weight than values that were classified correctly. As a result, the subsequent tree will focus on fitting these misclassified points. The process is conditioned by the learning rate, which controls the contribution of each tree to the final model.

The final model can consist of thousands of trees, however, over-fit models will exaggerate minor fluctuations in the data, making them poor predictors. Boosted regression uses cross validation to minimize over-fitting by determining when adding additional trees no longer improves predictive performance, and selecting that optimum number of trees.

Predictive error is measured as the Bernoulli residual deviance between the predicted values of the model and the observed values of the test data. All BRT models are fitted in R using the ‘gbm’ and ‘dismo’ libraries [3840].

Target species and occurrence data

Our study area of interest (Figure 1) focuses on an approximately 5 million square kilometer region of the MENA that encompasses Egypt, Israel, Lebanon, Syria, Jordan as well as substantial portions of Turkey, Saudi Arabia, and Iraq.

Figure 1
figure 1

Sampling locations of Culex pipiens .

The Cx. pipiens complex are the most widely distributed mosquito species in the world and two biotypes of Cx. pipiens L. are present in our study area, the anautogenous and autogenous biotypes. These mosquito biotypes breed in overlapping niches and readily hybridize in areas where they coexist [41]. In Egypt, both autogenous and anautogenous Cx. pipiens individuals were encountered in the progeny of autogenous or anautogenous female parents and more than 75% of field caught females produced mixed progenies [42].

Cx. pipiens is the most abundant mosquito in Lebanon, collected both indoors and outdoors. Active and abundant year round, it is anthropophilic, endophagic, and endophillic [21]. The species is highly behaviorally plastic, while Cx. pipiens in the Al Sharqyia governorate of Egypt’s Nile delta also displays endophagic and primarily anthropophiliic behavior [24], mosquitoes in the neighboring governorate have been observed to switch from indoor to outdoor feeding with seasonal shifts in temperature and wind speed [43]. Females will feed from a diverse range of hosts, including horses, cows, sheep, dogs, cats, humans and rats [23].

Cx. pipiens breeds in water with high organic content [41]. In Lebanon breeding sites for Cx. pipiens include containers which range in size from small cans filled with water to garden pools and irrigation ditches [21]. In the urban habitat of Cairo, the common breeding habitats for Cx. pipiens are cesspits, drainage canals, springs, cesspools, and irrigation ditches [22].

Training data from both Egypt and Lebanon were included to maximize sampling of the species’ environmental range, from the hot and hyperarid to cool and subhumid. This range of samples maximizes the combinations of environmental factors used in calibrating the models, increasing their predictive power [44, 45].

Occurrence records for Cx. pipiens were obtained from previously unpublished vector surveys in Egypt and Lebanon (Figure 1). Adult samples from Lebanon were collected in the summer of 2009. Methods included CDC light traps with a dry ice attractant, and BG-sentinel traps with pheromone attractant. Traps were placed in outdoor and indoor habitats at 5 pm and collected the following morning. In addition to adult sampling, larval stages of mosquitoes were sampled in the summers of 2002 and 2003. Samples were collected by the classical dipping method using a white tray and/or a ladle from both artificial and natural breeding sites including swamps, ponds, riverbanks, irrigation tanks, and water storage tanks. Coordinates for sampling locations were obtained from Google Earth and GPS. Adult specimens were pinned and 4th instar larval specimens were mounted on slides and identified morphologically using local and regional identification keys [46]. Occurrence records from Egypt (Gad, unpublished data) were collected using similar methods as part of a nation-wide survey.

Environmental layers

We considered twenty-four environmental variables as potential predictors of Cx. pipiens habitat (Table 1). Temperature and precipitation were represented by 19 variables of the WorldClim dataset [47]. Quality of vegetative cover was described by the standard deviation, mean, and maximum values of the enhanced vegetation index (EVI), as derived from Moderate Resolution Image Spectroradiometer (MODIS) imagery from the Terra Satellite for 2001. The EVI provides a measure of photosynthetic activity or landscape greenness and hence captures vegetation features such as leaf area, canopy cover, sugar resources that may provide resting sites and alternate food sources for this vector species [4850]. As available soil water limits primary productivity in the study region, mean and maximum EVI are generally correlated with annual rainfall, whereas the standard deviation of EVI provides a measure of vegetation seasonality, which is controlled by a variety of factors including temperature, day length, insolation, irrigation, and biotic factors [51]. An additional covariate related to potential vector breeding habitat was derived from topographic data, the topographical wetness index (TWI), which describes the tendency of water to collect in areas of topographic minima [52]. Because information regarding the distribution of hosts greatly improves models of mosquito distributions [53] and Cx. pipiens is adapted to breeding and feeding in human-altered landscapes, we also included human population density as a predictor using data from Landscan™ 2010. Environmental layers were gridded to a spatial resolution of approximately 1 km (926.63 m), a scale that captured as much environmental heterogeneity as possible within the limits of computer processing capability.

Table 1 Data sources of environmental predictors used in species distribution models

Data processing

Presence Data: All point locations recorded as positive for Cx. pipiens from Egypt (n = 239) and Lebanon (n = 83) surveys were included as presence locations in all models (Figure 1).

Background Data: Both boosted regression and maximum entropy methods predict areas with a high probability of presence of conditions suitable for the target species by comparing the values of environmental predictors at presence locations to the values of the same predictors at a set of random “background” locations, which represent the range of available environmental values [54]. Because our data, like many presence-only data sets [55, 56], showed a strong bias in sampling effort, we selected background points with the same geographical bias as the occurrence data (Additional file 1).

An independent data set of 79 Cx. pipiens locations, 23 from Israel and 56 from Egypt [57], was used to assess accuracy of model predictions (Additional file 2: Figure S1). These data were used in all Maxent test data calculations.

Assessing the contribution of human population density to Cx. pipiens distribution models

Human population density has the potential to be a strong predictor of Cx. pipiens habitat [53, 58]. However, because the survey locations of the original studies were all in close proximity to populated areas, population density is also a strong predictor of sampling effort. To control for this we ran the entire modeling process twice; once on the complete set of environmental variables (N = 24), and once using all environmental variables except population density (N = 23). Including population density allows us to examine the relative influence of an important socioeconomic environmental factor, while excluding human populations allows us to examine the bio-physical environmental factors on their own.

To increase the interpretability of the models both predictor sets were reduced using the ‘gbm.simplify’ function, which creates BRT models with every possible combination of the initial predictors, and based on minimizing predictive error, ranks predictors from most to least influential, and identifies the optimal predictor set [59].

Creating species distribution models

BRT models using the reduced predictor sets were fitted using a learning rate of 0.05, a tree complexity of 5, a bag fraction of 0.75, 10-fold cross validation, and a Bernoulli loss function. All other parameters were left at default settings. Predictions in geographic space were made in R using the “raster” package [60].

Maxent models using the reduced predictor sets were fit using Maxent v 3.3.3 k [61]. Spatial predictions of the model were "clamped", a Maxent feature which reduces the value of the predictions when extrapolating into areas of parameter space that exceeded the range of values present in the training data. The final model was the average logistic output of 10 repetitions of the modeling process. Additional specifics on program specifications can be found in Additional file 1.

Evaluating and comparing model predictions

Maxent models were evaluated using the area under the receiver operating characteristic curve (AUC), the corrected Aikaike information criterion (AICc), and the omission rate. BRT models were evaluated using AUC, point biserial correlation (COR), and the Bernoulli deviance. Because the two techniques use different test points for the intrinsic measures of model performance, the point biserial correlation was also calculated for all models at a common set of 158 points made of the 79 test presence locations and 79 random background points (Additional file 2: Figure S1). Pearson’s correlation coefficient was used to measure how closely predictions agreed between the two methods. Agreement was compared in the “training region”, using the common 158 points, as well as an “expanded region” of 158 points drawn from the entire model extent (Additional file 2: Figure S1).


Cx. pipiens habitat suitability

The predicted maps of suitable Cx. pipiens habitat are presented in Figures 2, 3, 4 and 5. Overall, maximum entropy models predicted a broader distribution of suitable habitat than boosted regression methods, which produced much more conservative predictions extent of suitable habitat and a lower probability of presence. However, relative suitability of habitats was consistent between models: areas of highest suitability identified by BRT were also the areas of highest suitability selected by maximum entropy.

Figure 2
figure 2

Maxent Pop. Probability of Culex pipiens presence based on a species distribution model generated using maximum entropy and a set of environmental predictors (N = 9) including human population density. Habitat suitability has been converted to probability of presence, assuming a prevalence of 0.5. Bottom: Enlarged to show population centers.

Figure 3
figure 3

Maxent No Pop. Probability of Culex pipiens presence based on a species distribution model generated using maximum entropy and a set of environmental predictors (N = 14) that do not include human population density. Habitat suitability has been converted to probability of presence, assuming a prevalence of 0.5. Bottom: Enlarged to show population centers.

Figure 4
figure 4

BRT Pop. Probability of Culex pipiens presence based on a species distribution model generated using boosted regression trees and a set of environmental predictors (N = 9) that includes include human population density. Bottom: Enlarged to show population centers.

Figure 5
figure 5

BRT No Pop: Probability of Culex pipiens presence based on a species distribution model generated using boosted regression trees and a set of environmental predictors (N = 14) that does not include human population density. Bottom: Enlarged to show population centers.

All four models agree in predicting a moderate to high probability of Cx. pipiens presence in several distinct habitats: along the banks of the Nile and throughout the Nile Delta, throughout the coastal plains of Israel, Lebanon, and Syria, as well as the valleys of east Lebanon and along Israel’s Jordanian border (Figures 2, 3, 4, and 5). In models built using human population data (Figures 2 and 4), habitats with the highest probability of presence were found near large population centers. Both BRTPop and MaxentPop models also indicate moderate probability of presence along the banks of the Tigris and Euphrates rivers. Models built without population data (Figures 3 and 5) predicted greater probability of presence in the semi-arid Central Anatolia region of Turkey, the Jazirah plain in North East Syria and Northern Iraq, and the marshes of southern Iraq than the population-inclusive models. However, much of Iraq and Eastern Turkey occupy a region of environmental space that falls outside the range of the training data (Additional file 2: Figures S3 and S4), so predictions in these areas should be interpreted cautiously.

Model accuracy

Evaluation metrics for Maxent and BRT models are presented in Tables 2 and 3. Among Maxent models, both sets of predictors performed well, with AUC values significantly greater than the null model. The test AUC values for both predictor sets were smaller than the training AUC values, which indicates slight over-fitting of the models. This difference was smaller for the model created without population data. The “NoPop” model also had a higher test AUC value, greater entropy, and a smaller omission rate than the model that included population. The population-inclusive model had a lower AICc than the “NoPop” model.

Table 2 Evaluation parameters for species distribution models built using human population data
Table 3 Evaluation parameters for species distribution models excluding human population data

The boosted regression models also performed well, and also displayed a degree of over-fitting similar to the Maxent models. Among the BRT models, population-inclusive models performed slightly better than the “NoPop” models on test data- with higher cross-validated AUC values, higher cross-validated COR values, and lower cross validated deviance scores.

When evaluated over a standard set of points, the Maxent “NoPop” model had the strongest correlation (COR = 0.534) between model predictions and presence/pseudo-absence, the BRT NoPop and Maxent population-inclusive model performed equally well, and the BRT population-inclusive model had the weakest correlation (COR = 0.413) of the four models evaluated.

Model agreement

When evaluated over the training region, the predictions of Maxent and BRT models were very strongly correlated, with models including population data more closely correlated than models which excluded it (Figure 6). When evaluated over the entire modeled region, the correlation between models was weaker.

Figure 6
figure 6

Correlation between predictions of Culex pipiens distribution models created using boosted regression trees (BRT) and maximum entropy (Maxent) algorithms. “Training region” consists of 158 points taken from within a similar geographical area to the samples used building the model, 79 background points and 78 independent occurrence points. “Expanded Region” measures model agreement throughout the entire modeled extent, and compares values at 158 random points taken with equal probability from the entire model extent. Left panels describe models built using human population density as a parameter (N = 9). Panels on the right describe models excluding human population density (N = 14). COR = Pearson correlation coefficient.

Contribution of environmental predictors to Cx. pipiens distribution models

Parameter reduction of the full predictor set (n = 24) resulted in a simplified set of 9 significant predictors. (Table 4). Parameter reduction of the full predictor set excluding human population density (n = 23) resulted in a simplified set of 14 significant predictors (Table 5).

Table 4 Contribution of environmental parameters to SDMs including human population data
Table 5 Contribution of environmental parameters to SDMs excluding human population data

Both modeling techniques selected similar variables as the most important environmental factors driving species distribution. Population was the largest single contributor to model predictions, ranked as most influential under both BRT (34.9%) and Maxent (60.6%).

In models created without human population density, the variable with the single largest contribution was the standard deviation of the enhanced vegetation index (EVI). Mean EVI was also highly ranked in all models and all data sets.


Model agreement and predictions

Our models showed strong agreement between boosted regression and maximum entropy methods in selection of habitat with the highest probability of occurrence of Cx. pipiens. High AUC values for all four models indicated that occupancy sites were highly likely to be assigned a higher probability of presence than background sites regardless of method. The strong correlation between BRT and Maxent outputs in areas with a high proportion of known Cx. pipiens habitat indicates that the models strongly agree on areas of potentially high risk. These areas included populated areas within the Nile delta, the valleys of Israel, Lebanon, and Jordan, and southwestern Saudi Arabia.

Our model predictions are supported by agreement between predicted areas of highest probability of Cx. pipiens occurrence, and recent positive vector surveys, or outbreaks of vector-associated diseases. The regions of central Israel, predicted by our models to be areas of high probability of Cx. pipiens presence, corresponded to the areas of highest incidence of West Nile virus during the 2000 outbreak [8]. Likewise, the region of highest probability of occupancy in Saudi Arabia corresponds with recent collections of Cx. pipiens[6264] and the incidence of infection during the 2000 Rift Valley Fever virus outbreak [65]. The distribution of suitable Cx. pipiens habitat in Saudi Arabia is of special interest because of the high potential for repeated import of diseases into the country via large numbers of pilgrims travelling through for the annual Hajj [66]. The city of Jeddah has twice been hit by epidemics of dengue directly following the Hajj [67]. Our results indicate that this same area is a suitable habitat for supporting Cx. pipiens, and potentially capable of transmitting West Nile virus, should it be re-introduced.

Agreement between model predictions was less correlated when evaluated over the full region, including areas where models were required to extrapolate. This is a result of the different behavior of each algorithm in extreme environments [68]. Maxent, when clamped, extrapolates in a horizontal line from the fit at the most extreme value in the training data. BRT, which does not use clamping, extrapolates at a constant value from the last known site. This difference in extrapolation can also be seen in the spatial predictions of the population-exclusive models when predicting into the most dissimilar areas of the modeling region. BRT assigns a higher relative probability of occupancy to habitat in northern Iraq and southern Syria, classifying the area with a probability of occupancy similar to that of the river valley along the Israel-Jordan border. The Maxent model, which constrains its predictions more severely, predicts a much more moderate probability of occupancy. The same relative over-estimation of the BRT algorithm is not as apparent in the population-inclusive model, because the environmental predictor that most exceeds its training values in this region is temperature seasonality, which was not as influential a predictor in the population-inclusive models as it was in the population-exclusive models. The effect of clamping can still be seen in the population-inclusive Maxent model. An advantage of using two different modeling methods is our ability to detect regions of parameter space where choice of the underlying modeling algorithm has the greatest influence on strength of predictions. Results in these areas need to be interpreted more cautiously than areas where both models are in agreement.

Presence-only modeling methods, such as those used in this study, make the assumption that a target species is equally detectable across all habitats [35]. However, if sampling detection probability varies with one or more environmental predictors, our model will not distinguish between habitat with a higher probability of occupancy and a habitat with greater detectability. Absolute values of the predictions should be interpreted with caution with this limitation in mind. Predictions should also be evaluated with the consideration that presence-only models treat densely and sparsely occupied sites the same, as the input data are binary.

In order to efficiently predict species distributions across the study area, we were required to coarsen the resolution of environmental predictors to 1 km. It is important to keep in mind that this may be a larger than optimal scale for examining the relationship between vectors and certain predictors with very fine scale heterogeneity [25, 69].

Relationship between Cx. pipiens and environmental predictors

Our study is unique in its aim of identifying the underlying environmental predictors driving Cx. pipiens distribution across a climate gradient that encompasses hyper arid to sub humid habitats. The novel aspects of our findings are the importance of human population density and seasonality of vegetation indices as powerful predictors of Cx. pipiens occurrence.

Human population density was identified by both modeling methods as the highest contributing predictor to habitat suitability. Interpreting this result is complex, because areas of dense human habitation may be favorable to mosquitoes for several reasons. Humans are a host species, their dwellings also provide a sheltered resting area, and in rural areas human activities such as irrigation and agriculture may also provide favorable breeding habitat and non-human hosts. Breeding conditions for mosquitoes are also favorable in slums, where high human population density is combined with poor infrastructure and inadequate services [70]. Characterization of mosquito breeding habitat in urban Cairo found that 93.5% of breeding sites were found in slum areas, characterized by incomplete construction, disorganized roads, and crowded, dense conditions [71]. Socioeconomic conditions were also significantly related to Cx. pipiens breeding sites in Washington DC, although in the northern temperate region the relationship was reversed, breeding sites were positively associated with presence of functional containers, like flower pots and garbage pails, which were found primarily in areas of higher socioeconomic status [27].

Our results indicate that quality and seasonality of vegetation is a powerful predictor of Cx. pipiens in arid and semi-arid habitats. The relationship between Cx. pipiens occurrence and vegetation quality was strongly positively associated with the maximum value of the enhanced vegetation index (EVI), and negatively associated with the mean and standard deviation (seasonality) of the EVI. This suggests that in our study region, high quality Cx. pipiens habitat consists of areas of high primary productivity, which maintain that quality of habitat with very little change throughout the year. In rural areas of Egypt, year round irrigation maintains precisely this kind of habitat. Vegetation indices are not always the most informative predictors in the region, in a study examining another Cx. pipiens vectored infection, filariasis, the Normalized Difference Vegetation Index did not distinguish between Egyptian villages at high and low risk of infection [72]. The positive relationship between Cx. pipiens and stable areas of high primary productivity suggested by our results is the opposite of the relationship seen in the forested northeastern United States, where the most accurate model of Cx. pipiens abundance was negatively correlated with forest cover and did not include the vegetative index at all [25].

Topographic wetness index has been a significant predictor of anopheline abundance and malaria risk among households in Thailand [73] and Kenya [74, 75]. Despite this, in our models it was a significant predictor only in the BRT "NoPop" model. It is possible that in arid climates with low humidity and little rainfall, the potential for pooling water, as predicted by TWI, does not translate into pools of water with enough frequency to make TWI a stronger indicator of vector breeding habitat. It is also possible that, even if TWI does accurately predict the distribution of pools of clear groundwater in the region, those pools of clear groundwater may not be the breeding habitat most favored by Cx. pipiens. In habitats where mosquitos chose to oviposit more frequently in artificial or natural containers, rather than ground pools, this will not be reflected in the TWI. It is also notable that none of our models found rainfall to be a very powerful predictor. In this context, the vegetation indices may provide better information on available soil moisture than hydrology or rainfall parameters.


Our study provides insight into the drivers of Cx. pipiens distribution in an understudied region at growing risk from the re-emergence of several arboviruses. The most dominant predictors in our species distribution model, human population density and the seasonality of the enhanced vegetation index, are also both environmental factors with potential correlations with urbanization and agricultural practices. By understanding the relationship between these predictors and disease vector distributions, we gain a better understanding of how changes in land use or shifting patterns of human settlement might influence disease transmission. Given the resurgence of vector borne diseases like West Nile and Rift Valley Fever [10] understanding how choices may influence disease ecology through agriculture, urbanization, and population growth, is a topic of vital importance.



Species distribution model


Boosted regression tree


Middle East and North Africa


Moderate Resolution Image Spectroradiometer


Enhanced Vegetative Index


Topographic Wetness Index


Vector borne diseases.


  1. Cohen B: Urban growth in developing countries: a review of current trends and a caution regarding existing forecasts. World Dev. 2004, 32: 23-51. 10.1016/j.worlddev.2003.04.008.

    Article  Google Scholar 

  2. Hotez PJ, Savioli L, Fenwick A: Neglected tropical diseases of the Middle East and North Africa: review of their prevalence, distribution, and opportunities for control. PLoS Negl Trop Dis. 2012, 6: e1475. 10.1371/journal.pntd.0001475.

    Article  PubMed Central  PubMed  Google Scholar 

  3. Roudi FN: Population Trends and Challenges in the Middle East and North Africa. 2001, Washington, DC: Population Reference Bureau

    Google Scholar 

  4. Reiter P: Climate change and mosquito-borne disease. Environ Health Persp. 2001, 109 (Suppl 1): 141-161.

    Article  Google Scholar 

  5. Meegan JM: The Rift Valley fever epizootic in Egypt 1977-78. 1. Description of the epizzotic and virological studies. Trans R Soc Trop Med Hyg. 1979, 73: 618-623. 10.1016/0035-9203(79)90004-X.

    Article  CAS  PubMed  Google Scholar 

  6. Hoogstraal H, Meegan JM, Khalil GM, Adham FK: The Rift Valley fever epizootic in Egypt 1977-78. 2. Ecological and entomological studies. Trans R Soc Trop Med Hyg. 1979, 73: 624-629. 10.1016/0035-9203(79)90005-1.

    Article  CAS  PubMed  Google Scholar 

  7. Shoemaker T, Boulianne C, Vincent MJ, Pezzanite L, Al-Qahtani MM, Al-Mazrou Y, Khan AS, Rollin PE, Swanepoel R, Ksiazek TG, Nichol ST: Genetic analysis of viruses associated with emergence of Rift Valley fever in Saudi Arabia and Yemen, 2000-01. Emerg Infect Dis. 2002, 8: 1415-1420. 10.3201/eid0812.020195.

    Article  PubMed Central  PubMed  Google Scholar 

  8. Weinberger M, Pitlik SD, Gandacu D, Lang R, Nassar F, David DB, Rubinstein E, Izthaki A, Mishal J, Kitzes R: West Nile fever outbreak, Israel, 2000: epidemiologic aspects. Emerg Infect Dis. 2001, 7: 686. 10.3201/eid0704.017416.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  9. Paz S, Semenza JC: Environmental drivers of West Nile fever epidemiology in Europe and Western Asia–a review. IJERPH. 2013, 10: 3543-3562. 10.3390/ijerph10083543.

    Article  PubMed Central  PubMed  Google Scholar 

  10. Harrus S, Baneth G: Drivers for the emergence and re-emergence of vector-borne protozoal and bacterial diseases. Int J Parasitol. 2005, 35: 1309-1318. 10.1016/j.ijpara.2005.06.005.

    Article  CAS  PubMed  Google Scholar 

  11. Cosner C, Beier JC, Cantrell RS, Impoinvil D, Kapitanski L, Potts MD, Troyo A, Ruan S: The effects of human movement on the persistence of vector-borne diseases. J Theor Biol. 2009, 258: 550-560. 10.1016/j.jtbi.2009.02.016.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  12. Apperson CS, Hassan HK, Harrison BA, Savage HM, Aspen SE, Farajollahi A, Crans W, Daniels TJ, Falco RC, Benedict M, Anderson M, McMillen L, Unnasch TR: Host feeding patterns of established and potential mosquito vectors of West Nile virus in the eastern United States. Vector-Borne Zoonot Dis. 2004, 4: 71-82. 10.1089/153036604773083013.

    Article  Google Scholar 

  13. Taylor B, Work TH, Hurlbut HS, Rizk F: A study of the ecology of West Nile virus in Egypt. Am J Trop Med Hyg. 1956, 5: 579-620.

    PubMed  Google Scholar 

  14. Turell MJ, Presley SM, Gad AM, Cope SE, Dohm DJ, Morrill JC, Arthur RR: Vector competence of Egyptian mosquitoes for Rift Valley fever virus. Am J Trop Med Hyg. 1996, 54: 136-139.

    CAS  PubMed  Google Scholar 

  15. Kilpatrick AM, Kramer LD, Campbell SR, Alleyne EO, Dobson AP, Daszak P: West Nile virus risk assessment and the bridge vector paradigm. Emerg Infect Dis. 2005, 11: 425-429. 10.3201/eid1103.040364.

    Article  PubMed Central  PubMed  Google Scholar 

  16. Amraoui F, Krida G, Bouattour A, Rhim A, Daaboub J, Harrat Z, Boubidi S-C, Tijane M, Sarih M, Failloux A-B: Culex pipiens, an experimental efficient vector of West Nile and Rift Valley fever viruses in the Maghreb region. PLoS One. 2012, 7: e36757. 10.1371/journal.pone.0036757.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  17. Tran A, Ippoliti C, Balenghien T, Conte A, Gely M, Calistri P, Goffredo M, Baldet T, Chevalier V: A geographical information system-based multicriteria evaluation to map areas at risk for Rift Valley fever vector-borne transmission in Italy. Transbound Emerg Dis. 2013, 60: 14-23.

    Article  PubMed  Google Scholar 

  18. Kalluri S, Gilruth P, Rogers D, Szczur M: Surveillance of arthropod vector-borne infectious diseases using remote sensing techniques: a review. PLoS Pathog. 2007, 3: e116. 10.1371/journal.ppat.0030116.

    Article  PubMed Central  Google Scholar 

  19. Amr ZS, Al-Khalili Y, Arbaji A: Larval mosquitoes collected from northern Jordan and the Jordan Valley. J Am Mosquito Contr. 1997, 13: 375-378.

    CAS  Google Scholar 

  20. Al-Khalili YH, Katbeh-Bader A, Mohsen ZH: Siphon index of Culex pipiens larvae collected from different biogeographical provinces in Jordan. Zool Middle East. 1999, 17: 71-76. 10.1080/09397140.1999.10637770.

    Article  Google Scholar 

  21. Knio KM, Markarian N, Kassis A, Nuwayri-Salti N: A two-year survey on mosquitoes of Lebanon. Parasite. 2005, 12: 229-235. 10.1051/parasite/2005123229.

    Article  CAS  PubMed  Google Scholar 

  22. Ammar SE, Kenawy MA, Abdel-Rahman HA, Gad AM, Hamed AF: Ecology of the mosquito larvae in urban environments of Cairo Governorate, Egypt. J Egypt Soc Parasitol. 2012, 42: 191-202.

    Article  PubMed  Google Scholar 

  23. Bhagat IM: Host-feeding patterns of Culex pipiens (Diptera-Culicidae) in El-Abtal village, Ismailia Governorate, Egypt. Egypt J Biol. 2004, 2: 139-143.

    Google Scholar 

  24. Gad AM, Riad IB, Farid HA: Host-feeding patterns of Culex pipiens and Cx. antennatus (Diptera: Culicidae) from a village in Sharqiya Governorate, Egypt. J Med Entomol. 1995, 32: 573-577.

    Article  CAS  PubMed  Google Scholar 

  25. Diuk-Wasser M, Brown H, Andreadis T, Fish D: Modeling the spatial distribution of mosquito vectors for West Nile virus in Connecticut, USA. Vector-Borne Zoonot. 2006, 6: 283-295. 10.1089/vbz.2006.6.283.

    Article  Google Scholar 

  26. Mweya CN, Kimera SI, Kija JB, Mboera LEG: Predicting distribution of Aedes aegypti and Culex pipiens complex, potential vectors of Rift Valley fever virus in relation to disease epidemics in East Africa. Infect Ecol Epidemiol. 2013, 3: 109-

    Google Scholar 

  27. Dowling Z, Ladeau SL, Armbruster P, Biehler D, Leisnham PT: Socioeconomic status affects mosquito (Diptera: Culicidae) larval habitat type availability and infestation level. J Med Entomol. 2013, 50: 764-772. 10.1603/ME12250.

    Article  PubMed  Google Scholar 

  28. Farid HA, Morsy ZS, Hassan AN, Hammad RE, Faris R, Kandil AM, Ahmed ES, Weil GJ: The impact of environmental and entomological factors on intervillage filarial focality in the Nile Delta. J Egypt Soc Parasitol. 2000, 30: 469-485.

    CAS  PubMed  Google Scholar 

  29. Gatton ML, Kay BH, Ryan PA: Environmental predictors of Ross River virus disease outbreaks in Queensland, Australia. Am J Trop Med Hygiene. 2005, 72: 792-799.

    Google Scholar 

  30. Morin CW, Comrie AC: Regional and seasonal response of a West Nile virus vector to climate change. Proc Natl Acad Sci USA. 2013, 110: 15620-15625. 10.1073/pnas.1307135110.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  31. Pearson RG, Raxworthy CJ, Nakamura M, Townsend Peterson A: Predicting species distributions from small numbers of occurrence records: a test case using cryptic geckos in Madagascar. J Biogeogr. 2007, 34: 102-117.

    Article  Google Scholar 

  32. Kumar S, Stohlgren TJ: Maxent modeling for predicting suitable habitat for threatened and endangered tree Canacomyrica monticola in New Caledonia. J Ecol Nat Environ. 2009, 1: 94-98.

    Google Scholar 

  33. Khatchikian C, Sangermano F, Kendell D, Livdahl T: Evaluation of species distribution model algorithms for fine-scale container-breeding mosquito risk prediction. Med Vet Entomol. 2010, 25: 268-275.

    Article  PubMed Central  PubMed  Google Scholar 

  34. Elith J, Phillips SJ, Hastie T, Dudík M, Chee YE, Yates CJ: A statistical explanation of MaxEnt for ecologists. Divers Distrib. 2010, 17: 43-57.

    Article  Google Scholar 

  35. Yackulic CB, Chandler R, Zipkin EF, Royle JA, Nichols JD, Campbell Grant EH, Veran S: Presence-only modelling using MAXENT: when can we trust the inferences?. Methods Ecol Evol. 2012, 4: 236-243.

    Article  Google Scholar 

  36. Fitzpatrick MC, Gotelli NJ, Ellison AM: MaxEnt versus MaxLike: empirical comparisons with ant species distributions. Ecosphere. 2013, 4: 1-15.

    Article  Google Scholar 

  37. Phillips SJ, Dudík M: Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography. 2008, 31: 161-175. 10.1111/j.0906-7590.2008.5203.x.

    Article  Google Scholar 

  38. R Development Core Team: R: A language and environment for statistical computing. 2013

    Google Scholar 

  39. Hijmans RJ, Phillips S, Leathwick J, Elith J: Dismo: Species Distribution Modeling. 2013, R package version 0.9-1

    Google Scholar 

  40. Ridgeway G: Gbm: Generalized Boosted Regression Models. 2013, R package version 2.1

    Google Scholar 

  41. Farajollahi A, Fonseca DM, Kramer LD, Kilpatrick AM: “Bird biting” mosquitoes and human disease: a review of Culex pipiens complex mosquitoes in epidemiology. Infect Genet Evol. 2011, 11: 1577-1585. 10.1016/j.meegid.2011.08.013.

    Article  PubMed Central  PubMed  Google Scholar 

  42. Gad AM, Abdel Kader M, Farid HA, Hassan AN: Absence of mating barriers between autogenous and anautogenous Culex pipiens L. in Egypt. J Egypt Soc Parasitol. 1995, 25: 63-71.

    CAS  PubMed  Google Scholar 

  43. Gad AM, Feinsod FM, Soliman BA, el Said S: Survival estimates for adult Culex pipiens in the Nile Delta. Acta Trop. 1989, 46: 173-179. 10.1016/0001-706X(89)90034-X.

    Article  CAS  PubMed  Google Scholar 

  44. Thuiller W, Brotons L, Araújo MB, Lavorel S: Effects of restricting environmental range of data to project current and future species distributions. Ecography. 2004, 27: 165-172. 10.1111/j.0906-7590.2004.03673.x.

    Article  Google Scholar 

  45. Pearson RG, Dawson TP: Predicting the impacts of climate change on the distribution of species: are bioclimate envelope models useful?. Global Ecol Biogeogr. 2003, 12: 361-371. 10.1046/j.1466-822X.2003.00042.x.

    Article  Google Scholar 

  46. Harbach RE: The mosquitoes of the subgenus Culex in southwestern Asia and Egypt (Diptera: Culicidae). Contrib Am Entomol Inst. 1988, 24: 1-140.

    Google Scholar 

  47. Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A: Very high resolution interpolated climate surfaces for global land areas. Int J Climatol. 2005, 25: 1965-1978. 10.1002/joc.1276.

    Article  Google Scholar 

  48. Gu W, Müller G, Schlein Y, Novak RJ, Beier JC: Natural plant sugar sources of Anopheles mosquitoes strongly impact malaria transmission potential. PLoS One. 2011, 6: e15996. 10.1371/journal.pone.0015996.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  49. Andersson IH, Jaenson TGT: Nectar feeding by mosquitoes in Sweden, with special reference to Culex pipiens and Cx torrentium. Med Vet Entomol. 1987, 1: 59-64. 10.1111/j.1365-2915.1987.tb00323.x.

    Article  CAS  PubMed  Google Scholar 

  50. Foster WA: Mosquito sugar feeding and reproductive energetics. Annu Rev Entomol. 1995, 40: 443-474. 10.1146/annurev.en.40.010195.002303.

    Article  CAS  PubMed  Google Scholar 

  51. Richardson AD, Keenan TF, Migliavacca M, Ryu Y, Sonnentag O, Toomey M: Climate change, phenology, and phenological control of vegetation feedbacks to the climate system. Agric For Meteorol. 2013, 169: 156-173.

    Article  Google Scholar 

  52. Beven KJ, Kirkby MJ: A physically based, variable contributing area model of basin hydrology/Un modèle à base physique de zone d“appel variable de l”hydrologie du bassin versant. Hydrolog Sci J. 1979, 24: 43-69.

    Google Scholar 

  53. Burkett-Cadena ND, McClure CJW, Estep LK, Eubanks MD: Hosts or habitats: What drives the spatial distribution of mosquitoes?. Ecosphere. 2013, 4: 1-16.

    Article  Google Scholar 

  54. Leathwick JR, Elith J, Francis MP, Hastie T, Taylor P: Variation in demersal fish species richness in the oceans surrounding New Zealand: an analysis using boosted regression trees. Mar Ecol Prog Ser. 2006, 321: 267-281.

    Article  Google Scholar 

  55. Syfert MM, Smith MJ, Coomes DA: The effects of sampling bias and model complexity on the predictive performance of MaxEnt species distribution models. PLoS One. 2013, 8: e55158. 10.1371/journal.pone.0055158.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  56. Kadmon R, Farber O, Danin A: Effect of roadside bias on the accuracy of predictive maps produced by bioclimatic models. Ecol Appl. 2004, 14: 401-413. 10.1890/02-5364.

    Article  Google Scholar 

  57. Walter Reed Biosystematics Unit: Mosquito Occurrence Dataset. Smithsonian Institution. Accessed via on 2013-07-26

  58. Pecoraro H, Day H, Reineke R, Stevens N: Climatic and landscape correlates for potential West Nile virus mosquito vectors in the Seattle region. J Vector Ecol. 2007, 32: 22-28. 10.3376/1081-1710(2007)32[22:CALCFP]2.0.CO;2.

    Article  PubMed  Google Scholar 

  59. Elith J, Leathwick JR, Hastie T: A working guide to boosted regression trees. J Anim Ecol. 2008, 77: 802-813. 10.1111/j.1365-2656.2008.01390.x.

    Article  CAS  PubMed  Google Scholar 

  60. Hijmans RJ: Raster: Geographic Data Analysis and Modeling. 2012, R package version 2.1-49

    Google Scholar 

  61. Phillips SJ, Anderson RP, Schapire RE: Maximum entropy modeling of species geographic distributions. Ecol Model. 2006, 190: 231-259. 10.1016/j.ecolmodel.2005.03.026.

    Article  Google Scholar 

  62. Ahmed Al AM, Badjah-Hadj-Ahmed A-Y, Othman Al ZA, Sallam MF: Identification of wild collected mosquito vectors of diseases using gas chromatography-mass spectrometry in Jazan Province, Saudi Arabia. J Mass Spectrom. 2013, 48: 1170-1177. 10.1002/jms.3282.

    Article  Google Scholar 

  63. Ahmed AM, Shaalan EA, Aboul-Soud MAM, Tripet F, Al-Khedhairy AA: Mosquito vectors survey in the AL-Ahsaa district of eastern Saudi Arabia. J Insect Sci. 2011, 11: 176-

    Article  PubMed Central  PubMed  Google Scholar 

  64. Khater EI, Sowilem MM, Sallam MF, Alahmed AM: Ecology and habitat characterization of mosquitoes in Saudi Arabia. Trop Biomed. 2013, 3: 409-427.

    Google Scholar 

  65. Arishi H, Ageel A, Rahman MA, Hazmi AA, Arishi AR, Ayoola B, Menon C, Ashraf J, Frogusin O, Sawwan F, Al-Hazmi M, As-Sharif A, Al-Sayed M, Ageel AR, Alrajhi ARA, Al-Hedaithy MA, Fatani A, Sahaly A, Ghelani A, Al-Basam T, Turkistani A, Al-Hamadan N, Mishkas A, Al-Jeffri MH, Al-Mazroa YY, Alamri MMAEA: Outbreak of Rift Valley fever - Saudi Arabia, August-October, 2000. Morb Mortal Wkly Rep. 2000, 49: 905-908.

    Google Scholar 

  66. Davies FG: Risk of a rift valley fever epidemic at the haj in Mecca, Saudi Arabia. Rev Off Int Epizoot. 2006, 25: 137-147.

    CAS  PubMed  Google Scholar 

  67. Zaki A, Perera D, Jahan SS, Cardosa MJ: Phylogeny of dengue viruses circulating in Jeddah, Saudi Arabia: 1994 to 2006. Trop Med Int Health. 2008, 13: 584-592. 10.1111/j.1365-3156.2008.02037.x.

    Article  PubMed  Google Scholar 

  68. Elith J, Graham CH: Do they? How do they? Why do they differ? On finding reasons for differing performances of species distribution models. Ecography. 2009, 32: 66-77. 10.1111/j.1600-0587.2008.05505.x.

    Article  Google Scholar 

  69. Diuk-Wasser MA, Touré MB, Dolo G, Bagayoko M, Sogoba N, Sissoko I, Traoré SF, Taylor CE: Effect of rice cultivation patterns on malaria vector abundance in rice-growing villages in Mali. Am J Trop Med Hyg. 2007, 76: 869-874.

    PubMed Central  PubMed  Google Scholar 

  70. Knudsen AB, Slooff R: Vector-borne disease problems in rapid urbanization: new approaches to vector control. B World Health Organ. 1992, 70: 1-6.

    CAS  Google Scholar 

  71. Hassan AN, Nogoumy NE, Kassem HA: Characterization of landscape features associated with mosquito breeding in urban Cairo using remote sensing. Egypt J Remote Sens Space Sci. 2013, 16: 63-69.

    Google Scholar 

  72. Hassan AN, Beck LR, Dister S: Prediction of villages at risk for filariasis transmission in the Nile Delta using remote sensing and geographic information system technologies. J Egypt Soc Parasitol. 1998, 28: 75-87.

    CAS  PubMed  Google Scholar 

  73. Sithiprasasna R, Linthicum KJ, Liu G-J, Jones JW, Singhasivanon P: Use of GIS-based spatial modeling approach to characterize the spatial patterns of malaria mosquito vector breeding habitats in northwestern Thailand. Southeast Asian J Trop Med Public Health. 2003, 34: 517-528.

    PubMed  Google Scholar 

  74. Cohen JM, Ernst KC, Lindblade KA, Vulule JM, John CC, Wilson ML: Local topographic wetness indices predict household malaria risk better than land-use andland-cover in the western Kenya highlands. Malar J. 2010, 9: 328. 10.1186/1475-2875-9-328.

    Article  PubMed Central  PubMed  Google Scholar 

  75. Moss WJ, Hamapumbu H, Kobayashi T, Shields T, Kamanga A, Clennon J, Mharakurwa S, Thuma PE, Glass G: A dynamic model of some malaria-transmitting anopheline mosquitoes of the Afrotropical region. II. Validation of species distribution and seasonal variations. Malar J. 2011, 10: 163. 10.1186/1475-2875-10-163.

    Article  PubMed Central  PubMed  Google Scholar 

Download references


Research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under award number R01GM093345. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Health.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Amy K Conley.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

AC designed the study, developed the species distribution models, performed the statistical analyses and drafted the manuscript. DO conceived of the study, participated in its design, acquired remotely sensed data and extracted summary data from MODIS imagery, and helped to draft the manuscript. NH conducted vector surveys (Lebanon) and helped draft the manuscript. AG conducted vector surveys (Egypt). AH organized vector data (Egypt) and provided significant revisions to the manuscript. JB conceived of the study, participated in its design and coordination, and helped draft the manuscript. All authors read and approved the final manuscript.

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Conley, A.K., Fuller, D.O., Haddad, N. et al. Modeling the distribution of the West Nile and Rift Valley Fever vector Culex pipiens in arid and semi-arid regions of the Middle East and North Africa. Parasites Vectors 7, 289 (2014).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: