- Open Access
Aedes aegypti in the Black Sea: recent introduction or ancient remnant?
Parasites & Vectorsvolume 11, Article number: 396 (2018)
The yellow fever mosquito Aedes aegypti transmits viral diseases that have plagued humans for centuries. Its ancestral home are forests of Africa and ~400–600 years ago it invaded the New World and later Europe and Asia, causing some of the largest epidemics in human history. The species was rarely detected in countries surrounding the Mediterranean Sea after the 1950s, but during the last 16 years it re-appeared in Madeira, Russia and in the eastern coast of the Black Sea. We genotyped Ae. aegypti populations from the Black Sea region to investigate whether this is a recent invasion (and if so, where it came from) or a remnant of pre-eradication populations that extended across the Mediterranean. We also use the Black Sea populations together with a world reference panel of populations to shed more light into the phylogeographical history of this species.
Microsatellites and ~19,000 genome-wide single nucleotide polymorphisms (SNPs) support the monophyletic origin of all populations outside Africa, with the New World as the site of first colonization. Considering the phylogenetic relationships, the Black Sea populations are basal to all Asian populations sampled. Bayesian analyses combined with multivariate analyses on both types of markers suggest that the Black Sea population is a remnant of an older population. Approximate Bayesian Computation Analysis indicates with equal probability, that the origin of Black Sea populations was Asia or New World and assignment tests favor the New World.
Our results confirmed that Ae. aegypti left Africa and arrived in New World ~500 years ago. The lineage that returned to the Old World and gave rise to present day Asia and the Black Sea populations split from the New World approximately 100–150 years ago. Globally, the Black Sea population is genetically closer to Asia, but still highly differentiated from both New World and Asian populations. This evidence, combined with bottleneck signatures and divergence time estimates, support the hypothesis of present day Black Sea populations being remnants of older populations, likely the now extinct Mediterranean populations that, consistent with the historic epidemiological record, likely represent the original return of Ae. aegypti to the Old World.
Aedes aegypti bears the official common name of “the yellow fever mosquito”, although today it is more feared as the major vector of viruses causing dengue, chikungunya and Zika fevers. There is little doubt that the origin of Ae. aegypti is Africa. This is consistent with ecology, biogeography, genetics and historical records (for a review see  and references within). The ancestral form of Ae. aegypti still breeds in tropical forests in sub-Saharan Africa, where its larvae are found inside tree holes and females prefer non-human blood meals [2, 3]. At some point during its evolutionary history, this forest-breeding mosquito became adapted to human habitats, with larvae breeding in man-made containers and adult females favoring human blood meals. This “domesticated” form of the mosquito spread with man around the world to tropical and subtropical regions [1, 4]. These two forms have been given subspecific names, Ae. aegypti formosus (Aaf) for the ancestral forest form and Ae. aegypti aegypti (Aaa) for the domestic form.
The distribution of Ae. aegypti outside Africa has been dynamic, as expected of an invasive human commensal . After its global geographical expansion, one of the major changes was its disappearance from the Mediterranean basin, where it was once widely established and responsible for outbreaks of yellow fever and dengue in the 19th and 20th centuries [6, 7]. Aedes aegypti has rarely been detected in countries surrounding the Mediterranean Sea after the 1950s, with sporadic reports coming from Italy, Israel and Turkey (for details see  and references within). This disappearance has been attributed to a combination of DDT use aimed at malaria-transmitting Anopheles, vector control measures in response to the Greek dengue epidemic, colder winters, and perhaps more importantly, improvements in sanitation, indoor plumbing in particular [8, 9].
After an apparent 50 year absence from Europe, Ae. aegypti was detected in 2001 in the area of Sochi, Russia and then in 2005 on the Portuguese island of Madeira . By 2008, it had re-populated the eastern coast of the Black Sea, with populations reported in Georgia, Russia [11, 12] and later (2015) from Turkey .
The current presence of Ae. aegypti in the Black Sea area may represent a new introduction from Ae. aegypti endemic areas (the Americas, Africa or Asia), or a cryptic population related to the Ae. aegypti that once populated the Mediterranean basin.
Here we present genetic data (microsatellites and SNPs) on populations of Ae. aegypti from Turkey and Georgia. We examine the genetic diversity and differentiation patterns among these populations and compare them with the known established populations of Ae. aegypti worldwide, with the goals of (i) determining whether present-day Black Sea populations are recent invasions or re-emergence of older cryptic populations; (ii) reconstructing the most probable biogeographical scenario for the invasion to the Black Sea region; and (iii) revealing the evolutionary relationships of the Black Sea populations with the known established populations in the New World, Asia and Africa.
Mosquito collections, DNA extraction and genotyping
Two different datasets of genetic markers were used in this study: (i) twelve previously published microsatellite loci [A1, B2, B3, A9 (tri-nucleotide repeats), and AC2, CT2, AG2, AC4, AC1, AC5, AG1, and AG4 (di-nucleotide repeats)] [14, 15]; and (ii) a panel of ~25,000 SNPs . The complete microsatellite dataset consisted of 1795 individuals from 56 populations (Fig. 1), while the SNP dataset consisted of 306 individuals from 30 populations (for details see Additional file 1: Table S1).
The Black Sea region is represented by samples from Turkey and Georgia. Two locations were sampled from Turkey (~50 km apart) and three locations from Georgia (inland: 2 locations; 15 individuals in total; and coast: 23 individuals). Since the sample size of the two inland Georgia localities was small (6 and 9 individuals), the preliminary analysis indicated that they form one homogenous group with low differentiation among them, and due to their geographical proximity (within ~10 km), these samples were pooled and treated as a single population sample of 15 individuals labeled as Georgia-inland.
For both datasets, total DNA was extracted using the DNeasy Blood and Tissue kit (Qiagen, Hilden, Germany) according to the manufacturer instructions, with an additional RNAse A (Qiagen) step. DNA samples were stored at -20 °C until further analysis. For the microsatellites dataset, individual mosquitoes were genotyped as described in . Microsatellite alleles were scored using GENEMAPPER version 4.0 (Applied Biosystems). The SNPs dataset included the 25,589 SNPs as they genotyped in the Ae. aegypti SNP-Chip  of Affymetrix. Briefly, genomic DNA (~200 ng per sample) was sent to the Functional Genomics Core at the University of North Carolina, Chapel Hill, for hybridization and production of genotypes. The Affymetrix Genotyping Console and the R package SNPolisher v1.4 (both from Affymetrix, Inc., Santa Clara, CA, USA) were used to generate and process genotype calls.
Raw data are available from VectorBase.org, Project ID VBP0000269 (microsatellite and SNPs).
Genetic diversity and differentiation
Within-population deviations from Hardy-Weinberg equilibrium (HWE) were estimated using the exact HWE test in GENEPOP v.4.5.1 [17, 18]. The test was run with 10,000 dememorizations, 1,000 batches and 10,000 iterations per batch. Bonferroni correction (α = 0.05) was applied to the resulting matrix of HWE. Number of alleles, allelic frequencies, and average observed (Ho) and expected (He) heterozygosities were estimated using GenAlEx . Allelic richness (AR) and private allelic richness (Np) were calculated in HPRARE [20, 21] using rarefaction and assuming 30 genes. The non-parametric Kruskal Wallis test was used to test for significant differences on Ho and allelic richness between different groups of populations.
Pairwise genetic distances (FST) between all pairs of populations and their significance were calculated in the SNP dataset with Arlequin v18.104.22.168 , using 1000 permutations.
Many of the populations included in the microsatellite dataset have been studied previously [14, 23,24,25,26] and here are used as a worldwide reference panel to accurately identify the origin(s) of the Black Sea populations and their genetic affinities with the other Ae. aegypti populations throughout the world.
Geographical population structure was evaluated using the Bayesian clustering method implemented in the software STRUCTURE v.2.3  for the microsatellite dataset, and the fastSTRUCTURE  for the SNP dataset. Prior to the implementation of the fastSTRUCTURE analysis, the SNP dataset was filtered to exclude highly linked SNPs, using the --indep-pairwise 50 10 0.3 parameter in PLINK .
Both methods identify genetic clusters and assign individuals to clusters with no a priori information of sampling location. For the STRUCTURE analysis the most likely number of clusters (K), was determined by conducting 20 independent runs for each K = 1 to the maximum number of populations included in the analysis. Each run assumed an admixture model and independent allele frequencies (λ set to 1), using a burn-in value of 100,000 iterations followed by 500,000 repetitions. The optimal number of K clusters was determined using the ΔK method of Evanno et al. , using the online version of STRUCTURE HARVESTER v.0.6.94 .
Since there is increasing evidence [31, 32] that STRUCTURE analysis is severely affected by uneven sampling, which could lead to wrong inferences on hierarchical structure, each population was represented by a random subsample of 34 individuals (or less if not available) in the microsatellite dataset and by 12 random individuals (or less if not available) in the SNP dataset, to match the sizes of the Black Sea samples (Additional file 1: Table S1). Following the same reasoning and given that the Black Sea region is underrepresented (four populations) compared with the other geographical groups, we performed the genetic structure analyses of both the microsatellite and the SNPs datasets twice: (i) using all the populations available; and (ii) by randomly selecting (multiple times) four populations each from Africa, New World and Asia-Pacific regions. Results from the analyses of all datasets were summarized and plotted using the online version of CLUMPAK .
To complement the genetic structure analyses, we performed Principal Components Analysis (PCA) and Discriminant Analysis of Principal Components (DAPC), using the R packages ade4 , LEA  and ADEGENET  in R v.3.4.4  on both datasets.
To test the degree of assignment of any individual mosquito to a specific population of origin or geographical group, we used the program GeneClass2 v2.0 . Assignment tests were performed on both datasets using the Rannala & Mountain criterion  and the Monte Carlo resampling algorithm of Paetkau et al.  (n = 1000) for level of significance 0.05. Due to limitations on the number of loci that can be analyzed by the GeneClass2 software, five independent analyses were run using 4000 randomly selected SNPs. Self-assignment tests on the reference population panel resulted in 97.7% of the individuals correctly assigned to their population of origin.
We tested for evidence of population bottleneck events using two methods, as implemented in BOTTLENECK . In the first method, the distribution of the expected heterozygosity from the observed number of alleles is calculated for each population and locus, under the assumption of mutation-drift equilibrium; the second method is based on allele frequency distributions. The allele frequency distribution is expected to be approximately L-shaped under mutation-drift equilibrium, while a shift in this distribution is indicative of a recent bottleneck . Although the program provides results under three possible mutation models; the Infinite Allele Model (IAM), the Stepwise Mutation Model (SMM) and the two-phase mutation model (TPM), we took into account only the TPM and the SMM model, which perform better for microsatellites datasets [43,44,45]. Simulation of heterozygosity at mutation-drift equilibrium distributions for the TPM model assumed 70% single-step mutations and 30% of multiple-step mutations. Significance was assessed using Wilcoxon’s signed rank test, as recommended for less than 20 markers.
The bottleneck analysis can only detect extreme reductions in population sizes that have occurred during the last 0.2–4.0 Ne generations . Based on the average Ne estimations for Ae. aegypti populations worldwide [26, 47, 48] the program cannot detect bottlenecks that occurred more than ~1200 generations ago.
To infer the evolutionary relationships among populations we used Maximum Likelihood (ML) analysis, as implemented in RAxML , using 1000 bootstraps and the GTR model of evolution along with the CAT model of rate heterogeneity. For the runs we used the string “ASC” to apply an ascertainment bias correction to the likelihood calculations, and the standard correction by Lewis  when only variant sites are included in the data set, as directed by the software’s manual. The phylogenetic analysis was performed using the SNP dataset and randomly choosing two individuals per population (Fig. 1). The final dataset consisted of 62 individuals (including two Aedes mascarensis individuals that served as outgroup and were genotyped for the same SNPs using the Ae. aegypti SNP-chip). Since a major assumption in the phylogenetic analyses is the neutrality of the loci under study, we identified outlier SNPs (q-values < 0.05) using the pcadapt package  in R, which is based on Principal Components Analysis (PCA) and it does not require grouping individuals into populations. The SNPs identified as outliers were then filtered out to exclude SNPs that might be under selection. In total, 134 SNPs were excluded from the phylogenetic analysis as possibly being under selection. The final dataset after filtering to include only variant sites for this specific pruned dataset of 62 samples resulted in 19,092 SNPs that were used for the ML analysis.
Inferring population history
Bayesian computation methods (ABC)  as implemented by DIYABC v.2.0.4  were used to infer the population history of Ae. aegypti in the Black Sea. The ABC analysis was performed on the microsatellite dataset. We tested five scenarios to identify the origin of the Black Sea populations (Scenario 1: Black Sea originated from an admixed event between New World and Asia; Scenarios 2 and 5: New World origin; Scenario 3: Asian origin; and Scenario 4: African origin).
The best-fit scenario and confidence on the model of choice were evaluated using the DIYABC . Divergence times were estimated in generations and the priors setting (for details see Additional file 1: Table S2) was based on the historical record information from previous studies [1, 7, 23, 26]. Given that the number of generations per year for Ae. aegypti is affected by the climatic conditions [54, 55], we collected climatic data (www.worldclim.com) for our Black Sea sampling localities and compared them with tropical and subtropical regions where Aaa is established (see Additional file 1: Table S3). The transformation of divergence time from generations to years for Black Sea region is challenging since studies estimating the number of generations per year have been mainly conducted for tropical populations and it is usually assumed an average of 10 generations per year (for details see [23, 55, 56]). However, considering that at least one quarter of the year the mean temperature in the Black Sea region is much lower than in other localities that established Aaa populations occurred (see Additional file 1: Table S3), it is possible that in Black Sea less than 10 generations per year exist. Thus, for the time estimates regarding the Black Sea region we provide a range assuming 6–8 generations per year. A mutation rate ranging from 9 × 10-6 to 1 × 10-5 was used based on rates reported in the literature for other Diptera species [57, 58]. Details on the effective population size and split time between regions used as priors for the ABC analysis are provided in Additional file 1: Table S2.
We did not run a DiyABC analysis on the SNP dataset due to the possible SNP ascertainment bias introduced during the SNP-chip design (the chip was designed to capture representative variation across different regions of the world ) that could affect the inference of demographic history through its effect on the Allele Frequency Spectrum-AFS [59,60,61,62]. Alternatively, we used the output of the ML analysis (see phylogenetic analysis section) to perform a Bayesian Binary MCMC (BBM) analysis on RASP  to reconstruct the ancestral area distribution of the Black Sea populations. For the MCMC analysis we ran 10 chains of 50,000 cycles, we set the maximum number of areas to four and we did not take the distribution area of outgroup into account for the analysis.
Genetic diversity and differentiation
Sixty-five Fis values out of 685 (9.49%) population-by-locus comparisons deviated significantly from Hardy-Weinberg equilibrium (HWE) after sequential Bonferroni correction (α = 0.05), a level common for microsatellites and most often due to rare null alleles [14, 23].
Population genetic statistics for each population for the microsatellite dataset are provided in Additional file 1: Table S4. Populations in the range of Aaa that includes the Black Sea samples have significantly lower levels of allelic richness (AR) and observed heterozygosity (Ho) compared with Aaf (Fig. 2; Kruskal-Wallis H-test: χ2 = 16.522, df = 4, P = 0.0024 and χ2 = 14.035, df = 4, P = 0.0072). Focusing only on Aaa, the Black Sea region shows similar levels of Ho when compared with the other regions, but lower levels of AR compared to Asia (Fig. 2). Private allelic richness (Np) for the full dataset (56 populations) was low (Np ≤ 0.09) in all cases with the exception of some African and Argentinean populations. Analysis of the Black Sea populations’ dataset (only Turkey and Georgia) revealed that the Np richness range between 0.09–0.17, but no private alleles were found in the Black Sea populations, relative to the populations of the New World or Asia. However, if we consider the region level instead of the population level, Black Sea has, on average, a small proportion of private alleles (Africa 2.36, New World 0.50, Asia 0.64, Pacific 0.11 and Black Sea 0.13).
The mean genetic differentiation between the geographical regions, FST based on SNP chip data are summarized in Fig. 3. All pairwise FST estimations using the SNP dataset were significant (P < 0.05, using 1000 permutations). This figure is comparable to Fig. 2b presented in  that uses RAD sequencing data, providing evidence that the patterns are robust and not dependent on type of genetic data analyzed.
Structure analyses on both datasets showed that the Black Sea populations are genetically distinct (Evanno method; K = 2) from sub-Saharan Aaf, clustering within Aaa populations (Figs. 4a, 5a). Subsequently, within Aaa, Black Sea clusters first with Asia (Figs. 4b, 5b) and then as a distinct group (Figs. 4c, 5c). Structure analyses on the smaller datasets, normalizing the number of populations per region with those of the Black Sea region, confirmed that the Black Sea is of the Aaa type (Additional file 1: Figure S1) and according to the Evanno et al. method , forms a distinct group within this range (Additional file 1: Figure S2).
PCA analysis on both datasets also confirmed the distinction between Aaf and Aaa populations (Fig. 6a, Additional file 1: Figure S3a). Focusing on Aaa, Black Sea samples again form a group distinct from both Asia and New World (Fig. 6b, Additional file 1: Figure S3b).
DAPC on the worldwide microsatellite and SNP chip datasets indicated (by the Bayesian Information Criterion-BIC) 20 DAPC-groups, with the majority (microsatellites dataset; 68 out of the 91) or all (SNP chip dataset) of the Black Sea individuals forming a single distinct group (Additional file 1: Figure S4). Focusing on the Black Sea, BIC supported three groups, (Additional file 1: Figure S5) with no clear distinction between Georgia and Turkey populations.
Self-assignment tests on the microsatellite dataset correctly assigned 80.9% individuals to their population of origin and 99.9% were correctly assigned to their original continent (America, Asia). Self-assignment tests on the SNP dataset correctly assigned 97.7% of the individuals to their original population with mis-assignments usually occurring within the same country, with the exception of Costa Rica, which was usually mis-assigned to Florida.
When using the worldwide reference panel, individuals from the Black Sea were assigned primarily to North American populations, with scores of 50% or higher, based on microsatellite data (Additional file 1: Figure S6). A large number of individuals from the Turkish populations were assigned with the highest average assignment probability to New Orleans (America), closely followed by Ho Chi Minh (Asia) and Miami (America). New Orleans and Miami were also assigned a large number of individuals from the coast population in Georgia, based on highest average assignment probability, while individuals from the inland-Georgia population were assigned with high probabilities to Pakistan, Thailand, Brazil and Mexico (Additional file 1: Figure S6).
For the SNP dataset, we ran the analysis multiple times in panels of 4000 SNPs. The Black Sea individuals were assigned to Mexico, Vietnam, New Orleans, Puerto Rico, and Pakistan, with frequencies as shown in Additional file 1: Figure S6. The majority of individuals were assigned with scores of 70% or higher to Puerto Rico (Caribbean), followed by Ho Chi Minh (Asia).
Bottleneck analysis on the Black Sea samples provided evidence of a recent bottleneck in all four populations. Specifically, both tests (Wilcoxon and mode shift) and both models (TPM, SMM) showed significant (Wilcoxon test one tail for H excess P = 0.002 and P = 0.046) demographic changes for the Turkey-Hopa population, while for the other three populations (Turkey-Pazar, Georgia-inland, Georgia-coast) only Wilcoxon test and only the TPM model showed marginally significant (Wilcoxon test one tail for H excess 0.04 < P < 0.05) signs of bottleneck.
The rooted ML phylogenetic tree constructed from the SNP dataset is presented in Fig. 7. All Aaa populations form a monophyletic group distinct from all Aaf populations. Within Aaa, Black Sea populations form a monophyletic group that is basal to the Asian clade and the sister clade to the South America-Caribbean clade.
Inferring population history
Figure 8 shows the scenarios tested for the diaspora of Ae. aegypti from Africa. The analysis indicated two scenarios with the highest Posterior Probability (PP); Scenario 3 (PP = 0.56) supporting that the species arrived in New World from Africa ~500 years ago, then from the New World invaded Asia ~150 years ago and later the Black Sea from Asia ~150–110 years ago (assuming 6–8 generations per year); and Scenario 2 (PP = 0.41) indicating that Asia and Black Sea may have been two independent invasions from the New World. Detailed information on the results of the demographic analysis is provided in Additional file 1: Table S2.
As mentioned above, Asia-Black Sea and South America-Caribbean are sister clades. RASP analysis (Fig. 7) supported that the ancestral area of these two major clades is likely North America (probability 0.74). However, focusing on the ancestral area of the Asia-Black Sea region, there is a degree of uncertainty. The probability of Asia being the ancestral area is 0.53, compared to 0.34 for the Black Sea ancestral [the remaining 0.13 is either to be unknown (black proportion in the pie in Fig. 7) or a continuous region (brown proportion in the pie) covering both Black Sea and Asia]. Thus the most likely scenario according to RASP (probability of 0.51), suggests that mosquitoes arrived in Asia from New World and then went to the Black Sea region through dispersal from Asia.
Our population genetics analyses on the Black Sea populations using two types of genetic markers (12 microsatellites and ~19,000 SNPs) revealed relatively low differentiation between the populations within this region and subtle genetic structure (Additional file 1: Figure S5), which is expected considering the intense road traffic between Georgia and Turkey. Thus, we consider the Ae. aegypti populations of Turkey and Georgia as parts of a single Black Sea genetic unit that we will refer to as BS. Placing this genetic unit into the worldwide reference panel of Ae. aegypti populations, all analyses concluded that BS populations clearly belong to the subspecies Aaa and are distantly related to African Aaf. Within Aaa, BS, Asia and New World are all equally genetically differentiated from each other, FST ~0.19–0.25 (Fig. 3).
The high levels of genetic differentiation (see Figs. 3, 4c, 5c, 6b, Additional file 1: Figures S2-S4) combined with the fact that BS was first reported in 2008  raise questions on the origin(s) and the approximate age of BS. It is well documented that Ae. aegypti, although present in Europe during 1800s and 1900s, has been rarely recorded since the 1950s (see  and references therein) due to a combination of factors (e.g. insecticides, improvements in sanitation). However, in recent years the species re-appeared in the periphery of Europe, with established populations reported in Madeira in 2004  and in the Black Sea region (Russia and Georgia) in 2001 . The Madeira population is very likely a recent re-invasion, as suggested by previous studies [10, 65]. The Madeira population shares mitochondrial alleles with South American populations , clusters together with South America and Asia in the microsatellites structure analysis , and falls within the South America-Caribbean and Asia clouds in our DAPC and PCA analyses (see Additional file 1: Figures S3 and S4). In contrast, BS is quite different in being equally and strongly genetically differentiated from both Asia and the New World (Fig. 3).
If the Black Sea population is a recent invasion [occurring in 2001 (first record in Sochi area) or shortly before] it should have close genetic affinities with some other population(s) in the reference panel, as observed in several other cases of recent re-invasions around world [65,66,67]. As noted in the Results section, individual assignment tests of random individuals in our data set are re-assigned to their correct population of origin > 98% of the time with SNP chip data. While BS shows genetic affinity with Asia (Figs. 4b, 5b, 7), this relationship is in the context of broader biogeographical patterns and the evidence is not consistent with a recent invasion that occurred during the last 10–15 years from Asia (Figs. 3, 6b, Additional file 1: Figures S2-S4). On the contrary, the high genetic differentiation combined with assignment tests, and estimated divergence times (Additional file 1: Table S2; on average ~950 generations back) support the hypothesis that the Black Sea is a remnant population that has existed in the region despite the apparent elimination of this species in the Mediterranean, and that remained cryptic until its detection in 2001. Bottleneck signatures found in this population are also consistent with the remnant hypothesis. Importantly, during the years after presumed eradication, Ae. aegypti was sporadically reported in Turkey (in 1961, 1984, 1992, 1993 and 2001) , suggesting that small populations of the species were maintained there after the 1950s, when it was reported eliminated from the Mediterranean. The relatively high genetic diversity (Fig. 2) in the Black Sea is consistent with this being an older population. However, the direct invasion source of the BS population is difficult to be found because (a) the gene flow between BS and modern Asian-North American populations may obscure the true pattern and (b) the population of origin could be a population not in our reference dataset (e.g. old Mediterranean populations or Russian population).
The most likely phylogeographical scenario, supported by two independent analyses (ABC and RASP) is that Ae. aegypti from the New World arrived in Asia ~150 years ago, coinciding with previous studies and the historic record [1, 23]. However, it is unclear if the species invaded the Black Sea region from Asia (Scenario 3; Fig. 8, Structure plots in Figs. 4, 5) or BS originated directly from the New World (Scenario 2; Fig. 8 and assignment tests) since both ABC and RASP analyses retain a degree of uncertainty regarding the relationship of New World-Black Sea-Asia (see Figs. 7, 8). Specifically, in ABC analysis both scenarios (Scenario 2 and 3) are almost equally probable and in the RASP analysis there is a probability of 0.3 (compared with 0.5) for BS being ancestral to all Asian populations. Inferring the exact biogeographical relationships between these regions is challenging for two reasons: (i) the lack of the now extinct Mediterranean populations from our analyses in order to test if BS is a remnant of this population; and (ii) the difficulty in estimating the actual age of this population. As mentioned above (see Methods section) the transformation from generations to years depends on the number of generations per year, but we lack such estimates for the BS region or regions with similar climatic conditions. However, the BS is considerably cooler than tropical Africa and Americas, so the number of generations/year must be lower. This, combined with the fact that the estimated divergence time (in generations) indicated by the analysis is relatively broad (see Additional file 1: Table S2), prevents us from precise estimates. Here, we chose to present the mean time divergence estimates for BS as we did for all the Ae. aegypti populations studied, however, we note that given the limitations presented above, we cannot reject that the actual age of the BS population could exceed the ~100–150 years, which would allow the estimate to be consistent with historical epidemiological data.
Given the appearance of yellow fever in the New World no later than the 17th Century and perhaps as early as the 15th , the estimated time of founding of the New World is about 500 years ago (see also Additional file 1: Table S2), shortly after the Europeans arrived in the New World. The first appearance of established yellow fever and dengue in Europe were in Spain in 1801–1804 (while cases of yellow fever are reported in European ports prior to this time, annually repeated epidemics are documented for first time in early 1800s, indicating establishment of year-round breeding Ae. aegypti). Urban dengue and chikungunya first appeared in Asia in 1879–1890. The opening of the Suez Canal in 1869 has been proposed as the path that facilitated this dispersal . The historic epidemiological record and the estimated time of Ae. aegypti arrival to Asia ~150 years ago reinforce this hypothesis. Although we included a similar scenario in our ABC analysis (Fig. 8; Scenario 5), the absence of data on Mediterranean populations prior to 1950 likely accounts for the lack of support for the hypothesis.
The use of two types of markers and several population genetics analyses revealed high genetic differentiation between the Black Sea population and an extensive reference panel of Ae. aegypti populations distributed worldwide. This high genetic differentiation combined with the mean approximate age estimated for this population of > 100 years, signs of a bottleneck, and the sporadic reports regarding the presence of the species in Turkey after 1950s, support the hypothesis that the present populations in the Black Sea are recently expanded populations of small remnants that existed in the area long before its discovery in 2008. We could not definitively determine whether the Black Sea population is a remnant of old Mediterranean populations that has been hypothesized to be the invasion source of Asian populations, or is an independent introduction from Asia.
- Aaa :
Aedes aegypti aegypti
- Aaf :
Aedes aegypti formosus
Approximate Bayesian Computation methods
Allele Frequency Spectrum
Binary Bayesian MCMC method
Bayesian Information Criterion
Discriminant Analysis of Principal Components
- F ST :
Generalized Stepwise Mutation Model
General Time Reversible
Infinite Allele Model
Markov chain Monte Carlo
Effective population size
Private allelic richness
Principal Components Analysis
Reconstruct Ancestral State in Phylogenies
Randomized Axelerated Maximum Likelihood
Stepwise Mutation Model
Single nucleotide polymorphisms
Two-phase Mutation Model
Powell JR, Tabachnick WJ. History of domestication and spread of Aedes aegypti - a review. Mem Inst Oswaldo Cruz. 2013;108(Suppl. 1):11–7.
McBride CS, Baier F, Omondi AB, Spitzer SA, Lutomiah J, Sang R, et al. Evolution of mosquito preference for humans linked to an odorant receptor. Nature. 2014;515:222–7.
Lounibos LP. Habitat segregation among African treehole mosquitoes. Ecol Entomol. 1981;6:129–54.
Tabachnick WJ. Evolutionary genetics and arthropod-borne disease: the yellow fever mosquito. Am Entomol. 1991;37:14–26.
Powell JR. Mosquitoes on the move. Science. 2016;354:971–2.
Copanaris P. La dengue en Grece. Bull Soc Path Exot. 1928;22:272–92.
Schaffner F, Mathis A. Dengue and dengue vectors in the WHO European region: past, present, and scenarios for the future. Lancet Infect Dis. 2014;14:1271–80.
Holstein M. Dynamics of Aedes aegypti distribution, density and prevalence in the Mediteranean area. Bull World Health Org. 1967;23:541–3.
Curtin TJ. Status of Aedes aegypti in the eastern Mediterranean. J Am Mosq Control Assoc. 1967;4:48–50.
Margarita Y, Grácio A, Lencastre I, Silva A, Novo M, Sousa C. First record of Aedes (Stegomyia) aegypti (Linnaeus, 1762) (Diptera, Culicidae) in Madeira Island - Portugal. Acta Parasitol. 2006;13:59–61.
Yunicheva Y, Ryabova T, Markovich N. First data on the presence of breeding populations of the Aedes aegypti L. mosquito in Greater Sochi and various cities of Abkhazia. Med Parazitol (Mosk). 2008;3:40–3.
Ganushkina LA, Patraman IV, Rezza G, Migliorini L, Litvinov SK, Sergiev VP. Detection of Aedes aegypti, Aedes albopictus, and Aedes koreicus in the area of Sochi, Russia. Vector Borne Zoonotic Dis. 2016;16:58–60.
Akiner MM, Demirci B, Babuadze G, Robert V, Schaffner F. Spread of the invasive mosquitoes Aedes aegypti and Aedes albopictus in the Black Sea region increases risk of chikungunya, dengue, and Zika outbreaks in Europe. PLoS Negl Trop Dis. 2016;10:e0004664.
Brown JE, McBride CS, Johnson P, Ritchie S, Paupy C, Bossin H, et al. Worldwide patterns of genetic differentiation imply multiple ‘domestications’ of Aedes aegypti, a major vector of human diseases. Proc R Soc B. 2011;278:2446–54.
Slotman MA, Kelly NB, Harrington LC, Kitthawee S, Jones JW, Scott TW, et al. Polymorphic microsatellite markers for studies of Aedes aegypti (Diptera: Culicidae), the vector of dengue and yellow fever. Mol Ecol Notes. 2007;7:168–71.
Evans BR, Gloria-Soria A, Hou L, McBride C, Bonizzoni M, Zhao H, et al. A multipurpose, high-throughput single-nucleotide polymorphism chip for the dengue and yellow fever mosquito, Aedes aegypti. G3 (Bethesda). 2015;5:711–8.
Rousset F. genepop’007: a complete re-implementation of the genepop software for Windows and Linux. Mol Ecol Resour. 2008;8:103–6.
Raymond M, Rousset F. GENEPOP (Version 1.2): Population genetics software for exact tests and ecumenicism. J Hered. 1995;86:248–9.
Peakall R, Smouse PE. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research - an update. Bioinformatics. 2012;28:2537–9.
Kalinowski S. HP-Rare: a computer program for performing rarefaction on measures of allelic diversity. Mol Ecol Notes. 2005;5:187–9.
Kalinowski ST. Counting alleles with rarefaction: private alleles and hierarchical sampling designs. Conserv Genet. 2004;5:539–43.
Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–7.
Gloria-Soria A, Ayala D, Bheecarry A, Calderon-Arguedas O, Chadee DD, Chiappero M, et al. Global genetic diversity of Aedes aegypti. Mol Ecol. 2016;25:5377–95.
Brown JE, Evans BR, Zheng W, Obas V, Barrera-Martinez L, Egizi A, et al. Human impacts have shaped historical and recent evolution in Aedes aegypti, the dengue and yellow fever mosquito. Evolution. 2014;68:514–25.
Kotsakiozi P, Gloria-Soria A, Caccone A, Evans B, Schama R, Martins AJ, et al. Tracking the return of Aedes aegypti to Brazil, the major vector of the dengue, chikungunya and Zika viruses. PLoS Negl Trop Dis. 2017;11:e0005653.
Saarman NP, Gloria-Soria A, Anderson EC, Evans BR, Pless E, Cosme LV, et al. Effective population sizes of a major vector of human diseases, Aedes aegypti. Evol Appl. 2017;10:1031–9.
Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.
Raj A, Stephens M, Pritchard JK. fastSTRUCTURE: variational inference of population structure in large SNP datasets. Genet. 2014;197:573–89.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.
Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14:2611–20.
Puechmaille SJ. The program structure does not reliably recover the correct population structure when sampling is uneven: subsampling and new estimators alleviate the problem. Mol Ecol Resour. 2016;16:608–27.
Wang J. The computer program structure for assigning individuals to populations: easy to use but easier to misuse. Mol Ecol Resour. 2017;17:981–90.
Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I. Clumpak: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Resour. 2015;15:1179–91.
Dray S, Dufour AB. The ade4 package: implementing the duality diagram for ecologists. J Stat Softw. 2007;22:1–20.
Frichot E, Francois O. LEA: an R package for landscape and ecological association studies. Methods Ecol Evol. 2015;6:925–9.
Jombart T. Adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24:1403–5.
R core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2018. https://www.R-project.org/.
Piry S, Alapetite A, Cornuet J-M, Paetkau D, Baudouin L, Estoup A. GENECLASS2: a software for genetic assignment and first-generation migrant detection. J Hered. 2004;95:536–9.
Rannala B, Mountain JL. Detecting immigration by using multilocus genotypes. Proc Natl Acad Sci USA. 1997;94:9197–201.
Paetkau D, Slade R, Burden M, Estoup A. Genetic assignment methods for the direct, real-time estimation of migration rate: a simulation-based exploration of accuracy and power. Mol Ecol. 2004;13:55–65.
Piry S, Luikart G, Cornuet J-M. Computer note. BOTTLENECK: a computer program for detecting recent reductions in the effective size using allele frequency data. J Hered. 1999;90:502–3.
Luikart G, Allendorf F, Cornuet J-M, Sherwin W. Distortion of allele frequency distributions provides a test for recent population bottlenecks. J Hered. 1998;89:238–47.
Slatkin M. A measure of population subdivision based on microsatellite allele frequencies. Genetics. 1995;139:457–62.
Chakraborty R, Jin L. Heterozygote deficiency, population substructure and their implications in DNA fingerprinting. Hum Genet. 1992;88:267–72.
Di Rienzo A, Peterson AC, Garza JC, Valdes AM, Slatkin M, Freimer NB. Mutational processes of simple-sequence repeat loci in human populations. Proc Natl Acad Sci USA. 1994;91:3166–70.
Luikart G, Cornuet J-M. Empirical evaluation of a test for identifying recently bottlenecked populations from allele frequency data. Conserv Biol. 1998;12:228–37.
Rasic G, Endersby-Harshman N, Tantowijoyo W, Goundar A, White V, Yang Q, et al. Aedes aegypti has spatially structured and seasonally stable populations in Yogyakarta, Indonesia. Parasit Vectors. 2015;8:610.
Olanratmanee P, Kittayapong P, Chansang C, Hoffmann AA, Weeks AR, Endersby NM. Population genetic structure of Aedes (Stegomyia) aegypti (L.) at a micro-spatial scale in Thailand: implications for a dengue suppression strategy. PLoS Negl Trop Dis. 2013;7:e1913.
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–1313.
Lewis PO. A likelihood approach to estimating phylogeny from discrete morphological character data. Syst Biol. 2001;50:913–25.
Luu K, Bazin E, Blum MGB. Molecular Ecology Resources. 2017;17(1):67–77.
Beaumont MA, Zhang W, Balding DJ. Approximate Bayesian computation in population genetics. Genetics. 2002;162:2025–35.
Cornuet J-M, Pudlo P, Veyssier J, Dehne-Garcia A, Gautier M, Leblois R, et al. DIYABC v2.0: a software to make approximate Bayesian computation inferences about population history using single nucleotide polymorphism, DNA sequence and microsatellite data. Bioinformatics. 2014;30:1187–9.
Beserra EB, Fernandes CRM, Silva SAO, LAd S, JWd S. Efeitos da temperatura no ciclo de vida, exigências térmicas e estimativas do número de gerações anuais de Aedes aegypti (Diptera, Culicidae). Iheringia, Sér Zool. 2009;99:142–8.
Marinho RA, Beserra EB, Bezerra-Gusmao MA, Porto Vde S, Olinda RA, Dos Santos CA. Effects of temperature on the life cycle, expansion, and dispersion of Aedes aegypti (Diptera: Culicidae) in three cities in Paraiba. Brazil. J Vector Ecol. 2016;41:1–10.
Beserra EB, FPd C Jr, JWd S, Santos TS, Fernandes CRM. Biologia e exigências térmicas de Aedes aegypti (L.) (Diptera: Culicidae) provenientes de quatro regiões bioclimáticas da Paraíba. Neotrop Entomol. 2006;35:853–60.
Pfeiler E, Flores-Lopez CA, Mada-Velez JG, Escalante-Verdugo J, Markow TA. Genetic diversity and population genetics of mosquitoes (Diptera: Culicidae: Culex spp.) from the Sonoran Desert of North America. Sci World J. 2013;2013:11.
Schug MD, Mackay TF, Aquadro CF. Low mutation rates of microsatellite loci in Drosophila melanogaster. Nat Genet. 1997;15:99–102.
Helyar SJ, Hemmer-Hansen J, Bekkevold D, Taylor MI, Ogden R, Limborg MT, et al. Application of SNPs for population genetics of nonmodel organisms: new opportunities and challenges. Mol Ecol Resour. 2011;11(Suppl. 1):123–36.
Guillot G, Foll M. Correcting for ascertainment bias in the inference of population structure. Bioinformatics. 2009;25:552–4.
Albrechtsen A, Nielsen FC, Nielsen R. Ascertainment biases in snp chips affect measures of population divergence. Mol Biol Evol. 2010;27:2534–47.
Lachance J, Tishkoff SA. SNP ascertainment bias in population genetic analyses: why it is important, and how to correct it. BioEssays. 2013;35:780–6.
Yu Y, Harris AJ, Blair C, He X. RASP (Reconstruct Ancestral State in Phylogenies): a tool for historical biogeography. Mol Phylogenet Evol. 2015;87:46–9.
Almeida AP, Goncalves YM, Novo MT, Sousa CA, Melim M, Gracio AJ. Vector monitoring of Aedes aegypti in the Autonomous Region of Madeira. Portugal. Euro Surveill. 2007;12:E071115.071116.
Seixas G, Salgueiro P, Silva AC, Campos M, Spenassatto C, Reyes-Lugo M, et al. Aedes aegypti on Madeira Island (Portugal): genetic variation of a recently introduced dengue vector. Mem Inst Oswaldo Cruz. 2013;108(Suppl. 1):3–10.
Brown JE, Scholte E-J, Dik M, Den Hartog W, Beeuwkes J, Powell JR. Aedes aegypti mosquitoes imported into the Netherlands, 2010. Emerg Infect Dis. 2011;17:2335–7.
Gloria-Soria A, Lima A, Lovin D, Cunningham J, Severson D, Powell J. Origin of a high latitude population of Aedes aegypti in Washington, DC. Am J Trop Med Hyg. 2018;98:445–52.
Cloudsley-Thompson JL. Insects and history. New York: Littlehampton Book Services; 1976.
We thank Mustafa Akiner, Berna Demirci and Giorgi Babuadze for fieldworks in Turkey and Georgia.
Financial support was provided by an NIH NIAID grant RO1 AI101112 to JRP. PK was supported by Bodossaki Foundation (Greece). Mosquito specimens from Turkey and Georgia were collected thanks to work carried out under the VectorNet framework contract OC/EFSA/AHAW/ 2013/02-FWC1 funded by the European Food Safety Authority (EFSA) and the European Centre for Disease prevention and Control (ECDC). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Availability of data and materials
Ethics approval and consent to participate
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Population information for the Ae. aegypti samples used in this study. Populations represented in both microsatellite and SNP datasets are indicated by bold characters. Table S2. Priors and posteriors for the ABC analysis testing scenarios on the origin of Black Sea populations. Table S3. Climatic data for Black Sea sampling localities (indicated by bold characters) and indicative worldwide sampling localities where Ae. aegypti is established as downloaded from www.worldclim.com. Table S4. Genetic diversity. Summary of the population genetic diversity statistics for the 56 populations of Ae. aegypti consisting the microsatellite dataset. Figure S1. STRUCTURE bar plots based on the microsatellite dataset, including equal number of populations from each region. Population names are reported on their X axes. For each STRUCTURE run only the number of genetic clusters supported by the Evanno method is presented. For details see legend of Fig. 4. Figure S2. STRUCTURE bar plots based on the microsatellite dataset, including equal number of Ae. ae. aegypti populations from each region. For each STRUCTURE run only the number of genetic clusters supported by the Evanno method is presented. For details see legend of Fig. 4. Figure S3. Principal Components Analysis (PCA) on the global (a) and the Ae. ae. aegypti (b) microsatellite dataset as implemented and plotted using the ade4 package in R. Figure S4. Discriminant Analysis of Principal Components (DAPC) for the Ae. ae. aegypti populations based on the microsatellite dataset. Figure S5. Discriminant Analysis of Principal Components (DAPC) for the Ae. aegypti populations collected from Turkey and Georgia based on the microsatellite dataset. Figure S6. Assignment test as implemented in Geneclass2 for the Black Sea populations and using the remaining worldwide populations as reference panel. (PDF 8863 kb)