Skip to main content

Barcoding of Italian mosquitoes (BITMO): generation and validation of DNA barcoding reference libraries for native and alien species of Culicidae

Abstract

Background

Mosquitoes (Culicidae), as disease vectors, represent a risk for human health worldwide. Repeated introductions of alien mosquito species and the spread of invasive species have been recorded in different countries. Traditionally, identification of mosquitoes relies on morphological observation. However, morphology-based identification is associated with a number of potential disadvantages, such as the high level of specialisation of the operator and its limited applicability to damaged samples. In these cases, species identification is achieved through molecular methods based on DNA amplification. Molecular-based taxonomy has also enabled the development of techniques for the study of environmental DNA (eDNA). Previous studies indicated the 16S mitochondrial ribosomal RNA (rRNA) gene as a promising target for this application; however, 16S rRNA sequences are available for only a limited number of mosquito species. In addition, although primers for the 16S rRNA gene were designed years ago, they are based on limited numbers of mosquito sequences. Thus, the aims of this study were to: (i) design pan-mosquito 16S rRNA gene primers; (ii) using these primers, generate a 16S rRNA gene mosquito reference library (with a focus on mosquitoes present in Italy); and (iii) compare the discriminatory power of the 16S rRNA gene with two widely used molecular markers, cytochrome c oxidase subunit 1 mitochondrial gene (COI) and internal transcribed spacer 2 (ITS2).

Methods

A total of six mosquito genera (28 mosquito species) were included in this study: Aedes (n = 16 species), Anopheles (5 species), Coquillettidia (1 species), Culex (3 species), Culiseta (2 species) and Uranotaenia (1 species). DNA was extracted from the whole mosquito body, and more than one specimen for each species was included in the analysis. Sanger sequencing was used to generate DNA sequences that were then analysed through the Barcode of Life Data Systems (BOLD). Phylogenetic analyses were also performed.

Results

Novel 16S rDNA gene, COI and ITS2 sequences were generated. The 16S rRNA gene was shown to possess sufficient informativeness for the identification of mosquito species, with a discriminatory power equivalent to that of COI.

Conclusions

This study contributes to the generation of DNA barcode libraries, focussed on Italian mosquitoes, with a significant increase in the number of 16S rRNA gene sequences. We hope that these novel sequences will provide a resource for studies on the biodiversity, monitoring and metabarcoding of mosquitoes, including eDNA-based approaches.

Graphical abstract

Background

Mosquitoes (Diptera: Culicidae) are recognised as the most impactful arthropod disease vectors worldwide. They inhabit diverse habitats, ranging from the tropics to the Arctic Circle. The taxonomic diversity of Culicidae is extensive: they are divided into two subfamilies (Anophelinae and Culicinae), with a total of 112 genera and 3718 species officially recognised as of June 2023 [1, 2]. Of these, 88 species are known as vectors of human pathogens, with an additional 243 species suspected to be potential vectors [3]

The Italian mosquito fauna is composed of 65 mosquito species belonging to eight genera from both subfamilies, including the invasive and now established tiger mosquito Aedes albopictus; of these, 65 species, 34 are competent to transmit human and animal pathogens [4]. This list of Italian mosquitoes also includes species that have not been detected in Italy in the last 50 years, among which is Aedes aegypti, whose last recording dates back to 1971. In recent years, the number of Italian mosquitoes has increased due to the introduction of alien species characterised by invasive capability, mostly originating from Asia. In addition to the already well-established Ae. albopictus, Aedes koreicus and Aedes japonicus japonicus are currently spreading quickly across the Italian Peninsula [5, 6].

Invasive mosquito species (IMS) represent a threat to public health because they can act as vectors for a variety of disease agents that are normally restricted to other geographical areas, in particular the tropical regions [7, 8]. For example, autochthonous cases of chikungunya and dengue fevers have recently been described in southern, central and northern areas of Italy in 2007, 2017, 2020 and 2023 [9,10,11,12,13].

Accurate entomological monitoring is pivotal for the early detection and control of IMS [14]. Traditional monitoring methods are based on the morphological identification of mosquito species at different life stages, using dichotomous keys [7, 15, 16]. These methods are associated with a number of disadvantages: they are time-consuming, require a high level of taxonomic expertise and present issues in the case of damaged samples or life stages whose morphology is poorly differentiated [1, 15]. Moreover, certain mosquito species that have distinct vectorial capacities can appear morphologically indistinguishable [16], making the development of alternative monitoring strategies urgent. Approaches based on molecular markers could offer a helpful alternative for morphological identifications of adult and immature stages. These strategies require less time from the operator, especially in the case of large sets of specimens that may be processed in parallel, do not need a high level of expertise and allow identification at all the life stages or of damaged samples. Furthermore, they discriminate between cryptic species or species belonging to complexes [15, 16]. In recent years, high-throughput sequencing has been applied in DNA barcoding, allowing the concurrent identification of multiple species from single environmental samples. This approach is known as metabarcoding [17]. Until now the biggest sequencing effort for mosquitoes has been focussed on the mitochondrial cytochrome c oxidase I (COI) gene, but other markers have also been proposed for mosquito barcoding [1]: the mitochondrial gene coding for 16S ribosomal RNA (rRNA; hereafter 16S rDNA) and the nuclear marker internal transcribed spacer 2 (ITS2).

ITS2 is a non-coding region that forms a spacer region between the 5.8S and 28S rRNA genes. It is a highly variable region and thus a useful marker to reveal even minor genetic discontinuities. However, several studies [16, 18, 19], including one on Italian cryptic species [20], suggested that this marker might not be optimal to identify mosquito species. Indeed, the insect nuclear genome usually contains a high number of rRNA gene copies, repeated in tandem, with ITS2 presenting intra-individual variation between the different copies of this spacer. Furthermore, it frequently presents high levels of inter-individual polymorphisms. Thus, it may be infeasible/impractical to employ this marker for the identification of single individuals [1, 21].

The COI gene is a widely used marker for animal identification and is considered a sort of gold standard in DNA barcoding for several reasons: (i) it is conserved in almost all eukaryotic organisms; (ii) it is present in a high copy number (being associated with mitochondria); and (iii) it has a higher substitution rate compared to the nuclear genes, allowing a higher level of intra- and inter-specific discrimination [22]. COI is maternally inherited in mosquitoes, with almost no recombination [1]. Barcoding studies have been conducted in mosquitoes by several authors and these have generally been based on COI; these investigations have been performed in different countries, including India, Belgium, Pakistan, Portugal, Sri Lanka, French Guiana, China, England, Canada, Mexico, Croatia, Estonia and Thailand [16, 23,24,25,26,27,28,29,30,31,32,33,34,35]. Notably, a comprehensive barcoding study on mosquito COI is still lacking in Italy. In most of the quoted studies, results based on morphological observation and COI barcoding were highly congruent.

In 2024, Maurício Moraes Zenker and colleagues assessed the availability of mosquito reference sequences for COI and ITS2 and found low coverage for both markers (28.4–30.11% for COI in the Barcode of Life Data Systems [BOLD] + GenBank, and 12.32% for ITS2 in GenBank), with countries hosting the higher biodiversity having the lower reference sequence coverage [36].

Despite COI being regarded as a standard in barcoding applications [37, 38], doubts have been expressed on whether it is the best marker in all possible contexts [39]. The main issue is that possible primer binding sites in this gene are not well conserved, especially due to synonymous substitutions, possibly leading to biases in amplification and, thus, in under- or over-representation of taxa in bulk samples. Moreover, variations at less constrained sites become saturated between distantly related taxa as a result of homoplasy, [40, 41]. DNA barcoding and metabarcoding have important applications in environmental DNA (eDNA) studies. eDNA-based investigations rely on detectable DNA traces that organisms release in the environment and on their persistence, which allows for the detection of rare or elusive species [42]. COI has widely been used in eDNA studies, even though this marker is affected by the limits reported above, and by the length of PCR amplicons [1]. To try to overcome this problem, short but still informative molecular markers have been investigated in the last 20 years [42, 43]. The design of primers targeting markers with these characteristics can be challenging, to the point that some authors referred to this goal as the “search for the Holy Grail” [44, 45].

Another mitochondrial marker, 16S rDNA, has been shown to be effective in insect barcoding and metabarcoding [44, 46, 47]. 16S rDNA evolves slower than COI and is characterised by conserved sequence stretches flanked by highly variable regions due to its stem-loop structures [48]. As a result, barcoding studies targeting this region often provide broader taxonomic coverage than markers based on COI [44, 46]. However, when all available insect reference databases are considered, there are threefold more reference sequences for the COI marker than for 16S rDNA [46]. Thus, 16S rDNA can be regarded as a good candidate to be developed as a barcode for mosquito identification and eDNA studies. In all cases, the identification and testing of primers appropriate for DNA metabarcoding requires the availability of exhaustive reference databases, over which the taxonomic coverage and resolution of potential primers can be tested [44, 49,50,51,52]. It is thus necessary to build a more complete reference library for mosquitoes for this mitochondrial gene. The primary aim of this study was thus to generate a reference barcode 16S rDNA library for Italian mosquitoes. In addition, to determine the actual efficacy of this marker in mosquito barcoding, we also generated libraries for COI and ITS2, starting from the same individuals, and compared the discriminatory power of the three markers.

Methods

Mosquito collection and morphological identification

A total of 28 mosquito species were included in this study. Mosquitoes were collected as larvae and adults by various collaborators during a monitoring activity conducted between 2021 and 2023 in Italy. The collected mosquitoes were raised under standard laboratory conditions [53], and at the pupal stage they were transferred into mosquito cages to facilitate adult-stage emergence. Field-collected adults were captured using CDC-CO2 traps or BG Sentinel traps (Biogents AG, Regensburg, Germany).

All mosquito adults were morphologically identified using previously described morphological features under a stereomicroscope (model IC90 E; Leica, Wetzlar, Germany) [54,55,56,57]. When possible, more than one specimen for each species was included in the analysis. The complete list of mosquito species and the numbers of individuals collected for this study are summarised in Table 1. After morphological identification, all of the specimens were individually placed in a 1.5-ml tube with 70% ethanol and stored at − 20 °C until DNA extraction and amplification.

Table 1 Specific details on the mosquito samples used in this study

Mitochondrial 16S rDNA primers design

Novel primers targeting the mosquito mitochondrial 16S rDNA were designed using available 16S rDNA sequences or complete mitochondrial genomes of mosquitoes present in GenBank as of October 2022 (n = 967) (Additional file 3: Table S3). The sequences were aligned together using MUSCLE, a multiple sequence alignment method, and the alignment was visualised using the ARB software package to identify the most conserved regions for manual primer design. As a result, five possible primers were designed: two forward and three reverse (Additional file 4: Figure S1). All six of these combinations were tested in silico with the online Multiple Primer Analyzer tool by Thermo Fisher Scientific (https://www.thermofisher.com/it/en/home/brands/thermo-scientific/molecular-biology/molecular-biology-learning-center/molecular-biology-resource-library/thermo-scientific-web-tools/multiple-primer-analyzer.html) and in vitro PCR on known mosquito samples (data not shown).

The selected pair of primers (Culic_m16S_F91 and Culic_m16S_R555) was preferred due to the longer product and thus higher resolution, while an alternative pair (Culic_m16S_F91 and Culic_m16S_R503) of primers was applied in few cases when amplification with the first combination was unsuccessful.

DNA extraction and PCR amplification

DNA was extracted from the whole mosquito body using the DNeasy Blood and Tissue Kit (Qiagen, Valencia, CA, USA), following the manufacturer’s protocol with only minor adjustments. After DNA extraction, three different markers were amplified for each sample: COI, ITS2 and 16S rDNA (see Table 2 for specific information). For the amplification of COI, the conventional primers LCO1490/HCO2198 were used [59], and the PCR cycling parameters were: a first denaturation step of 94 °C for 5 min, followed by 40 cycles of 30 s at 94 °C, 45 s at 49 °C 45 s and 1 min at 72 °C, with a final extension step of 72 °C 10 min.

Table 2 Existing and newly designed primers used to amplify three target gene regions with the amplification size

For 16S rDNA, the following PCR cycling parameters were applied: An initial denaturation for 5 min at 95 °C, followed by 35 cycles of 30 s at 95 °C, 30 s at 54 °C and 1 min at 72°, with a final extension step of 5 min at 72 °C.

The ITS2 nuclear marker was amplified using the 5.8S and 28S primers using the protocol suggested by Collins and Paskewitz [60] with some minimal adaptations.

After checking the quality of the amplifications by electrophoresis in agarose gels, PCR products were sent to Eurofins Genomics to be sequenced through Sanger-based technology (Eurofins Genomics, Ebersberg, Germany).

Phylogenetic analyses

The obtained sequences were checked for quality and edited using the DNA sequencing software Chromas (Technelysium Pty Ltd, Brisbane, Australia). For phylogenetic analyses we analysed the dataset for each marker (COI, 16S rDNA and ITS2) separately. In order to confirm our species identification, we added at least one published sequence per species to the COI dataset. For this purpose, whenever present, Italian sequences deposited in the BOLD were preferred; in the absence of Italian sequences, the preferred order of sequences was sequences from BOLD from other European countries, sequences from BOLD from any location and finally sequences in NCBI GenBank with the same order of geographical preference. BOLD was preferred over GenBank due to the higher taxonomic curation of the former. Such an approach allowed us to obtain a reliable and comprehensive comparative framework as reference for all of the following analyses. Specifically, the COI phylogeny allowed us to “anchor” all analyses on the same specimens with the other markers, in particular the 16S rDNA, for which reference sequences are limited or completely lacking for most analysed species in both GeneBank and BOLD databases.

For each dataset, all sequences were aligned using the multiple sequence alignment method MUSCLE [61] and trimmed using Gblocks [62] integrated in the Seaview 5.0 software application [63] with default settings. The best substitution model for each dataset was determined using jModelTest 2.1.7 [64]. Then, maximum likelihood phylogenetic trees were inferred with phyML [65] using 100 bootstrap replicates. Considering that multiple studies (e.g. [66]) consistently reported monophyly of both Culicinae and Anophelinae, and thus the separation of these two subfamilies, we rooted our trees on the branch separating the representatives of the two families (i.e. between genus Anopheles and the others).

Molecular species delimitation

All of the newly obtained sequences were uploaded in the BOLD database. The COI sequences were automatically assigned Barcode Index Numbers (BINs) using the REfin Single Linkage clustering (RESL) approach based on operational taxonomic units (OTUs) [35, 67]. A ‘BIN discordance analysis’ was carried out to verify the concordance between such BIN assignments and morphological taxonomic designation.

Molecular species delimitation analyses were performed on each of the newly obtained COI and 16S rDNA sequence datasets using the tools available in the BOLD workbench. The ITS2 was excluded from such analyses due to its insufficient informativeness (see Results).

The tools employed included ‘Distance Summary’ and 'Barcode Gap Analysis’. The former was applied to investigate the sequence divergence among barcode sequences at the conspecific, congeneric and confamilial levels. The latter, i.e. Barcode Gap Analysis, was meant to assess the distance to the nearest neighbour for each of the species considered. For these analyses, all sequences shorter than 100 bp were discarded, as well as, in the respective analysis, singletons, i.e. species represented by a single sequence or genera represented by a single species.

The Pairwise Kimura distances were calculated after aligning with the MUSCLE aligner and applying pairwise deletion of ambiguous bases/gaps. Then, following Madeira et al., a ‘distance summary’ was obtained for the distances within species, genus and the whole Culicidae family [16]. Moreover, a ‘Barcode Gap Analysis’ was performed by comparing the distance of each sequence to its furthest conspecific (the maximum intra-specific genetic distance) and to its nearest non-conspecific (the minimum interspecific genetic distance) [68, 69].

Data availability

All newly obtained sequences were deposited into BOLD [68] into a novel dedicated project named BITMO (Barcoding of ITalian MOsquitoes), as well as into the NCBI GenBank (accession numbers: COI, PP941650—PP941722; 16s rDNA, PP835598—PP835668; ITS2, PP835545—PP835597). Accession numbers to each individual sequence, for all the examined individual mosquitoes, are reported at the terminal nodes of the trees presented in Figs. 1 and 3, and in Additional file 5: Figure S2. No datasets were generated or analysed during the current study.

Results

After the mosquitoes had been identified and the DNA extraction from the samples, sequences were obtained for a total of 74 individuals representing 28 mosquito species present in Italy, belonging to six genera: Aedes (16 species), Anopheles (5), Coquillettidia (1), Culex (3), Culiseta (2) and Uranotaenia (1). Most of the species were represented by at least two individuals; for three species (Aedes sticticus, Aedes cantans and Anopheles labranchiae), only single individuals were examined. The results obtained for each marker are presented separately in the following sections.

Cytochrome c oxidase 1 gene

A successful amplification of the COI gene from all 74 mosquito specimens was obtained for the 28 included species. The length of the obtained sequences ranged from 464 to 686 bp, with the overwhelming majority of sequences (68/74) being longer than 600 bp. The large-scale COI phylogeny of Culicidae was consistent with expectations, with a highly supported separation between the subfamilies Culicinae and Anophelinae (Fig. 1). In contrast, deep evolutionary relationships among and within genera of Culicinae were poorly supported and although sequences from congeneric species largely clustered together, not all genera are monophyletic for this gene. In particular, species from the genera Culiseta, Aedes and Culex had been placed into polyphyletic or paraphyletic clusters. However, phylogenetically misplaced species generally had poor bootstrap support in the reconstructed COI-based positioning (< 50). The limits of COI for the reconstruction of the overall mosquito phylogeny have already been reported [30].

Fig. 1
figure 1

Maximum likelihood phylogenetic tree with the GTR+I+G model of the COI sequences of 28 Italian mosquito species, including newly sequenced ones (shown in bold font) and the sequences downloaded from BOLD and GenBank (regular font). Sequence names are accompanied by the respective BOLD identifier and NCBI accession number (in brackets). Bootstrap values (with 100 replicates) > 50 are reported on the branches. The different mosquito genera are indicated by different colours: Anopheles, (blue), Aedes (yellow), Culiseta (pink), Coquilletidia (green), Culex (orange), Uranotaenia (purple). BOLD Barcode of Life Data Systems; COI, cytochrome c oxidase 1; NCBI, National Center for Biotechnology Information

All species, including the sequences available in the databases and the newly obtained ones (i.e. 74 + 65 available sequences), formed monophyletic clusters, mostly with high support values (21/28 with > 80 bootstrap support). Therefore, the COI tree confirmed morphological assignment, at least for the newly generated sequences. As indicated by branch lengths, some species show some intra-specific variability, which may be attributed to the different geographic origin of the different samples, in particular when sequences retrieved from the databases and the newly obtained ones are considered, such as in the cases of Aedes vexans and Ae. aegypti.

From the 28 species, the automatic BOLD assignment identified 28 BINs among the novel 74 sequences. This result was largely consistent with the morphological identification, with two exceptions. In one case, BIN discordance consisted in the clustering of the two sister species Aedes zammitii and Aedes caspius as a single BIN (AAB7911). In the other case, individuals of Aedes geniculatus were assigned to two different BINs (ADZ3180; AEG2154).

For the ‘Distance summary’ and “Barcode Gap Analysis,” we used 72 sequences out of the 74 novel sequences. Two specimens for which the 16S rDNA sequences had not been obtained (see below) were discarded in order to have fully comparable datasets.

As expected, mean genetic distance sharply increased from lower to higher taxonomic levels, being 0.30% within species, 11.30%, between species within genera and 14.56% between species belonging to different genera within the same families (Table 3). However, distance ranges within genera and families were quite broad, resulting in partial overlaps of the within-genus range with the within-species and within-family ranges (Table 3). Nevertheless, the distribution of distances within different groups was overall quite distinct, with the overlaps being due to a small number of “outlier” distances (Fig. 2a).

Table 3 Summary of genetic distance for the cytochrome c oxidase 1 gene in mosquitoes at different taxonomic levels
Fig. 2
figure 2

a Distance summary analysis for COI, showing the within-species normalised distribution for the divergence within species (red) and within genus (blue). b Scatterplot showing the maximum intra-specific distances vs the inter-specific distance for each species in the COI. The red diagonal line represents the equal values for the two distances, as a threshold for adequate species delimitation (barcoding gap). COI, Cytochrome c oxidase 1

The “Barcode Gap Analysis” indicated a perfect separation at the species level, since for all species the distance to the closest non-conspecific (i.e. nearest neighbour [NN]) was higher than the respective maximum intra-specific distance (Fig. 2b). Specifically, 26 of the 28 species showed a low intra-specific diversity (< 2%), with the two exceptions being Aedes pulcritarsis (2.16%) and Ae. geniculatus (2.91%) (Additional file 1: Table S1), consistent with the relatively longer branch lengths (Fig. 1). In addition, the three singletons (8 species) showed no variability (intra-specific divergence = 0%). Furthermore, all interspecific distances to the respective NNs were always > 2%, except between the closely related Ae. caspius and Ae. zammitii (1.78%).

16S rDNA mitochondrial gene

A total of 72 sequences, derived from the included 28 mosquito species, were obtained for the 16S rDNA mitochondrial marker, with at least one specimen for each investigated species. The length of the sequences obtained ranged from 247 to 438 bp, with the vast majority (66/72) being longer than 350 bp. Due to the almost complete lack of reference sequences in Genbank and BOLD, only the newly obtained sequences were considered for the phylogenetic analysis. The reliability of species identification was guaranteed by our accurate morphological observation and the use of COI as an additional marker for species identification.

The 16S rDNA-based phylogeny was overall consistent with the one based on COI (Fig. 3). Specifically, there was a sharp and fully supported separation between Anophelinae and Culicinae. Interestingly, the consistency of molecular phylogeny with the taxonomic assignment at the species level was even higher than that observed in the COI tree. Indeed, the only non-monophyletic genus was Aedes, and also in this case the nodes that would indicate its paraphyly were poorly supported (< 50). Conversely, phylogenetic relationships at the species level were slightly less resolved as compared with the COI phylogeny. Indeed, 26 of the 28 species were monophyletic (only 8 with > 80 support), with the exceptions being Ae. zammitii and Culex mimeticus, although in both of these cases the non-monophyly was poorly supported (< 50).

Fig. 3
figure 3

Maximum likelihood phylogenetic tree with the GTR+I+G model of the newly obtained 16S rDNA sequences from the 28 Italian mosquito species. Sequence names are accompanied by the respective BOLD identifier and NCBI accession number (in brackets). Bootstrap values (with 100 replicates) > 50 are reported on the branches. The different mosquito genera are indicated by different colours: Anopheles, (blue), Aedes (yellow), Culiseta (pink), Coquilletidia (green), Culex (orange), Uranotaenia (purple). BOLD, Barcode of Life Data Systems; COI, cytochrome c oxidase 1; NCBI, National Center for Biotechnology Information

As with the COI analysis, the mean genetic distances increased from lower to higher taxonomic levels, but at lower values than those observed for the COI analysis, being 0.03% within species, 2.66% within genera, and 8.33% within families (Table 4). In contrast to the COI analyses, there was no overlap between the distance ranges within species (maximum 0.35%) and within genera (minimum 0.51%), while there was still an overlap, though smaller, of the distance ranges between genera (maximum 6.50%) and families (minimum 3.56%).

Table 4 Summary of genetic distance for 16S ribosomal DNA gene in mosquitoes at different taxonomic levels

Consistent with the complete separation of the distances within species and genera, the results obtained from the “Barcode Gap Analysis” confirmed the presence of a barcoding gap (Fig. 4a), thereby supporting the capability of the 16S rDNA marker to effectively distinguish between species. The intra-specific diversity was much lower than that observed for COI, being at most 0.35% for Aedes cinereus. The minimum inter-specific distances to the respective NNs were smaller as well, ranging from 0.51% (6 cases) to 6.32% of Coquilletidia richiardii with respect to Culex mimeticus. Nevertheless, none of such distances to NNs outweighed the corresponding maximum intra-specific distance (Additional file 2: Table S2).

Fig. 4
figure 4

a Distance summary analysis for the 16S rDNA, showing the within-species normalised distribution for the divergence within species (red) and within genus (blue). b The scatterplot showing the maximum intra-specific distances vs the inter-specific distance for each species in the 16S rDNA. The red diagonal line represents the equal values for the two distances, as a threshold for adequate species delimitation (barcoding gap). rDNA, Ribosomal DNA

Internal transcribed spacer 2 marker

The amplification and sequencing of the ITS2 was successful for 65 individuals out of the 74 examined. In particular, ITS2 could not be amplified and sequenced for the following specimens: Ae. koreicus (BITMO001_02), Ae. japonicus (BITMO002_02; BITMO002_03), Ae. geniculatus (BITMO010_01), Ae. zammitii (BITMO019_02), Uranotaenia unguiculata (BITMO021; BITMO021_01), Aedes pulcritarsis (BITMO022_01) and Culiseta annulata (BITMO027_01). We obtained sequences of at least one specimen for the 28 investigated species, the missing one being Ur. unguiculata. The obtained sequences ranged from 104 to 401 bp, with approximately half of the sequences (n = 28) being longer than 300 bp.

For consistency with the analyses on COI and 16S rDNA, the two specimens for which no 16S rDNA was obtained were excluded from the phylogeny analysis on ITS2. The obtained phylogenetic tree was partially consistent with those previously inferred on the two mitochondrial markers (Additional file 5: Figure S2). However, the results were unsatisfactory in several respects. In particular, the topology of the ITS2-based tree was poorly supported for several nodes and was partially inconsistent with that of the trees based on the other markers. For example, the separation between Culicinae and Anophelinae had relatively low support (< 80), and the Culex and Aedes genera were non-monophyletic, with the former split into seven different branches. Moreover, five of the 27 included species were non-monophyletic, including Culex mimeticus BITMO026_01, which branched within a clade of sequences affiliated to the Aedes genus as sister group of Aedes rusticus, with high support (> 80).

This apparent lower quality of the phylogeny is likely due to insufficient phylogenetic signal, as a consequence of the short length of the sequences obtained and of the intrinsic properties of this marker. This result is consistent with previous studies, which already evidenced that, due to multiple divergent ITS2 copies present in the same individual, it is frequently impractical to obtain long and high-quality sequences with PCR followed by direct Sanger sequencing only [1].

Based on the short length and deemed insufficient informativeness of the obtained sequences, no further analyses on sequence divergence for species discrimination were performed on this marker.

Discussion

In this study, we aimed to generate and validate reference libraries for DNA barcoding for native and alien species of Culicidae. We designed pan-mosquito 16S rRNA gene primers and we used these set of primers to generate a 16S rRNA gene mosquito reference library focussed on mosquito species present in southern Europe; finally, we successfully compared the discriminatory power of the 16S rRNA gene with two commonly used molecular markers, COI and ITS2. In summary, novel sequences for these three markers were produced, with the 16S rRNA gene demonstrating sufficient informativeness to identify mosquito species, with a discriminatory power comparable to that of the COI gene.

Cytochrome c oxidase 1 and 16S rDNA mitochondrial genes

To verify the suitability of the selected COI primers to effectively distinguish mosquito species (see Results), we investigated the presence of the Barcoding Gap, defined as the difference between the highest intra-specific and the lowest inter-specific distance in congeneric species [16, 70, 71]. Analyses were run directly on the BOLD database; these included only our new sequenced samples, listed in a dedicated project named BITMO (Barcoding of Italian Mosquitoes). Previous investigations on DNA barcoding on mosquitoes were mainly based on this marker (i.e. COI). In the present study, in general, molecular and morphological characterisations were shown to be highly consistent in these studies [72]. We found high consistency between morphology and COI in relation with species identification. However, COI-based DNA barcoding yielded results that have not always been regarded as reliable in Diptera. It is still unclear whether the observed inconsistencies were caused by the lack of phylogenetic signal in the COI gene, particularly at deep taxonomic level, by the methods used for phylogenetic reconstruction or the non-monophyly of investigated taxa due to mitochondrial DNA introgression or incomplete lineage sorting [73]. A further potential issue of COI is the length of the metabarcodes, as they often are > 600 bp, which hampers inexpensive sequencing technologies (e.g. Illumina 2×300; Illumina Inc., San Diego, CA, USA).

Once the efficacy of COI had been assessed in terms of species identification and barcoding gap, the analyses were run on the 16S rDNA sequence obtained from the same samples, with the results confirming that also this marker is suitable for species identification in mosquitoes. For this purpose, more 16S rDNA sequences from other mosquito species, both from Italy and from other geographic locations, would be useful.

Our current study represents the first comprehensive investigation of 16S rDNA sequence diversity in the mosquito fauna of a specific region. Previous to the present study, only a few studies had investigated 16S rDNA in mosquito populations, and these usually included a limited number of species [74,75,76,77,78].

Comparing results obtained on COI and 16S rDNA, we would summarise and comment our results as follows: (i) the barcoding gap in 16S rDNA is less pronounced than in COI, possible due to the lower rate of evolution of the 16S rDNA gene; (ii) the 16S rDNA marker proved suitable for mosquito identification in our setting, despite this less-pronounced gap; (iii) the expected amplified DNA size from the designed 16S rDNA-based marker is shorter than that of the marker based on COI gene (464 bp for 16S; 648 bp for COI). This is extremely valuable for eDNA studies, where shorter DNA fragments are expected to be present in the samples due to DNA degradation [38, 70]. We emphasise that previous studies, which focused on different taxa, showed that 16S rDNA is also a good candidate for eDNA studies [44, 47, 79,80,81].

Internal transcribed spacer 2 marker

Our analysis of the ITS2 marker did not lead to the expected results in terms of its suitability for species differentiation in mosquitoes. Our results are in agreement with those of previous studies that reported the limits of this marker, particularly in relation with standard Sanger-based sequencing. Batovska and colleagues [82] investigated the ITS2 marker in mosquitos, analysing 88 sequences obtained with Sanger sequencing; only 18 of the 26 species considered were resolved monophyletically. In comparison, 24 species were detected with Illumina sequencing. These authors concluded that ITS2 Sanger sequencing is not completely satisfactory for mosquitoes species identification when dealing with numerous species [82]. Due to the pronounced intra-specific variability, the ITS marker could be very useful in delimitating species in mosquito complexes, but, for this characteristic, alignment of distantly related species is often difficult.

Conclusions

In conclusion, this study contributes to the building of national reference libraries for mosquito species present in Italy, targeting three markers: COI, ITS2 and 16S rDNA. COI was confirmed to be an effective barcode for mosquitoes and suitable for species identification. In our experimental settings, ITS2 was associated with a few limitations, likely due to its nature as a multicopy DNA sequence, which entails intra-individual variability. 16S rDNA has proved to be suitable tool for mosquito identification, with results comparable to those based on COI. The fact that the use of shorter amplicons is possible when using this marker as compared with COI make 16S rDNA worthy of further investigation for its application in eDNA-based monitoring. Therefore, in consideration of our results, we would propose that this marker is assessed for its efficacy also in studies on the presence of mosquito DNA in the environment.

Availability of data and materials

The datasets generated in this study were deposited in the NCBI with the accession numbers: COI, PP941650—PP941722; 16 s rDNA, PP835598—PP835668; ITS2, PP835545—PP835597) and in BOLD with the sequence page numbers from BITMO047-23 to BITMO122-23.

References

  1. Beebe NW. DNA barcoding mosquitoes: advice for potential prospectors. Parasitology. 2018;145:622–33.

    Article  PubMed  Google Scholar 

  2. Harbach RE. Mosquito taxonomic inventory. 2023. http://mosquitotaxonomic-inventory.info/. Accessed 20 Jan 2024.

  3. Yee DA, Bermond CD, Reyes-Torres LJ, Fijman NS, Scavo NA, Nelsen J, et al. Robust network stability of mosquitoes and human pathogens of medical importance. Parasit Vectors. 2022;15:1–9.

  4. Severini F, Toma L, Di Luca M. Zanzare in Italia raccolta, identificazione e conservazione delle specie più comuni. Istituto Superiore Sanità: Rome; 2022.

    Google Scholar 

  5. Negri A, Arnoldi I, Brilli M, Bandi C, Gabrieli P, Epis S. Evidence for the spread of the alien species Aedes koreicus in the Lombardy region Italy. Parasit Vectors. 2021;14:1–6.

    Article  Google Scholar 

  6. Montarsi F, Martini S, Michelutti A, Da Rold G, Mazzucato M, Qualizza D, et al. The invasive mosquito Aedes japonicus japonicus is spreading in northeastern Italy. Parasit Vectors. 2019;12:1–8.

    Article  Google Scholar 

  7. Schneider J, Valentini A, Dejean T, Montarsi F, Taberlet P, Glaizot O, et al. Detection of invasive mosquito vectors using environmental DNA (eDNA) from water samples. PLoS ONE. 2016;11:e0162493.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Lühken R, Brattig N, Becker N. Introduction of invasive mosquito species into Europe and prospects for arbovirus transmission and vector control in an era of globalization. Infect Dis Poverty. 2023;12:1–15.

    Article  Google Scholar 

  9. Angelini P, Macini P, Finarelli AC, Pol C, Venturelli C, Bellini R, et al. Chikungunya epidemic outbreak in Emilia-Romagna (Italy) during summer 2007. Parassitologia. 2008;50:97–8.

    CAS  PubMed  Google Scholar 

  10. Lindh E, Argentini C, Remoli ME, Fortuna C, Faggioni G, Benedetti E, et al. The Italian 2017 outbreak chikungunya virus belongs to an emerging Aedes albopictus-adapted virus cluster introduced from the Indian subcontinent. Open Forum Infect Dis. 2019;6:ofy321.

    Article  PubMed  Google Scholar 

  11. Lazzarini L, Barzon L, Foglia F, Manfrin V, Pacenti M, Pavan G, et al. First autochthonous dengue outbreak in Italy. Euro Surveill. 2020;25:36.

    Google Scholar 

  12. Cassaniti I, Ferrari G, Senatore S, Rossetti E, Defilippo F, Maffeo M, et al. Preliminary results on an autochthonous dengue outbreak in Lombardy Region Italy. Euro Surveill. 2023;28:37.

    Article  Google Scholar 

  13. De Carli G, Carletti F, Spaziante M, Gruber CEM, Rueca M, Spezia PG, et al. Outbreaks of autochthonous Dengue in Lazio region, Italy, August to September 2023 preliminary investigation. Euro Surveill. 2023;28:44.

    Article  Google Scholar 

  14. Marrama Rakotoarivony L, Schaffner F. ECDC guidelines for the surveillance of invasive mosquitoes in Europe. Euro Surveill. 2012;17:20265.

    CAS  PubMed  Google Scholar 

  15. Boerlijst SP, Trimbos KB, Van der Beek JG, Dijkstra KDB, Van der Hoorn BB, Schrama M. Field evaluation of dna based biodiversity monitoring of caribbean mosquitoes. Front Ecol Evol. 2019;7:240.

    Article  Google Scholar 

  16. Madeira S, Duarte A, Boinas F, Costa OH. A DNA barcode reference library of Portuguese mosquitoes. Zoonoses Public Health. 2021;68:926–36.

    Article  CAS  PubMed  Google Scholar 

  17. Taberlet P, Coissac E, Pompanon F, Brochmann C, Willerslev E. Towards next-generation biodiversity assessment using DNA metabarcoding. Mol Ecol. 2012;21:2045–50.

    Article  CAS  PubMed  Google Scholar 

  18. Ruiz-Arrondo I, McMahon BJ, Hernández-Triana LM, Santibañez P, Portillo A, Oteo JA. Surveillance of mosquitoes (Diptera, Culicidae) in a northern central region of Spain: implications for the medical community. Front Vet Sci. 2019;6:438636.

    Article  Google Scholar 

  19. Alquezar DE, Hemmerter S, Cooper RD, Beebe NW. Incomplete concerted evolution and reproductive isolation at the rDNA locus uncovers nine cryptic species within Anopheles longirostris from Papua New Guinea. BMC Evol Biol. 2010;10:1–12.

    Article  Google Scholar 

  20. Calzolari M, Desiato R, Albieri A, Bellavia V, Bertola M, Bonilauri P, et al. Mosquitoes of the Maculipennis complex in Northern Italy. Sci Rep. 2021;11:1–12.

    Article  Google Scholar 

  21. Beebe NW, Maung J, van den Hurk AF, Ellis JT, Cooper RD. Ribosomal DNA spacer genotypes of the Anopheles bancroftii group (Diptera: Culicidae) from Australia and Papua New Guinea. Insect Mol Biol. 2001;10:407–13.

    Article  CAS  PubMed  Google Scholar 

  22. Hebert PDN, Cywinska A, Ball SL, deWaard JR. Biological identifications through DNA barcodes. Proc Biol Sci. 2003;270:313–21.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Kumar NP, Rajavel AR, Natarajan R, Jambulingam P. DNA barcodes can distinguish species of Indian mosquitoes (Diptera: Culicidae). J Med Entomol. 2007;44:1–7.

    Article  CAS  PubMed  Google Scholar 

  24. Versteirt V, Nagy ZT, Roelants P, Denis L, Breman FC, Damiens D, et al. Identification of Belgian mosquito species (Diptera: Culicidae) by DNA barcoding. Mol Ecol Resour. 2015;15:449–57.

    Article  CAS  PubMed  Google Scholar 

  25. Ashfaq M, Hebert PDN, Mirza JH, Khan AM, Zafar Y, Mirza MS. Analyzing mosquito (Diptera: culicidae) diversity in Pakistan by DNA barcoding. PLoS ONE. 2014;9:e97268.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Weeraratne TC, Surendran SN, Parakrama Karunaratne SHP. DNA barcoding of morphologically characterized mosquitoes belonging to the subfamily Culicinae from Sri Lanka. Parasit Vectors. 2018;11:1–10.

    Article  Google Scholar 

  27. Talaga S, Leroy C, Guidez A, Dusfour I, Girod R, Dejean A, et al. DNA reference libraries of French Guianese mosquitoes for barcoding and metabarcoding. PLoS ONE. 2017;12:e0176993.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Wang G, Li C, Guo X, Xing D, Dong Y, Wang Z, et al. Identifying the main mosquito species in China based on DNA barcoding. PLoS ONE. 2012;7:e47051.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Hernández-Triana LM, Brugman VA, Nikolova NI, Ruiz-Arrondo I, Barrero E, Thorne L, et al. DNA barcoding of British mosquitoes (Diptera, Culicidae) to support species identification, discovery of cryptic genetic diversity and monitoring invasive species. Zookeys. 2019;832:57–76.

  30. Cywinska A, Hunter FF, Hebert PDN. Identifying Canadian mosquito species through DNA barcodes. Med Vet Entomol. 2006;20:413–24.

    Article  CAS  PubMed  Google Scholar 

  31. Adeniran AA, Hernández-Triana LM, Ortega-Morales AI, Garza-Hernández JA, de la Cruz-Ramos J, Chan-Chable RJ, et al. Identification of mosquitoes (Diptera: Culicidae) from Mexico State, Mexico using morphology and COI DNA barcoding. Acta Trop. 2021;213:105730.

    Article  CAS  PubMed  Google Scholar 

  32. Bušić N, Kučinić M, Merdić E, Bruvo-Mađarić B. Diversity of mosquito fauna (Diptera, Culicidae) in higher-altitude regions of Croatia. J Vector Ecol. 2021;46:65–75.

    Article  PubMed  Google Scholar 

  33. Bušić N, Klobučar A, Landeka N, Žitko T, Vignjević G, Turić N, et al. A DNA barcode reference library of Croatian mosquitoes (Diptera: Culicidae): implications for identification and delimitation of species, with notes on the distribution of potential vector species. Parasit Vectors. 2024;17:216.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Kirik H, Tummeleht L, Kurina O. Rediscovering the mosquito fauna (Diptera: Culicidae) of Estonia: an annotated checklist with distribution maps and DNA evidence. Zootaxa. 2022;5094:261–87.

    Article  PubMed  Google Scholar 

  35. Chaiphongpachara T, Changbunjong T, Laojun S, Nutepsu T, Suwandittakul N, Kuntawong K, et al. Mitochondrial DNA barcoding of mosquito species (Diptera: Culicidae) in Thailand. PLoS ONE. 2022;17:e0275090.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Moraes Zenker M, Portella TP, Pessoa FAC, Bengtsson-Palme J, Galetti PM. Low coverage of species constrains the use of DNA barcoding to assess mosquito biodiversity. Sci Rep. 2024;14:1–11.

    Article  Google Scholar 

  37. Andújar C, Arribas P, Yu DW, Vogler AP, Emerson BC. Why the COI barcode should be the community DNA metabarcode for the metazoa. Mol Ecol. 2018;27:3968–75.

    Article  PubMed  Google Scholar 

  38. Yu DW, Ji Yinqiu, Brent CE, Xiaoyang W, Chengxi Y, Chunyan Y, et al. Biodiversity soup: metabarcoding of arthropods for rapid biodiversity assessment and biomonitoring. Methods Ecol Evol. 2012;3:613–23.

    Article  Google Scholar 

  39. Collins RA, Bakker J, Wangensteen OS, Soto AZ, Corrigan L, Sims DW, et al. Non-specific amplification compromises environmental DNA metabarcoding with COI. Methods Ecol Evol. 2019;10:1985–2001.

    Article  Google Scholar 

  40. Deagle BE, Jarman SN, Coissac E, Pompanon F, Taberlet P. DNA metabarcoding and the cytochrome c oxidase subunit I marker: not a perfect match. Biol Lett. 2014;10:9.

    Article  Google Scholar 

  41. Ficetola GF, Coissac E, Zundel S, Riaz T, Shehzad W, Bessière J, et al. An In silico approach for the evaluation of DNA barcodes. BMC Genomics. 2010;11:1–10.

    Article  Google Scholar 

  42. Ficetola GF, Miaud C, Pompanon F, Taberlet P. Species detection using environmental DNA from water samples. Biol Lett. 2008;4:423–5.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Taberlet P, Bonin A, Zinger L, Coissac E. Environmental DNA for biodiversity research and monitoring. Oxford: Oxford University Press; 2018.

    Book  Google Scholar 

  44. Ficetola GF, Boyer F, Valentini A, Bonin A, Meyer A, Dejean T, et al. Comparison of markers for the monitoring of freshwater benthic biodiversity through DNA metabarcoding. Mol Ecol. 2021;30:3189–202.

    Article  CAS  PubMed  Google Scholar 

  45. Rubinoff D, Cameron S, Will K. Are plant DNA barcodes a search for the Holy Grail? Trends Ecol Evol. 2006;21:1–2.

    Article  PubMed  Google Scholar 

  46. Clarke LJ, Soubrier J, Weyrich LS, Cooper A. Environmental metabarcodes for insects: in silico PCR reveals potential for taxonomic bias. Mol Ecol Resour. 2014;14:1160–70.

    Article  CAS  PubMed  Google Scholar 

  47. Elbrecht V, Taberlet P, Dejean T, Valentini A, Usseglio-Polatera P, Beisel J-N, et al. Testing the potential of a ribosomal 16S marker for DNA metabarcoding of insects. PeerJ. 2016;4:e1966.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Epp LS, Boessenkool S, Bellemain EP, Haile J, Esposito A, Riaz T, et al. New environmental metabarcodes for analysing soil DNA: potential for studying past and present ecosystems. Mol Ecol. 2012;21:1821–33.

    Article  CAS  PubMed  Google Scholar 

  49. Ficetola GF, Guerrieri A, Cantera I, Bonin A. In silico assessment of 18S rDNA metabarcoding markers for the characterization of nematode communities. PLoS ONE. 2024;19:e0298905.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Charrier E, Chen R, Thundathil N, Gilleard JS. A set of nematode rRNA cistron databases and a primer assessment tool to enable more flexible and comprehensive metabarcoding. Mol Ecol Resour. 2024;24:e13965.

    Article  CAS  PubMed  Google Scholar 

  51. Van Nynatten A, Gallage KS, Lujan NK, Mandrak NE, Lovejoy NR. Ichthyoplankton metabarcoding: An efficient tool for early detection of invasive species establishment. Mol Ecol Resour. 2023;23:1319–33.

    Article  PubMed  Google Scholar 

  52. Morinière J, Balke M, Doczkal D, Geiger MF, Hardulak LA, Haszprunar G, et al. A DNA barcode library for 5,200 German flies and midges (Insecta: Diptera) and its implications for metabarcoding-based biomonitoring. Mol Ecol Resour. 2019;19:900–28.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Arnoldi I, Negri A, Soresinetti L, Brambilla M, Carraretto D, Montarsi F, et al. Assessing the distribution of invasive Asian mosquitoes in Northern Italy and modelling the potential spread of Aedes koreicus in Europe. Acta Trop. 2022;232:106536.

    Article  PubMed  Google Scholar 

  54. Cameron EC, Wilkerson RC, Mogi M, Miyagi I, Toma T, Kim H-C, et al. Molecular phylogenetics of Aedes japonicus, a disease vector that recently invaded Western Europe, North America, and the Hawaiian islands. J Med Entomol. 2010;47:527–35.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Farajollahi A, Price DC. A rapid identification guide for larvae of the most common North American container-inhabiting Aedes species of medical importance. J Am Mosq Control Assoc. 2013;29:203–21.

    Article  PubMed  Google Scholar 

  56. Romi R, Pontuale G, Sabatinelli G. Le zanzare italiane: generalità e identificazione degli stadi preimaginali (Diptera, Culicidae). Fragm Entomol. 1997;29:1–141.

    Google Scholar 

  57. Severini F, Toma L, Di Luca M, Romi R. Le zanzare Italiane: generalità e identificazione degli adulti (Diptera, Culicidae). Fragm Entomol. 2009;41:213–372.

    Article  Google Scholar 

  58. Harbach RE, Wilkerson RC. The insupportable validity of mosquito subspecies (Diptera: Culicidae) and their exclusion from culicid classification. Zootaxa. 2023;5303:1–184.

    Article  PubMed  Google Scholar 

  59. Folmer O, Black M, Hoeh W, Lutz R, Vrijenhoek R. DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol Mar Biol Biotechnol. 1994;3:294–9.

    CAS  PubMed  Google Scholar 

  60. Collins FH, Paskewitz SM. A review of the use of ribosomal DNA (rDNA) to differentiate among cryptic Anopheles species. Insect Mol Biol. 1996;5:1–9.

    Article  CAS  PubMed  Google Scholar 

  61. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Talavera G, Castresana J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007;56:564–77.

    Article  CAS  PubMed  Google Scholar 

  63. Gouy M, Tannier E, Comte N, Parsons DP. Seaview Version 5: A multiplatform software for multiple sequence alignment, molecular phylogenetic analyses, and tree reconciliation. Methods Mol Biol. 2021;2231:241–60.

    Article  CAS  PubMed  Google Scholar 

  64. Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9:772–772.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  65. Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003;52:696–704.

    Article  PubMed  Google Scholar 

  66. da Silva AF, Machado LC, de Paula MB, et al. Culicidae evolutionary history focusing on the Culicinae subfamily based on mitochondrial phylogenomics. Sci Rep. 2020;10:18823.

    Article  PubMed  PubMed Central  Google Scholar 

  67. Ratnasingham S, Hebert PDN. A DNA-based registry for all animal species: the barcode index number (BIN) system. PLoS ONE. 2013;8:e66213.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Ratnasingham S, Hebert PDN. BOLD: the Barcode Of Life Data system. Mol Ecol Notes. 2007;7:355–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Čandek K, Kuntner M. DNA barcoding gap: reliable species identification over morphological and geographical scales. Mol Ecol Resour. 2015;15:268–77.

    Article  PubMed  Google Scholar 

  70. Rozo-Lopez P, Mengual X. Mosquito species (Diptera, Culicidae) in three ecosystems from the Colombian Andes: identification through DNA barcoding and adult morphology. Zookeys. 2015;39:64.

    Google Scholar 

  71. Renaud AK, Savage J, Adamowicz SJ. DNA barcoding of Northern Nearctic Muscidae (Diptera) reveals high correspondence between morphological and molecular species limits. BMC Ecol. 2012;12:1–15.

    Article  Google Scholar 

  72. Meyer CP, Paulay G. DNA Barcoding: error rates based on comprehensive sampling. PLoS Biol. 2005;3:e422.

    Article  PubMed  PubMed Central  Google Scholar 

  73. Wilson AW, Eberhardt U, Nguyen N, Noffsinger CR, Swenie RA, Loucks JL, et al. Does one size fit all? Variations in the DNA barcode gaps of macrofungal genera. J Fungi. 2023;9:788.

    Article  CAS  Google Scholar 

  74. Kumar G, Reaume AM, Farrell E, Gaither MR. Comparing eDNA metabarcoding primers for assessing fish communities in a biodiverse estuary. PLoS ONE. 2022;17:e0266720.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Martins FMS, Porto M, Feio MJ, Egeter B, Bonin A, Serra SRQ, et al. Modelling technical and biological biases in macroinvertebrate community assessment from bulk preservative using multiple metabarcoding markers. Mol Ecol. 2021;30:3221–38.

    Article  CAS  PubMed  Google Scholar 

  76. Kocher A, Gantier J-C, Gaborit P, Zinger L, Holota H, Valiere S, et al. Vector soup: high-throughput identification of Neotropical phlebotomine sand flies using metabarcoding. Mol Ecol Resour. 2017;17:172–82.

    Article  CAS  PubMed  Google Scholar 

  77. Shouche YS, Patole MS. Sequence analysis of mitochondrial 16S ribosomal RNA gene fragment from seven mosquito species. J Biosci. 2000;25:361–6.

    Article  CAS  PubMed  Google Scholar 

  78. Sharma AK, Mendki MJ, Tikar SN, Kulkarni G, Veer V, Prakash S, et al. Molecular phylogenetic study of Culex quinquefasciatus mosquito from different geographical regions of India using 16S rRNA gene sequences. Acta Trop. 2010;116:89–94.

    Article  CAS  PubMed  Google Scholar 

  79. Iyiola OA, Shaibu RD, Shittu O, Adelaja OJ, Aishat TKI, Fadipe TO, et al. Genetic diversity and molecular characterization of mosquitoes (Diptera: Culicidae) in North-Central Nigeria using ribosomal DNA ITS2 and mitochondrial 16S-DNA sequences. Iraqi J Vet Med. 2020;44:78–91.

    Article  Google Scholar 

  80. Suman DS, Sharma G, De S, Kumar V, Veer V, Chandra K, et al. Description of Culex (Culex) kodaikanalensis sp. nov. (Diptera: Culicidae) from India with phyletic analysis using mitochondrial cytochrome oxidase I and 16S rRNA genes. Int J Trop Insect Sci. 2022;42:2969–78.

    Article  Google Scholar 

  81. Kohli R, Chaudhry S. Sequence analysis of mitochondrial 16s ribosomal RNA Gene fragment in the two populations of Armigeres (Armigeres) subalbatus (Culcidae: Diptera). Cytologia. 2007;72:83–8.

    Article  CAS  Google Scholar 

  82. Batovska J, Cogan NOI, Lynch SE, Blacket MJ. Using next-generation sequencing for DNA barcoding capturing allelic variation in ITS2. G3 (Bethesda). 2017;7:19–29.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The graphical abstract was created with BioRender.com

Funding

Funding was provided by the MUSA (Multilayered Urban Sustainability Action) project, funded by the European Union–NextGenerationEU under the National Recovery and Resilience Plan (NRRP) Mission 4 Component 2 Investment Line 1.5: Strengthening of research structures and creation of R&D “innovation ecosystems” and “territorial leaders in R&D”; PNRR project PE-13 INF-ACT “One Health Basic and Translational Research Actions addressing Unmet Needs on Emerging Infectious Diseases”; and PNRR Project title “National Biodiversity Future Center—NBFC” Project code CN_00000033, Concession Decree No. 1034 of 17 June 2022 adopted by the Italian Ministry of University and Research.

Author information

Authors and Affiliations

Authors

Contributions

BB processed the samples, designed the primers, performed the PCRs and BOLD analyses and wrote the first draft of the manuscript. MiC contributed to the design of the primers, performed phylogenetic analyses and edited the first draft. SE, LS, AN and IA contributed to the experimental work. FM, FG, EC, VM, MdL, MaC, DP, FD and PG contributed to the design of the study, provided mosquito samples and edited the manuscript prior to submission. DS, PG and FGF contributed to the design of the study and edited the first draft. CB and SE conceptualised and designed the study and contributed to the writing and editing of the manuscript up to the final version. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Sara Epis.

Ethics declarations

Declarations

Ethics approval and consent to participate.

Not applicable.

Consent for publication

Not applicable.

Competing interests

Sara Epis is an Associate Editor for Parasites & Vectors. The authors declare that they have no other competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Details for species comparison for the COI marker; the mean and maximum intra-specific values are compared to the nearest neighbour for each species. When the species is a singleton, the intra-specific values are represented as N/A

Additional file 2: Table S2.

Details for species comparison for the 16S marker; the mean and maximum intra-specific values are compared to the nearest neighbour for each species. When the species is a singleton, the intra-specific values are represented as N/A.

Additional file 3: Table S3.

Available 16S rDNA sequences or complete mitochondrial genomes of mosquitoes present in GenBank as of October 2022 used for the design of new primers.

Additional file 4: Figure S1.

The five primers targeting the mitochondrial 16S rRNA gene of mosquitoes, presented relative to the mitochondrial genome of Aedes albopictus strain Rimini.

Additional file 5: Figure S2.

Maximum likelihood phylogenetic tree with the GTR+I+G model of the newly obtained ITS2 sequences of 28 Italian mosquito species. Sequence names are accompanied by the respective BOLD identifier and NCBI accession number. Bootstrap valuesabove 50% are reported on the branches. Different colours indicate the mosquito genera: Anopheles , Aedes , Culiseta , Coquilletidia , Culex , Uranotaenia .

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bisaglia, B., Castelli, M., Soresinetti, L. et al. Barcoding of Italian mosquitoes (BITMO): generation and validation of DNA barcoding reference libraries for native and alien species of Culicidae. Parasites Vectors 17, 407 (2024). https://doi.org/10.1186/s13071-024-06478-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13071-024-06478-0

Keywords