Mitochondrial phylogenomics provides conclusive evidence that the family Ancyrocephalidae is deeply paraphyletic
Parasites & Vectors volume 16, Article number: 83 (2023)
Unresolved taxonomic classification and paraphyly pervade the flatworm class Monogenea: the class itself may be paraphyletic and split into Polyopisthocotylea and Monopisthocotylea; there are some indications that the monopisthocotylean order Dactylogyridea may also be paraphyletic; single-gene markers and some morphological traits indicate that the family Ancyrocephalidae is paraphyletic and intertwined with the family Dactylogyridae.
To attempt to study the relationships of Ancyrocephalidae and Monopisthocotylea using a phylogenetic marker with high resolution, we sequenced mitochondrial genomes of two fish ectoparasites from the family Dactylogyridae: Dactylogyrus simplex and Dactylogyrus tuba. We conducted phylogenetic analyses using three datasets and three methods. Datasets were ITS1 (nuclear) and nucleotide and amino acid sequences of almost complete mitogenomes of almost all available Monopisthocotylea mitogenomes. Methods were maximum likelihood (IQ-TREE), Bayesian inference (MrBayes) and CAT-GTR (PhyloBayes).
Both mitogenomes exhibited the ancestral gene order for Neodermata, and both were compact, with few and small intergenic regions and many and large overlaps. Gene sequences were remarkably divergent for nominally congeneric species, with only trnI exhibiting an identity value > 80%. Both mitogenomes had exceptionally low A + T base content and AT skews. We found evidence of pervasive compositional heterogeneity in the dataset and indications that base composition biases cause phylogenetic artefacts. All six mitogenomic analyses produced unique topologies, but all nine analyses produced topologies that rendered Ancyrocephalidae deeply paraphyletic. Mitogenomic data consistently resolved the order Capsalidea as nested within the Dactylogyridea.
The analyses indicate that taxonomic revisions are needed for multiple Polyopisthocotylea lineages, from genera to orders. In combination with previous findings, these results offer conclusive evidence that Ancyrocephalidae is a paraphyletic taxon. The most parsimonious solution to resolve this is to create a catch-all Dactylogyridae sensu lato clade comprising the current Ancyrocephalidae, Ancylodiscoididae, Pseudodactylogyridae and Dactylogyridae families, but the revision needs to be confirmed by another marker with a sufficient resolution.
Platyhelminthes (flatworms) is a phylum of largely parasitic animals that cause diseases in a broad range of host animal groups. The major radiation of parasitic flatworms represents the superclass Neodermata, divided into three classes: Monogenea, Trematoda and Cestoda. Monogenea comprises a large part of the flatworm diversity, but there are indications that this class might be paraphyletic and split into two independent subclasses: Polyopisthocotylea and Monopisthocotylea [1,2,3,4]. Monopisthocotylea is divided into five orders: Dactylogyridea, Capsalidea, Gyrodactylidea, Monocotylidea and Montchadskyellidea. Dactylogyridea Bychowsky, 1937, is a large order whose lower level taxonomy remains debated, which might be paraphyletic itself [5,6,7]. Currently, there are 11 valid families within the order, plus Ancyrocephalidae Bychowsky, 1937, whose status remains uncertain . The status of Ancyrocephalidae has a long and complicated history. Initially, it was established as a subfamily (Ancyrocephalinae) of the large family Dactylogyridae Bychowsky, 1933 , and later it was elevated to the status of a family . Following this, several subsequent studies failed to find support for this revision using both morphological and molecular (single locus) data, so its status remains unresolved [11,12,13].
Traditionally, most studies of monogenean phylogeny relied on morphology and single-locus molecular markers, both of which often have limited resolution. Monogenean parasites usually have few reliable morphological traits, and host-induced morphological variability is common [6, 14, 15]. Due to the small amount of information (phylogenetic signal) that they carry, single-locus molecular markers generally have too limited resolving power for deep phylogenies. Phylogenomic datasets offer a much greater resolution, but due to the limited availability of data, their application in Platyhelminthes had been limited so far to two nuclear genomic studies [16, 17] and several studies using complete mitochondrial genomes, e.g. [2, 18, 19]. The mitochondrial genome offers a much higher resolution than single-gene markers, as well as unilinear inheritance and (mostly) absence of recombination, so mitochondrial molecular markers were initially thought to be the ideal molecular marker for studying the evolutionary history of life on earth . Despite these remarkable comparative advantages, mitogenomes are not a fool-proof marker, and in some cases they can produce artefactual results because of heterogeneity in base composition and evolutionary rates, introgression, mutational saturation, etc. [21,22,23,24]. Their applicability for phylogenetic studies is often further hampered by the scarcity of data. Indeed, whereas annotated mitogenomes are currently available for nine Ancyrocephalidae species, only one Dactylogyridae mitogenome has been sequenced so far: Dactylogyrus lamellatus . This is insufficient to resolve the relationships between Dactylogyridae and Ancyrocephalidae using mitogenomic data.
Despite its putatively recent origin, Dactylogyrus (Dactylogyrinae subfamily) is a remarkably speciose (> 900 nominal species) genus of parasites of freshwater fishes with global distribution [25, 26]. To generate data needed to assess the relationship between Ancyrocephalidae and Dactylogyridae, as well as the relationships of Dactylogyridea and Monogenea, herein we sequenced two poorly studied Dactylogyrus species. Dactylogyrus simplex Bychowsky, 1936, is a small parasite that attaches itself to the gill lamellae of a broad range of fish hosts [27, 28]. Despite the almost global distribution (reported from Eurasia, North America and Africa), there are very few mentions of this species in scientific literature, and there are even fewer molecular data available: only two ITS1 (internal transcribed spacer 1) sequences in the GenBank. Dactylogyrus tuba Linstow, 1878, also attaches itself to the gills or nasal cavity of a wide range of Cyprinidae hosts, but predominantly from the Leuciscinae subfamily, and their distribution appears to be limited to Eurasia [26, 28,29,30]. This species is also rarely mentioned in the scientific literature, and in terms of available molecular data, there are only three ITS1 sequences in the GenBank.
Dactylogyrus simplex was sampled from the host Diptychus maculatus Steindachner, 1866 (Cyprinidae), obtained from the Taxkorgan River (a tributary of the Yarkand River), Xinjiang, China (37°41′14″ N; 75°18′9″ E). Dactylogyrus tuba was sampled from the host Leuciscus baicalensis (Dybowski, 1874) (Leuciscinae) obtained from the Haba River (a tributary of the Irtysh River), Xinjiang, China (48°7′58″ N; 86°23′24″ E). Parasites were identified morphologically according to Gussev et al.  (Additional file 1: Figures S1 and S2) and molecularly using the ITS1 data. The cox1 barcode identification did not produce any similarity to the available sequences. Dactylogyrus simplex exhibited 99.46% identity to the two available conspecific ITS1 sequences (MT476981 and MT476980). Dactylogyrus tuba exhibited 99.82% identity to the conspecific ITS1 sequence KJ605445 and 97.2% identity to the other two nominally conspecific sequences (AJ564157 and AJ564158).
DNA extraction, mitogenome amplification and sequencing (Sanger), as well as the sequence annotation and analyses, were conducted exactly as described before [31, 32]. ITS1 sequences were obtained using two primers designed by us based on available orthologues: D (5′-GGNGTCGATGAAGAACGCAG-3′) and B1 (5′-CGGATCCGAATCCTGGTTAG-3′). For the annotation of sequenced mitogenome, we selected the only other available congeneric mitogenome as the reference: Dactylogyrus lamellatus . tRNAs were identified using ARWEN  and MITOS  programmes. ORF-Finder  was further used to search for genes resembling atp8 in large noncoding regions of both mitogenomes. We downloaded all available Monopisthocotylea mitogenomes. As mitogenomes remain unavailable for Monocotylidea and Montchadskyellidea, we could only include representatives of three monopisthocotylean orders: Gyrodactylidea, Dactylogyridea and Capsalidea. To stabilize the topology as much as possible, we treated the entire Gyrodactylidea clade as the outgroup, as previous studies based on different types of data consistently supported it as a closely related but mutually monophyletic clade in relation to Dactylogyridea [4, 7, 18]. From this dataset, we removed all unannotated mitogenomes and left only one mitogenome per species. We also removed Ancyrocephalus morgundae (Ancyrocephalidae), because this mitogenome was incomplete (four PCGs missing: cox2, nad2, nad4, nad4L). After the pruning, there were 39 mitogenomes in the dataset (including the two new sequences). We also made minor taxonomic adjustments: Tetraonchidae is treated as a Gyrodactylidea family in the NCBI and Dactylogyridea family in WORMS, so we changed the taxonomic identity according to WORMS, as this database is updated more regularly . PhyloSuite  was used to standardize the annotation, extract data, generate comparative tables and conduct phylogenetic analyses using several plug-in programmes. The TreeSuite function of PhyloSuite was used to infer the root-to-tip branch lengths, signal-to-noise ratio, relative composition variability score, treeness and long-branch scores [38, 39].
We used two mitogenomic datasets: (i) nucleotide sequences of 12 concatenated PCGs and two rRNA genes (NUC) and (ii) amino acid sequences of 12 concatenated PCGs (AAs). PCGs for the NUC dataset were first aligned using the codon mode and the accurate G-INS-i strategy in MAFFT  and alignments were then refined using MACSE, which is better at processing frameshift mutations . rRNAs and AAs were aligned using only MAFFT (I-INS-i and G-INS-i strategies respectively). rRNAs were included to maximise the resolution. Genes were concatenated and partitioned by PhyloSuite (each gene in its own partition) and best-suited evolutionary models for partitions were inferred using another plug-in PhyloSuite programme: ModelFinder . For phylogenetic reconstruction, we used three different algorithms: (i) maximum likelihood (ML) in IQ-TREE , (ii) Bayesian inference (BI) in MrBayes 3.2.6  and (iii) CAT-GTR algorithm in PhyloBayes, designed to account for compositional heterogeneity . ML analysis was run with 20,000 ultrafast bootstraps . For the BI analysis, we used two parallel runs and 25% burn-in. For the ITS1 phylogenetic analyses, we downloaded all orthologous sequences belonging to Dactylogyridae, Ancyrocephalidae, Ancylodiscoididae and Pseudodactylogyridae. From other Dactylogyridea families, we retrieved representatives if they were available. Two Gyrodactylidae species were used as outgroups. We removed all sequences shorter than 800 bp, most of the conspecific sequences and all sequences that exhibited large deletions or missing data in the alignment. The final dataset comprised 67 species. We used the E-INS-i strategy in MAFFT to align them. iTOL  was used to visualise the phylogeny and architecture using files generated by PhyloSuite.
Comparative mitogenomic architecture
At 14,648 bp in D. simplex and 15,406 bp in D. tuba, the two mitogenomes were average in size within the Monopisthocotylea dataset (Fig. 1, Additional file 2). Both comprised the standard neodermatan 36 genes (atp8 missing from both), all of which are transcribed from the same strand. Both mitogenomes exhibited a single large (> 100 bp) NCR (likely to comprise the control region), but it was located in different places. Comparison with related species (Additional file 1: Figure S3) indicates that D. tuba exhibits the ancestral architecture, with the NCR located downstream from nad5, whereas D. simplex exhibits a derived architecture, with the NCR located between cox1 and rrnL. The NCR was much larger in D. tuba (2,401 bp) than in D. simplex (1532 bp). We confirmed that atp8 is not encoded in the two mitogenomes by searching both large NCRs for open-reading frames (ORFs). Both NCRs contained multiple putative ORFs, but none of them corresponded to the key characteristics of atp8 genes identified in other Lophotrochozoa: starting with MPQM of MPHM motif and length between 38 to 56 AAs . We also examined the identified ORFs encoded on the plus strand using the BLASTp suite, but none exhibited similarity to known genes. In other aspects, the two species exhibited an identical gene order, also common in other Neodermata (Table 1; Additional file 1: Figure S3). Compared to these two species, the congeneric D. lamellatus exhibits a transposition of the trnL2 gene. Mitogenomes were compact in both species, with few and small noncoding regions and a relatively large number of overlaps, two of which were very large (Table 1). The smaller of the two overlaps comprised trnF and trnM, and it was almost perfectly conserved in both species (20 bp in D. simplex and 21 in D. tuba). The largest overlap of 28 bp was found between two PCGs: nad4 and nad4L. Identity values between genes of these two nominally congeneric species were remarkably low (e.g. 74% for cox1), with only trnI exhibiting an identity value of over 80% (Table 1).
The two newly sequenced mitogenomes exhibited exceptionally low A + T base content (D. tuba = 54%, D. simplex = 58.4%) compared to other Dactylogyridea species (60–78.4%; Fig. 1, Additional file 2). The T content was exceptionally low in D. tuba (29.4%) and very low in D. simplex (34%). In comparison, in the rest of the dataset, it ranged between 35.1 and 47.2%. Regarding A content, the two species were merely at the bottom of the range in the dataset, but both species had the highest C and G content in the entire dataset. Similarly, both species had exceptionally low (in absolute terms) AT skews, calculated as (A—T)/(A + T), in the dataset: − 0.087 in D. tuba and − 0.164 in D. simplex (dataset range = − 0.087 to 0.338). Correspondingly, D. tuba had the lowest GC skew, calculated as (G − C)/(G + C) in the dataset (0.085), whereas the GC skew of D. simplex was merely in the lowest quartile (0.18; dataset range = 0.085 to 0.372).
Mitochondrial phylogenomic analyses
The concatenated NUC alignments of all 39 species in the dataset failed the compositional homogeneity test, whereas only ten species passed the test in the AA dataset. After Capsaloides cristatus (0.37) and Cichlidogyrus mbirizei (0.33), D. tuba (0.31) exhibited the third-highest relative composition variability in the dataset. Despite the unusual base composition, the two newly sequenced species had some of the lowest long-branch scores in the dataset (− 7.97 and − 4.70). The highest values were exhibited by Aglaiogyrodactylus forficulatus (34.95) and Paratetraonchoides inermis (19.48) (Fig. 2). The longest root-to-tip branch lengths were exhibited by several Cichlidogyrus and Scutogyrus species, along with Paratetraonchoides inermis, with the three Dactylogyrus species exhibiting relatively high values within the dataset. Signal-to-noise ratio was 2.05 and the treeness was 0.29 (the numbers refer to the BI_NUC tree, but overall results were similar across different trees).
The BI analyses were run until the average standard deviation of split frequencies (ASDSF) was stationary and < 0.01 (a very good indication of convergence according to MrBayes manual). Both BI analyses converged; for the NUC dataset, the final ASDSF was 0.0067 after 500,000 MCMC steps, and for the AA dataset, the final ASDSF was 0.0066 after 1,000,000 MCMC steps. PhyloBayes analysis (CAT-GTR) met the conditions for an acceptable run: AA maxdiff = 0.11 and NUC maxdiff = 0.22 (maxdiff < 0.3 indicates an acceptable run and maxdiff < 0.1 indicates a good run).
All six analyses produced unique topologies (Figs. 3, 4, 5, 6, 7). As the two ML analyses differed only in minor topological details in the Gyrodactylidae clade and relationships between Scutogyrus and Cichlidogyrus, only the AA tree is shown (Fig. 3), whereas the NUC tree is available in Additional file: Figure S4. We also tested whether the inclusion of two rRNA genes affected the topology, so we removed the two genes and conducted an ML analysis using only the 12 PCGs. The topology was unaffected, but support values for some nodes differed: in some cases they were higher and in some lower (Additional file 1: Figure S5). Overall, there was no effect on the main conclusions. Dactylogyrus was monophyletic in all analyses, with D. tuba and D. simplex being sister species in ML and BI analyses. However, in both CAT-GTR analyses, D. tuba and D. lamellatus were sister species. Support values for this node were high in all analyses (≥ 97%), apart from the ML_AA tree, where it was 63% (Fig. 3). Dactylogyridae (monophyletic) were nested within the Ancyrocephalidae (paraphyletic) in all six analyses. All analyses aside from NUC_BI (all ≥ 98%) produced some support values < 90% in the Dactylogyridae + Ancyrocephalidae clade. The Dactylogyridae + Ancyrocephalidae clade was monophyletic and exhibited a sister-group relationship to Ancylodiscoididae in all analyses (with high support). The order Dactylogyridea was rendered paraphyletic by Capsalidea nested within the clade in all analyses except NUC_BI, where Capsalidea and Gyrodactylidea were sister groups (Fig. 4). Node support varied between high and low among the analyses. Disregarding Tetraonchidae, Diplectanidae was resolved as the basal (sister group to the rest of the clade) radiation of Dactylogyridea in all topologies aside from the NUC_BI. Both Dactylogyridea and Gyrodactylidea were rendered paraphyletic by Tetraonchidae and Tetraonchoididae forming a clade at the base of the Dactylogyridea. This clade was resolved with high confidence in all analyses. The Gyrodactylidae clade was not stable either. There were two main topologies regarding the position of Gyrodactylus nyanzae. In one topology, the Gyrodactylidae family was divided into two clades, with G. nyanzae clustering with Paragyrodactylus and Macrogyrodactylus (ML_AAs, both BI analyses, NUC_CAT-GTR). In the other topology (ML_NUC, AAs_CAT-GTR) G. nyanzae was resolved at the base of the main Gyrodactylus clade. Gyrodactylus was further rendered paraphyletic by Gyrodactylus sp. (MW464989) clustering closer to Paragyrodactylus and Macrogyrodactylus than to the rest of the nominally congeneric species.
The newly generated ITS1 sequences (also comprising fragments of 18 s and 5.8S) were 1322 bp for D. tuba and 934 bp for D. simplex. We downloaded all homologous Dactylogyridae sequences from GenBank but kept only the ones exhibiting high coverage to ensure maximum resolution. BI and CAT-GTR analyses both converged. BI and ML analyses produced identical results (Fig. 8, Additional file 1: Figure S6). In this topology, Dactylogyridae and Ancylodiscoididae were monophyletic, while Ancyrocephalidae was deeply paraphyletic and split into three clades. One clade, comprised of Cleidodiscus, Urocleidus and Ancyrocephalus, clustered with Ancylodiscoididae (a sister-group relationship) with high support. None of these genera were available in the mitogenomic dataset, so we cannot make a meaningful comparison between the two datasets. Clade two comprised multiple Cichlidogyrus species, with Scutogyrus nested within, rendering the genus paraphyletic (again, assuming that this species was not misidentified). The third clade comprised Thylacicleidus, Parancyrocephaloides and Gobiocetes, with Pseudodactylogyridae (Pseudodactylogyrus anguillae) nested within. The family Dactylogyridae was split into two major clades. One comprised a group of nine Dactylogyrus species (including D. simplex and D. lamellatus) and Acolpenteron ureterocetes. The other comprised a group of 18 Dactylogyrus species (including D. tuba). Therefore, the family Dactylogyridae was monophyletic, but Dactylogyrus was split into two clades and rendered paraphyletic by A. ureterocetes (assuming that this species was not misidentified). The CAT-GTR phylogeny also produced the above three Ancyrocephalidae clades and the two Dactylogyridae clades, but the overall topology of these clades differed slightly; most notably, the two Dactylogyridae clades were polyphyletic (Fig. 9).
The two newly sequenced mitogenomes generally possessed standard features for neodermatan mitogenomes, including all genes being transcribed from the same strand and the missing atp8, which may be in the ground pattern of Platyhelminthes, but it is missing from all known Neodermata [7, 31, 48,49,50]. An intriguing feature of Monopisthocotylea is that it exhibits mitogenomes with rearranged gene orders, which is uncommon within the Neodermata . However, the two newly sequenced mitogenomes exhibited an identical gene order, which was previously recognised as the ancestral architecture for Neodermata . Both mitogenomes exhibited exceptionally low A + T base content within the dataset. Remarkably, in this aspect, the only other available congeneric species, D. lamellatus, was average among the Dactylogyridea. High A + T bias in flatworms was previously attributed to mutation, as opposed to selection , which suggests that these two species may evolve under lower mutational pressures. This is further supported by these two species exhibiting some of the lowest long-branch scores in the dataset.
The annotation of most genes was relatively straightforward, with cytochrome b (cytb) in D. simplex as an exception. Most of the available cytb orthologues use the TAG stop codon, but using this stop codon in D. simplex would create a 30-base 3’ end elongation compared to D. tuba and a large overlap of 25 bp with the downstream nad4L. As such large overlaps between PCGs other than atp6-atp8 and nad4L-nad4 pairs would be considered highly unusual , we opted to use an abbreviated T– stop codon, which merely caused a five-base abbreviation in D. simplex (1072 bp, compared to 1077 bp in D. tuba). We sequenced this section twice, but we failed to find evidence of a sequencing error (Additional file 1: Figure S7). We suspect that a single-base deletion mutation occurred near the 3’ end of this gene in its evolutionary history, because a single-base elongation mutation in its current sequence would cause a frameshift mutation that would make available two stop codons in the expected locations (the current 10-bp intergenic space between cytb and nad4L): TAA and TAG. It would be interesting to corroborate this putative frameshift-causing deletion on a different conspecific specimen as well as closely related species. As per convention, rrnS was annotated to the adjacent downstream gene (cox2), which generated a 3’ extension of about 30 bp in D. simplex compared to most other orthologues.
Both species exhibited multiple large overlaps between genes. Whereas large overlaps comprising protein-coding genes (PCGs) are rare because of stringent evolutionary constraints, large overlaps between tRNA genes have been observed in multiple animal lineages . However, the largest overlap of 28 bp did comprise two PCGs: nad4 and nad4L. It was perfectly conserved in both species as well as in the third available congeneric mitogenome: D. lamellatus . Therefore, it is highly evolutionarily conserved, and we have no reasons to suspect annotation artefacts. These two genes commonly overlap in other neodermatan species as well, but the size of the overlap varies . This indicates that they are probably translated from an evolutionarily conserved bicistronic transcript, similar to atp6/atp8 in many other animal mitogenomes [51, 53].
For congenerics, the two newly sequenced species exhibited remarkably low identities between orthologous gene pairs . There are two possible explanations for this phenomenon: it may be reflective of the rapid molecular evolution of many flatworm lineages [22, 55,56,57] or it may indicate that the genus Dactylogyrus should be split into two different genera. Either way, the barcode identification thresholds proposed for other animal lineages  are almost certainly not applicable to this lineage. Indeed, it explains why our attempts to identify these two species using the BOLD database  were futile.
Compositional biases are known to affect phylogenetic reconstruction and other types of evolutionary studies [22, 23, 59]. As the two newly sequenced species exhibited atypical base composition within the dataset and most of the concatenated datasets failed to pass the compositional homogeneity test in IQ-TREE, we carefully designed phylogenetic analyses of this dataset with this confounding factor in mind. To test for topological stability, we conducted phylogenetic analyses using two standard algorithms on two different datasets: NUC and AAs. Furthermore, we applied the CAT-GTR algorithm, designed to account for compositional heterogeneity. A study has shown that this algorithm outperforms the standard methods when strong base composition biases produce phylogenetic artefacts . Indeed, our analyses revealed a rather pervasive topological instability, with six analyses producing six different topologies. Intriguingly, the instability was largely confined to the Dactylogyridae/Ancyrocephalidae clade (and some in the Gyrodactylidae clade). Compared to the mitogenomic dataset, the ITS1 dataset had a reduced resolution in terms of the power of the marker, but an improved resolution in terms of the availability of data. As small markers are not suitable for deep phylogenies, we focused only on the order Dactylogyridea. Comparison with the mitogenomic results was rendered difficult by the incomplete overlap of species available for the two datasets.
The addition of two new Dactylogyridae mitogenomes allowed us to study the relationships between Dactylogyridae and Ancyrocephalidae using mitogenomic molecular data. Despite the instability of this clade, all six mitogenomic and three ITS1 analyses produced topologies that rendered Ancyrocephalidae paraphyletic. Considering all nine topologies, the most parsimonious solution to resolve the widespread paraphyly is to create a catch-all Dactylogyridae s.l. clade comprising the current Ancyrocephalidae, Ancylodiscoididae, Pseudodactylogyridae and Dactylogyridae sensu stricto families. The alternative would be to erect a large number of small families with questionable support. A similar conclusion was reached based on 18 homologous series of morphological characters . The lineages included in these two studies did not exhibit a full overlap, so it is difficult to compare the results, but this indicates that morphological and molecular markers produce relatively congruent signals. Ancyrocephalidae was already introduced, and our analyses leave no doubt about its paraphyly. In mitogenomic analyses, the family Ancylodiscoididae Gusev, 1961, was consistently monophyletic and resolved as a closely related sister clade to Dactylogyridae + Ancyrocephalidae. ITS1 analyses supported this topology but indicated that several Ancyrocephalidae genera cluster with this family. Although mitogenomic data did not question the paraphyly of this family, all analyses indicate a very close relationship with other families in the Dactylogyridae s.l. Ancylodiscoididae was initially a subfamily of the Dactylogyridae , but later it was elevated to the family level based on several morphological characteristics . However, this revision was soon contested based on molecular data (28S rRNA fragment), which indicated that this may be merely another subfamily of Dactylogyridae . The family Pseudodactylogyridae Gusev, 1965, also has a similar history: first a genus (Pseudodactylogyrus) of Ancyrocephalinae, later elevated to the subfamily Pseudodactylogyrinae and finally further elevated to the family Pseudodactylogyridae . A 28S-based phylogenetic analysis resolved Pseudodactylogiridae as clustering with some Dactylogyrus species and Ancyrocephalus morgundae . Mitogenomes remain unavailable for this family, so we can only rely on the ITS1 analyses to debate its status, which suggest that these four families may be valid subfamilies of the Dactylogyridae s.l. family if some genera of the highly paraphyletic Ancyrocephalinae were reassigned to other subfamilies. Cleidodiscus, Urocleidus and Ancyrocephalus should be reassigned to Ancylodiscoidinae. Thylacicleidus, Parancyrocephaloides and Gobioecetes should be reassigned to Pseudodactylogyrinae. The current Dactylogyridae s.s. appear monophyletic, so they could be all assigned to Dactylogyrinae. Finally, the subfamily Ancyrocephalinae s.s. would then comprise Cichlidogyrus and Scutogyrus species. As the latter was nested within the former in both mitogenomic and ITS1 analyses, there is strong evidence that these two should be synonymised. This also makes it questionable whether Ancyrocephalinae deserves a subfamily status.
All three ITS1 analyses indicate that the genus Dactylogyrus may be paraphyletic and split into two distinct clades: one comprising a group of 9 Dactylogyrus species (including D. simplex and D. lamellatus) and Acolpenteron ureterocetes and the other comprising a group of 18 Dactylogyrus species (including D. tuba). However, mitogenomic data did not support this, as D. simplex and D. tuba clustered together in all four ML and BI analyses. As both species had unusual base composition biases within the dataset, this may be an artefact of two species with homoplastic base compositions clustering together [22, 61]. This hypothesis was supported by both CAT-GTR analyses, where D. tuba clustered with D. lamellatus. However, gene orders and base composition analyses indicate that D. simplex and D. tuba are more closely related to each other than to D. lamellatus. Therefore, different parameters and datasets produce multiple conflicting hypotheses for the topology of this genus, so this issue should be studied with utmost care before a revision is proposed.
Regarding other taxa, this study offers further evidence that mitogenomic data consistently support the validity of the Tetraonchoidea clade, and consistently they place it at the base of the Dactylogyridea + Capsalidea clade . As there is partial support for this clade from 28S rDNA data and morphological apomorphies , we can conclude that there is very strong molecular support for the validity of the Tetraonchoidea order. However, P. inermis exhibited a pronounced long-branch score, which indicates that we should exclude an LBA artefact before a revision is proposed. Our results consistently resolved the order Capsalidea as nested within the Dactylogyridea, thus rendering the latter paraphyletic. This was also observed in several previous mitochondrial phylogenomic studies [2, 5, 62, 63]. However, multiple studies that relied on morphological data and small molecular markers (28S rRNA) found that Capsalidea was resolved as evolutionarily distant from Dactylogyridea [4, 64,65,66]. As mitochondrial genomes are the most powerful marker so far applied to study this problem, this discrepancy may be attributable to the poor resolution of morphological and single-molecular marker data. This would indicate that Capsalidea should be reassigned to the order Dactylogyridea, where it may be given a suborder status. However, before a revision is proposed, it is necessary to exclude mitochondrial introgression in the evolutionary history of these orders. This may be achieved using a multilocus nuclear dataset. Although Gyrodactylidea was not the focus of our study, it deserves a brief mention that Gyrodactylus was rendered paraphyletic by Gyrodactylus sp. (MW464989) and G. nyanzae clustering outside the main Gyrodactylus clade. The former mitogenome is unpublished, so the paraphyly may be caused by a taxonomic misidentification, but G. nyanzae rendering the genus paraphyletic had been observed in the original publication , so this problem deserves further studies.
This study was hampered by the limited availability of data; for example, there are 12 valid genera in Dactylogyridae, but only one of them was represented in our dataset. We urge the sequencing of further Dactylogyridae mitogenomes. As our results clearly show that mitogenomic data produce a certain level of topological instability and phylogenetic artefacts, future studies should seek congruence between multiple types of markers before they propose further taxonomic revisions in this highly problematic group of parasitic flatworms.
Availability of data and materials
The two mitogenomes are available from GenBank under accession numbers OP058752 (D. simplex) and OP05873 (T. tuba), and the two ITS sequences under OP019371 (D. simplex) and OP019825 (T. tuba).
Dataset comprised of amino acid sequences of 12 PCGs
Average standard deviation of split frequencies
Internal transcribed spacer 1
Dataset comprised of nucleotide sequences of 12 PCGs and two rRNAs
Perkins EM, Donnellan SC, Bertozzi T, Whittington ID. Closing the mitochondrial circle on paraphyly of the Monogenea (Platyhelminthes) infers evolution in the diet of parasitic flatworms. Int J Parasitol. 2010;40:1237–45.
Zhang D, Li WX, Zou H, Wu SG, Li M, Jakovlić I, et al. Homoplasy or plesiomorphy? Reconstruction of the evolutionary history of mitochondrial gene order rearrangements in the subphylum Neodermata. Int J Parasitol. 2019;49:819–29.
Laumer CE, Giribet G. Inclusive taxon sampling suggests a single, stepwise origin of ectolecithality in Platyhelminthes. Biol J Linn Soc. 2014;111:570–88.
Boeger W, Kritsky D. Phylogenetic relationships of the Monogenoidea. In ‘Interrelationships of the Platyhelminthes.’ London and Newyork: Taylor and Francis; 2001. p. 92–102.
Zhang D, Li WX, Zou H, Wu SG, Li M, Jakovlić I, et al. Mitochondrial genomes of two diplectanids (Platyhelminthes: Monogenea) expose paraphyly of the order Dactylogyridea and extensive tRNA gene rearrangements. Parasites Vectors. 2018;11:601.
Zhang D, Zou H, Wu SG, Li M, Jakovlić I, Zhang J, et al. Sequencing of the complete mitochondrial genome of a fish-parasitic flatworm Paratetraonchoides inermis (Platyhelminthes: Monogenea): tRNA gene arrangement reshuffling and implications for phylogeny. Parasites Vectors. 2017;10:462.
Zhang D, Zou H, Wu SG, Li M, Jakovlić I, Zhang J, et al. Sequencing, characterization and phylogenomics of the complete mitochondrial genome of Dactylogyrus lamellatus (Monogenea: Dactylogyridae). J Helminthol. 2018;92:455–66.
WoRMS - World Register of Marine Species - Ancyrocephalidae Bychowsky, 1937 [Internet]. WORMS. https://marinespecies.org/aphia.php?p=taxdetails&id=160481. 13 Nov 2022.
Bychowsky BE. Monogenetic Trematodes: Their Systematics and Phylogeny (Russ.) Acad. Moscow, Leningrad: Nauk SSSR, p. 509; 1957.
Bychowsky B, Nagibina L. Revision of Ancyrocephalinae Bychowsky, 1937. Parazitologicheskii Sbornik. 1978;28:5–15.
Šimková A, Plaisance L, Matějusová I, Morand S, Verneau O. Phylogenetic relationships of the Dactylogyridae Bychowsky, 1933 (Monogenea: Dactylogyridea): the need for the systematic revision of the Ancyrocephalinae Bychowsky, 1937. Syst Parasitol. 1937;54:1–11.
Kritsky DC, Boeger WA. The Phylogenetic Status of the Ancyrocephalidae Bychowsky, 1937 (Monogenea: Dactylogyroidea). J Parasitol. 1989;75:207–11.
Mendoza-Palmero CA, Blasco-Costa I, Scholz T. Molecular phylogeny of Neotropical monogeneans (Platyhelminthes: Monogenea) from catfishes (Siluriformes). Parasites Vectors. 2022;8:164.
Poulin R, Morand S. The diversity of parasites. Q Rev Biol. 2000;75:277–93.
Perkins EM, Donnellan SC, Bertozzi T, Chisholm LA, Whittington ID. Looks can deceive: molecular phylogeny of a family of flatworm ectoparasites (Monogenea: Capsalidae) does not reflect current morphological classification. Mol Phylogenet Evol. 2009;52:705–14.
Egger B, Lapraz F, Tomiczek B, Müller S, Dessimoz C, Girstmair J, et al. A transcriptomic-phylogenomic analysis of the evolutionary relationships of flatworms. Curr Biol. 2022;2015:1347–53.
Laumer CE, Hejnol A, Giribet G. Nuclear genomic signals of the ‘microturbellarian’ roots of platyhelminth evolutionary innovation. eLife. 2015;4:e05503.
Vanhove MPM, Briscoe AG, Jorissen MWP, Littlewood DTJ, Huyse T. The first next-generation sequencing approach to the mitochondrial phylogeny of African monogenean parasites (Platyhelminthes: Gyrodactylidae and Dactylogyridae). BMC Genomics. 2018;19:520.
Solà E, Álvarez-Presas M, Frías-López C, Littlewood DTJ, Rozas J, Riutort M. Evolutionary analysis of mitogenomes from parasitic and free-living flatworms. PLOS ONE. 2015;10:e0120081.
Avise JC, Arnold J, Ball RM, Bermingham E, Lamb T, Neigel JE, et al. Intraspecific phylogeography: the mitochondrial DNA bridge between population genetics and systematics. Ann Rev Ecol Syst. 1987;18:489–522.
Rubinoff D, Holland BS, Savolainen V. Between two extremes: mitochondrial DNA is neither the panacea nor the nemesis of phylogenetic and taxonomic inference. Syst Biol. 2005;54:952–61.
Zhang D, Zou H, Hua C-J, Li W-X, Mahboob S, Al-Ghanim KA, et al. Mitochondrial architecture rearrangements produce asymmetrical nonadaptive mutational pressures that subvert the phylogenetic reconstruction in Isopoda. Gen Biol Evol. 2019;11:1797–812.
Jakovlić I, Zou H, Zhao X-M, Zhang J, Wang G-T, Zhang D. Evolutionary history of inversions in directional mutational pressures in crustacean mitochondrial genomes: implications for evolutionary studies. Molecular Phylogenet Evol. 2021;164:107288.
Rota-Stabelli O, Kayal E, Gleeson D, Daub J, Boore JL, Telford MJ, et al. Ecdysozoan Mitogenomics: evidence for a common origin of the legged invertebrates, the Panarthropoda. Gen Biol Evol. 2010;2:425–40.
Gibson DI, Timofeeva TA, Gerasev PI. A catalogue of the nominal species of the monogenean genus Dactylogyrus Diesing, 1850 and their host genera. Syst Parasitol. 1996;35:3–48.
ŜImková A, Morand S, Jobet E, Gelnar M, Verneau O. Molecular phylogeny of congeneric monogenean parasites (Dactylogyrus): a case of Intrahost speciation. Evolution. 2004;58:1001–18.
Mizelle JD. Ectoparasites of the Blunt-Nosed Minnow (Hyborhynchus notatus). Am Midl Nat. 1937;18:612–21.
Gussev A, Poddubnaya A, Abdeeva V. Key to parasites of freshwater fishes of the USSR. Monogenea. Moscow: Nauka Publications; 1985. p. 87–99.
Sobecka E, Jurkiewicz E, Piasecki W. Parasite fauna of ide, Leuciscus idus [L.] in Lake Dabie Poland. Acta Ichthyologica et Piscatoria. 2004;34:33–42.
Cui-Lan H, Cheng Y, Wei-Jian Y, Jian-Guo Y, Li J, Meng-Ying Z, et al. Population dynamics of Dactylogyrus and a new record on Dactylogyrus in Leuciscue idus Linnaeus in Irtysh river of China. Acta Hydrobiologica Sinica. 2014;38:227–32.
Zhang D, Zou H, Wu SG, Li M, Jakovlić I, Zhang J, et al. Three new Diplozoidae mitogenomes expose unusual compositional biases within the Monogenea class: implications for phylogenetic studies. BMC Evol Biol. 2018;18:133.
Hao C-L, Arken K, Kadir M, Zhang W-R, Rong M-J, Wei N-W, et al. The complete mitochondrial genomes of Paradiplozoon yarkandense and Paradiplozoon homoion confirm that Diplozoidae evolve at an elevated rate. Parasites & Vectors. 2022;15:149.
Laslett D, Canbäck B. ARWEN: a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics. 2008;24:172–5.
Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, et al. MITOS: Improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 2013;69:313–9.
Rombel IT, Sykes KF, Rayner S, Johnston SA. ORF-FINDER: a vector for high-throughput gene identification. Gene. 2022;282:33–41.
WORMS Editorial Board. WoRMS - World Register of Marine Species. http://marinespecies.org. 2021. Accessed Jan 3 2020.
Zhang D, Gao F, Jakovlić I, Zou H, Zhang J, Li WX, et al. PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol Ecol Resour. 2020;20:348–55.
Struck TH. TreSpEx—detection of misleading signal in phylogenetic reconstructions based on tree information. Evol Bioinform Online. 2014;10:51–67.
Steenwyk JL, Buida TJ III, Labella AL, Li Y, Shen X-X, Rokas A. PhyKIT: a broadly applicable UNIX shell toolkit for processing and analyzing phylogenomic data. Bioinformatics. 2021;37:2325–31.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.
Ranwez V, Harispe S, Delsuc F, Douzery EJP. MACSE: multiple alignment of coding sequences accounting for frameshifts and stop codons. PLOS ONE. 2011;6:e22594.
Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14:587–9.
Trifinopoulos J, Nguyen LT, von Haeseler A, Minh BQ. W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016;44:W232–5.
Ronquist F, Teslenko M, Van Der Mark P, Ayres DL, Darling A, Höhna S, et al. Mrbayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61:539–42.
Lartillot N, Brinkmann H, Philippe H. Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol Biol. 2007. https://doi.org/10.1186/1471-2148-7-S1-S4.
Minh BQ, Nguyen MAT, von Haeseler A. Ultrafast approximation for phylogenetic bootstrap. Mol Biol Evol. 2013;30:1188–95.
Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49:W293-6.
Egger B, Bachmann L, Fromm B. Atp8 is in the ground pattern of flatworm mitochondrial genomes. BMC Genomics. 2017;18:414.
Wey-Fabrizius AR, Podsiadlowski L, Herlyn H, Hankeln T. Platyzoan mitochondrial genomes. Mol Phylogenet Evol. 2013;69:365–75.
Nam U-H, Whang I, Kim J-H. The complete mitochondrial genome sequence of Microcotyle caudata (Platyhelminthes: Monogenea) from dark-banded rockfish (Sebastes inermis) in Korea. Mitochondrial DNA Part B. 2020;5:1817–9.
Boore JL. Animal mitochondrial genomes. Nucleic Acids Res. 1999;27:1767–80.
Doublet V, Ubrig E, Alioua A, Bouchon D, Marcade I, Marechal-Drouard L. Large gene overlaps and tRNA processing in the compact mitochondrial genome of the crustacean Armadillidium vulgare. RNA Biol. 2015;12:1159–68.
Fearnley I, m., Walker J E. Two overlapping genes in bovine mitochondrial DNA encode membrane components of ATP synthase. EMBO J. 1986;5:2003–8.
Hebert PDN, Ratnasingham S, de Waard JR. Barcoding animal life: cytochrome c oxidase subunit 1 divergences among closely related species. Proc R Soc Lond Ser B: Biol Sci. 2003;270:S96-9.
Philippe H, Delsuc F, Brinkmann H, Lartillot N. Phylogenomics. annual review of ecology, evolution, and systematics. Ann Rev. 2005;36:541–62.
Rokas A, Williams BL, King N, Carroll SB. Genome-scale approaches to resolving incongruence in molecular phylogenies. Nature. 2003;425:798–804.
Bernt M, Bleidorn C, Braband A, Dambach J, Donath A, Fritzsch G, et al. A comprehensive analysis of Bilaterian mitochondrial genomes and phylogeny. Mol Phylogenet Evol. 2013;69:352–64.
Ratnasingham S, Hebert PDN. Bold: the barcode of life data system (http://www.barcodinglife.org). Mol Ecol Notes. 2007;7:355–64.
Foster PG. Modeling compositional heterogeneity. Syst Biol. 2004;53:485–95.
Lim LHS, Timofeeva TA, Gibson DI. Dactylogyridean monogeneans of the siluriform fishes of the Old World. Syst Parasitol. 2001;50:159–97.
Zou H, Jakovlić I, Zhang D, Hua C-J, Chen R, Li W-X, et al. Architectural instability, inverted skews and mitochondrial phylogenomics of Isopoda: outgroup choice affects the long-branch attraction artefacts. R Soc Open Sci. 2020;7:191887.
Zhang D, Li WX, Zou H, Wu SG, Li M, Jakovlić I, et al. Mitochondrial genomes and 28S rDNA contradict the proposed obsoletion of the order Tetraonchidea (Platyhelminthes: Monogenea). Int J Biol Macromol. 2020;143:891–901.
Zhang D, Zou H, Wu SG, Li M, Jakovlić I, Zhang J, et al. Evidence for adaptive selection in the Mitogenome of a Mesoparasitic Monogenean Flatworm Enterogyrus malmbergi. Genes. 2019;10:863.
Olson PD, Littlewood DTJ. Phylogenetics of the Monogenea – evidence from a medley of molecules. Int J Parasitol. 2022;32:233–44.
Justine J-L, Lambert A, Mattei X. Spermatozoon ultrastructure and phylogenetic relationships in the monogeneans (Platyhelminthes). Int J Parasitol. 1985;15:601–8.
Boeger WA, Kritsky DC. Phylogeny and a revised classification of the Monogenoidea Bychowsky, 1937 (Platyhelminthes). Syst Parasitol. 1993;26:1–32.
We thank the late Jian-guo Yin for his help during the sampling trip.
This work was supported by the National Natural Science Foundation of China (grant nos. 31960737 and 31860738) and the Natural Science Foundation of Xinjiang Uygur Autonomous Region (grant no. 2021D01B56). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
Ethics approval and consent to participate
All procedures contributing to this work comply with the ethical standards of the relevant national and institutional guides on the care and use of laboratory animals, and the study was approved by the Animal Care and Use Committee of Xinjiang Agricultural University (permit no. 2019021).
Consent for publication
The authors declare that there are no conflicts of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The drawing of a Dactylogyrus simplex Bychowsky, 1936 specimen. Figure S2. The drawing of a Dactylogyrus tuba Linstow, 1878 specimen. Figure S3. The mitogenomic architecture of the studied Monopisthocotylea dataset. Figure S4. The NUC_ML mitochondrial phylogenomic analysis of the Monopisthocotylea. Figure S5. The NUC_ML_12PCGs mitochondrial phylogenomic analysis of the Monopisthocotylea. Figure S6. The ITS1-based BI phylogenetic analysis of the Dactylogyridea. Figure S7. Sequencing chromatogram comprising the ‘problematic’ segment comprising the 3’ end of cytb, 5’ end of nad4L and 10-bp intergenic space between them in D. simplex.
Comparative mitogenomics of Polyopisthocotylea. Taxonomy, mitogenome size and base composition. Worksheet B. Gene comparison in Dactylogyridea. Size, start/stop codons and skews.
About this article
Cite this article
Hao, CL., Wei, NW., Liu, YJ. et al. Mitochondrial phylogenomics provides conclusive evidence that the family Ancyrocephalidae is deeply paraphyletic. Parasites Vectors 16, 83 (2023). https://doi.org/10.1186/s13071-023-05692-6