Horizontal transfer of β-carbonic anhydrase genes from prokaryotes to protozoans, insects, and nematodes
© Zolfaghari Emameh et al. 2016
Received: 3 February 2016
Accepted: 1 March 2016
Published: 16 March 2016
Horizontal gene transfer (HGT) is a movement of genetic information occurring outside of normal mating activities. It is especially common between prokaryotic endosymbionts and their protozoan, insect, and nematode hosts. Although beta carbonic anhydrase (β-CA) plays a crucial role in metabolic functions of many living organisms, the origin of β-CA genes in eukaryotic species remains unclear.
This study was conducted using phylogenetics, prediction of subcellular localization, and identification of β-CA, transposase, integrase, and resolvase genes on the MGEs of bacteria. We also structurally analyzed β-CAs from protozoans, insects, and nematodes and their putative prokaryotic common ancestors, by homology modelling.
Our investigations of a number of target genomes revealed that genes coding for transposase, integrase, resolvase, and conjugation complex proteins have been integrated with β-CA gene sequences on mobile genetic elements (MGEs) which have facilitated the mobility of β-CA genes from bacteria to protozoan, insect, and nematode species. The prokaryotic origin of protozoan, insect, and nematode β-CA enzymes is supported by phylogenetic analyses, prediction of subcellular localization, and homology modelling.
MGEs form a complete set of enzymatic tools, which are relevant to HGT of β-CA gene sequences from prokaryotes to protozoans, insects, and nematodes.
Horizontal, or lateral, gene transfer (HGT or LGT) refers to movement of genetic information across normal mating barriers, between more or less phylogenetically distinct organisms, and thus stands in distinction to the standard vertical transmission of genes from parent to offspring. HGT is proving to be a more influential evolutionary mechanism than 20th-century scientists ever thought . Most early, and even current, evidence for HGT in eukaryotes comes from study of protists [2, 3].
Mobile genetic elements (MGEs) are segments of DNA, encoding enzymes and other proteins, which mediate the movement of DNA in HGT within genomes (intracellular mobility) or between cells (intercellular mobility) . Transposases and site-specific recombinases catalyse the intracellular movement of MGEs. Site-specific recombinases in bacteria fall into one of two very distinct families, the λ integrase-like enzymes and the resolvases/invertases . Recombinase interacts with a specific site in the DNA, brings the sites together in a synapse, and religates exchanged DNA strand to the host genome. Homologous recombination systems of the host also enable them to function in chromosomal deletions and other rearrangements . The majority of horizontally transferred genes are either eventually excluded or rapidly become nonfunctional in the recipient genome. However, there are some reports where horizontally transferred genes have shown high level of transcription [6, 7].
Many protists are phagotrophic and subsist by consuming bacteria. Subsequently, protozoan phagotrophs often live for long periods in environments where they are frequently exposed to bacterial DNA. One such example is the direct contact of bacteria and parasites in digestive system of ruminants .
Examples of HGT of prokaryotic genes to protozoans, insects, and nematodes
Prokaryotic gene donors
Protozoan, insect and nematode gene recipients
Horizontally transfered genes
Aedes aegypti (yellow fever mosquito), Anopheles gambiae (malaria mosquito), and Drosophila melanogaster
Antibiotic-resistance genes 
Anaerobic protozoans: Trichomonas vaginalis, Entamoeba histolytica, and Naegleria gruberi
Dictyostelium discoideum (soil-living amoeba)
18 prokaryotic genes 
Trypanosomatids: Leishmania spp., Angomonas deanei and Strigomonas culicis
Bacterial amino acid pathways 
Mitochondria (initiation point of apoptosis) 
β-proteobacteria and γ-proteobacteria
Trypanosomatids: Leishmania spp., Angomonas deanei and Strigomonas culicis
Heme synthesis gene 
Lateral gene transfer fragment (TvLF) 
Carbonic anhydrases (CAs) are ubiquitous metalloenzymes, which belong to six evolutionary divergent gene families, including α, β, γ, δ, ζ, and η [22, 23]. The active site of most CAs contains a zinc ion (Zn2+) which plays a critical role in the catalytic activity of the enzyme. CAs are involved in many biological processes, such as respiration involving transport of CO2 and bicarbonate between metabolizing tissues, regulation of pH homeostasis, electrolyte transfer, bone resorption, calcification, tumor progression, gluconeogenesis, lipogenesis, and ureagenesis [24–27]. In the past decade, a large number of putative β-CAs have been discovered in protozoans, arthropods, and nematodes [28–32], as well as in bacteria, fungi, algae, and plants . Despite the presence of β-CA sequences in genomes of many, if not most, living organisms, they are absent in vertebrate genomes [28, 29].
In this study, we investigated the possible origin of β-CA gene sequences in protozoans, insects, and nematodes by HGT from ancestral prokaryotes using phylogenetics, prediction of subcellular localization, and identification of β-CA, transposase, integrase, and resolvase genes on the MGEs of bacteria. We also structurally analyzed β-CAs from protozoans, insects, and nematodes and their putative prokaryotic common ancestors, by homology modelling. Our study suggests that HGT likely explains the presence of similar β-CA genes across multiple species living together in distinct environments.
Identification of β-CA gene and protein sequences
We collected all β-CA protein expressing bacteria which are endosymbiotic or pathogenic to a protozoan, insect, or nematode species from Uniprot (http://www.uniprot.org/) and EMBL-EBI databases (http://www.ebi.ac.uk/) (Additional file 1). In addition, we included ten β-CA protein sequences from endosymbiotic bacteria of protozoans, insects, and nematodes to the identification process, including: Afipia spp. (K8NQ88), Anaeromyxobacter spp. (A7HD59), Campylobacter spp. (K0I0K3), Salmonella spp. (Q8ZRS0), Gardnerella spp. (E3D7T4), Emticicia spp. (I2EZ21), Simkania spp. (F8L9G5), Nostoc spp. (Q8YT17), Exiguobacterium spp. (K0ACL8), and Fusobacterium spp. (C6JPI1). Moreover, we performed protein homology BLAST search for β-CA protein sequences from protozoans, insects, and nematodes in the EMBL-EBI BLAST database (http://www.ebi.ac.uk/Tools/sss/fasta/) to define bacterial β-CA protein homologs. A highly conserved region (102 amino acid residues, starting from three amino acid residues prior to the first highly conserved motif (CXDXR) was extracted from bacterial, protozoan, insect, and nematode β-CA protein sequences. These sequences were aligned using the Clustal Omega multiple sequence alignment (MSA) algorithm (http://www.ebi.ac.uk/Tools/msa/clustalo/) , and the results were visualized in Jalview (http://www.jalview.org/) .
Predicted sources of the β-CA genes. The tentative prokaryotic endosymbionts and their hosts are listed
Tentative prokaryotic endosymbiont (donor)
Protozoan, insect, and nematode hosts (acceptor)
Cesiribacter andamanensis (M7MX87)
Acanthamoeba castellanii (L8GR38) (Fig. 2a)
Leptospira kirschneri (M6X652)
Naegleria gruberi (Predicted 1, 2, 3)
Paramecium tetraurelia (A0BD61, A0CEX6, A0C922, A0BDB1, A0E8I0) (Fig. 2a)
Colwellia psychrerythraea (Q47YG3)
Ichthyophthirius multifiliis (G0QYZ1, G0QPN9)
Tetrahymena thermophila (Q22U21, Q22U16, I7M0M0, I7M748, I7LWM1, I7MDL7, Q23AV1, I7MD92)
Dictyostelium spp (Q555A3, Q55BU2, Q94473, F0Z7L1, F4PL43) (Fig. 2a)
Magnetospirillum magneticum (Q2VZD0)
Angomonas daenei (S9WXX9)
Strigominas culicis (S9TM82)
Leishmania spp (A4H4M7 as predicted, E9B8S3, A4HSV2, Q4QJ17, E9AKU0, S0CTX5) (Fig. 2a)
Insects and nematodes (F1LE18, G4V6B2, Q22460, Q5TU56, Q17N64, Q9VHJ5) (Fig. 2b)
Vesicomyosocius okutanii (A5CVM8)
Entamoeba spp (B0E7M0, 1C4LXK3, K2GQM0) (Fig. 2c)
Afipia felis (K8NQ88)
Acanthamoeba castellanii (L8GLS7) (Fig. 2c)
Bradyrhizobium japonicum (G7D846)
Selenomonas ruminantium (I0GLW8)
Trichomonas vaginalis (A2ENQ8, A2DLG4) (Fig. 2d)
Veillonella spp (F9N508)
Prediction of subcellular signals
Prediction of subcellular signals of defined protozoan, insect, and nematode β-CA protein sequences was performed using a subcellular signal prediction tool. Mitochondrial and secretory targeting peptides in β-CA protein sequences were predicted by TargetP 1.1 Server (http://www.cbs.dtu.dk/services/TargetP/) . Even if these targeting systems are only found in eukaryotes, bacterial sequences were analyzed as well to see if they contain regions similar to eukaryotic targeting signals. Based on the phylogenetic tree results, we performed this analysis on only those bacterial β-CA protein sequences, which had a predicted common ancestor with protozoan, insect, or nematode β-CA protein sequences. Specifically, this included Afipia felis (K8NQ88), Bradyrhizobium japonicum (G7D846), Cesiribacter andamanensis (M7MX87), Colwellia psychrerythraea (Q47YG3), Corallococcus coralloides (H8MJ17), Leptospira kirschneri (M6X652), Magnetospirillum magneticum (Q2VZD0), Selenomonas ruminantium (I0GLW8), Veillonella spp. (F9N508), and Vesicomyosocius okutanii (A5CVM8).
Identification of β-CA, transposase, integrase, resolvase, and conjugation complex protein (CCP) genes on the prokaryotic MGEs
Identification of β-CA, transposase, integrase, resolvase, and CCP genes on the bacterial MGEs was carried out using the plasmid database from EMBL-EBI (http://www.ebi.ac.uk/genomes/plasmid.html), and the Jena Prokaryote Genome Viewer (JPGV) (http://jpgv.fli-leibniz.de/cgi/index.pl) . JPGV contains a vast amount of information on most fully sequenced prokaryotic genomes and presents figures of linear and circular genome plots.
Identification of β-CA gene sequences on protozoan, insect, and nematode genomic DNA
Analyses regarding determination of precise locations of protozoan, insect and nematode β-CA genes in genomic DNA were performed using National Center for Biotechnology Information (NCBI) database (http://www.ncbi.nlm.nih.gov/). Furthermore, we utilized the Trichomonas vaginalis genome project database (TrichDB version 1.3) (http://trichdb.org/trichdb/)  and EMBL-EBI database (http://www.ebi.ac.uk/), for detection of β-CA genes in Trichomonas vaginalis (a protozoan parasite and the causative agent of trichomoniasis) and C. elegans respectively. Analysis of mitochondrial coding genes in Acanthamoeba castellanii (the most common free-living amoeba in soil and water) was performed using the NCBI database (http://www.ncbi.nlm.nih.gov/).
Homology models were prepared for β-CAs selected based on the phylogenetic analysis. The most similar eukaryotic and prokaryotic proteins within the phylogeny tree branch in question were selected using the percent identity matrix generated by Clustal Omega (http://www.ebi.ac.uk/Tools/msa/clustalo/) . For each of the selected proteins, the most similar protein structure was obtained using BLAST search targeted for the PDB database (http://www.rcsb.org/pdb/home/home.do). For each protein pair (eukaryotic and prokaryotic) analyzed here, the BLAST search resulted in the same template protein as follows: Clade A: Escherichia coli β-CA PDB 1I6P; Clade B: Pisum sativum β-CA PDB 1EKJ; Clade C: Mycobacterium tuberculosis β-CA PDB 1YM3; and Clade D: Methanobacterium thermoautotrophicum β-CA PDB 1G5C.
Clustal Omega (http://www.ebi.ac.uk/Tools/msa/clustalo/)  was used to prepare a sequence alignment for the modelled protein and the template protein sequence. The homology models were prepared using Modeller program (version 9.14) . The resulting models were structurally aligned by the BODIL program . A figure illustrating the homology models was prepared using the VMD program (version 1.9.1)  and edited with Adobe Photoshop (version 13.0.1).
The evaluation of the conserved residues in the homology models was performed by using multiple sequence alignments prepared by Clustal Omega algorithm (http://www.ebi.ac.uk/Tools/msa/clustalo/)  and by inspecting the homology models using program VMD program (version 1.9.1) .
Identification and phylogenetic analysis of β-CA protein sequences from defined bacterial, protozoan, insect, and nematode species
MrBayes/PhyML Settings and Results of Phylogentic Analysis
PhyML Boot Straps
Std Dev of Split Freq
Prediction of subcellular signals
Prediction of subcellular signals revealed that five protozoan (L8GR38, A4H4M7, S0CTX5, S9TM82, and I7MDL7) and three insect (Q5TU56, Q17N64, and Q9VHJ5) β-CA proteins probably contain mitochondrial targeting peptides. Even three bacterial β-CA proteins (K8NQ88, H8MJ17, and M6X652) contained N-terminal sequences sufficiently similar to mitochondrial targeting peptides so that mitochondrial prediction by TargetP 1.1 Server was positive. In addition, one protozoan β-CA protein (L8H861) sequence from A. castellanii is predicted to contain a signal peptide for the secretory pathway. The prediction tool provided no definitive localization for the other bacterial, protozoan, insect, and nematode β-CA proteins (Additional file 2).
Identification of β-CA, transposase, integrase, resolvase, and conjugation complex protein (CCP) coding sequences on the bacterial MGEs
Identification of β-CA gene sequences on protozoan, insect, and nematode genomic DNA
Analysis of the precise location of β-CA gene sequences in protozoan, insect, and nematode genetic structures revealed that all were located in chromosomal DNA (Additional file 4). Exon counts, for the group of studied β-CA gene sequences, vary in quantity from 1 to 11. The maximum exon counts were 8 for A. castellanii (Entry ID: L8GR38), and 11 for P. pacificus (Entry ID: H3EVA6) for protozoan and nematode species, respectively. Interestingly, some protozoan β-CA gene sequences included only one exon. The definitive locations of β-CA gene sequences are shown on linear genomic DNA from T. vaginalis (A2DLG4) (Additional file 5) and C. elegans (Q22460) (Additional file 6), whereas they are still unknown in many species. Analysis of the genes on circular mitochondrial DNA from A. castellanii revealed that none of the protozoan β-CAs were considered mitochondrial coding genes (data not shown).
Throughout their evolution all eukaryotes have been in close contact with bacteria, and while eukaryotrophs are comparatively rare there are numerous identified bacterial endosymbionts which have adapted to intracellular endosymbiosis with protozoan host species [1, 48–52]. In general, HGT of prokaryotic genes to protozoan genomes is probably much more common than vice versa . Interestingly, many bacteria are able to tolerate harsh conditions, such as presence of digestive enzymes in phagocytic vesicles, and survive inside protozoan species without any problems. The mechanisms of these efficient endosymbiotic and HGT phenomena are still unknown. There are multiple examples of highly efficient HGT, such as: from E. coli to protozoan ciliates, including T. thermophila and T. pyriformis ; from Klebsiella spp. to Salmonella spp. within the endosymbiotic environment of rumen protozoa of ruminants ; and from endosymbiont bacteria to Leishmania spp. during bacterial sepsis .
Multiple sequence alignment (MSA) of suspected protozoan, insect, and nematode β-CA protein sequences with previously defined bacterial β-CA proteins, revealed that all of the evaluated sequences contained the first (CXDXR) and second (HXXC) highly conserved motifs characteristic of β-CA proteins. Phylogenetic analysis revealed that protozoan, insect, and nematode β-CA protein sequences are mostly categorized as clade A or B β-CA protein structures, respectively.
Based on our phylogenetic analysis, A. castellanii possesses two β-CA genes, one from clade A and one from clade C. Our results in Fig. 2c, suggest that the β-CA gene of A. castellanii (L8GLS7) was potentially horizontally transferred from a bacterial species, which probably was a common ancestor of B. japonicum (G7D846) and A. felis (K8NQ88). In addition, previous studies have shown that B. japonicum  and A. felis  are endosymbionts of A. castellanii.
Phylogenetic analysis of clade A β-CAs (Fig. 2a), showed that all β-CAs in N. gruberi and P. tetraurelia protozoa have a common source with the single β-CA from spirochaetes bacteria, L. kirschneri (M6X652). Potentially, after HGT of a β-CA gene from the common source to these two protozoan hosts, the gene duplicated and created three different β-CA genes for N. gruberi (Predicted 1, 2, 3) and five for P. tetraurelia (A0BD61, A0CEX6, A0C922, A0BDB1, A0E8I0).
Among the various prokaryotic endosymbionts it is proposed that I. multifiliis, T. thermophila, and Dictyostelium spp. potentially have a distant common source with gammaproteobacteria C. psychrerythraea (Q47YG3), because there are multiple branch points between C. psychrerythraea and the other prokaryotic species. Gene duplication in these protozoans led to multiple copies of β-CA in I. multifiliis (G0QYZ1, G0QPN9), T. thermophila (Q22U21, Q22U16, I7M0M0, I7M748, I7LWM1, I7MDL7, Q23AV1, I7MD92), and Dictyostelium spp. (Q555A3, Q55BU2, Q94473, F0Z7L1, F4PL43) .
It has been shown earlier that essential amino acid and heme synthesis genes horizontally transferred from endosymbiont alpha, beta, and gammaproteobacteria to Trypanosomatidea [9, 50, 58]. Our phylogenetic results (Fig. 2a) revealed that β-CA genes in Trypanosomatidea, including Leishmania spp. (A4H4M7, E9B8S3, A4HSV2, Q4QJ17, E9AKU0, and S0CTX5), A. daenei (S9WXX9), and S. culicis (S9TM82) have a common source with an alphaproteobacterium similar to M. magneticum (Q2VZD0).
The phylogenetic analysis (Fig. 2b) showed that insect and nematode β-CAs belong to clade B and suggests that they may have a common source with myxobacterial β-CAs. The various myxobacteria Corallococcus, Enhygromyxa, Stigmatella, and Myxococcus, are part of the same subtree that contains insect and nematode β-CAs. However, a larger analysis with more insect, nematode, and plant β-CAs, which also belong to clade B, would be needed to fully resolve the relationships within this clade. Given the apparent distribution within insects and nematodes, in our limited analysis, this HGT would have occurred in the distant past. A single, very old transfer of β-CA gene to insects and nematodes would fit with the idea that heritable transfer to sexually reproducing organisms is significantly more difficult. Due to sequence divergence over 800 million years (estimated divergence time between nematodes and arthropods), our phyologenetic trees do not provide conclusive evidence for this, and it is thus possible to speculate that the β-CAs of clade B, which we see in insects and nematodes, have been retained from an ancestral eukaryote. However, it is tempting to assume that β-CAs of all four clades in protozoans, insects, and nematodes would have been derived by HGT from prokaryotes. In this context, we may also note that the HGT of β-CA gene sequences might have involved several mechanisms and genetic elements in addition to MGEs, such as genomic islands (GIs) and insertion sequence (IS) elements.
Phylogenetic analysis of clade C (Fig. 2c) revealed that β-CA genes from Entamoeba spp. (B0E7M0, 1C4LXK3, K2GQM0) have a common source with the β-CA gene of gammaproteobacterium V. okutanii (A5CVM8). From this result, we propose that β-CA genes horizontally transferred from an ancestral enteric gammaproteobacteria to Entamoeba spp. through a symbiotic or pathogenic relationship in the gut of arthropods, nematodes, or animals.
Phylogenetic analysis of clade D (Fig. 2d) revealed that β-CA genes in T. vaginalis (A2ENQ8, A2DLG4) have a common source with β-CA genes from firmicutes bacteria S. ruminantium (I0GLW8) and Veillonella spp. (F9N508). Previous results have shown that Clostridium sordellii and Veillonella spp. from firmicutes phylum and T. vaginalis have a symbiotic living situation in sexual organs of animals [18, 19], providing the environment in which a transfer of firmicutes bacteria β-CA gene sequence into the T. vaginalis genome is possible.
Prediction of subcellular signals of β-CA protein sequences revealed that some bacterial species (A. felis, L. kirschneri, and C. coralloides), protozoan species (A. castellanii, L. braziliensis, L. guyanensis, S. culicis, and T. thermophila), and insect species (A. gambiae, A. aegypti and D. melanogaster) include mitochondrial signals or similar bacterial sequences in their β-CA protein sequences (Additional file 2). It is well established that prokaryotes and some anaerobic protozoa, such as G. lamblia, E. histolytica, T. vaginalis, C. parvum, Blastocystis hominis, Encephalitozoon cuniculi, Sawyeria marylandensis, Neocallimastix patriciarum, and Mastigamoeba balamuthi completely lack mitochondria. In anaerobic protozoan species, mitochondrion-related organelles (MROs, mitosoms, or hydrogenosomes) replaced mitochondria in oxygen-restricted environments. Many studies have hypothesized that a majority of the mitochondrial genes in anaerobic parasitic protozoa have been acquired from α-proteobacterial genomes . The Monoamine oxidase (a mitochondrial outer membrane enzyme for metabolism of neuromediators) gene is one such example, and its sequence has been investigated thoroughly from bacterial to vertebrate lineages . Therefore, we hypothesize that sequences similar to mitochondrial localization signals emerged in β-CA proteins in prokaryotes, leading to their mitochondrial localization after HGT into protozoans and possibly insects. Supporting this idea, the β-CA of D. melanogaster has been experimentally shown to be localized in mitochondria [28, 29].
Identification of β-CA with transposase, integrase, resolvase, and CCP coding sequences in bacterial MGEs suggests that these genetic elements are a complete set of enzymatic tools, which are relevant to HGT. These accessory enzymes detect target sites on the genome of recipient protozoan species using complex mechanisms and create a conducive environment for integration of β-CA gene sequences. On the other hand, in some MGEs, including pSLT, pOU1113, pSCV50, and pKDSC50 from S. typhimurium (str. LT2), S. enterica, S. enterica (serovar Choleraesuis, str. SC-B67), and S. enterica (serovar Choleraesuis), respectively, β-CA is a virulence factor which is located at 5´ end of the resolvase gene . The MGEs from E. histolytica contain the coding sequence for B2 DNA polymerase . Analysis of the full genomes of protozoans revealed that all β-CA gene sequences were located on a single chromosome, although the precise chromosomal location for some protozoan β-CA genes is still pending (Additional file 4).
In order to evaluate the structural features of the identified β-CA proteins, we first analyzed the functional roles of the conserved residues. β-CAs have only a limited number of conserved residues essential for the protein fold and function . We demonstrated this by creating a MSA of the β-CAs included in the homology modelling analysis. Indeed, this analysis indicated strict conservation of only the active site residues plus one glycine (Fig. 1). We then further analyzed the residues conserved in β-CAs where eukaryotic and prokaryotic versions grouped together in phylogenetic analysis, i.e. those that we suspected were the result of HGT. One would expect that high similarity between proteins in distantly related species would exist due to two reasons: (1) convergent evolution or (2) HGT. In the possible case of convergent evolution, there should be a selective pressure towards a particular structural or functional feature in certain locations of the protein sequence. We analyzed this by selecting residues, which were found to be conserved between each pairing of phylogenetically grouped eukaryotic and prokaryotic β-CAs, but not in the β-CAs used as a template in homology modelling. Because this excludes the well known functional active site residues, the remaining conserved residues (especially the side chains) should have a particularly important role in the protein structure to cause convergent evolution. Within the ten conserved residues from the protein core selected for analysis of each homology model, we typically observed only a few hydrophobic contacts and in particular polar interactions were almost completely missing, even when considering possible rotamers of the surrounding residues. The result of this analysis thus implies that there are no structurally important roles for the majority of the conserved residues common for the protein pairs observed in the phylogeny analysis. This suggests that the proteins share their identical residues due to their origins in a relatively recent identical genetic source (HGT), not because of selection pressure towards the particular residue observed in each position.
Our present findings may shed some light into the question of why β-CA gene sequences are completely absent in the genomes of vertebrates. In protozoan and invertebrate metazoans, including insect and nematode species, β-CA gene sequences have integrated in nuclear chromosomes through the aid of some enzymatic functions included in MGEs, such as transposase, integrase, and resolvase. These enzymes function as site-specific cutters and snip the DNA of the recipient eukaryotic host. There are some possible reasons for the lack of HGT of β-CA gene sequences in vertebrate genomes. First, there may not be a specific transposable element insertion site within vertebrate genomes for these enzymatic cutters. Second, vertebrates are complex multicellular organisms in which evolutionarily stable integration of β-CA gene sequences would need to have taken place in the germ cells that give rise to egg and sperm cells . Finally, supposing successful integration of a β-CA gene sequence in the germ line, it may have then been removed by genetic assortment of the vertebrate hosts. Therefore, the lack of β-CA gene sequences from the vertebrate genomes is understandable, especially because there is no evolutionary pressure for the adoption of another CA class due to the presence of several efficient α-CAs in all vertebrates.
Many prokaryotic MGEs contain necessary enzyme gene sequences, such as transposase, integrase, and resolvase, together with β-CA. These enzymes can facilitate HGT of β-CA genes from prokaryotes to other prokaryotes (Pro-Pro) and eukaryotes (Pro-Euk). The results from both mitochondrial targeting signal prediction and phylogenetic analysis supported our hypothesis of HGT of β-CA gene sequences from endosymbiont bacteria to protozoan, insect, and nematode hosts by MGEs. The phylogenetic analysis suggests that different protozoan β-CA genes have various common ancestors among prokaryotes, divided between clades A, C and D of β-CAs. In contrast, the case of insect and nematode β-CA genes is more complex. We propose that they may have had a single common ancestor from a bacterial β-CA gene, however, their descent from an ancient eukaryote origin cannot be ruled out. In analysis of the conserved residues in the homology models of prokaryote/eukaryote pairs, we observed no particularly important structural reason for the high sequence homology. This finding speaks against convergent evolution as a reason for the high similarity between the proteins and supports the idea of HGT as a source of the β-CA gene in eukaryotic species.
To perform these studies RZE received a scholarship support from the Ministry of Science, Research and Technology, and National Institute of Genetic Engineering and Biotechnology of Islamic Republic of Iran. Also, this work was supported by the Academy of Finland, Finnish Cultural Foundation (Pirkanmaa Regional Fund for RZE and Maili Autio Fund for HRB), Sigrid Juselius Foundation, Jane and Aatos Erkko Foundation, Tampere Tuberculosis Foundation, and Competitive Research Funding of the Tampere University Hospital for SP.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Andersson JO. Lateral gene transfer in eukaryotes. Cell Mol Life Sci. 2005;62(11):1182–97.View ArticlePubMedGoogle Scholar
- Keeling PJ, Palmer JD. Horizontal gene transfer in eukaryotic evolution. Nat Rev Genet. 2008;9(8):605–18.View ArticlePubMedGoogle Scholar
- Top EM, Springael D. The role of mobile genetic elements in bacterial adaptation to xenobiotic organic compounds. Curr Opin Biotechnol. 2003;14(3):262–9.View ArticlePubMedGoogle Scholar
- Frost LS, Leplae R, Summers AO, Toussaint A. Mobile genetic elements: the agents of open source evolution. Nat Rev Microbiol. 2005;3(9):722–32.View ArticlePubMedGoogle Scholar
- Thorpe HM, Smith MC. In vitro site-specific integration of bacteriophage DNA catalyzed by a recombinase of the resolvase/invertase family. Proc Natl Acad Sci U S A. 1998;95(10):5505–10.View ArticlePubMedPubMed CentralGoogle Scholar
- Toussaint A, Merlin C. Mobile elements as a combination of functional modules. Plasmid. 2002;47(1):26–35.View ArticlePubMedGoogle Scholar
- Hou Q, He J, Yu J, Ye Y, Zhou D, Sun Y, et al. A case of horizontal gene transfer from to C6/36 cell line. Mob Genet Elements. 2014;4(1):e28914.View ArticlePubMedPubMed CentralGoogle Scholar
- Wernegreen JJ. Endosymbiosis: lessons in conflict resolution. PLoS Biol. 2004;2(3):E68.View ArticlePubMedPubMed CentralGoogle Scholar
- Catta-Preta CM, Brum FL, da Silva CC, Zuma AA, Elias MC, de Souza W, et al. Endosymbiosis in trypanosomatid protozoa: the bacterium division is controlled during the host cell cycle. Front Microbiol. 2015;6:520.PubMedPubMed CentralGoogle Scholar
- Greub G, Raoult D. Microorganisms resistant to free-living amoebae. Clin Microbiol Rev. 2004;17(2):413–33.View ArticlePubMedPubMed CentralGoogle Scholar
- Skriwan C, Fajardo M, Hagele S, Horn M, Wagner M, Michel R, et al. Various bacterial pathogens and symbionts infect the amoeba Dictyostelium discoideum. Int J Med Microbiol. 2002;291(8):615–24.View ArticlePubMedGoogle Scholar
- Bertelli C, Greub G. Lateral gene exchanges shape the genomes of amoeba-resisting microorganisms. Front Cell Infect Microbiol. 2012;2:110.View ArticlePubMedPubMed CentralGoogle Scholar
- Sun HY, Noe J, Barber J, Coyne RS, Cassidy-Hanley D, Clark TG, et al. Endosymbiotic bacteria in the parasitic ciliate Ichthyophthirius multifiliis. Appl Environ Microbiol. 2009;75(23):7445–52.View ArticlePubMedPubMed CentralGoogle Scholar
- Beier CL, Horn M, Michel R, Schweikert M, Gortz HD, Wagner M. The genus Caedibacter comprises endosymbionts of Paramecium spp. related to the Rickettsiales (Alphaproteobacteria) and to Francisella tularensis (Gammaproteobacteria). Appl Environ Microbiol. 2002;68(12):6043–50.View ArticlePubMedPubMed CentralGoogle Scholar
- Fujishima M, Kodama Y. Endosymbionts in Paramecium. Eur J Protistol. 2012;48(2):124–37.View ArticlePubMedGoogle Scholar
- Nakajima T, Sano A, Matsuoka H. Auto-/heterotrophic endosymbiosis evolves in a mature stage of ecosystem development in a microcosm composed of an alga, a bacterium and a ciliate. Biosystems. 2009;96(2):127–35.View ArticlePubMedGoogle Scholar
- Siegmund L, Burmester A, Fischer MS, Wostemeyer J. A model for endosymbiosis: interaction between Tetrahymena pyriformis and Escherichia coli. Eur J Protistol. 2013;49(4):552–63.View ArticlePubMedGoogle Scholar
- Fichorova RN, Buck OR, Yamamoto HS, Fashemi T, Dawood HY, Fashemi B, et al. The villain team-up or how Trichomonas vaginalis and bacterial vaginosis alter innate immunity in concert. Sex Transm Infect. 2013;89(6):460–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Smutna T, Goncalves VL, Saraiva LM, Tachezy J, Teixeira M, Hrdy I. Flavodiiron protein from Trichomonas vaginalis hydrogenosomes: the terminal oxygen reductase. Eukaryot Cell. 2009;8(1):47–55.View ArticlePubMedPubMed CentralGoogle Scholar
- Boucher Y, Douady CJ, Papke RT, Walsh DA, Boudreau ME, Nesbo CL, et al. Lateral gene transfer and the origins of prokaryotic groups. Annu Rev Genet. 2003;37:283–328.View ArticlePubMedGoogle Scholar
- Kjeldsen KU, Obst M, Nakano H, Funch P, Schramm A. Two types of endosymbiotic bacteria in the enigmatic marine worm Xenoturbella bocki. Appl Environ Microbiol. 2010;76(8):2657–62.View ArticlePubMedPubMed CentralGoogle Scholar
- Elleuche S, Poggeler S. Carbonic anhydrases in fungi. Microbiology. 2010;156(Pt 1):23–9.View ArticlePubMedGoogle Scholar
- Del Prete S, Vullo D, Fisher GM, Andrews KT, Poulsen SA, Capasso C, et al. Discovery of a new family of carbonic anhydrases in the malaria pathogen falciparum- the eta-carbonic anhydrases. Bioorg Med Chem Lett. 2014;24(18):4389–96.View ArticlePubMedGoogle Scholar
- Alterio V, Vitale RM, Monti SM, Pedone C, Scozzafava A, Cecchi A, et al. Carbonic anhydrase inhibitors: X-ray and molecular modeling study for the interaction of a fluorescent antitumor sulfonamide with isozyme II and IX. J Am Chem Soc. 2006;128(25):8329–35.View ArticlePubMedGoogle Scholar
- Nishimori I, Minakuchi T, Onishi S, Vullo D, Scozzafava A, Supuran CT. Carbonic anhydrase inhibitors. DNA cloning, characterization, and inhibition studies of the human secretory isoform VI, a new target for sulfonamide and sulfamate inhibitors. J Med Chem. 2007;50(2):381–8.View ArticlePubMedGoogle Scholar
- Vullo D, Franchi M, Gallori E, Antel J, Scozzafava A, Supuran CT. Carbonic anhydrase inhibitors. Inhibition of mitochondrial isozyme V with aromatic and heterocyclic sulfonamides. J Med Chem. 2004;47(5):1272–9.View ArticlePubMedGoogle Scholar
- Vullo D, Innocenti A, Nishimori I, Pastorek J, Scozzafava A, Pastorekova S, et al. Carbonic anhydrase inhibitors. Inhibition of the transmembrane isozyme XII with sulfonamides-a new target for the design of antitumor and antiglaucoma drugs? Bioorg Med Chem Lett. 2005;15(4):963–9.View ArticlePubMedGoogle Scholar
- Zolfaghari Emameh R, Barker H, Tolvanen ME, Ortutay C, Parkkila S. Bioinformatic analysis of beta carbonic anhydrase sequences from protozoans and metazoans. Parasit Vectors. 2014;7:38.View ArticlePubMedPubMed CentralGoogle Scholar
- Syrjanen L, Tolvanen M, Hilvo M, Olatubosun A, Innocenti A, Scozzafava A, et al. Characterization of the first beta-class carbonic anhydrase from an arthropod (Drosophila melanogaster) and phylogenetic analysis of beta-class carbonic anhydrases in invertebrates. BMC Biochem. 2010;11:28.View ArticlePubMedPubMed CentralGoogle Scholar
- Zolfaghari Emameh R, Syrjanen L, Barker H, Supuran CT, Parkkila S. Drosophila melanogaster: a model organism for controlling Dipteran vectors and pests. J Enzyme Inhib Med Chem. 2015;30(3):505–13.View ArticlePubMedGoogle Scholar
- Zolfaghari Emameh R, Barker H, Hytonen VP, Tolvanen ME, Parkkila S. Beta carbonic anhydrases: novel targets for pesticides and anti-parasitic agents in agriculture and livestock husbandry. Parasit Vectors. 2014;7:403.View ArticlePubMedPubMed CentralGoogle Scholar
- Zolfaghari Emameh R, Kuuslahti M, Vullo D, Barker HR, Supuran CT, Parkkila S. Ascaris lumbricoides beta carbonic anhydrase: a potential target enzyme for treatment of ascariasis. Parasit Vectors. 2015;8:479.View ArticlePubMedPubMed CentralGoogle Scholar
- Smith KS, Jakubzick C, Whittam TS, Ferry JG. Carbonic anhydrase is an ancient enzyme widespread in prokaryotes. Proc Natl Acad Sci U S A. 1999;96(26):15184–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Sievers F, Higgins DG. Clustal omega. Curr Protoc Bioinformatics. 2014;48:3.13.1–16.View ArticleGoogle Scholar
- Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ. Jalview Version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25(9):1189–91.View ArticlePubMedPubMed CentralGoogle Scholar
- Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, et al. CDD: NCBI’s conserved domain database. Nucleic Acids Res. 2015;43(Database issue):D222–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Slater GS, Birney E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005;6:31.View ArticlePubMedPubMed CentralGoogle Scholar
- Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34(Web Server issue):W609–12.View ArticlePubMedPubMed CentralGoogle Scholar
- Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Hohna S, et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61(3):539–42.View ArticlePubMedPubMed CentralGoogle Scholar
- Guindon S, Delsuc F, Dufayard JF, Gascuel O. Estimating maximum likelihood phylogenies with PhyML. Methods Mol Biol. 2009;537:113–37.View ArticlePubMedGoogle Scholar
- Emanuelsson O, Brunak S, von Heijne G, Nielsen H. Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc. 2007;2(4):953–71.View ArticlePubMedGoogle Scholar
- Romualdi A, Felder M, Rose D, Gausmann U, Schilhabel M, Glockner G, et al. GenColors: annotation and comparative genomics of prokaryotes made easy. Methods Mol Biol. 2007;395:75–96.View ArticlePubMedGoogle Scholar
- Aurrecoechea C, Brestelli J, Brunk BP, Carlton JM, Dommer J, Fischer S, et al. GiardiaDB and TrichDB: integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis. Nucleic Acids Res. 2009;37(Database issue):D526–30.View ArticlePubMedPubMed CentralGoogle Scholar
- Eswar N, Webb B, Marti-Renom MA, Madhusudhan MS, Eramian D, Shen MY, et al. Comparative protein structure modeling using Modeller. Curr Protoc Bioinformatics. 2006;Chapter 5:Unit 5.6.Google Scholar
- Lehtonen JV, Still DJ, Rantanen VV, Ekholm J, Bjorklund D, Iftikhar Z, et al. BODIL: a molecular modeling environment for structure-function analysis and drug design. J Comput Aided Mol Des. 2004;18(6):401–19.View ArticlePubMedGoogle Scholar
- Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J Mol Graph. 1996;14(1):33–8. 27–8.View ArticlePubMedGoogle Scholar
- Leplae R, Lima-Mendez G, Toussaint A. ACLAME: a CLAssification of Mobile genetic Elements, update 2010. Nucleic Acids Res. 2010;38(Database issue):D57–61.View ArticlePubMedPubMed CentralGoogle Scholar
- Eichinger L, Pachebat JA, Glockner G, Rajandream MA, Sucgang R, Berriman M, et al. The genome of the social amoeba Dictyostelium discoideum. Nature. 2005;435(7038):43–57.View ArticlePubMedPubMed CentralGoogle Scholar
- Huang J, Mullapudi N, Lancto CA, Scott M, Abrahamsen MS, Kissinger JC. Phylogenomic evidence supports past endosymbiosis, intracellular and horizontal gene transfer in Cryptosporidium parvum. Genome Biol. 2004;5(11):R88.View ArticlePubMedPubMed CentralGoogle Scholar
- Alves JM, Voegtly L, Matveyev AV, Lara AM, da Silva FM, Serrano MG, et al. Identification and phylogenetic analysis of heme synthesis genes in trypanosomatids and their bacterial endosymbionts. PLoS One. 2011;6(8):e23518.View ArticlePubMedPubMed CentralGoogle Scholar
- Strese A, Backlund A, Alsmark C. A recently transferred cluster of bacterial genes in Trichomonas vaginalis-lateral gene transfer and the fate of acquired genes. BMC Evol Biol. 2014;14:119.View ArticlePubMedPubMed CentralGoogle Scholar
- Motta MC, Martins AC, de Souza SS, Catta-Preta CM, Silva R, Klein CC, et al. Predicting the proteins of Angomonas deanei, Strigomonas culicis and their respective endosymbionts reveals new aspects of the trypanosomatidae family. PLoS One. 2013;8(4):e60209.View ArticlePubMedPubMed CentralGoogle Scholar
- Matsuo J, Oguri S, Nakamura S, Hanawa T, Fukumoto T, Hayashi Y, et al. Ciliates rapidly enhance the frequency of conjugation between Escherichia coli strains through bacterial accumulation in vesicles. Res Microbiol. 2010;161(8):711–9.View ArticlePubMedGoogle Scholar
- McCuddin ZP, Carlson SA, Rasmussen MA, Franklin SK. Klebsiella to Salmonella gene transfer within rumen protozoa: implications for antibiotic resistance and rumen defaunation. Vet Microbiol. 2006;114(3–4):275–84.View ArticlePubMedGoogle Scholar
- Endris M, Takele Y, Woldeyohannes D, Tiruneh M, Mohammed R, Moges F, et al. Bacterial sepsis in patients with visceral leishmaniasis in Northwest Ethiopia. Biomed Res Int. 2014;2014:361058.View ArticlePubMedPubMed CentralGoogle Scholar
- La Scola B, Mezi L, Auffray JP, Berland Y, Raoult D. Patients in the intensive care unit are exposed to amoeba-associated pathogens. Infect Control Hosp Epidemiol. 2002;23(8):462–5.View ArticlePubMedGoogle Scholar
- La Scola B, Raoult D. Afipia felis in hospital water supply in association with free-living amoebae. Lancet. 1999;353(9161):1330.View ArticlePubMedGoogle Scholar
- Alves JM, Klein CC, da Silva FM, Costa-Martins AG, Serrano MG, Buck GA, et al. Endosymbiosis in trypanosomatids: the genomic cooperation between bacterium and host in the synthesis of essential amino acids is heavily influenced by multiple horizontal gene transfers. BMC Evol Biol. 2013;13:190.View ArticlePubMedPubMed CentralGoogle Scholar
- Makiuchi T, Nozaki T. Highly divergent mitochondrion-related organelles in anaerobic parasitic protozoa. Biochimie. 2014;100:3–17.View ArticlePubMedGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409(6822):860–921.View ArticlePubMedGoogle Scholar
- Stavrinides J, Kirzinger MW, Beasley FC, Guttman DS. E622, a miniature, virulence-associated mobile element. J Bacteriol. 2012;194(2):509–17.View ArticlePubMedPubMed CentralGoogle Scholar
- Pastor-Palacios G, Lopez-Ramirez V, Cardona-Felix CS, Brieba LG. A transposon-derived DNA polymerase from Entamoeba histolytica displays intrinsic strand displacement, processivity and lesion bypass. PLoS One. 2012;7(11):e49964.View ArticlePubMedPubMed CentralGoogle Scholar
- Sawaya MR, Cannon GC, Heinhorst S, Tanaka S, Williams EB, Yeates TO, et al. The structure of beta-carbonic anhydrase from the carboxysomal shell reveals a distinct subclass with one active site for the price of two. J Biol Chem. 2006;281(11):7546–55.View ArticlePubMedGoogle Scholar
- Andersson JO, Doolittle WF, Nesbo CL. Genomics. Are there bugs in our genome? Science. 2001;292(5523):1848–50.View ArticlePubMedGoogle Scholar
- Baldini F, Segata N, Pompon J, Marcenac P, Robert Shaw W, Dabire RK, et al. Evidence of natural Wolbachia infections in field populations of Anopheles gambiae. Nat Commun. 2014;5:3985.View ArticlePubMedPubMed CentralGoogle Scholar
- Dunning Hotopp JC, Clark ME, Oliveira DC, Foster JM, Fischer P, Munoz Torres MC, et al. Widespread lateral gene transfer from intracellular bacteria to multicellular eukaryotes. Science. 2007;317(5845):1753–6.View ArticlePubMedGoogle Scholar
- Portal-Celhay C, Nehrke K, Blaser MJ. Effect of Caenorhabditis elegans age and genotype on horizontal gene transfer in intestinal bacteria. FASEB J. 2013;27(2):760–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Loftus B, Anderson I, Davies R, Alsmark UC, Samuelson J, Amedeo P, et al. The genome of the protist parasite Entamoeba histolytica. Nature. 2005;433(7028):865–8.View ArticlePubMedGoogle Scholar
- Taylor-Brown E, Hurd H. The first suicides: a legacy inherited by parasitic protozoans from prokaryote ancestors. Parasit Vectors. 2013;6:108.View ArticlePubMedPubMed CentralGoogle Scholar