Suitability of internal transcribed spacers (ITS) as markers for the population genetic structure of Blastocystis spp
Parasites & Vectors volume 7, Article number: 461 (2014)
The purpose of this study was to assess the genetic variation and differentiation of Blastocystis subtypes (STs) recovered from symptomatic children by analysing partial sequences of the small subunit rDNA gene region (SSUrDNA) and internal transcribed spacers (1 and 2) plus the 5.8S region (ITS, ITS1 + 5.8S + ITS2) and comparing with isolates from other countries.
Faecal samples from 47 Blastocystis-infected children with gastrointestinal symptoms and negative for pathogenic enterobacteria were analysed. PCR was performed on DNA from all the samples to identify Blastocystis STs, amplifying a fragment of SSUrDNA and the ITS region. The amplicons were purified and sequenced, and consensus sequences were submitted to GenBank; afterwards, SSUrDNA sequences were analysed for genetic diversity according to geographic area. Regarding the Blastocystis STs found, 51% were ST1, 23% ST2, 19% ST3 and 2% ST7. For ITS, a haplotype network tree and Bayesian inference revealed the presence of two novel variants of ST1, clustering some sequences into ST1A and ST1B. The values of nucleotide diversity (π) and haplotype polymorphism (θ) for ST1, ST2 and ST3 ranged from 0 to 1, whereas the ratio of genetic differentiation (FST)/migration index (Nm) showed the highest differentiation between Libya and Thailand-Philippines for ST2 (0.282/0.63). In contrast, a high flow gene was observed between Czech Republic-Denmark-Holland-Spain and USA-Mexico-Colombia for ST1 (0.003/84).
Our data on genetic differentiation and gene flow might explain the differences for the prevalence of Blastocystis STs. Moreover, the ITS region could be used as a genetic marker to assess genetic variation in this parasite.
Blastocystis spp are very common intestinal parasites of humans and animals, with a prevalence that varies from country to country and among various communities within the same country -. In addition, this parasite shows extensive polymorphisms, as reflected by microscopy or molecular analysis. Such polymorphisms have allowed for the recognition of at least 17 genetic subtypes (STs) in the genus Blastocystis based on the analysis of its small subunit rDNA (SSUrDNA) ,-. In eukaryotes, DNA sequences coding for rDNA genes constitute transcription units located on nuclear chromosomes, which are organised to form a simple multigenic family comprising a high number of repeated copies in tandem. The internal transcribed spacer includes two spacers, ITS1 and ITS2, separated by the 5.8S rRNA gene. These spacer regions evolve much faster than coding regions because substitutions occurring in spacers may be considered neutral mutations without any constraints. Therefore, the large amount of research available with regard to the usefulness of ITS sequences as excellent markers for species distinction has led scientists to consider both, but mainly ITS-2, as the best tool for problematic taxa, such as cryptic species. Furthermore, the 5.8S rRNA gene, although being too short to effectively indicate robust phylogenies across large time scales, shows levels of gene conservation similar to SSUrDNA, which is the most frequently used region for genetic typing analyses in Blastocystis,.
As there is scarce knowledge of the evolution, ecology, and population genetics in this parasite, the purpose of the present study was to assess the genetic variation of Blastocystis STs recovered from symptomatic children from Michoacan state, Mexico, by analysing partial SSUrDNA sequences and internal transcribed spacers (1 and 2) plus the 5.8S region (ITS, ITS1 + 5.8S + ITS2) and comparing with isolates from other countries.
Frozen faecal samples from 47 children infected by Blastocystis (22 males, 25 females, mean age of 8 ± 3.3 years), who were treated at the Hospital Infantil de Morelia “Eva Samano de Lopez Mateos” due to gastrointestinal symptoms and were negative for pathogenic enterobacteria, were analysed. Clinical and demographic data were obtained from medical records, and information about the Blastocystis load (reported as “more than” or “less than” five Blastocystis cells per 40X field) was obtained from laboratory files; variables were analysed by χ2 tests. The Ethics and Research Committee of the hospital approved the study, and informed consent was obtained from one parent of each child.
Stool DNA was extracted from approximately 250 mg faeces using QIAamp DNA Stool Mini Kit (QIAGEN Inc., Germany). A partial SSUrDNA sequence (~500 bp) was obtained using the primers reported by Santín et al. A new specific set of primers for amplifying the ITS1 + 5.8S + ITS2 (ITS) region was designed based on the alignment of highly conserved areas that delimited this region of available Blastocystis sequences and other related organisms in GenBank (Blastocystis hominis, AY125914-AY125919; Proteromonas lacertae, AY224080; Fucus vesiculosus, EF625890, AF102930, AF102913, AF102920, AF102929; Thalassiosira pseudonana, HF565129-HF565132; Laminaria digitata, FJ042772, FJ042773, FJ042764). The designed forward and reverse primers were ITS_Blas_F (5’-GGAAGGTGAAGTCGTAACAAG-3’) and ITS_Blas_R (5’-CAGCAGGTCTTCTTRCTTGA-3’).
DNA (2 μL) was used to amplify the genomic sequences in a 25-μL PCR reaction. For ITS1-5.8S-ITS2 region amplification, the reaction contained 200 pmol of each nucleotide, 1X PCR buffer (8 mM Tris–HCl, pH 8, 20 mM KCl, 1 mM MgCl2), 1X dNTPs, 0.01 mg BSA, and 1 U Taq DNA Polymerase (Promega). After the first denaturation step at 94°C for 5 min, 35 cycles of denaturation at 94°C for 30 s, annealing at 60°C for 45 s, and extension at 72°C for 30 s were performed, followed by a final extension step at 72°C for 10 min. The amplicons were subjected to electrophoresis in 1.2% agarose gels and then purified and sequenced on both strands by a commercial service.
All sequences were subjected to a BLAST search in the GenBank database; multiple alignments were performed using the CLUSTAL W  and Muscle  programmes with manual adjustment in MEGA 5.05 software . The best fit model of nucleotide substitution was determined using the Akaike Information Criterion in Modeltest version 3.7 software ; for both markers, it was the Hasegawa Kishino Yano model with gamma distribution and invariable sites. The phylogenetic reconstruction using Bayesian inference was performed with the Mr. Bayes 3.1.2 programme -. The analysis was performed for 10 million generations with sampling trees every 100 generations. Trees with scores lower than those at the stationary phase (burn-in) were discarded, and the trees that reached the stationary phase were collected and used to build majority consensus trees. Other sequences of both markers were obtained from GenBank and used as subtype references. A Median Joining Network analysis  was generated using NETWORK 4.611 software (fluxus-engineering.com) with the default settings and assumptions. A genetic diversity analysis within and between populations was performed using DnaSPv4  and included nucleotide diversity (π), haplotype polymorphism (θ), gene flow (Nm) and genetic differentiation index (FST). These indexes refer to the following: π, the average proportion of nucleotide differences between all possible pairs of sequences in the sample; θ, the proportion of nucleotide sites that are expected to be polymorphic in any suitable sample from this region of the genome. Both indexes are used to assess polymorphisms at the DNA level and to monitor diversity within or between ecological populations and examine the genetic variation in related species or their evolutionary relationships. FST is a typical genetic statistic used to measure differentiation between or among populations. The common used values for genetic differentiation are as follows: 0 to 0.05, small; 0.05 to 0.15, moderate; 0.15 to 0.25, great; values above 0.25 indicate huge genetic differentiation. The gene flow or migration index (Nm) refers to the movement of organisms among subpopulations; those strongly differentiated have an Nm < < 1, whereas an Nm > 4 behaves as a single panmictic unit .
Table 1 summarises the clinical, parasitological and genetic data of the infected children. Although 47 faecal samples were analysed, clinical information for 7 patients was not recovered because the clinical records were incomplete and the parent who gave informed consent was not sure of the symptoms. Co-infection was common with other commensal single protozoa including Endolimax nana (19%), Entamoeba coli (15%) and Entamoeba hartmanni (13%). Only 10 samples (21%) presented >5 Blastocystis stages per 40X field, with the vacuolar form being the most frequent stage found. Regarding Blastocystis STs, 43 sequences were obtained for SSUrDNA and 46 for ITS. For five samples, we did not have a sufficient amount of DNA with adequate quality to obtain amplicons, most likely due to the activity of several DNases that degrade nucleic acids and therefore decreased the DNA concentration in the samples . A BLAST search confirmed that each sequence matched Blastocystis. All amplicons for SSUrDNA were ~500 bp. For ITS, the amplicons ranged from 530 to 620 bp, showing products of ~550, ~530, ~620 and ~590 bp for ST1, 2, 3 and 7, respectively. Regarding Blastocystis STs, 51% were ST1, 23% ST2, 19% ST3 and 2% ST7 by SSUrDNA and/or ITS. Only two cases (samples 17 and 34) did not match, which might have been due to a phenomenon of co-infection between STs because the sequencing of several clones is required to detect potential co-infections by different STs when SSUrDNA markers are used. No statistical associations between the symptoms, parasite load and Blastocystis STs were found.
Figure 1 shows selected alignments of the highly polymorphic ITS region used in this study, including representative sequences of each ST found. Clear nucleotide differences can be observed along the alignments within STs, particularly for ST1A and ST1B at nucleotide positions 181–242. Phylogenetic trees were built for the SSUrDNA (Figure not shown) and ITS (Figure 2) regions using all available sequences recorded in GenBank. Our ITS sequences were grouped into the clades ST1, −2, −3 and −7; interestingly, this inference also revealed the presence of two novel variants of ST1, clustering some sequences into ST1A and ST1B. The haplotype network (Figure 3) also exhibited the presence of two novel variants of ST1, which allowed for clearly distinguishing within each ST and showed the great genetic variability between them. The SSUrDNA (270) and ITS (56) sequences of Blastocystis ST1, −2 and −3 from Libya (Africa), Thailand-Philippines-Japan (Asia), Czech Republic-Denmark-Holland-Spain (Europe), and USA-Colombia (North and Latin America) deposited in the GenBank databases, as well as our sequences (accession numbers from KM213425 to KM213513), were analysed for their genetic variability. The average nucleotide diversity (π) and haplotype polymorphism (θ) for SSUrDNA was 0.0179 ± 0.0112 and 0.0355 ± 0.0076, respectively (Figure 4), with 0.0533 ± 0.0339 and 0.0642 ± 0.0428, respectively, found for ITS. The analysis regarding continents showed that π and θ were highest in the European population and lowest in the Libyan population (Figure 4, values inside of the continent image). Information on the π and θ data for human intestinal parasites is rather limited: for helminths, these values range from 0.004 to 0.025 for π and from 0.005 to 0.02 for θ -; for Giardia lamblia, π ranges from 0 to 0.07 ; and for Entamoeba histolytica, these values are π = 0.004 and θ = 0.51 (Martinez-Hernandez F, unpublished data).
The ratio coancestry coefficient (FST)/migration index (Nm) for SSUrDNA showed the highest differentiation between the Libyan and Thailand-Philippines populations for ST2 (0.282/0.63); in contrast, a high flow gene was observed between the European and American populations for ST1 (0.003/84) (Figure 4, values over the arrows). As Africa had few sequences (ranged from 4 to 7), the differences in the genetic variation indexes were too large (ranging from 0 to 1).
In the present study, ITS was found to be more polymorphic than SSUrDNA (the ITS sequences showed 3 times more variants than SSUrDNA) and thus is a good marker for research on genetic diversity, as for other parasites ,,,,. Therefore, its use in population genetic analyses offers reliable results on the genetic variability within populations of this parasite. Regardless, as there are only a few sequences of Blastocystis ITS regions available in GenBank, a robust population genetic analysis using this marker is not possible. In addition, because ITS allows for distinguishing between ST1, −2, −3 and −7 using only one PCR directly applied to faecal samples, this marker could be used for Blastocystis ST diagnosis, avoiding the use of PCR-sequencing . However, a limiting factor for this genetic marker is the lack of information about the length of the ITS region of other relevant human STs, such as ST4. An interesting finding in the present study was that the ITS analysis allowed for the clear identification of two novel variants of ST1, as recognised by Bayesian and haplotype network inferences. Recently, Ramírez et al.  analysed several faecal samples containing Blastocystis recovered from humans, birds and other mammals; the ST identification was performed according to DNA barcoding, and a phylogenetic reconstruction using a Maximum Composite Likelihood was performed. Interestingly, the ST1 cluster exhibited two clades, indicating variants, but this was not sufficiently discussed.
The SSUrDNA analysis revealed that ST1 and ST2 showed the lowest genetic differentiation, with a high gene flow between the American and European populations, whereas the match between ST1 and ST2 for the Libyan and American populations showed a higher genetic differentiation with little gene flow. Interestingly, ST3 exhibited the most gene flow (migrants) for Thailand-Philippines vs Libya, America and Europe. Libya had a low number of sequences (ranging from 4 to 7); thus, differences in the variation of the genetic values were also large (ranging from 0 to 1). Alfellani et al. determined the prevalence of Blastocystis STs in populations from Africa and UK and compiled and contrasted the relative distribution of Blastocystis STs across continents, finding a significant variation in ST prevalence between populations. Our results support the analysis of Alfellani et al.  because for ST3, these authors found frequencies up to 40% for major geographic regions, except for America, where the frequency was approximately 25%. According to our FST/Nm data, the gene flow (Nm) of ≈ 4 for America-Europe and America-Africa as well as for America-Asia (Nm ≈ 9) suggest a high migrant flow, but less than Europe-Asia (Nm ≈ 12) and Africa-Asia (Nm ≈ ∞), indicating a vast migrant flow among these areas. For ST2, America shows the highest frequency (up to 30%), with low values of Nm <4 for America-Africa and America-Asia but Nm = 33 for America-Europe; these results could mean that ST2 from America is more isolated than in other major geographic regions. Finally, ST1 shows contrasting values because it was more frequent in America and East/Southeast Asia and lower in Europe and Africa. The FST/Nm data showed Nm ≤ 4 among Africa vs Europe-America-Asia, whereas Nm ranged from 4 to 84 between Europe and America-Asia-Africa. Therefore, due to the current globalisation, the variable frequencies observed in ST prevalence might indirectly show the genetic differentiation and gene flow of each ST population around the world.
Poirier et al.  described high polymorphism between copies of the SSUrDNA gene of Blastocystis ST7 and Stensvold et al., emphasised a remarkable intra-ST genetic diversity and some cryptic host specificity for Blastocystis. Considering that we used part of SSUrDNA in our genetic analyses, we can speculate that these biological features may be a secondary effect due to variation in other genes; most likely by a direct genetic hitchhiking phenomenon  or by an indirect association, similar to what has been proposed for other parasites .
Internal transcribed spacers
Small subunit rDNA gene
Genetic differentiation index
Boorom KF, Smith H, Nimri L, Viscogliosi E, Spanakos G, Parkar U, Li LH, Zhou XN, Ok UZ, Leelayoova S, Jones MS: Oh my aching gut: irritable bowel syndrome, Blastocystis, and asymptomatic infection. Parasit Vectors. 2008, 1: 40-10.1186/1756-3305-1-40.
Alfellani MA, Stensvold CR, Vidal-Lapiedra A, Onuoha ES, Fagbenro-Beyioku AF, Clark CG: Variable geographic distribution of Blastocystis subtypes and its potential implications. Acta Trop. 2013, 126: 11-18. 10.1016/j.actatropica.2012.12.011.
Scanlan PD, Stensvold CR:Blastocystis: getting to grips with our guileful guest. Trends Parasitol. 2013, 29: 523-529. 10.1016/j.pt.2013.08.006.
Dunn LA, Boreham PF, Stenzel DJ: Ultrastructural variation of Blastocystis hominis stocks in culture. Int J Parasitol. 1989, 19: 43-56. 10.1016/0020-7519(89)90020-9.
Tan KS: New insights on classification, identification, and clinical relevance of Blastocystis spp. Clin Microbiol Rev. 2008, 21: 639-665. 10.1128/CMR.00022-08.
Ragavan ND, Govind SK, Chye TT, Mahadeva S: Phenotypic variation in Blastocystis sp. ST3. Parasit Vectors. 2014, 7: 404-10.1186/1756-3305-7-404.
Stensvold CR, Alfellani M, Clark CG: Levels of genetic diversity vary dramatically between Blastocystis subtypes. Infect Genet Evol. 2012, 12: 263-273. 10.1016/j.meegid.2011.11.002.
Alfellani MA, Taner-Mulla D, Jacob AS, Imeede CA, Yoshikawa H, Stensvold CR, Clark CG: Genetic diversity of Blastocystis in livestock and zoo animals. Protist. 2013, 164: 497-509. 10.1016/j.protis.2013.05.003.
Mas-Coma S, Bargues MD: Populations, hybrids and the systematic concepts of species and subspecies in Chagas disease triatomine vectors inferred from nuclear ribosomal and mitochondrial DNA. Acta Trop. 2009, 110: 112-136. 10.1016/j.actatropica.2008.10.013.
Nolan MJ, Cribb TH: The use and implications of ribosomal DNA sequencing for the discrimination of digenean species. Adv Parasitol. 2005, 60: 101-163. 10.1016/S0065-308X(05)60002-4.
Santín M, Gómez-Muñoz MT, Solano-Aguilar G, Fayer R: Development of a new PCR protocol to detect and subtype Blastocystis spp. from humans and animals. Parasitol Res. 2011, 109: 205-212. 10.1007/s00436-010-2244-9.
Thompson JD, Higgins DG, Gibson TJ, CLUSTAL W: Improving the sensitivity and progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.
Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Posada D, Crandall KA: Modeltest: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.
Huelsenbeck JP, Ronquist F, Nielsen R, Bollback JP: Bayesian inference of phylogeny and its impact on evolutionary biology. Science. 2001, 14: 2310-2314. 10.1126/science.1065889.
Holder M, Lewis PO: Phylogeny estimation: traditional and Bayesian approaches. Nat Rev Genet. 2003, 4: 275-284. 10.1038/nrg1044.
Ronquis F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.
Bandelt HJ, Forster P, Röhl A: Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999, 16: 37-48. 10.1093/oxfordjournals.molbev.a026036.
Rozas J, Sanchez-DelBarrio JC, Messeguer X, Rozas R: DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics. 2003, 19: 2496-2497. 10.1093/bioinformatics/btg359.
Hartl DL, Clark AG: Principles of Population Genetics. 1997, Sinauer Associates, Inc Publishers, Sunderland, Massachusetts
Mathis A, Deplazes P: Copro-DNA tests for diagnosis of animal taeniid cestodes. Parasitol Int. 2006, 55: S87-S90. 10.1016/j.parint.2005.11.012.
Miranda RR, Tennessen JA, Blouin MS, Rabelo EM: Mitochondrial DNA variation of the dog hookworm Ancylostoma caninum in Brazilian populations. Vet Parasitol. 2008, 151: 61-67. 10.1016/j.vetpar.2007.09.027.
Martinez-Hernandez F, Jimenez-Gonzalez DE, Chenillo P, Alonso-Fernandez C, Maravilla P, Flisser A: Geographical widespread of two lineages of Taenia solium due to human migrations: can population genetic analysis strengthen this hypothesis?. Infect Genet Evol. 2009, 9: 1108-1114. 10.1016/j.meegid.2009.09.005.
Sharma M, Fomda BA, Mazta S, Sehgal R, Bagicha Singh B, Malla N: Genetic diversity and population genetic structure analysis of Echinococcus granulosus sensu stricto complex based on mitochondrial DNA signature. PLoS One. 2013, 8: 12-
Teodorovic S, Braverman JM, Elmendorf HG: Unusually low levels of genetic variation among Giardia lamblia isolates. Eukaryot Cell. 2007, 6: 1421-1430. 10.1128/EC.00138-07.
Pomajbíková K, Oborník M, Horák A, Petrželková KJ, Grim JN, Levecke B, Todd A, Mulama M, Kiyang J, Modrý D: Novel insights into the genetic diversity of balantidium and balantidium-like cyst-forming ciliates. PLoS Negl Trop Dis. 2013, 7: e2140-e2150. 10.1371/journal.pntd.0002140.
Lopez-Escamilla E, Sanchez-Aguillon F, Alatorre-Fernandez CP, Aguilar-Zapata D, Arroyo-Escalante S, Arellano T, Moncada-Barron D, Romero-Valdovinos M, Martinez-Hernandez F, Rodriguez-Zulueta P, Maravilla P: New Tetratrichomonas species in two patients with pleural empyema. J Clin Microbiol. 2013, 51: 3143-3146. 10.1128/JCM.01056-13.
Ramírez JD, Sánchez LV, Bautista DC, Corredor AF, Flórez AC, Stensvold CR:Blastocystis subtypes detected in humans and animals from Colombia. Infect Genet Evol. 2014, 22: 223-228. 10.1016/j.meegid.2013.07.020.
Poirier P, Wawrzyniak I, Albert A, El Alaoui H, Delbac F, Livrelli V: Development and evaluation of a real-time PCR assay for detection and quantification of Blastocystis parasites in human stool samples: prospective study of patients with hematological malignancies. J Clin Microbiol. 2011, 49: 975-983. 10.1128/JCM.01392-10.
Stensvold CR, Suresh GK, Tan KS, Thompson RC, Traub RJ, Viscogliosi E, Yoshikawa H, Clark CG: Terminology for Blastocystis subtypes–a consensus. Trends Parasitol. 2007, 23: 93-96. 10.1016/j.pt.2007.01.004.
Smith JM, Haigh J: The hitch-hiking effect of a favourable gene. Genet Res. 1974, 23: 23-35. 10.1017/S0016672300014634.
Palafox-Fonseca H, Zúñiga G, Bobes RJ, Govezensky T, Piñero D, Texco-Martínez L, Fleury A, Proaño J, Cárdenas G, Hernández M, Sciutto E, Fragoso G: Genetic variation in the Cytb gene of human cerebral Taenia solium cysticerci recovered from clinically and radiologically heterogeneous patients with neurocysticercosis. Mem Inst Oswaldo Cruz. 2013, 108: 914-920. 10.1590/0074-0276130308.
The authors thank Ana Flisser for her critical comments. This work was sponsored by Grants Conacyt 182089 and 168619.
The authors declare that they have no competing interests.
GV, GEO-M, ML-P and EL-E collected the samples and performed the coprological assays. GV, MLP and ELE performed the PCR, purification of amplicons and sequencing assays. GV and FMH performed the genetic analysis. AC-A, LR-G, AO-D, MR-V, PM and FM-H formulated the idea, obtained the authorisations and improved the analysis of data. All authors participated during the discussion and writing of the manuscript and approved its final version.
About this article
Cite this article
Villalobos, G., Orozco-Mosqueda, G.E., Lopez-Perez, M. et al. Suitability of internal transcribed spacers (ITS) as markers for the population genetic structure of Blastocystis spp. Parasites Vectors 7, 461 (2014). https://doi.org/10.1186/s13071-014-0461-2