- Open Access
Genetic diversity in the transmission-blocking vaccine candidate Plasmodium vivax gametocyte protein Pvs230 from the China–Myanmar border area and central Myanmar
Parasites & Vectors volume 15, Article number: 371 (2022)
Sexual stage surface antigens are potential targets of transmission-blocking vaccines (TBVs). The gametocyte and gamete surface antigen P230, a leading TBV candidate, is critical for red blood cell binding during exflagellation and subsequent oocyst development. Here, the genetic diversity of Pvs230 was studied in Plasmodium vivax parasite isolates from the China–Myanmar border (CMB) and central Myanmar.
Plasmodium vivax isolates were collected in clinics from malaria-endemic areas of the CMB (143 samples) and Myanmar (23 samples). The interspecies variable part (IVP, nucleotides 1–807) and interspecies conserved part (ICP, 808–2862) of Pvs230 were amplified by PCR and sequenced. Molecular evolution studies were conducted to evaluate the genetic diversity, signature of selection, population differentiation, haplotype network, and population structure of the study parasite populations and publicly available Pvs230 sequences from six global P. vivax populations.
Limited genetic diversity was observed for the CMB (π = 0.002) and Myanmar (π = 0.001) isolates. Most amino acid substitutions were located in the IVP and cysteine-rich domain of Pvs230. Evidence of positive selection was observed for IVP and purifying selection for ICP. Codon-based tests identified specific codons under natural selection in both IVP and ICP. The fixation index (FST) showed low genetic differentiation between East and Southeast Asian populations, with FST ranging from 0.018 to 0.119. The highest FST value (FST = 0.503) was detected between the Turkey and Papua New Guinea populations. A total of 92 haplotypes were identified in global isolates, with the major haplotypes 2 and 9 being the most abundant and circulating in East and Southeast Asia populations. Several detected non-synonymous substitutions were mapped in the predicted structure and B-cell epitopes of Pvs230.
We detected low levels of genetic diversity of Pvs230 in global P. vivax populations. Geographically specific haplotypes were identified for Pvs230. Some mutations are located within a potential B-cell epitope region and need to be considered in future TBV designs.
The National Malaria Control Programs (NMCPs) in the Greater Mekong Subregion (GMS) have made substantial strides in malaria control, and in the past 9 years, most of the GMS countries endemic for malaria are making progress in reducing their malaria burden . However, this success is faltering in some areas because of multidrug-resistant parasites, cross-border human migration, and the epidemics of the novel coronavirus disease 2019 (COVID-19) [2, 3, 4, 5]. Therefore, to ensure the most effective outcomes, the NMCPs in the GMS countries should design and adopt new strategies to eliminate malaria. The development of a highly effective malaria vaccine is an essential component of malaria control and eventual eradication. Transmission-blocking vaccines (TBVs) that reduce parasite transmission from humans to mosquitoes would be an important component toward the goal of malaria elimination.
TBVs induce antibodies that target the malaria antigens expressed in sexual stages (gametocyte, gamete, zygote, and ookinete), and are responsible for reducing the oocyst density in the mosquito midgut . Based on the expression profile of target antigens, TBV candidates can be categorized into pre- (before zygote formation) and post-fertilization antigens. The expression of post-fertilization antigens, such as the zygote and ookinete surface antigens, P25/28, is typically limited to the mosquito stage of the parasite life cycle . P25 and P28 are effective for inducing transmission-blocking activity (TBA), and have exhibited a transmission-blocking effect in phase I clinical trials, albeit with some adverse events . Post-fertilization antigens are not expressed in blood-stage parasites and hence are not exposed to the human immune response by natural malaria infections. In contrast, pre-fertilization antigens P230 and P48/45, which belong to a family of 6-cysteine-containing domain (6-cys) proteins, are both found on the surface of gametocytes and gametes during infection, and therefore are exposed to the host immune system . Pfs230 is a 363-kDa protein containing 14 cysteine-motif (CM) domains and elicits a transmission-reducing humoral immune response during natural malaria infection [10, 11, 12]. In Pfs230-disrupted parasites, the exflagellation centers, oocyst intensity, and mosquito infectivity are significantly reduced . In standard membrane-feeding assays (SMFA), monoclonal antibodies against Pfs230 exhibited strong transmission-reducing activity (TRA) [14, 15]. Antibodies raised against both the N-terminal pro-domain (443–558 amino acids) and CM domains (443–1132 aa) and the CM1 domains (589–730 aa) of the Pfs230 protein exhibited efficient TBA [16, 17, 18]. Collectively, these results indicate that Pfs230 is a promising target for TBV.
A 6-cys domain protein in Plasmodium vivax has been identified as an orthologue of P230 in Plasmodium falciparum. Although Pfs230 has been studied extensively, limited studies have focused on Pvs230. Genetic diversity studies for vaccine candidate genes in malaria-endemic areas will provide useful information for vaccine design. Studies on asexual-stage vaccine candidate genes, such as ama1, revealed that the polymorphic antigen-induced immune response is allelic variant-dependent, which will limit the efficacy of the vaccine design based on this antigen [19, 20]. To date, no genetic diversity data exist for Pvs230 in the China–Myanmar border (CMB) region and Myanmar, where P. vivax transmission persists. Thus, in the present study, the polymorphisms, neutral selections, genetic differentiation, haplotype network, and population structure of Pvs230 were analyzed in P. vivax populations from this region. These analyses revealed limited genetic diversity in the Pv230 gene. Neutrality tests identified Pvs230 domains that are under positive and purifying selection. Although Pvs230 haplotypes seem to be continent-specific, little genetic differentiation was found among East and Southeast Asia populations. The same major haplotypes were circulating in these endemic areas, which may facilitate TBV design for controlling malaria in these regions.
Sample collection and ethics
A total of 166 blood samples were collected by finger-pricking between April 2011 and October 2012 in the CMB region . Twenty-three samples were collected near Mandalay in Myanmar in 2015. Plasmodium vivax infections were determined by either rapid diagnostic test (RDT) or microscopic examination of Giemsa-stained thick smears. All research involving human subjects in this study received ethical approval from the Health Department of Kachin in Myanmar and the institutional ethics committees of China Medical University, China. All volunteers gave written informed consent to participate in the study and provide blood samples.
DNA extraction, polymerase chain reaction (PCR), and sequencing of the Pvs230 gene
Genomic DNA was extracted from the collected P. vivax samples using the TIANamp Genomic DNA kit (TIANGEN, Beijing, China). The interspecies variable part (IVP, corresponding to nucleotides [nt] 1–807) and the interspecies conserved part (ICP, nt 808–2862) of the Pvs230 gene, based on the Sal-I reference (PVX_003905 in PlasmoDB), were amplified by two independent PCRs using two sets of primers (Additional file 1: Table S1). The 20 μl PCRs consisted of 1× KOD-Plus-Neo buffer, 200 μM dNTPs, 1 mM MgSO4, 0.25 µM each specific forward and reverse primer, 0.4 units of KOD-Plus-Neo DNA polymerase (Toyobo, Osaka, Japan), and 1 μl of sample DNA. The reaction was run at 94 °C for 5 min, followed by 45 cycles of 94 °C for 30 s, 55 °C for 15 s, and 68 °C for 3 min, with a final extension at 68 °C for 5 min. The PCR products were purified using the ExoSAP-IT reagent (Thermo Fisher, MA, USA) and subjected to Sanger sequencing (BGI Science, Beijing, China) on both strands with primers listed in Additional file 1: Table S1 using an ABI Prism® BigDye™ cycle sequencing kit (Applied Biosystems, CA, USA) on an ABI 3730XL DNA analyzer.
Sequence assembling and polymorphism analysis
Raw sequence data of Pvs230 were assembled, edited, and aligned to the P. vivax Sal-I reference sequence using the CLUSTALW program in MEGA 7.0 . A 2811-base-pair (bp) region (nt 1–2862) encompassing the IVP and ICP of Pvs230 (containing cysteine-rich domain [CRD]I–IV) was assembled, excluding the primer sequences. To investigate the global population structure of Pvs230, 108 publicly available Pvs230 sequences were retrieved from GenBank, representing six locations: Papua New Guinea (PNG) (AB574595–AB574615), Thailand (AB574575–AB574594), Central China (Hubei province, AB574574–AB574555), Madagascar (AB574548–AB574554), Turkey (AB574528–AB574547), and Brazil (AB574508–AB574527) . Molecular evolution analysis of the Pvs230 gene, including the number of polymorphic sites (S), the total number of mutations (η), the average number of nucleotide differences (k), nucleotide diversity (π), the number of haplotypes (H), haplotype diversity (Hd), and the number of synonymous mutations (SP) and non-synonymous mutations (NS), was performed using DnaSP, version 6.12.03 . Sliding window plots of π values across the IVP and ICP of Pvs230 were generated using a 90-bp sliding window with 3-bp step size . Sequences generated in this study were submitted to GenBank (accession numbers OP429256–OP429421).
To test the neutral theory of evolution in the Pvs230 gene, dN–dS, Tajima’s D, Fu and Li’s D* and F*, and McDonald–Kreitman (MK) indices were calculated using DnaSP v6.12.03 . The rates of synonymous (dS) and non-synonymous (dN) mutations were estimated and compared by the Z-test (P < 0.05) in MEGA 7 using the Nei–Gojobori method with Jukes and Cantor correction . A value of dN–dS > 0 implies a positive selection, whereas dN–dS < 0 designates a purifying selection . In Tajima’s D test, the departure from neutrality in the nucleotide frequency distributions was determined by comparing the values of θ (estimated nucleotide diversity) derived from π and S . Fu and Li’s D* and F* tests were used to test the hypothesis that all mutations are selectively neutral, and the mutations in the external branches were compared to the genealogy branches . Positive values of Tajima’s D and Fu and Li’s D* and F* tests correspond to positive diversifying selection or population structuring due to an excess of variants at intermediate frequencies, whereas a negative value indicates population size expansion and/or negative/purifying selection. Sliding window plots with a window size of 90 and a step size of 3 were also generated for the Tajima’s D and Fu and Li’s D* and F* values, and dN–dS was used to identify regions of Pvs230 under selection . The sequence of Plasmodium cynomolgi P230 (GenBank accession no. AB574620) was used as the interspecies outgroup for the MK test, which allows for determination of the ratio of synonymous substitutions to non-synonymous substitutions between and within species . An excess of the ratio between species versus within species suggests purifying selection. A two-tailed Fisher’s exact test was computed to determine statistical significance (P < 0.05). A codon-based test using SLAC (Single-Likelihood Ancestor Counting)  and FUBAR (Fast, Unconstrained Bayesian AppRoximation)  methods implemented in the Datamonkey web server  was used to identify the existence of positive selection pressure on individual amino acids of the protein. Sites were considered under positive selection if the dN–dS indicated high statistical significance (P < 0.05).
F ST, haplotype network construction, and phylogenetic analysis
The degree of genetic differentiation in Pvs230 among global isolates was estimated by calculating Wright’s fixation index (FST) using Arlequin software, version 188.8.131.52 . Interpretation of FST values is defined as described previously , with no differentiation (0), low genetic differentiation (≤ 0.15), moderate genetic differentiation (0.15–0.25), and high differentiation (≥ 0.25). A P-value < 0.05 was considered significant difference. To investigate the genetic relatedness among the Pvs230 haplotypes, a haplotype network was constructed using the median-joining (MJ) method implemented in the PopART v1.7 software with a haplotype frequency > 1 . A phylogenetic tree of Pvs230 sequences (nt 52–2862) from the global isolates  was constructed by the NJ method  with a bootstrap of 1000 replicates using MEGA 7.0 software . Final trees were drawn and edited using iTOL version 6 (https://itol.embl.de/).
Population structure and clusters of global isolates based on Pvs230 haplotypes were investigated using the Bayesian model-based clustering method implemented in STRUCTURE software, version 2.3.4 . Ten independent runs were performed with values of assumed clusters (K) ranging from 1 to 10. Each run was carried out with a “burn-in” period of 105 and Markov chain Monte Carlo (MCMC) length of 2 × 105 iterations under an admixture model with correlated allele frequencies among populations. The most probable K was inferred by the Evanno et al. method  using the online Structure Harvester v0.6.94 . CLUMPP software  was used to address label switching within the determined value of K, using the greedy algorithm to increase speed, and a run of 1000 random repeats of the data. The graphical depiction of the STRUCTURE results was generated using Distruct software .
Prediction of B-cell epitopes concerning single-nucleotide polymorphisms (SNPs) in IVP and ICP of Pvs230
The B-cell epitopes in IVP and ICP of Pvs230 were predicted using the ABCpred server (www.imtech.res.in/raghava/abcpred) with a threshold of 0.7 and a window length of 16 , and using BepiPred-2.0 (https://services.healthtech.dtu.dk/service.php?BepiPred-2.0) with a threshold of 0.5 and peptide length > 10 . The three-dimensional (3D) structure of Pvs230 protein was predicted using the I-TASSER meta-threading procedure (https://zhanglab.ccmb.med.umich.edu/I-TASSER/) . The overlapped regions in the B-cell epitopes from the prediction software and the mutation sites were mapped on a 3D structure of Pvs230 using PyMOL software (http://www.scalacs.org/TeacherResources).
Sequence polymorphism in Pvs230 gene
The Pvs230 gene was successfully amplified and sequenced from 166 P. vivax samples collected in the CMB area (n = 143) and central Myanmar (n = 23). For the sequenced region (nt 52–2862), there were 30 and 19 polymorphic sites in isolates from the CMB and Myanmar, respectively (Table 1). Of these, 14 and 13 polymorphic sites were in the IVP (nt 52–808), and 16 and 6 polymorphic sites were in the ICP (CRDI–IV, 808–2862 bp), respectively (Table 1). The average number of pairwise nucleotide differences (k) of the CMB isolates for the entire sequence, IVP, and ICP were 5.558, 3.518, and 2.041, respectively (Table 1). In comparison, the k values for these Pvs230 fragments of the central Myanmar isolates were lower, at 3.553, 2.071, and 1.482, respectively. Nucleotide diversity (π) for the entire sequence, IVP, and ICP of Pvs230 among the 143 P. vivax isolates of the CMB isolates were 0.002, 0.006, and 0.001, respectively (Table 1). For the Myanmar isolates, these values were 0.001, 0.004, and 0.001. A sliding window plot of π revealed values ranging from 0.000 to 0.023 for the CMB isolates and 0.000 to 0.016 for the Myanmar isolates (Fig. 1a). The CMB isolates had 47 Pvs230 haplotypes, with an overall Hd of 0.935 (Table 1). The 23 Myanmar isolates had 15 haplotypes, with an Hd value of 0.925 (Table 1).
The Pvs230 sequences obtained from this study were compared with published sequences from six other geographical regions. A total of 48 mutations were identified among the 274 global and reference Pvs230 sequences, including 16 synonymous SNPs (SP SNPs) and 32 non-synonymous SNPs (NS SNPs) (Table 2). The majority of the NS SNPs (17 sites, 53.1%) were clustered within the IVP, resulting in a peak π of 0.026 for this region (Fig. 1a). A total of 92 haplotypes were identified among the global isolates, demonstrating a high level of haplotype diversity (Hd = 0.968). Nucleotide diversity was 0.002 for the global samples, higher in East/Southeast Asia and Africa than in South America and Oceania, and the lowest in West Asia (Turkey) (Table 2). Except for the CMB and Thailand isolates, the π value in each population was lower than that in the global samples, suggesting country-specific substitutions. A sliding window plot of Pvs230 in the global isolates revealed a peak π value of 0.026 at nt 207–479 of the IVP (Fig. 1a), including 10 of 46 amino acid changes (Fig. 1b).
Two repeat sequences (RGXXXGXHXVIH and RVVH/RVIH/CVVA/RVIQ/RDVH) and degenerative repeats (DEDGD and DGND) were detected in the IVP of Pvs230 (Fig. 2 and Additional file 2: Table S2). The number of repeats was variable among isolates: 2–5 times for the RGXXXGXHXVIH motif, and 0–5 times for the RVVH/RVIH/CVVA/RVIQ/RDVH motif in the CMB and Myanmar isolates. The absence of the RVVH/RVIH/CVVA/RVIQ/RDVH motif was only observed in Myanmar, Thailand, and PNG isolates. Variation in the number of the second repeat DEDGD–VDDD–DGND was limited and geographically restricted. One DEDGD was observed in all countries except Myanmar and PNG, where the repeat number was 0–2 (Fig. 2 and Additional file 2: Table S2). In addition, 1–2 repeats of DGND were observed in the CMB and Madagascar isolates.
Evidence of positive diversifying selection on Pvs230 gene
To examine whether natural selection contributed to the generation of the diversity in Pvs230 of the CMB and Myanmar isolates, dN–dS, Tajima’s D, Fu and Li’s D* and F*, and MK test were performed. Although the values of Tajima’s D, and D* and F* were not significant for either the CMB or Myanmar isolates, differences in Pvs230 domains existed, including positive values in the IVP for the CMB isolates and negative values in the ICP in both regions. Significant values of dN over dS were observed in the entire Pvs230 sequence and IVP in isolates from both regions, suggesting positive diversifying selection. Sliding window plots depicted significant negative D, D*, and F* values in the IVP of Myanmar isolates. Meanwhile, significant negative D* and F* test values were detected in the ICP of CMB isolates, supporting purifying selection acting on this domain (Fig. 3). These observations were further supported by the SLAC and FUBAR tests identifying purifying selection on site 627, positive selection on 287 and 308 in the ICP of the CMB isolates, and positive selection on 287 in the Myanmar isolates (Additional file 3: Table S3). The MK test of the CMB isolates showed that the entire sequenced region (neutrality index [NI] = 6.410, P < 0.001), particularly within IVP (7.233, P < 0.05), had significantly more non-synonymous than synonymous substitutions expected from the comparison with P. cynomolgi. A similar pattern from the MK test was observed in the Myanmar isolates (Table 1), although the values were not significant. Together, these results suggest that polymorphisms found for Pvs230, especially in the IVP, are maintained by diversifying selection, presumably due to host immune pressure.
F ST analysis
Plasmodium vivax population differentiation was evaluated using the FST test. Pairwise population comparisons showed the highest level of genetic differentiation (FST = 0.609) between the West Asia (Turkey) and the Oceania (PNG) populations. High FST values (0.261–0.609) were detected between the East Asia (Central China) and Southeast Asia (Myanmar) populations, between West Asia and Brazil and other Asia populations, and between Oceania with Asia (Thailand, Myanmar, Turkey), Africa, and Brazil populations. A moderate range of FST values (0.173–0.249) were detected when comparing the CMB Myanmar, Oceania, and Africa populations with the Myanmar and West Asia populations, and with South America, Thailand, and Africa populations. FST values among East and Southeast Asia populations approached zero, indicating a high degree of genetic similarity. The lowest FST value (0.038) was observed between CMB and Thailand populations, suggesting extensive gene flow between these populations (Table 3).
Genetic relationships between global isolates
A total of 92 Pvs230 haplotypes were identified in 274 global isolates, demonstrating a high level of haplotype diversity across the eight analyzed populations (Hd = 0.968). Of the 92 haplotypes, 63% (58/92) were single, and 84.8% (78/92) were region-specific. There was also significant haplotype sharing between regions and continents (Fig. 4a and b). For example, haplotypes 2, 3, 4, 5, 7, and 9 were shared between East and Southeast Asia populations. Haplotype 8 was the only haplotype present in three continents (Fig. 4a and b). The haplotype network showed three clusters, consistent with the phylogenetic analysis (Additional file 4: Fig. S1). The NJ tree revealed geographical clustering by countries or regions, with the Madagascar, Turkey, and Brazil haplotypes only observed in Group I. Group II consisted of admixed haplotypes from Oceania and three adjacent Asia populations (CMB, Myanmar, and Thailand), while 97.4% (37/38) of group III haplotypes were from CMB, Myanmar, Thailand, and Central China populations (Additional file 4: Fig. S1). Similar to the phylogenetic results, the STRUCTURE plot showed that 274 isolates from the eight global populations were grouped into three clusters (Fig. 5a, b). The parasite populations from the global isolates had admixed haplotypes, except for Madagascar and Brazil, where haplotypes were classified into only one cluster (Fig. 5c).
Evaluating the haplotypes within the ICP revealed 44 haplotypes from 274 global isolates, of which 45.5% were single (Additional file 5: Fig. S2). Individual haplotype prevalence ranged from 0.36 to 24.7%. There were region-specific haplotypes (e.g., haplotypes 1, 19, and 20 in Brazil). Haplotypes 4 and 5 were shared among the GMS populations. There was also haplotype sharing among continents (e.g., haplotypes 10, 11, and 14).
Analysis of Pvs230 structure and antigenicity
The modeled structure of Pvs230 was used to determine the distribution of the B-cell epitopes and the polymorphic residues. Six predicted potential B-cell epitopes were identified in the ICP of Pvs230 (Fig. 6 and Table 4). The overlap region of the putative B-cell epitopes from two prediction software algorithms is illustrated in the deduced 3D model shown in red (with polymorphic residues, N623–K649, K693–N764, and K836–S851) and yellow (without polymorphic residues, T333–A354, E419–M448, and N792–E818), respectively (Fig. 6). A moderate-level nucleotide and haplotype diversity was found in the B-cell epitopes E229–V275 (π = 0.006 ± 0.000, Hd = 0.590 ± 0.024) and N623–K649 (π = 0.006 ± 0.000, Hd = 0.457 ± 0.018), which contained two (V236F and V275M) and one (E631K) polymorphic residue, respectively. Lower nucleotide and haplotype diversity was found in epitopes E419–M448, K693–N764, N792–E818, and K836–S851, while the T333–A354 epitope was conserved, suggesting functional constraints of these regions.
Due to the surface location and functional essentiality in forming exflagellation centers, Pvs230 has been considered a malaria TBV candidate and has undergone extensive studies [46, 47, 48]. The N-terminal tandem repeats of Pvs230 are species-specific in Plasmodium. In this study, we found that the presence and extension of the N-terminal tandem repeats in the IVP of Pvs230 differed by geographical origin, with most repeated patterns restricted to the region of origin. We found that replicate 12 (r12) was the most prevalent variant in the CMB isolates and was shared with Central China, Southeast Asia, and Oceania populations. Tandem repeats in some malaria vaccine candidate genes, such as circumsporozoite protein (CSP), differ according to transmission intensity and show an association with the response to treatment [49, 50, 51]. The function for the N-terminal repeats in Pvs230 of P. vivax remains elusive.
Polymorphic sites were distributed unevenly across Pvs230, with most variations accumulated in IVP and CRDI–II in ICP, suggesting that these regions are potentially more exposed to the host immune system. Immunological studies indicate that the pro-domain (downstream of the cleavage site in IVP) and CRDI–II domain in ICP could elevate a naturally acquired antibody response in endemic areas [11, 52, 53]. Evidence showed that antibodies against the Pvs230 ICP exhibit a profound inhibitory effect on oocyst formation . Recombinant Pvs230 proteins with varying CRD domain combinations showed different TBA, with antibodies targeting the CRDI domain triggering the strongest TB activity, suggesting that this domain is a promising target for TBV development . In our study, several non-synonymous substitutions under positive selection, including A287D/T and L308I/V, were identified in the CRDI domain in the CMB and Myanmar isolates. Whether these substitutions will affect the efficiency of a TBV that targets these regions needs further experimental examination. This study identified 92 Pvs230 haplotypes in the global isolates, with varying levels of haplotype diversity among regions. The highest diversity was detected in Asia, where 73 haplotypes were reported, including the two major haplotypes (2 and 9) and several minor haplotypes, indicating a close genetic relatedness between these P. vivax populations, comparable to the analysis of Pvhap2 sequences . The lowest haplotype diversity was observed in Turkey, indicating more genetically related parasite isolates.
Pv230 showed low genetic diversity (π = 0.001–0.002), consistent with a previous report . This result is also comparable to the low genetic diversity observed for other TBV candidates reported in P. vivax, including Pvs28/25 (π = 0.0024/0.0017, Asia; π = 0.0024/0.0018, Americas; ), Pvs48/45 (π = 0.00173, global isolates, ), Pvhap2 (π = 0.0002, CMB; π = 0.0005, Myanmar, ), and Pvceltos (π = 0.001, Brazil ). This high level of conservation may be a consequence of the functional constraint of these proteins in sexual development. Neutrality tests including dN–dS and the MK test detected positive selection in the IVP, which was further supported by the codon-based selection tests, indicating that the IVP may be targeted by the host immune system. The N-terminal region upstream of the proteolytic cleavage site of Pfs230 is released into the medium after processing during gametogenesis, and might be important for immune evasion . Pfs230 protein sequences downstream of the cleavage site in the IVP could elicit TBA comparable to that in the ICP . Although the cleavage site is not conserved between Pfs230 and Pvs230, the evolutionarily conserved feature of predicted B-cell epitopes mapped around this region (E299–V275) may be beneficial for TBV design. In contrast, significant values of ds over dN in ICP of the pvs230 gene from both the CMB and Myanmar isolates suggest that purifying selection acts on this region. Earlier studies showed that an intact ICP is important for the gamete surface localization of Pfs230 , and the CRDI within ICP is critical for the generation of an effective antibody with TBA [15, 18, 60, 61], suggesting that this region may be critical for the gamete fertilization function of Pfs230, and could be a promising target for TBV development. We identified several non-synonymous substitutions in the B-cell epitopes predicted in the ICP. The codon-based test for selection also revealed two sites (287 and 308) under positive selection in the CRDI domain in the CMB isolates. Therefore, future investigations are needed to determine whether these mutations could interfere with TBV efficiency.
Despite low sequence diversity, Pvs230 showed divergent haplotype distribution patterns in different P. vivax populations. Population differentiation, especially among the Asian populations, was supported by both Wright’s FST statistic and phylogenetic analysis, which indicated population subdivision among the P. vivax populations. The FST estimation between East/Southeast Asia (Central China, CMB, and Thailand) and West Asia (Turkey) populations was remarkable (0.41–0.558), whereas parasites from the GMS exhibited only slight geographical genetic differentiation. Although antigens under selection may not be suitable markers for population genetic studies, the results from Pvs230 analysis are highly congruent with the population genomics studies conducted using whole genome sequencing data from the GMS . These less differentiated P. vivax populations suggest the lack of gene flow barriers in the GMS, which may be further enhanced by the extensive human migration in this region [63, 64]. In addition, although the world P. vivax populations were grouped into three clusters by structure, phylogenetic, and haplotype network analyses, similar haplotypes in the Pvs230 sequences were found in many parts of the world. Therefore, geographical differentiation of the parasite population and conservation of certain Pvs230 haplotypes should be considered for Pv230-based vaccine design.
This study revealed limited genetic diversity in the Pvs230 gene, and neutrality tests showed that the gene is under positive selection on the IVP and under purifying selection on the ICP. The study revealed continent-specific and shared Pvs230 haplotypes in the global isolates, and further reinforced the observation from genomic analysis, showing little genetic differentiation in the GMS. The presence of dominant haplotypes in the ICP circulating in East and Southeast Asia parasite populations suggests the possibility of developing a universal Pvs230-based TBV in this region.
Availability of data and materials
The datasets supporting the conclusions of this article are included within the article.
Interspecies variable part
Interspecies conserved part
National Malaria Control Programs
Standard membrane-feeding assay
- F ST :
- SP SNPs:
Synonymous single-nucleotide polymorphisms
- NS SNPs:
Non-synonymous single-nucleotide polymorphisms
Papua New Guinea
Markov chain Monte Carlo
Baird JK. Asia-Pacific malaria is singular, pervasive, diverse and invisible. Int J Parasitol. 2017;47:371–7. https://doi.org/10.1016/j.ijpara.2016.06.006.
Azizi H, Esmaeili ED. Is COVID-19 posed great challenges for malaria control and elimination? Iran J Parasitol. 2021;16:346–7. https://doi.org/10.18502/ijpa.v16i2.6289.
Nghochuzie NN, Olwal CO, Udoakang AJ, Amenga-Etego LN, Amambua-Ngwa A. Pausing the fight against malaria to combat the COVID-19 pandemic in Africa: is the future of malaria bleak? Front Microbiol. 2020;11:1476. https://doi.org/10.3389/fmicb.2020.01476.
Menard D, Dondorp A. Antimalarial drug resistance: a threat to malaria elimination. Cold Spring Harb Perspect Med. 2017;7:a025619. https://doi.org/10.1101/cshperspect.a025619.
Wang J, Xu C, Wong YK, He Y, Adegnika AA, Kremsner PG, et al. Preparedness is essential for malaria-endemic regions during the COVID-19 pandemic. Lancet. 2020;395:1094–6. https://doi.org/10.1016/S0140-6736(20)30561-4.
Carter R, Chen DH. Malaria transmission blocked by immunisation with gametes of the malaria parasite. Nature. 1976;263:57–60. https://doi.org/10.1038/263057a0.
Tomas AM, Margos G, Dimopoulos G, van Lin LH, de Koning-Ward TF, Sinha R, et al. P25 and P28 proteins of the malaria ookinete surface have multiple and partially redundant functions. EMBO J. 2001;20:3975–83. https://doi.org/10.1093/emboj/20.15.3975.
Wu Y, Ellis RD, Shaffer D, Fontes E, Malkin EM, Mahanty S, et al. Phase 1 trial of malaria transmission blocking vaccine candidates Pfs25 and Pvs25 formulated with montanide ISA 51. PLoS ONE. 2008;3:e2636. https://doi.org/10.1371/journal.pone.0002636.
Williamson KC. Pfs230: from malaria transmission-blocking vaccine candidate toward function. Parasite Immunol. 2003;25:351–9. https://doi.org/10.1046/j.1365-3024.2003.00643.x.
Drakeley CJ, Bousema JT, Akim NI, Teelen K, Roeffen W, Lensen AH, et al. Transmission-reducing immunity is inversely related to age in Plasmodium falciparum gametocyte carriers. Parasite Immunol. 2006;28:185–90. https://doi.org/10.1111/j.1365-3024.2005.00818.x.
Bousema T, Roeffen W, Meijerink H, Mwerinde H, Mwakalinga S, van Gemert GJ, et al. The dynamics of naturally acquired immune responses to Plasmodium falciparum sexual stage antigens Pfs230 & Pfs48/45 in a low endemic area in Tanzania. PLoS ONE. 2010;5:e14114. https://doi.org/10.1371/journal.pone.0014114.
Bousema JT, Roeffen W, van der Kolk M, de Vlas SJ, van de Vegte-Bolmer M, Bangs MJ, et al. Rapid onset of transmission-reducing antibodies in Javanese migrants exposed to malaria in Papua, Indonesia. Am J Trop Med Hyg. 2006;74:425–31.
Eksi S, Czesny B, van Gemert GJ, Sauerwein RW, Eling W, Williamson KC. Malaria transmission-blocking antigen, Pfs230, mediates human red blood cell binding to exflagellating male parasites and oocyst production. Mol Microbiol. 2006;61:991–8. https://doi.org/10.1111/j.1365-2958.2006.05284.x.
Vermeulen AN, Ponnudurai T, Beckers PJ, Verhave JP, Smits MA, Meuwissen JH. Sequential expression of antigens on sexual stages of Plasmodium falciparum accessible to transmission-blocking antibodies in the mosquito. J Exp Med. 1985;162:1460–76. https://doi.org/10.1084/jem.162.5.1460.
Rener J, Graves PM, Carter R, Williams JL, Burkot TR. Target antigens of transmission-blocking immunity on gametes of plasmodium falciparum. J Exp Med. 1983;158:976–81.
Tachibana M, Wu Y, Iriko H, Muratova O, MacDonald NJ, Sattabongkot J, et al. N-terminal prodomain of Pfs230 synthesized using a cell-free system is sufficient to induce complement-dependent malaria transmission-blocking activity. Clin Vaccine Immunol. 2011;18:1343–50. https://doi.org/10.1128/CVI.05104-11.
Williamson KC, Keister DB, Muratova O, Kaslow DC. Recombinant Pfs230, a Plasmodium falciparum gametocyte protein, induces antisera that reduce the infectivity of Plasmodium falciparum to mosquitoes. Mol Biochem Parasitol. 1995;75:33–42. https://doi.org/10.1016/0166-6851(95)02507-3.
Tachibana M, Miura K, Takashima E, Morita M, Nagaoka H, Zhou L, et al. Identification of domains within Pfs230 that elicit transmission blocking antibody responses. Vaccine. 2019;37:1799–806. https://doi.org/10.1016/j.vaccine.2019.02.021.
Healer J, Murphy V, Hodder AN, Masciantonio R, Gemmill AW, Anders RF, et al. Allelic polymorphisms in apical membrane antigen-1 are responsible for evasion of antibody-mediated inhibition in Plasmodium falciparum. Mol Microbiol. 2004;52:159–68. https://doi.org/10.1111/j.1365-2958.2003.03974.x.
Martinelli A, Cheesman S, Hunt P, Culleton R, Raza A, Mackinnon M, et al. A genetic approach to the de novo identification of targets of strain-specific immunity in malaria parasites. Proc Natl Acad Sci USA. 2005;102:814–9. https://doi.org/10.1073/pnas.0405097102.
Li N, Parker DM, Yang Z, Fan Q, Zhou G, Ai G, et al. Risk factors associated with slide positivity among febrile patients in a conflict zone of north-eastern Myanmar along the China-Myanmar border. Malar J. 2013;12:361. https://doi.org/10.1186/1475-2875-12-361.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–4. https://doi.org/10.1093/molbev/msw054.
Doi M, Tanabe K, Tachibana S, Hamai M, Tachibana M, Mita T, et al. Worldwide sequence conservation of transmission-blocking vaccine candidate Pvs230 in Plasmodium vivax. Vaccine. 2011;29:4308–15. https://doi.org/10.1016/j.vaccine.2011.04.028.
Rozas J, Ferrer-Mata A, Sanchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34:3299–302. https://doi.org/10.1093/molbev/msx248.
Kryazhimskiy S, Plotkin JB. The population genetics of dN/dS. PLoS Genet. 2008;4:e1000304. https://doi.org/10.1371/journal.pgen.1000304.
Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989;123:585–95.
Fu YX, Li WH. Statistical tests of neutrality of mutations. Genetics. 1993;133:693–709.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–2. https://doi.org/10.1093/bioinformatics/btp187.
McDonald JH, Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991;351:652–4. https://doi.org/10.1038/351652a0.
Kosakovsky Pond SL, Frost SD. Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol. 2005;22:1208–22. https://doi.org/10.1093/molbev/msi105.
Murrell B, Moola S, Mabona A, Weighill T, Sheward D, Kosakovsky Pond SL, et al. FUBAR: a fast, unconstrained bayesian approximation for inferring selection. Mol Biol Evol. 2013;30:1196–205. https://doi.org/10.1093/molbev/mst030.
Weaver S, Shank SD, Spielman SJ, Li M, Muse SV, Kosakovsky Pond SL. Datamonkey 2.0: a modern web application for characterizing selective and other evolutionary processes. Mol Biol Evol. 2018;35:773–7. https://doi.org/10.1093/molbev/msx335.
Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–7. https://doi.org/10.1111/j.1755-0998.2010.02847.x.
Balloux F, Lugon-Moulin N. The estimation of population differentiation with microsatellite markers. Mol Ecol. 2002;11:155–65. https://doi.org/10.1046/j.0962-1083.2001.01436.x.
Bandelt HJ, Forster P, Rohl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999;16:37–48.
Moon SU, Na BK, Kang JM, Kim JY, Cho SH, Park YK, et al. Genetic polymorphism and effect of natural selection at domain I of apical membrane antigen-1 (AMA-1) in Plasmodium vivax isolates from Myanmar. Acta Trop. 2010;114:71–5. https://doi.org/10.1016/j.actatropica.2010.01.006.
Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406–25. https://doi.org/10.1093/oxfordjournals.molbev.a040454.
Hubisz MJ, Falush D, Stephens M, Pritchard JK. Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour. 2009;9:1322–32. https://doi.org/10.1111/j.1755-0998.2009.02591.x.
Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14:2611–20. https://doi.org/10.1111/j.1365-294X.2005.02553.x.
Earl DA, Bridgett M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012;4:2. https://doi.org/10.1007/s12686-011-9548-7.
Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23:1801–6. https://doi.org/10.1093/bioinformatics/btm233.
Na R. DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes Wiley Online Library. 2004;4:2. https://doi.org/10.1046/j.1471-8286.2003.00566.x.
Saha S, Raghava GP. Prediction of continuous B-cell epitopes in an antigen using recurrent neural network. Proteins. 2006;65:40–8. https://doi.org/10.1002/prot.21078.
Jespersen MC, Peters B, Nielsen M, Marcatili P. BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes. Nucleic Acids Res. 2017;45:W24–9. https://doi.org/10.1093/nar/gkx346.
Yang J, Yan R, Roy A, Xu D, Poisson J, Zhang Y. The I-TASSER Suite: protein structure and function prediction. Nat Methods. 2015;12:7–8. https://doi.org/10.1038/nmeth.3213.
Singh K, Burkhardt M, Nakuchima S, Herrera R, Muratova O, Gittis AG, et al. Structure and function of a malaria transmission blocking vaccine targeting Pfs230 and Pfs230-Pfs48/45 proteins. Commun Biol. 2020;3:395. https://doi.org/10.1038/s42003-020-01123-9.
Coelho CH, Galson JD, Truck J, Duffy PE. B cell clonal expansion and mutation in the immunoglobulin heavy chain variable domain in response to Pfs230 and Pfs25 malaria vaccines. Int J Parasitol. 2021. https://doi.org/10.1016/j.ijpara.2021.11.008.
Healy SA, Anderson C, Swihart BJ, Mwakingwe A, Gabriel EE, Decederfelt H, et al. Pfs230 yields higher malaria transmission-blocking vaccine activity than Pfs25 in humans but not mice. J Clin Invest. 2021;131:e146221. https://doi.org/10.1172/JCI146221.
Machado RL, Povoa MM. Distribution of Plasmodium vivax variants (VK210, VK247 and P. vivax-like) in three endemic areas of the Amazon region of Brazil and their correlation with chloroquine treatment. Trans R Soc Trop Med Hyg. 2000;94:377–81. https://doi.org/10.1016/s0035-9203(00)90110-x.
Ribeiro Bde P, Cassiano GC, de Souza RM, Cysne DN, Grisotto MA, de Azevedo dos Santos AP, et al. Polymorphisms in Plasmodium vivax circumsporozoite protein (CSP) influence parasite burden and cytokine balance in a pre-amazon endemic area from Brazil. PLoS Negl Trop Dis. 2016;10:e0004479. https://doi.org/10.1371/journal.pntd.0004479.
Cui L, Mascorro CN, Fan Q, Rzomp KA, Khuntirat B, Zhou G, et al. Genetic diversity and multiple infections of Plasmodium vivax malaria in Western Thailand. Am J Trop Med Hyg. 2003;68:613–9. https://doi.org/10.4269/ajtmh.2003.68.613.
Jones S, Grignard L, Nebie I, Chilongola J, Dodoo D, Sauerwein R, et al. Naturally acquired antibody responses to recombinant Pfs230 and Pfs48/45 transmission blocking vaccine candidates. J Infect. 2015;71:117–27. https://doi.org/10.1016/j.jinf.2015.03.007.
Ouedraogo AL, Roeffen W, Luty AJ, de Vlas SJ, Nebie I, Ilboudo-Sanogo E, et al. Naturally acquired immune responses to Plasmodium falciparum sexual stage antigens Pfs48/45 and Pfs230 in an area of seasonal transmission. Infect Immun. 2011;79:4957–64. https://doi.org/10.1128/IAI.05288-11.
Mayumi T, Chiho S, Hitoshi O, Jetsumon S, Osamu K, Motomi T, et al. Plasmodium vivax gametocyte protein Pvs230 is a transmission blocking vaccine candidate. Vaccine. 2012;30(10):1807–12.
Li D, Yu C, Guo J, Wang Y, Zhao Y, Wang L, et al. Plasmodium vivax HAP2/GCS1 gene exhibits limited genetic diversity among parasite isolates from the Greater Mekong Subregion. Parasit Vectors. 2020;13:175. https://doi.org/10.1186/s13071-020-04050-0.
Chaurio RA, Pacheco MA, Cornejo OE, Durrego E, Stanley CE Jr, Castillo AI, et al. Evolution of the transmission-blocking vaccine candidates Pvs28 and Pvs25 in Plasmodium vivax: geographic differentiation and evidence of positive selection. PLoS Negl Trop Dis. 2016;10:e0004786. https://doi.org/10.1371/journal.pntd.0004786.
Bitencourt Chaves L, Perce-da-Silva DS, Rodrigues-da-Silva RN, Martins da Silva JH, Cassiano GC, Machado RL, et al. Plasmodium vivax cell traversal protein for ookinetes and sporozoites (PvCelTOS) gene sequence and potential epitopes are highly conserved among isolates from different regions of Brazilian Amazon. PLoS Negl Trop Dis. 2017;11:e0005344. https://doi.org/10.1371/journal.pntd.0005344.
Brooks SR, Williamson KC. Proteolysis of Plasmodium falciparum surface antigen, Pfs230, during gametogenesis. Mol Biochem Parasitol. 2000;106:77–82. https://doi.org/10.1016/s0166-6851(99)00201-7.
Eksi S, Stump A, Fanning SL, Shenouda MI, Fujioka H, Williamson KC. Targeting and sequestration of truncated Pfs230 in an intraerythrocytic compartment during Plasmodium falciparum gametocytogenesis. Mol Microbiol. 2002;44:1507–16. https://doi.org/10.1046/j.1365-2958.2002.02986.x.
Quakyi IA, Carter R, Rener J, Kumar N, Good MF, Miller LH. The 230-kDa gamete surface protein of Plasmodium falciparum is also a target for transmission-blocking antibodies. J Immunol. 1987;139:4213–7.
Read D, Lensen AH, Begarnie S, Haley S, Raza A, Carter R. Transmission-blocking antibodies against multiple, non-variant target epitopes of the Plasmodium falciparum gamete surface antigen Pfs230 are all complement-fixing. Parasite Immunol. 1994;16:511–9. https://doi.org/10.1111/j.1365-3024.1994.tb00305.x.
Brashear AM, Fan Q, Hu Y, Li Y, Zhao Y, Wang Z, et al. Population genomics identifies a distinct Plasmodium vivax population on the China–Myanmar border of Southeast Asia. PLoS Negl Trop Dis. 2020;14:e0008506. https://doi.org/10.1371/journal.pntd.0008506.
Cui L, Yan G, Sattabongkot J, Cao Y, Chen B, Chen X, et al. Malaria in the Greater Mekong Subregion: heterogeneity and complexity. Acta Trop. 2012;121:227–39. https://doi.org/10.1016/j.actatropica.2011.02.016.
Shetty AC, Jacob CG, Huang F, Li Y, Agrawal S, Saunders DL, et al. Genomic structure and diversity of Plasmodium falciparum in Southeast Asia reveal recent parasite migration patterns. Nat Commun. 2019;10:2665. https://doi.org/10.1038/s41467-019-10121-3.
This work was supported by grants from the NIAID, National Institutes of Health (R01AI150553 and U19AI089672), and the National Natural Science Foundation of China (Grant no. 81971961).
Ethics approval and consent to participate
The study protocol was approved by China Medical University and by the Health Department of Kachin.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Table S1.
Primers used in Pvs230 amplification and sequencing.
Additional file 2: Table S2.
The diversity pattern of the N-terminal repeat region in IVP of Pvs230 from global isolates. Related to Fig. 2.
Additional file 3: Table S3.
Codon-based tests for selection on Pvs230 in China–Myanmar border and Myanmar isolates.
Additional file 4: Figure S1.
Phylogenetic analysis of Pvs230 sequences from global P. vivax populations. The maximum-likelihood phylogenetic tree reconstructed based on alignment by ClustalW with bootstrap analysis to assess clade support (500 replicates) was shown for Pvs230 global isolates. The global isolates were clustered into three main groups, the branches of which are shown in red (group 1), green (group 2), and blue (group 3).
Additional file 5: Figure S2.
Median-joining network of Pv230 among eight geographically diverse populations. Haplotypes composed of nucleotide polymorphism in ICP of Pv230 with a frequency > 1 were used to create a median-joining network. Each node represents one haplotype, node size indicates haplotype frequency, and node color corresponds to the country of origin. Line length is proportional to genetic distance.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Zhao, X., Hu, Y., Zhao, Y. et al. Genetic diversity in the transmission-blocking vaccine candidate Plasmodium vivax gametocyte protein Pvs230 from the China–Myanmar border area and central Myanmar. Parasites Vectors 15, 371 (2022). https://doi.org/10.1186/s13071-022-05523-0
- Plasmodium vivax
- Transmission-blocking vaccine
- Genetic diversity