- Open Access
Genetic clustering and polymorphism of the merozoite surface protein-3 of Plasmodium knowlesi clinical isolates from Peninsular Malaysia
Parasites & Vectorsvolume 10, Article number: 2 (2017)
The simian malaria parasite Plasmodium knowlesi has been reported to cause significant numbers of human infection in South East Asia. Its merozoite surface protein-3 (MSP3) is a protein that belongs to a multi-gene family of proteins first found in Plasmodium falciparum. Several studies have evaluated the potential of P. falciparum MSP3 as a potential vaccine candidate. However, to date no detailed studies have been carried out on P. knowlesi MSP3 gene (pkmsp3). The present study investigates the genetic diversity, and haplotypes groups of pkmsp3 in P. knowlesi clinical samples from Peninsular Malaysia.
Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008–2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes.
From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups.
A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection still occur.
Malaria is a disease caused by the infection of blood protozoa belonging to the genus Plasmodium. Molecular evidence suggests that the simian malaria agent Plasmosium knowlesi evolved from a group which included Plasmodium cynomolgi and P. vivax some 30.5 million years ago . The first report of natural transmission of P. knowlesi to humans was reported in 1965 when a US Army surveyor acquired the infection while working in Peninsular Malaysia . It was observed that the parasite could be transmitted to humans through blood inoculation and thus the authors designated it the human strain or strain H. A second case was reported in southern Peninsular Malaysia 5 years later . A large number of human knowlesi malaria was reported in Malaysian Borneo in 2004 , and reports have also been published on this infection in several neighbouring Asian countries such as Singapore , the Philippines  and Thailand . However, the majority of the infections have been recorded in Malaysia. More than 300 human cases have been detected in Peninsular Malaysia since 2005 [8–10]. Recently, a study reported that more than half of the malaria cases in Malaysia were caused by P. knowlesi . The highest proportion of P. knowlesi cases was found to be in the Malaysian Borneo as well as in the Peninsular Malaysia states of Kelantan, Pahang, Terengganu and Johor .
Malaria parasites invade the red blood cells (RBC) of many vertebrate hosts including humans and simians. The proteins involved in the invasion process have been studied to gain deeper insights of the invasion mechanism, and also to identify potential vaccine candidates against malaria . One of these proteins, the merozoite surface protein-3 (MSP3), was identified in P. falciparum in 1994 [13, 14]. Subsequently, a novel surface antigen was discovered in P. vivax and was named MSP3α, due to its putative similarity to the MSP3 of P. falciparum . Two paralogs of the P. vivax MSP3 protein were further identified, designated as PvMSP3β and PvMSP3γ . Due to the presence of more than one such protein in a species, the P. vivax MSP3 proteins were grouped into a multi-gene family . Full genome analysis on P. vivax (Salvador I strain) revealed 12 msp3 paralogs which cluster on chromosome 10 . Surprisingly, these paralogs have limited similarity to the P. knowlesi MSP3 and the four P. falciparum MSP3 proteins. Although a number of studies have suggested that the msp3 genes in P. vivax and P. falciparum are related, a closer comparison between the domain organizations on chromosome 10 as well as the syntenic loci of pvmsp3, pfmsp3 and P. knowlesi putative msp3 genes suggest that these genes are not homologues .
Structurally the protein is characterized by a putative signal peptide and lacks a transmembrane domain or a GPI-lipid modification to anchor it to the outer membrane of the parasite. Another characteristic of the protein family is the presence of an alanine-rich central domain containing a series of heptad coiled-coil repeats [15, 20]. Recent studies have predicted that the MSP-3 proteins in P. vivax form oligometric and elongated molecules suggesting the protein may mediate interactions between host proteins and other merozoite surface proteins .
Genetic diversity in a natural population is usually generated by the introduction of new alleles through the process of migration, mutation, or recombination . The frequency of these alleles on the other hand is governed by the actions of selection and natural drift . For pathogens that infect humans, the host’s immune responses as well as modes of treatment administered are major components of selection, thus, genetic diversity can be an important indicator of how a pathogen responds to modes of intervention such as vaccines or drugs . In this instance, directional selection leads towards fixing beneficial alleles in the population, resulting in reduced diversity . Conversely, naturally acquired host immunity can exert balancing selection which tends to preserve or increase the allelic diversity of antigen genes. This of course occurs within the functional constraints of the encoded protein to prevent the protein from losing its native ability and function [26, 27]. The modelling of neutral processes in a population with a constant size allows for the prediction of expected frequencies of a particular allele. Thus, departures from this neutrality can thus be utilised to identify or pinpoint alleles that are targets for directional or balancing selection [28–31].
Several studies have been carried out on MSP3 proteins of P. falciparum and P. vivax; however, studies on P. knowlesi MSP3 lag far behind. In this study, the genetic diversity, natural selection and haplotype groups of pkmsp3 gene of P. knowlesi clinical isolates from Peninsular Malaysia were studied. Evidence of purifying selection in the C-terminal domain and haplotype grouping of P. knowlesi MSP3 was found. These data will be useful in understanding the genetic variation and natural selection forces acting on this gene and may indicate the gene’s potential as a vaccine candidate.
Blood sample collection
Twenty-three blood samples from knowlesi malaria patients were obtained from the University of Malaya Medical Centre (UMMC), Kuala Lumpur as well as from private clinics in Peninsular Malaysia between July 2008 and July 2012. Each blood sample was assigned a reference code for laboratory record. Knowlesi malaria infection was re-confirmed using several tests including microscopic examination of Giemsa-stained thick and thin blood smears, BinaxNOW® malaria rapid diagnostic test (Inverness Medical International, Stockport, United Kingdom) and polymerase chain reaction (PCR) based on the Plasmodium small subunit ribosomal RNA gene .
Genomic DNA extraction
Genomic DNA was extracted from the blood samples using a commercial blood extraction kit (QIAGEN, Hilden, Germany). One hundred μl of blood were used per extraction and the DNA was eluted into 50 μl of TE Buffer.
PCR, cloning and sequencing of pkmsp3
The pkmsp3 gene was amplified by nested PCR. For the initial primary PCR, oligonucleotide primers MSP3N1F: 5′-CCT CTT CAA CCA CAC ACA CA-3′ and MSP3N1R: 5′-GTT CAT TCT GGC GGA TAA GG-3′ were used . Oligonucleotide primers MSP3N2F: 5′-CCC GTG AAA TAA CAC CCA-3′ and MSP3BN2R: 5′-CCA CCA TCT TAC GTT CAG-3′  were used for the secondary PCR. Approximately 0.5 μg of genomic DNA was used in a final volume of 20 μl which also contained 0.2 mM of dNTP, 0.4 μM of forward and reverse primers, 2 mM MgCl2 and 1 unit of Taq DNA polymerase in buffer provided by the commercial kit (Promega, Madison, WI, USA). The PCR thermal profile was as follows, an initial denaturation of one cycle at 95 °C for 5 min followed by 30 cycles of 1 min at 94 °C, 1 min at 50 °C for annealing and 1 min 30 s at 72 °C for nest 1. Cycling for nest 2 consisted of a 5 min initial denaturation at 95 °C and 30 cycles of 1 min at 94 °C, 1 min at 48 °C, 1 min 30 s at 72 °C, and a final extension step at 72 °C for 10 min. PCR products were analysed by gel electrophoresis on a 1.5% agarose gel stained with SYBR® Safe DNA gel stain (Invitrogen, Eugene, USA).
Purification of PCR products and DNA cloning
PCR products were purified using QIAquick PCR purification kit (Qiagen, Hilden, Germany) per the manufacturer’s instructions. The purified PCR products were then ligated into the pGEM-T® TA cloning vector (Promega, Madison, Wisconsin, USA) and transformed into Escherichia coli TOP10F’ competent cells; colonies were then screened for the presence of recombinant plasmids harbouring the pkmsp3 fragment. These plasmids were then sequenced in a commercial laboratory (MyTACG Bioscience Enterprise, Malaysia). Between 3 and 5 recombinant plasmids were sent for sequencing per isolate. DNA for isolates showing clonal sequence variations (singletons or rare substitutions) was re-amplified and re-sequenced in order to confirm that the variations were genuine, and not the result of incorporation errors of the Taq DNA polymerase.
Analysis of pkmsp3 sequences
Editing and alignment of the pkmsp3 nucleotide sequences (including the sequence of reference P. knowlesi strain H, GenBank: XM_002259752) were performed using the BioEdit sequence alignment editor ver. 7.2.0. Gene Runner ver. 126.96.36.199 was used to deduce the respective amino acid sequences. The Neighbour Joining method described in MEGA6 was used to construct a phylogenetic tree  with bootstrap replicates of 1000. The Median-Joining method in NETWORK v188.8.131.52 program  was used to establish the genetic relationship among pkmsp3 haplotypes and construct the haplotype network. All newly-generated sequences were deposited in the GenBank database (KT900798–KT900845).
Sequence polymorphism analysis of pkmsp3
The programme DnaSP ver. 5.10.01  was used to determine pkmsp3 genetic polymorphism by calculating the number of nucleotide differences per site (π), singleton sites (S), segregating sites (Ss), haplotypes (H), parsimony-informative sites (Ps), and haplotype diversity (Hd) .
The neutral model of molecular evolution acting on the pkmsp3 was tested according to nucleotide polymorphisms and haplotype distribution in the Fu and Li’s D* and F* tests . The Tajima’s D test  was calculated to test the hypothesis that all mutations are selectively neutral. Tajima’s D test is based on the difference between Ss and π where positively significant values indicate balancing selection and negatively significant values indicate directional or purifying selection. In all tests carried out, sites that had gaps were excluded from the analysis. In tests requiring an outgroup, Plasmodium cynomolgi MSP3 was used (GenBank: KC907504). The FST fixation index  in DnaSP 5.10.00 was used to measure the genetic differentiation between the different clustering groups observed in the pkmsp3 phylogenetic tree and haplotype network.
The effect of natural selection was evaluated by the codon based Z-test, which determines whether it is negative or positive selection. Probability (P) values of less than 0.05 were considered significant. The variance of the differences was computed using the bootstrap method with 1000 replicates. The ratio between the average number of non-synonymous substitutions per non-synonymous site (dN) and the average number of synonymous substitutions per synonymous site (dS) using the Nei-Gojobori method with Jukes and Cantor correction  was also calculated. MEGA6 was used to calculate the Z-test and dN/dS ratio .
The Interpro programme (http://www.ebi.ac.uk/interpro) predicted the P. knowlesi MSP3 to have a large coiled-coil region. Genetic diversity and selection analyses were also performed separately on the coiled-coil region (Domain A) and the C-terminal (Domain B) of the protein (Fig. 1). This was carried out to investigate domain specific selective pressure.
Genetic diversity at the nucleotide level
Successful PCR amplification produced DNA fragments of 1077 bp. This fragment contained a region coding a protein sequence of 338 amino acids. A total of 48 sequences were obtained for analysis.
Table 1 gives the estimates of genetic diversity for the full length pkmsp3 sequence, Domain A and Domain B. In the full length sequence, 384 segregating sites were observed; of these, 320 were parsimony-informative and 64 were singleton sites. When separated into Domain A and B, however, Domain B contained more segregating sites as compared to Domain A (273 vs 104). As for diversity, the full length sequence had haplotype diversity (Hd) of 0.997 ± 0.005. Both Domains A and B had similar Hd of 0.989 ± 0.007.
Nucleotide diversity (π: 0.046 ± 0.011) for the full length sequence was found to be several times higher compared to other P. knowlesi functional genes such as PkDBPαII (π: 0.012) , PkAMA-1 (π: 0.00501)  and PkRAP-1 (π: 0.01298) . Diversity for Domain B (π: 0.067 ± 0.025) was found to be higher than that for Domain A (π: 0.039 ± 0.002). A sliding window plot with a window length of 100 bp and a step size of 25 bp provided a detailed analysis of the full length sequence, with π ranging from 0.012 to 0.087 (Fig. 2). The highest peak diversity was within nucleotide positions 801–975 in Domain B, whereas in Domain A, the most conserved region was within nucleotide positions 51–150.
Genetic diversity at the amino acid level
Comparisons and analysis with P. knowlesi strain H as a reference sequence showed mutations at 339 positions. Of these positions, 101 were synonymous changes and 238 were non-synonymous. When translated into deduced amino acids, high level polymorphism was observed (Fig. 3 and Additional file 1: Table S1). Among the 119 polymorphic sites, 100 were monomorphic mutations with a change into one amino acid type, and 19 showed dimorphic mutations with change in two amino acid types (K33R/N, T38I/S, N59E/G, L62E/Q, N66T/Y, N68D/G, T72A/M, A78K/E, V82M/A, K118N/R, K155E/R, E158Q/R, H173N/Y, Y197W/C, N228H/K, A281V/T, E307G/A, E317D/G and H319Y/P). The amino acid sequences could be categorised into 42 haplotypes (H1-42) (Fig. 3) with haplotype 11 having the highest frequency. Fifteen of the 23 patient samples had mixed haplotype infections (Table 2).
Phylogenetic analysis of pkmsp3
Analysis of the phylogenetic tree (Fig. 4) and haplotype network (Fig. 5) revealed that the haplotypes are clustered into two main groups (Group 1 and Group 2), which contained almost equal number of haplotypes. Furthermore, mixed haplotypes from the same blood sample were found to cluster into the same group in both the phylogenetic tree (Fig. 4) and haplotype network and (Fig. 5).
Further analysis was carried out to determine if Domain A or Domain B contributed to the haplotype clustering. A Neighbour Joining tree was constructed for both the domains (Fig. 6) and it was observed that polymorphisms in Domain A contributed to the haplotype clustering, as the clustering observed in this domain mirrored the tree constructed using the full length pkmps3 sequences.
Analysis on the diversity parameters and natural selection of members in Groups 1 and 2 was also carried out (Table 3). Haplotype diversity (Group 1: 0.993; Group 2: 0.995) and nucleotide diversity (Group 1: 0.02276; Group 2: 0.02418) of both groups were quite similar, as was the average number of nucleotide differences (Group 1: 24.31; Group 2: 25.97). The FST value between the groups was 0.402, indicating high genetic differentiation between these two groups. However, analysis of the phylogenetic tree did not indicate any temporal distribution between the two groups.
Tests of selection for pkmsp3
Tests were carried out to determine if the diversity in pkmsp3 was due to natural selection. The Tajima’s D, Fu & Li’s D* and F* tests showed no significant departure from neutrality in the full length pkmsp3, Domain A or Domain B (Table 1), thus suggesting neutral selection may be acting on these regions. Similarly, Tajima’s D test carried out on Group 1 and 2 showed no significant departure from neutrality (Table 3). This was reinforced by estimation of the dN/dS ratio, where, the dN/dS ratio for the full length sequence as well as Domain A were just slightly above 1, indicating neutral selection. However, the dN/dS ratio for Domain B was 0.6, suggesting that this domain may be under purifying selection.
Vaccine development against malaria parasites is not a straightforward procedure. Multistage vaccines have recently been proposed because unique antigens are produced during the different stages of the parasite’s life-cycle. The merozoite has been identified as an important vaccine target due to its mobile and invasive nature, which exposes this stage to the host’s immune responses . Many of the merozoite surface proteins contain polymorphic domains that signify diversifying selection, and conserved domains which indicate functional constraints of the protein. Furthermore, different strains within a Plasmodium species have been found to co-exist , thus vaccine candidates would need to be strain-transcending as one particular antibody generated against the protein from one strain may be ineffective against another. Antigenic diversity in vaccine candidates is one of the hurdles to design effective malaria vaccine. In vaccine development, it is prerequisite to survey genetic polymorphism of the candidate antigens, particularly the polymorphism from a wide range of field isolates. Furthermore, genetic polymorphism is also an important epidemiological tool. Plasmodium knowlesi has emerged in south-east Asia within the recent decade, and molecular epidemiological investigation may explain reasons of this recent emergence.
Although the biological functions of P. vivax and P. knowlesi MSP3 are not fully understood at this juncture, the alanine-rich central core in both proteins is predicted to form a coiled-coil tertiary structure . Being located on the surface of the merozoites, the P. vivax MSP3 has been suggested to interact with other merozoite surface proteins, possibly mediated through protein-protein interactions involving the coiled-coil structure [18, 19] which is similar to what is observed in P. falciparum MSP3 . In the present study, the coiled-coil region of P. knowlesi MSP3 was observed to be conserved. Therefore, similar to P. falciparum and P. vivax MSP3, the P. knowlesi MSP3 coiled-coil region may also utilise protein-protein interaction type bonds to interact with other merozoite surface proteins.
The nucleotide diversity (π: 0.046 ± 0.011) was found to be high when compared to other P. knowlesi functional genes [39–41], considering that most of the haplotypes discovered in this study were unique. A similar observation has also been reported for other merozoite surface antigens such as eba175, and this suggests that even where functional constraints exist, a range of haplotypes can still occur . The low nucleotide diversity in Domain A as compared to that of the full length sequence, suggests limited polymorphism in the domain due to the presence of the coiled-coil region. Sliding window plot analysis (Fig. 2) showed high nucleotide diversity in the C-terminal, a finding also reported in pvmsp3β . Temporal distribution of the haplotypes was not detected and this may be due to the fact that the P. knowlesi isolates were recent and collected within a 4-year period (2008–2012). The possibility of temporal distribution happening within such a short time is unlikely.
The pkmsp3 gene shares significant homology with the P. vivax pvmsp3 . A study on pvmsp3 of P. vivax isolates from Korea revealed nucleotide diversity of 0.0727 ± 0.002 and 0.0304 ± 0.001 at the N- and C-terminal domains respectively , which contrast the nucleotide diversity of pkmsp3 domains (N-terminal π: 0.039 ± 0.002; C-terminal π: 0.067 ± 0.025). However, similar to pkmsp3, the C-terminal of pvmsp3 had ratio of dN/dS < 1, indicating purifying selection in that region. A study on pvmsp3 of P. vivax isolates from Thailand found nucleotide diversity of 0.0877 ± 0.005 , which is comparatively higher than the nucleotide diversity of pkmsp3 (π: 0.046 ± 0.011). Like pkmsp3, the C-terminal of pvmsp3 of the Thailand isolates also showed purifying selection (dN/dS < 1).
Phylogenetic and haplotype network analyses revealed that the P. knowlesi MSP3 haplotypes were clustered into two main groups. The Domain A in particular contributed to this clustering (Fig. 6). To gain a clearer picture of selection, the Z-test and Tajima’s D test for all three sets of sequences were analysed. In this instance, results for both the Z-test and Tajima’s D were not significant for the full length gene, Domain A or Domain B, indicating neutral selection. The dN/dS ratio is widely used to evaluate the effect of natural selection on genes where a lack of dN relative to dS (dN/dS < 1) suggests negative or purifying selection. Conversely, a higher value of dN compared to dS (dN/dS > 1) is indicative of positive selection. The dN/dS ratio for the full length gene as well as Domain A marginally exceeded 1, indicating neutral selection. Domain B, however, had a ratio of 0.6, indicating purifying selection on this part of the gene. Thus, it could be postulated that the P. knowlesi MSP3 has a functionally restricted Domain A which is protected from immune responses by an exposed and polymorphic Domain B.
In the present study, the phylogenetic tree showed separation of the P. knowlesi MSP3 haplotypes into two groups. Studies on P. knowlesi proteins such as the Duffy binding protein (PkDBPαII) , Pknbpxa  and PkAMA-1 domain I  have also reported bifurcation of haplotypes, indicating dimorphism of the genes. These findings provide support to the notion that two distinct P. knowlesi types or lineages exist in south-east Asia . Microsatellite DNA analysis revealed two divergent P. knowlesi populations which have been associated with different macaque reservoir host species . Recently, a whole-genome population study highlighted two major subgroups of P. knowlesi clinical isolates .
To the best of our knowledge, the present study is the first to investigate genetic diversity of the pkmsp3 gene as well as the natural selection acting on it. A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two groups of haplotypes provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, a region with reports of the highest number of human knowlesi malaria cases to date.
Escalante AA, Barrio E, Ayala FJ. Evolutionary origin of human and primate malarias: evidence from the circumsporozoite protein gene. Mol Biol Evol. 1995;12:616–26.
Chin W, Contacos PG, Coatney GR, Kimball HR. A naturally acquited quotidian-type malaria in man transferable to monkeys. Science. 1965;149:865.
Fong YL, Cadigan FC, Coatney GR. A presumptive case of naturally occurring Plasmodium knowlesi malaria in man in Malaysia. Trans R Soc Trop Med Hyg. 1971;65:839–40.
Singh B, Kim Sung L, Matusop A, Radhakrishnan A, Shamsul SS, Cox-Singh J, et al. A large focus of naturally acquired Plasmodium knowlesi infections in human beings. Lancet. 2004;363:1017–24.
Ng OT, Ooi EE, Lee CC, Lee PJ, Ng LC, Pei SW, et al. Naturally acquired human Plasmodium knowlesi infection, Singapore. Emerg Infect Dis. 2008;14:814–6.
Luchavez J, Espino F, Curameng P, Espina R, Bell D, Chiodini P, et al. Human Infections with Plasmodium knowlesi, the Philippines. Emerg Infect Dis. 2008;14:811–3.
Jongwutiwes S, Putaporntip C, Iwasaki T, Sata T, Kanbara H. Naturally acquired Plasmodium knowlesi malaria in human, Thailand. Emerg Infect Dis. 2004;10:2211–3.
Vythilingam I, Noorazian YM, Huat TC, Jiram AI, Yusri YM, Azahari AH, et al. Plasmodium knowlesi in humans, macaques and mosquitoes in peninsular Malaysia. Parasit Vectors. 2008;1:26.
Lau YL, Tan LH, Chin LC, Fong MY, Noraishah MA, Rohela M. Plasmodium knowlesi reinfection in human. Emerg Infect Dis. 2011;17:1314–5.
Lee WC, Chin PW, Lau YL, Chin LC, Fong MY, Yap CJ, et al. Hyperparasitaemic human Plasmodium knowlesi infection with atypical morphology in peninsular Malaysia. Malar J. 2013;12:88.
Yusof R, Lau YL, Mahmud R, Fong MY, Jelip J, Ngian HU, et al. High proportion of knowlesi malaria in recent malaria cases in Malaysia. Malar J. 2014;13:168.
Conway DJ. Molecular epidemiology of malaria. Clin Microbiol Rev. 2007;20:188–204.
McColl DJ, Silva A, Foley M, Kun JF, Favaloro JM, Thompson JK, et al. Molecular variation in a novel polymorphic antigen associated with Plasmodium falciparum merozoites. Mol Biochem Parasitol. 1994;68:53–67.
Oeuvray C, Bouharoun-Tayoun H, Grass-Masse H, Lepers JP, Ralamboranto L, Tartar A, et al. A novel merozoite surface antigen of Plasmodium falciparum (MSP-3) identified by cellular-antibody cooperative mechanism antigenicity and biological activity of antibodies. Mem Inst Oswaldo Cruz. 1994;89 Suppl 2:77–80.
Galinski MR, Corredor-Medina C, Povoa M, Crosby J, Ingravallo P, Barnwell JW. Plasmodium vivax merozoite surface protein-3 contains coiled-coil motifs in an alanine-rich central domain. Mol Biochem Parasitol. 1999;101:131–47.
Galinski MR, Ingravallo P, Corredor-Medina C, Al-Khedery B, Povoa M, Barnwell JW. Plasmodium vivax merozoite surface proteins-3beta and-3gamma share structural similarities with P. vivax merozoite surface protein-3alpha and define a new gene family. Mol Biochem Parasitol. 2001;115:41–53.
Jiang J, Barnwell JW, Meyer EV, Galinski MR. Plasmodium vivax merozoite surface protein-3 (PvMSP3): expression of an 11 member multigene family in blood-stage parasites. PLoS ONE. 2013;8:e63888.
Carlton JM, Adams JH, Silva JC, Bidwell SL, Lorenzi H, Caler E, et al. Comparative genomics of the neglected human malaria parasite Plasmodium vivax. Nature. 2008;455:757–63.
Rice BL, Acosta MM, Pacheco MA, Carlton JM, Barnwell JW, Escalante AA. The origin and diversification of the merozoite surface protein 3 (msp3) multi-gene family in Plasmodium vivax and related parasites. Mol Phylogenet Evol. 2014;78:172–84.
Rayner JC, Huber CS, Feldman D, Ingravallo P, Galinski MR, Barnwell JW. Plasmodium vivax merozoite surface protein PvMSP-3 beta is radically polymorphic through mutation and large insertions and deletions. Infect Genet Evol. 2004;4:309–19.
Jimenez MC, Ramos CH, Barbosa JA, Galinski MR, Barnwell JW, Rodrigues MM, et al. Biophysical characterization of the recombinant merozoite surface protein-3 of Plasmodium vivax. Biochim Biophys Acta. 2008;1780:983–8.
Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989;123:585–95.
Hahn MW, Rausher MD, Cunningham CW. Distinguishing between selection and population expansion in an experimental lineage of bacteriophage T7. Genetics. 2002;161:11–20.
Clark AG. Population genetics: malaria variorum. Nature. 2002;418:283–5.
Paul RE, Day KP. Mating patterns of Plasmodium falciparum. Parasitol Today. 1998;14:197–202.
Kimura M. The neutral theory of molecular evolution: a review of recent evidence. Jpn J Genet. 1991;66:367–86.
Brunham RC, Plummer FA, Stephens RS. Bacterial antigenic variation, host immune response, and pathogen-host coevolution. Infect Immun. 1993;61:2273–6.
Conway DJ. Natural selection on polymorphic malaria antigens and the search for a vaccine. Parasitol Today. 1997;13:26–9.
Conway DJ, Cavanagh DR, Tanabe K, Roper C, Mikes ZS, Sakihama N, et al. A principal target of human immunity to malaria identified by molecular population genetic and immunological analyses. Nat Med. 2000;6:689–92.
Conway DJ, Polley SD. Measuring immune selection. Parasitology. 2002;125(Suppl):S3–16.
Polley SD, Conway DJ. Strong diversifying selection on domains of the Plasmodium falciparum apical membrane antigen 1 gene. Genetics. 2001;158:1505–12.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.
NETWORK v184.108.40.206, a programme for haplotype analysis downloaded from http://www.fluxus-engineering.com. Accessed 1 Dec 2016.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–2.
Depaulis F, Veuille M. Neutrality tests based on the distribution of haplotypes under an infinite-site model. Mol Biol Evol. 1998;15:1788–90.
Fu YX, Li WH. Statistical tests of neutrality of mutations. Genetics. 1993;133:693–709.
Hudson RR, Slatkin M, Maddison WP. Estimation of levels of gene flow from DNA sequence data. Genetics. 1992;132:583–9.
Nei M, Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986;3:418–26.
Fong MY, Lau YL, Chang PY, Anthony CN. Genetic diversity, haplotypes and allele groups of Duffy binding protein (PkDBPαII) of Plasmodium knowlesi clinical isolates from Peninsular Malaysia. Parasit Vectors. 2014;7:161.
Faber BW, Abdul Kadir K, Rodriguez-Garcia R, Remarque EJ, Saul FA, Vulliez-Le Normand B, et al. Low levels of polymorphisms and no evidence for diversifying selection on the Plasmodium knowlesi Apical Membrane Antigen 1 gene. PLoS ONE. 2015;10:e0124400.
Rawa MS, Fong MY, Lau YL. Genetic diversity and natural selection in the rhoptry-associated protein 1 (RAP-1) of recent Plasmodium knowlesi clinical isolates from Malaysia. Malar J. 2016;15:62.
Escalante AA, Lal AA, Ayala FJ. Genetic polymorphism and natural selection in the malaria parasite Plasmodium falciparum. Genetics. 1998;149:189–202.
Snounou G, White NJ. The co-existence of Plasmodium: sidelights from falciparum and vivax malaria in Thailand. Trends Parasitol. 2004;20:333–9.
McColl DJ, Anders RF. Conservation of structural motifs and antigenic diversity in the Plasmodium falciparum merozoite surface protein-3 (MSP-3). Mol Biochem Parasitol. 1997;90:21–31.
Schultz L, Wapling J, Mueller I, Ntsuke PO, Senn N, Nale J, et al. Multilocus haplotypes reveal variable levels of diversity and population structure of Plasmodium falciparum in Papua New Guinea, a region of intense perennial transmission. Malar J. 2010;9:336.
Escalante AA, Cornejo OE, Rojas A, Udhayakumar V, Lal AA. Assessing the effect of natural selection in malaria parasites. Trends Parasitol. 2004;20:388–95.
Kang JM, Ju HL, Cho PY, Moon SU, Ahn SK, Sohn WM, et al. Polymorphic patterns of the merozoite surface protein-3β in Korean isolates of Plasmodium vivax. Malar J. 2014;13:104.
Putaporntip C, Miao J, Kuamsab N, Sattabongkot J, Sirichaisinthop J, Jongwutiwes S, et al. The Plasmodium vivax merozoite surface protein 3β sequence reveals contrasting parasite populations in southern and northwestern Thailand. PLoS Negl Trop Dis. 2014;8:e3336.
Pinheiro MM, Ahmed MA, Millar SB, Sanderson T, Otto TD, Lu WC, et al. Plasmodium knowlesi genome sequences from clinical isolates reveal extensive genomic dimorphism. PLoS ONE. 2015;10:e0121303.
Fong MY, Wong SS, De Silva JR, Lau YL. Genetic polymorphism in domain I of the apical membrane antigen-1 among Plasmodium knowlesi clinical isolates from Peninsular Malaysia. Acta Trop. 2015;152:145–50.
Muehlenbein MP, Pacheco MA, Taylor JE, Prall SP, Ambu L, Nathan S, et al. Accelerated diversification of nonhuman primate malarias in southeast Asia: adaptive radiation or geographic speciation? Mol Biol Evol. 2015;32:422–39.
Divis PC, Singh B, Anderios F, Hisam S, Matusop A, Kocken CH, et al. Admixture in humans of two divergent Plasmodium knowlesi populations associated with different macaque host species. PLoS Pathog. 2015;11:e1004888.
Assefa S, Lim C, Preston MD, Duffy CW, Nair MB, Adroub SA, et al. Population genomic structure and adaptation in the zoonotic malaria parasite Plasmodium knowlesi. Proc Natl Acad Sci U S A. 2015;112:13027–32.
We thank the Department of Parasitology Diagnostic Laboratory, Faculty of Medicine, University of Malaya and University of Malaya Medical Centre for providing patient blood samples.
This research project was supported by the University Malaya Postgraduate Research Grant (PG054-2016A) awarded to JRDS.
Availability of data and materials
The data supporting the conclusions of this article are included within the article and its Additional file 1. The nucleotide sequences of the pkmsp3 gene generated in this study are available in the GenBank database under accession numbers KT900798–KT900845.
MYF and YLL designed the study and supervised the study process. JRDS performed all the experiments and analyzed the sequence data. JRDS and MYF performed sequence and phylogenetic analyses. JRDS, MYF and YLL wrote the manuscript. All authors read and approved the final version of the manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Ethical clearance for this study was obtained from University of Malaya Medical Ethics Committee (Ref No. 817.18). Consent was obtained from patients prior to collection and they were informed of the use of these samples for research. This consent procedure was approved by the ethics committee.
Multiple alignment of full amino acid sequences of pkmsp3. The yellow columns are the variable amino acid positions. The region highlighted in red at the top of the alignment indicates Domain A, and the region in green indicates Domain B. (XLS 389 kb)