Skip to main content

Analysis of codon usage bias of thioredoxin in apicomplexan protozoa

Abstract

Background

Apicomplexan protozoa are a diverse group of obligate intracellular parasites causing many diseases that affect humans and animals, such as malaria, toxoplasmosis, and cryptosporidiosis. Apicomplexan protozoa possess unique thioredoxins (Trxs) that have been shown to regulate various cellular processes including metabolic redox regulation, parasite survival, and host immune evasion. However, it is still unknown how synonymous codons are used by apicomplexan protozoa Trxs.

Methods

Codon usage bias (CUB) is the unequal usage of synonymous codons during translation which leads to the over- or underrepresentation of certain nucleotide patterns. This imbalance in CUB can impact a variety of cellular processes including protein expression levels and genetic variation. This study analyzed the CUB of 32 Trx coding sequences (CDS) from 11 apicomplexan protozoa.

Results

The results showed that both codon base composition and relative synonymous codon usage (RSCU) analysis revealed that AT-ended codons were more frequently used in Cryptosporidium spp. and Plasmodium spp., while the Eimeria spp., Babesia spp., Hammondia hammondi, Neospora caninum, and Toxoplasma gondii tended to end in G/C. The average effective number of codon (ENC) value of these apicomplexan protozoa is 46.59, which is > 35, indicating a weak codon preference among apicomplexan protozoa Trxs. Furthermore, the correlation analysis among codon base composition (GC1, GC2, GC3, GCs), codon adaptation index (CAI), codon bias index (CBI), frequency of optimal codons (FOP), ENC, general average hydropathicity (GRAVY), aromaticity (AROMO), length of synonymous codons (L_sym), and length of amino acids (L_aa) indicated the influence of base composition and codon usage indices on CUB. Additionally, the neutrality plot analysis, PR2-bias plot analysis, and ENC-GC3 plot analysis further demonstrated that natural selection plays an important role in apicomplexan protozoa Trxs codon bias.

Conclusions

In conclusion, this study increased the understanding of codon usage characteristics and genetic evolution of apicomplexan protozoa Trxs, which expanded new ideas for vaccine and drug research.

Graphical Abstract

Background

Proteins serve as the primary agents responsible for biological functions and are primarily comprised of 20 standard amino acids. The 20 standard amino acids are denoted by 64 codons, out of which 61 encode amino acids, while the remaining three represent translation stop signals. With the exception of methionine (Met) and tryptophan (Trp), which are represented by a solitary codon, most species employ various synonymous codons to encode the remaining 18 amino acids [1,2,3]. Despite undergoing evolution over time, the genetic code remains highly conserved and permits the use of diverse codons or synonymous codons for encoding the same amino acid [4,5,6]. The frequency of synonymous codon usage is non-uniform and often random across various organisms, genes, or even the same gene among different species. In many cases, some codons are favored over others for amino acid encoding purposes [7,8,9]. Codon usage bias (CUB) is a prevalent occurrence wherein synonymous codons manifest with distinct frequencies [10,11,12,13]. Throughout the course of evolution, synonymous mutations, also known as “silent mutations,” are not anticipated to modify the original sequence or primary structure of proteins. Therefore, variations in synonymous codons among organisms can significantly contribute to genome evolution [14]. Many previous studies have noted that multiple factors affect CUB in different organisms, of which the basic factors for CUB are assumed to be a balance between natural selection (e.g. translational selection, gene length, and gene function) and mutation bias (such as GC content and mutation position of base) as well as the influence of random genetic drift [14,15,16,17,18]. CUB is known to have a significant impact on a wide range of cellular processes such as mRNA stability, transcription, translation efficiency and accuracy, as well as protein structure, folding, expression, and function. Additionally, there are various significant practical applications for understanding CUB, including heterologous gene expression [19], identifying species origins [6, 20], designing degenerate primers [21], predicting gene expression levels [22, 23], predicting gene functions [24, 25], and designing synthetic genes for biotechnological applications [26]. However, most of the numerous studies on CUB have focused on bacterium, fungi, viruses, and mycoplasma [27,28,29,30,31]. Thus far, the genetic features of codon bias in parasites, particularly in apicomplexan protozoa, have not been comprehensively comprehended.

Apicomplexans are a diverse group of protozoa that are obligate intracellular parasites and are responsible for causing many diseases that affect humans and animals, including Toxoplasma gondii, Neospora caninum, Plasmodium spp., Cryptosporidium spp., Eimeria spp., Babesia spp., Theileria spp. [32,33,34,35,36,37,38]. They have a complex life cycle involving multiple hosts and typically have an apical complex that aids in penetrating host cells. Their cell structure includes a complex organelle called the apicoplast, which is derived from secondary endosymbiosis and is essential for parasite survival. Apicomplexans have been a topic of research due to their unique features, pathogenicity, and impact on global health. Multiple studies imply the invasion process of apicomplexans is mediated by many invasion-related protein molecules, including microneme proteins, rhoptry proteins, dense granule proteins, surface antigen proteins [39,40,41,42,43,44]. In recent years, many studies have shown that thioredoxin (Trx) is also involved in the invasion process of apicomplexan protozoa. Trx is a redox enzyme that regulates cellular redox homeostasis by catalyzing the reduction of disulfide bonds in proteins. Apicomplexan protozoa possess unique Trxs that have been shown to regulate various cellular processes including metabolic redox regulation, parasite survival, and host immune evasion in T. gondii, Plasmodium falciparum, N. caninum, Babesia spp., and Cryptosporidium spp. [45,46,47,48,49,50,51]. The Trx systems in apicomplexan protozoa have been identified as potential targets for the development of novel antiparasitic drugs. The functional domain of Trx in apicomplexan protozoa is conserved, but the coding sequences are vastly different, and research on apicomplexan protozoa Trx codon usage is rare. In this study, we systematically analyzed and compared the CUB of Trxs of 32 sequences from 11 apicomplexan protozoa, including Babesia spp., Besnoitia besnoiti, Cryptosporidium spp., Cyclospora cayetanensis, Eimeria spp., Gregarina niphandrodes, Hammondia hammondi, N. caninum Liverpool, Plasmodium spp., Theileria spp., and T. gondii. The phylogenetic tree of apicomplexan protozoa was constructed based on the relative synonymous codon usage of Trxs, which was compared with the phylogenetic tree constructed according to the Trx coding sequences (CDSs). Therefore, analyzing the CUB can provide further information on the genetics and evolution of species and help accurately predict the function and expression regulation mechanisms of related genes.

Methods

Sequences

A total of 32 Trxs complete coding sequences from 11 apicomplexan protozoa were retrieved from the National Center for Biotechnology Information (NCBI) GenBank database (https://www.ncbi.nlm.nih.gov/genbank/) for subsequent CUB analysis. Detailed information about the overall 32 Trx CDS is listed in Additional file 1: Table S1.

Analysis of codon base composition

In this study, CodonW software was used to determine the contents of the nucleotide at the third codon location (C3, T3, G3, and A3%) for all synonymous codons in apicomplexan protozoa Trxs. Furthermore, the GC% contents of all three codon locations (GC1, GC2, and GC3%) and total GCs% and ATs% contents were measured. Only 59 synonymous codons encoding 18 amino acids were considered for the present study, not including the first AUG codon (Met), the codon (UGG) encoding Trp, and the three termination codons (UAG, UAA, and UGA), respectively [52].

Analysis of codon usage indices

Mutational pressure and natural selection are two key factors for codon bias. For this, many statistical methods have been proposed to analyze the codon usage indices and then determine which one is the driving force in this study. The codon adaptation index (CAI) is applied to calculate the gene expression level depending on its codon-based sequence through an online tool used for CAI calculation. It ranges from zero to one; the larger the value is, the more frequent the CUB. Thus, CAI is useful for predicting the expression level of a particular gene [53]. The codon bias index (CBI) is used as a standard to evaluate gene expression, which reflects the components of highly expressed superior codons in a specific gene [54]. The frequency of optimal codons (FOP) is calculated by counting the ratio of the optimal codon number to the total synonymous codon number in one specific gene. The FOP value varies and ranges from 0.36 (which means the codon usage bias is weak) to 1 (which means the codon usage bias is strong). The value of CBI near zero indicates all codons are completely randomly used [55]. The effective number of codons (ENC) refers to the number of effective codons used in one specific gene. The ENC value varies and ranges from 20 (which means that only one codon is used for each amino acid) to 61 (which means that each codon is used on average). In addition, if the value of ENC is < 35, the codon usage bias is strong; if it is > 35, the codon is randomly used [56]. The general average hydropathicity (GRAVY) values were calculated by the arithmetic mean of the sum of the hydropathic indices of each amino acid. GRAVY values range from − 2 to 2; positive and negative values represent hydrophobic and hydrophilic proteins, respectively [57]. The aromaticity (AROMO) value represents the frequency of aromatic amino acids (Phe, Tyr, and Trp) in a specific gene [58]. The length of synonymous codons (L_sym) and length of amino acids (L_aa) are the two indices which represent the number of synonymous codons and the number of translatable codons, respectively. The variation in amino acid composition can also influence the analysis results of codon usage [59].

Analysis of relative synonymous codon usage

Relative synonymous codon usage (RSCU) value was calculated by dividing the amino acids encoded by the same codons and their probability of appearing in the same codons. An RSCU value > 1 indicates a positive codon bias (RSCU value > 1.6 indicates a strong positive codon bias), an RSCU value < 1 indicates a negative codon bias, and an RSCU value = 1 indicates a random codon usage [60].

Neutrality plot analysis

The neutrality plot can explain the balance between mutation pressure and natural selection in specific genes. The line of regression slope between GC3 and GC12 (the average GC codon content in GC1 and GC2) indicates that mutation pressure is the major factor affecting CUB when values come close to 1. In contrast, if there is no correlation between GC12 and GC3, the value comes close to 0, and then the main driving force of the tested gene is natural selection [61].

PR2-bias plot analysis

Parity Rule 2 bias (PR2-Bias) plot analyses were performed based on [A3/(A3 + U3) vs. G3/(G3 + C3)]. If the codon had no usage bias, A = T and C = G, the value was in the center point of the plot. In contrast, the other vectors emitted from the center point indicate the degree and direction of the gene bias [62].

ENC-GC3 plot analysis

The ENC-GC3 plot (ENC vs. GC3) is usually used to analyze the influencing factor of CUB in a specific gene, such as mutation pressure and natural selection. The ENC-GC3 diagram consists of the ordinate ENC value and abscissa GC3 value, and the standard curve shows the functional relation between ENC and GC3. If the corresponding points are distributed around or on the standard curve, we can conclude that the mutation pressure is an independent force in CUB. If the corresponding point is lower or far from the standard curve, the natural selection factor may play a key role in the formation of codon bias [63].

Correlation analysis

Correlation analysis was performed to illustrate the relationship among codon base composition (GC1, GC2, GC3, GCs), CAI, CBI, FOP, ENC, GRAVY, AROMO, L_sym, and L_aa of apicomplexan protozoa Trxs. Spearman’s rank correlation method was applied in correlation analysis. All processes were executed using the R corrplot package [64].

Phylogenetic analysis

The clustering analysis to the RSCU of Trxs was made among 32 representative apicomplexan protozoa using the method of squared Euclidean distance [2]. The phylogenetic tree was constructed using the neighbor-joining method by MEGA 11.0 (https://www.megasoftware.net/), and a cluster heat map was generated by Hemi 1.0 software (http://hemi.biocuckoo.org/down.php).

Software used

All indices of codon usage bias above were calculated in the data set using the program CodonW 1.4.2 (http://codonw.sourceforge.net/). Clustering and correlation analyses were conducted using the statistical software SPSS 18.0. Graphs were generated in GraphPad Prism 6.01 (http://www.graphpad.com/scientific-software/prism/).

Results

Results of codon base composition in apicomplexan protozoa Trxs

CUB can be considerably influenced by the general base composition of genomes. We selected 32 Trxs from the 11 apicomplexan protozoa for codon usage analysis (Additional file 1: Table S1). Statistical analysis found that the encoding region length of these Trx ranged from 255 to 1665 bp, with the Plasmodium vivax Trx gene having the longest length and the Eimeria necatrix Trx gene having the shortest. We further calculated the base composition of 32 Trxs, and our outcomes disclosed that Plasmodium spp. and Cryptosporidium spp. are rich in the A3, T3, and ATs bases, and Eimeria spp. and Babesia spp. are rich in the G3, C3, and GCs bases (Fig. 1, Additional file 2: Table S2). The content of T3% is most in Cryptosporidium muris (56.52%) and least in E. necatrix (9.86%). The A3% content of Plasmodium yoelii (69.15%) is at a maximum level higher than that in other apicomplexan protozoa, while the content of G3% (3.27%) and C3% (9.46%) in P. yoelii is least among these apicomplexan protozoa (Fig. 1A, Additional file 2: Table S2). In addition, nucleotide content analysis at the first, second, and third synonymous codon positions showed that the values of GC1% ranged from 31.31 to 63.12% (mean: 44.52%), while GC3% ranged from 12.11 to 85.78% (mean: 46.98%). However, the GC2% values ranged from 23.1 to 62.75%; the average value is the lowest (mean: 32.57%, Fig. 1B, Additional file 2: Table S2).

Fig. 1
figure 1

Codon base composition in apicomplexan protozoa Trxs. The relative content of codon base composition in apicomplexan protozoa Trxs were calculated. A List the contents of the nucleotide at the third codon location (C3, T3, G3, and A3%) in apicomplexan protozoa Trxs. B GC% contents of all three codon locations (GC1, GC2, and GC3%) and total GCs% and ATs% contents in apicomplexan protozoa Trxs. X-axis represents percentages, while Y-axis represents different apicomplexan protozoa

Results of codon usage index analysis in apicomplexan protozoa Trxs

We calculated the CAI values of 32 Trxs from the 11 apicomplexan protozoa and found that the CAI values of Trxs ranged from 0.171 to 0.373 (Table 1). Among them, C. muris had the lowest CAI value, while the E. necatrix had the highest CAI value, indicating that the E. necatrix gene had a high codon bias. In terms of species, the Trxs of Eimeria spp. has the highest CAI value, followed by H. hammondi, while the CAI values of Cryptosporidium spp. are the lowest, indicating that Eimeria spp. have strong codon bias. The CBI values of the 32 Trxs that were detected ranged from − 0.209 to 0.415 (Table 1). Among them, Cryptosporidium ubiquitum had the lowest CBI value, and E. necatrix had the highest, which has a strong codon bias. The average FOP values ranged from 0.305 to 0.679 among the 32 Trxs detected, while C. muris had the lowest FOP value and E. necatrix had the highest FOP value with strong codon bias (Table 1). We further calculated the GRAVY values of 32 Trxs and the results showed that 25 of the 32 Trxs had negative GRAVY values, indicating that they might be hydrophilic proteins, while most Theileria spp. were considered hydrophobic (Table 1). The frequency of aromatic amino acids (AROMO value) ranges from 0.049 to 0.153 (Table 1). Babesia bovis has the highest AROMO value, while Eimeria mitis is the lowest. The AROMO values of different apicomplexan protozoa Trxs varied obviously, with an average of 0.103. The average ENC value of all 32 Trxs detected ranged from 30.77 to 61, with an average ENC value of 46.59. Only the ENC value of E. necatrix was < 35, and the others were more than 35, even equal to 61, indicating that these genes had a weak codon usage preference (Table 1). The data of L_sym (range from 81 to 534) and L_aa (range from 85 to 555) are listed in Table 1.

Table 1 Codon usage indices in apicomplexan protozoa Trxs

Defining codon usage patterns in apicomplexan protozoa Trxs

An RSCU analysis was used to regulate the identical pattern of codon usage in the Trxs of apicomplexan protozoa. CUB was found to occur among these parasites, and 31 of the 32 apicomplexan protozoa contained > 24 positive codon bias (RSCU ≥ 1), except Babesia bigemina (including 23 positive codon bias, Fig. 2, Additional file 3: Table S3). In addition, > 6 high-frequency codons (RSCU ≥ 1.6) among the 32 apicomplexan protozoa with 19 high-frequency codons in Plasmodium berghei ANKA, P. yoelii, and B. besnoiti indicate Plasmodium spp. have a stronger positive codon bias and only six high-frequency codons in B. bovis. Furthermore, from the RSCU analysis, we found that the most abundantly used codons in 32 apicomplexan protozoa are AGC (Ser) and UUA (Leu), while CGG (Arg) is seldom used, even never used in Cryptosporidium spp., B. besnoiti, C. cayetanensis, and H. hammondi. Among the optimal codons, the AGA (Arg), AGC (Ser), and AGG (Arg) have the highest value (RSCU = 6), followed by AGC (Ser, RSCU = 5.36) and CGC (Arg, RSCU = 4.5), indicating the strongest positive codon bias, while AAG (Lys) has the lowest value (RSCU = 0.04) among the 59 synonymous codons. In addition, GCA (Ala) is used as the optimal codon in Cryptosporidium spp., AGC (Ser) is used as the optimal codon in Eimeria spp., and CGC (Arg) is used as the optimal codon in B. besnoiti, H. hammondi, N. caninum, and T. gondii.

Fig. 2
figure 2

RSCU values in apicomplexan protozoa Trxs. Relative synonymous codon usage (RSCU) value was calculated by dividing the amino acids encoded by the same codons and their probability of appearing in the same codons. The color of the color block changes from blue to red, indicating that the RSCU values are increasing, of which an RSCU value > 1 indicates a positive codon bias. The homology of codons are also shown on the right side of the figure

Results of neutrality plot analysis in apicomplexan protozoa Trxs

A plot of neutrality was performed, which implied the relationships between GC12 and GC3 composition to determine the position of mutation pressure and natural selection that has an impact on the CUB form. The GC12 content varied from 27.21 to 56.54%, and the GC3 content varied from 12.11 to 85.78% (Additional file 2: Table S2). To observe the association, we programmed a paradigm on the plot of neutrality between GC12 and GC3 for the 32 Trxs in apicomplexan protozoa. These 32 apicomplexan protozoa were divided into six groups: (A) Babesia spp., (B) Cryptosporidium spp., (C) Eimeria spp., (D) Plasmodium spp., (E) Theileria spp., and (F) others (including B. besnoiti, C. cayetanensis, G. niphandrodes, H. hammondi, N. caninum, and T. gondii). The slopes of the regression lines ranged from − 0.1598 (Eimeria spp.) to 0.5124 (Theileria spp.), indicating that the content of GC12 and GC3 in apicomplexan protozoa Trxs is weakly associated (Fig. 3). In addition, the R2 value of the standard curve ranged from 0.993 (Eimeria spp.) to 0.8491 (Plasmodium spp.). There was no significant correlation between GC12 value and GC3 value (p > 0.05), which indicated that natural selection may play an important role in driving the evolution of Trxs in apicomplexan protozoa. This phenomenon is similar to the findings of previous studies.

Fig. 3
figure 3

Neutrality plot analysis in apicomplexan protozoa Trxs. The correlations between the average GC codon content in GC1 and GC2 (GC12) and the third codon position (GC3) were analyzed and the standard curve and R2 in apicomplexan protozoa Trxs, respectively. X-axis represents GC3%, while Y-axis represents GC12%. A Babesia spp., B Cryptosporidium spp., C Eimeria spp., D Plasmodium spp., E Theileria spp., F others (including Besnoitia besnoiti, Cyclospora cayetanensis, Gregarina niphandrodes, Hammondia hammondi, Neospora caninum, and Toxoplasma gondii)

Results of PR2-bias plot analysis in apicomplexan protozoa Trxs

To determine whether Trxs in apicomplexan protozoa have biases, we further performed a Parity Rule 2 (PR2) plot analysis (Fig. 4). Both axes were centered on 0.5 to divide the plot into four quadrants. In the first quadrant, the optimal codons are A and G, and the optimal codons are T and C in the third quadrant. The Babesia spp., B. besnoiti, C. cayetanensis, H. hammondi, N. caninum, and T. gondii prefer codon T to A (Fig. 4A, F). The optimal codon in Cryptosporidium spp. is G (Fig. 4B). Most of the dots were found to be distributed in the second quadrant of the Eimeria spp. and Plasmodium spp. (preferring A to T and C to G, Fig. 4C, D), with random codon usage in Theileria spp. (Fig. 4E). The analysis results showed that other factors, such as natural selection, play an important role in the process of codon bias in apicomplexan protozoa.

Fig. 4
figure 4

PR2-bias plot analysis in apicomplexan protozoa Trxs. The correlations between A3/(A3 + U3) and G3/(G3 + C3) were analyzed in apicomplexan protozoa Trxs, respectively. If the codon has no usage bias, A = T and C = G, the value was in the center point of the plot. The first quadrant represents the codon preference of A/G, and the third quadrant represents T/C preference. X-axis represents GC3%, while Y-axis represents GC12%. A Babesia spp., B Cryptosporidium spp., C Eimeria spp., D Plasmodium spp., E Theileria spp., F others (including Besnoitia besnoiti, Cyclospora cayetanensis, Gregarina niphandrodes, Hammondia hammondi, Neospora caninum, and Toxoplasma gondii)

Results of ENC-GC3 plot analysis in apicomplexan protozoa Trxs

To further confirm the influence of GC3s on the codon bias of Trxs in apicomplexan protozoa, a distribution plot was employed that deviated from the same usage of indistinguishable codons (Fig. 5). In this study, ENC values were used against the GC3, and the standard curve indicates that the functional relationship between ENC and GC3 is influenced by mutation pressure rather than natural selection. If the GC subject of the gene exhibits mutational pressure, all the points in this plot will lie on the expected curve, indicating random codon usage. However, if there was natural selection pressure on the gene, most of the points were below the expected curve and just a few points beyond it (Babesia bovis, B. microti, B. ovata, Eimeria tenella, Plasmodium knowlesi, P. malariae, G. niphandrodes). The results showed that all of the points were closed to the standard curve without lying on it, which indicates that mutation pressure is not the only factor that shapes codon bias, and natural selection also plays a key role in codon bias formation.

Fig. 5
figure 5

ENC-GC3 plot analysis in apicomplexan protozoa Trxs. The correlations between the effective number of codons (ENC) and the contents of the nucleotide G/C at the third codon location (GC3) were analyzed in apicomplexan protozoa Trxs, respectively. The standard curve represents the functional relationship between ENC and GC3 under mutation pressure rather than natural selection. X-axis represents GC3%, while Y-axis represents ENC. A Babesia spp., B Cryptosporidium spp., C Eimeria spp., D Plasmodium spp., E Theileria spp., F others (including Besnoitia besnoiti, Cyclospora cayetanensis, Gregarina niphandrodes, Hammondia hammondi, Neospora caninum, and Toxoplasma gondii)

Results of correlation analysis in apicomplexan protozoa Trxs

To intuitively display the indices related to the 12 main contributors, correlations of the important indices were calculated to determine the important factors that result in codon bias (Fig. 6). In Babesia spp., the values of GC1, GC2, ENC, GRAVY, and AROMO did not correlate with other indices, while GC3 was correlated with GCs, CAI, CBI, and FOP (p < 0.05, Fig. 6A). In addition, we did not observe a significant correlation between GC1 and GC2 or GC3 in Babesia spp., Cryptosporidium spp., Eimeria spp., and Theileria spp., except Plamodium spp. (Fig. 6). CBI value was significantly correlated with the FOP among these apicomplexan protozoa (p < 0.01). There was a significant correlation between the ENC and GC1 contents in Eimeria spp. and Plasmodium spp., which might lead to an assumption about the usage of synonymous codons suffered from natural selection (Fig. 6C, D). Furthermore, only a few indices correlate with Theileria spp. (Fig. 6E); however, almost all indices correlate with Plasmodium spp. (Fig. 6D), which indicated both mutation pressure and natural selection play a key role in codon bias formation.

Fig. 6
figure 6

Correlation analysis in apicomplexan protozoa Trxs. The correlations among codon base composition (GC1, GC2, GC3, GCs), codon adaptation index (CAI), codon bias index (CBI), frequency of optimal codons (FOP), effective number of codons (ENC), general average hydropathicity (GRAVY), aromaticity (AROMO), length of synonymous codons (L_sym), and length of amino acids (L_aa) were analyzed in apicomplexan protozoa Trxs, respectively. The color of the color block changes from blue to red, indicating that the correlation is increasing. X-axis represents correlation, while Y-axis represents different apicomplexan protozoa. A Babesia spp., B Cryptosporidium spp., C Eimeria spp., D Plasmodium spp., E Theileria spp., F others (including Besnoitia besnoiti, Cyclospora cayetanensis, Gregarina niphandrodes, Hammondia hammondi, Neospora caninum, and Toxoplasma gondii). One asterisk (*) indicates a significant correlation among indices at p < 0.05; two asterisks (**) indicate the correlation at p < 0.01; Three asterisks (***) indicate the correlation at p < 0.001

Results of phylogenetic analysis in apicomplexan protozoa Trxs

To assess the consequence of evolutionary procedures on the Trxs in apicomplexan protozoa codon usage patterns, 32 apicomplexan protozoa RSCU values of Trxs were used for the cluster analysis (Fig. 7A). The results showed that all the species are divided into two big clusters at the evolution distance; the Babesia spp. and Theileria spp. were also separated into different clusters, respectively, while Plasmodium spp. and Cryptosporidium spp. were in the same cluster. Compared with the phylogenetic relationship based on RSCU, a phylogenetic analysis was used by CDS through the neighbor-joining method (Fig. 7B). Based on CDS phylogenetic analysis, the Babesia spp. and Theileria spp. were in different evolutionary clades of the same cluster, which is closer to the real evolution.

Fig. 7
figure 7

Phylogenetic analysis in apicomplexan protozoa Trxs. The evolution analyses among apicomplexan protozoa Trxs were clustered by relative synonymous codon usage (RSCU) value (A, the color of the color block changes from blue to red, indicating that the RSCU values are increasing) and coding sequences (B, through the neighbor-joining method), respectively

Discussion

Across long-term evolution, organisms will eventually develop a specific set of codon usages, which preserves the conveyance of genetic information between nucleotides and amino acids [65, 66]. Nevertheless, disparate genes of the same or distinct species display varying predilections towards codon usage [67]. Consequently, CUB analysis offers valuable insights into the regulatory mechanisms of translation processes and facilitate exogenous gene prediction and optimization for improved expression levels through industrial modification [59, 68]. To date, the characteristics of codon usage for thioredoxin genes of apicomplexan protozoa have not been fully understood.

Trx is a type of redox protein, which plays an important role in metabolic redox regulation, parasite survival, host immune evasion, and the invasion process of apicomplexan protozoa [69,70,71,72,73,74]. The length and codon base composition of Trxs in apicomplexan protozoa showed large variations, indicating the differentiation of apicomplexan protozoa Trxs. It is reported that the difference in synonymous codons is mainly reflected in the difference in the third codon. In this study, we found that the Cryptosporidium spp. and Plasmodium spp. tend to end with A/T, which is similar to previous research on P. falciparum, Mycoplasma capricolum, and Onchocerca volvulus, being enriched with A and T. Eimeria spp., Babesia spp., H. hammondi, N. caninum, and T. gondii, were rich in C3/G3, which proved that one specific gene shows diverse codon usage bias in different species and the results are consistent with the feature of apicomplexan protozoa codon usage in other genes [75, 76]. Most high-frequency Trx codons analyzed by RSCU also show the same tendency of using the third codon in apicomplexan protozoa. In addition, the CAI, CBI, and Fop values of E. necatrix were the highest, which indicates a strong codon bias. An ENC value < 35 indicates a strong codon preference [77, 78]. The average ENC of these 32 apicomplexan protozoa was 46.59 in this study; all ENC values except Eimeria necatrix (30.77) were > 35, which indicates a weak codon preference among apicomplexan protozoa. Furthermore, we detected the correlations among codon base composition (GC1, GC2, GC3, GCs), CAI, CBI, FOP, ENC, GRAVY, AROMO, L_sym, and L_aa, indicating the influence of base composition and codon usage indices on CUB, which show a significant correlation in Plasmodium spp. The neutrality plot analysis, PR2-bias plot analysis, and ENC-GC3 plot analysis further demonstrated that natural selection plays an important role in Trxs of apicomplexan protozoa codon bias. Despite some differences in codon usage indices among apicomplexan protozoa, their common point was that CUB of Trx was affected by strong natural selection.

Apicomplexans are a class of obligate intracellular parasitic protozoa, with a large geographical distribution, which are important pathogens for humans and animals and can cause serious zoonotic diseases such as malaria, toxoplasmosis, and cryptosporidiosis [35, 39,40,41,42,43,44, 79]. Besides, apicomplexans are believed to have been obtained from Protista, dividing into aconoidasida and conoidasida, including T. gondii, Plasmodium spp., Cryptosporidium spp., Eimeria spp., Babesia spp., Theileria spp., and N. caninum. At present, the RSCU clustering and CDS phylogenetic tree are widely used for analyzing the evolutionary relationship of the same gene in different species. These two clustering analysis methods have consistent results in some species, while others differ significantly [2]. In this study, we analyzed the relationship of Trxs in different apicomplexan protozoa based on CDS and RSCU, respectively. Actually, the phylogenetic relationships based on CDS are more reliable, which is different from the RSCU-based relationships, especially for the Babesia spp. and Theileria spp. However, the genetic relationship between some species was correctly interpreted according to the RSCU value, which was consistent with other studies [60, 80]. The results show that the phylogenetic results based on RSCU can be an important supplement to the phylogenetic results based on the sequence.

Conclusions

Many factors can result in the CUB of organisms. For the Trxs in apicomplexan protozoa, natural selection is found to dominate the high CUB. We believe that mutation pressure only plays a relatively minor role. Moreover, our study provides new insight into the exploration of setting up new methods for species taxonomy, though a trial still needs to be conducted in the future.

Availability of data and materials

All data associated with this study are present in the paper or the Additional files. Any other relevant data are available from the corresponding author upon reasonable request.

References

  1. Brule CE, Grayhack EJ. Synonymous codons: choose wisely for expression. Trends Genet. 2017;33:283–97. https://doi.org/10.1016/j.tig.2017.02.001.

    Article  CAS  PubMed  Google Scholar 

  2. Jiang Y, Neti SS, Sitarik I, Pradhan P, To P, Xia Y, et al. How synonymous mutations alter enzyme structure and function over long timescales. Nat Chem. 2023;15:308–18. https://doi.org/10.1038/s41557-022-01091-z.

    Article  CAS  PubMed  Google Scholar 

  3. Parvathy ST, Udayasuriyan V, Bhadana V. Codon usage bias. Mol Biol Rep. 2022;49:539–65. https://doi.org/10.1007/s11033-021-06749-4.

    Article  CAS  PubMed  Google Scholar 

  4. Bailey SF, Alonso Morales LA, Kassen R. Effects of synonymous mutations beyond codon bias: The evidence for adaptive synonymous substitutions from microbial evolution experiments. Genome Biol Evol. 2021;13:141. https://doi.org/10.1093/gbe/evab141.

    Article  CAS  Google Scholar 

  5. Chaney JL, Clark PL. Roles for synonymous codon usage in protein biogenesis. Annu Rev Biophys. 2015;44:143–66. https://doi.org/10.1146/annurev-biophys-060414-034333.

    Article  CAS  PubMed  Google Scholar 

  6. Yao H, Chen M, Tang Z. Analysis of synonymous codon usage bias in Flaviviridae virus. Biomed Res Int. 2019. https://doi.org/10.1155/2019/5857285.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Pakrashi A, Patidar A, Singha D, Kumar V, Tyagi K. Comparative analysis of the two suborders of Thysanoptera and characterization of the complete mitochondrial genome of Thrips parvispinus. Arch Insect Biochem Physiol. 2023. https://doi.org/10.1002/arch.22010.

    Article  PubMed  Google Scholar 

  8. Wang H, Liu S, Lv Y, Wei W. Codon usage bias of Venezuelan equine encephalitis virus and its host adaption. Virus Res. 2023;328:199081. https://doi.org/10.1016/j.virusres.2023.199081.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Zhao ZY, Yu D, Ji CM, Zheng Q, Huang YW, Wang B. Comparative analysis of newly identified rodent arteriviruses and porcine reproductive and respiratory syndrome virus to characterize their evolutionary relationships. Front Vet Sci. 2023;10:1174031. https://doi.org/10.3389/fvets.2023.1174031.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Alqahtani T, Khandia R, Puranik N, Alqahtani AM, Chidambaram K, Kamal MA. Codon usage is influenced by compositional constraints in genes associated with dementia. Front Genet. 2022;13:884348. https://doi.org/10.3389/fgene.2022.884348.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Chen F, Yang JR. Distinct codon usage bias evolutionary patterns between weakly and strongly virulent respiratory viruses. iScience. 2022;25:103682. https://doi.org/10.1016/j.isci.2021.103682.

    Article  CAS  PubMed  Google Scholar 

  12. Iriarte A, Lamolle G, Musto H. Codon usage bias: an endless tale. J Mol Evol. 2021;89:589–93. https://doi.org/10.1007/s00239-021-10027-z.

    Article  CAS  PubMed  Google Scholar 

  13. Khandia R, Saeed M, Alharbi AM, Ashraf GM, Greig NH, Kamal MA. Codon usage bias correlates with gene length in neurodegeneration associated genes. Front Neurosci. 2022;16:895607. https://doi.org/10.3389/fnins.2022.895607.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Bhattacharyya D, Uddin A, Das S, Chakraborty S. Mutation pressure and natural selection on codon usage in chloroplast genes of two species in Pisum L. (Fabaceae: Faboideae). Mitochondrial DNA A DNA Mapp Seq Anal. 2019;30:664–73. https://doi.org/10.1080/24701394.2019.1616701.

    Article  CAS  PubMed  Google Scholar 

  15. Hu H, Dong B, Fan X, Wang M, Wang T, Liu Q. Mutational bias and natural selection driving the synonymous codon usage of single-exon genes in rice (Oryza sativa L.). Rice (N Y). 2023;16:11. https://doi.org/10.1186/s12284-023-00627-2.

    Article  CAS  PubMed  Google Scholar 

  16. Matsushita T, Kano-Sueoka T. Non-random codon usage of synonymous and non-synonymous mutations in the human HLA-A gene. J Mol Evol. 2023;91:169–91. https://doi.org/10.1007/s00239-023-10093-5.

    Article  CAS  PubMed  Google Scholar 

  17. Shen G, Gao M, Cao Q, Li W. The molecular basis of FIX deficiency in hemophilia B. Int J Mol Sci. 2022;23:2762. https://doi.org/10.3390/ijms23052762.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Shen X, Song S, Li C, Zhang J. Synonymous mutations in representative yeast genes are mostly strongly non-neutral. Nature. 2022;606:725–31. https://doi.org/10.1038/s41586-022-04823-w.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Wang W, Blenner MA. Engineering heterologous enzyme secretion in Yarrowia lipolytica. Microb Cell Fact. 2022;21:134. https://doi.org/10.1186/s12934-022-01863-9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Yu D, Zhao ZY, Yang YL, Qin Y, Pan D, Yuan LX, et al. The origin and evolution of emerged swine acute diarrhea syndrome coronavirus with zoonotic potential. J Med Virol. 2023;95:e28672. https://doi.org/10.1002/jmv.28672.

    Article  CAS  PubMed  Google Scholar 

  21. Chassalevris T, Chaintoutis SC, Apostolidi ED, Giadinis ND, Vlemmas I, Brellou GD, et al. A highly sensitive semi-nested real-time PCR utilizing oligospermine-conjugated degenerate primers for the detection of diverse strains of small ruminant lentiviruses. Mol Cell Probes. 2020;51:101528. https://doi.org/10.1016/j.mcp.2020.101528.

    Article  CAS  PubMed  Google Scholar 

  22. Manjunath LE, Singh A, Som S, Eswarappa SM. Mammalian proteome expansion by stop codon readthrough. Wiley Interdiscip Rev RNA. 2023;14:e1739. https://doi.org/10.1002/wrna.1739.

    Article  CAS  PubMed  Google Scholar 

  23. Vaz PK, Armat M, Hartley CA, Devlin JM. Codon pair bias deoptimization of essential genes in infectious laryngotracheitis virus reduces protein expression. J Gen Virol. 2023. https://doi.org/10.1099/jgv.0.001836.

    Article  PubMed  Google Scholar 

  24. Bu Y, Wu X, Sun N, Man Y, Jing Y. Codon usage bias predicts the functional MYB10 gene in Populus. J Plant Physiol. 2021;265:153491. https://doi.org/10.1016/j.jplph.2021.153491.

    Article  CAS  PubMed  Google Scholar 

  25. Gorlov IP, Pikielny CW, Frost HR, Her SC, Cole MD, Strohbehn SD, et al. Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples. BMC Bioinformatics. 2018;19:430. https://doi.org/10.1186/s12859-018-2455-0.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Hernandez-Alias X, Benisty H, Radusky LG, Serrano L, Schaefer MH. Using protein-per-mRNA differences among human tissues in codon optimization. Genome Biol. 2023;24:34. https://doi.org/10.1186/s13059-023-02868-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Dilucca M, Pavlopoulou A, Georgakilas AG, Giansanti A. Codon usage bias in radioresistant bacteria. Gene. 2020;742:144554. https://doi.org/10.1016/j.gene.2020.144554.

    Article  CAS  PubMed  Google Scholar 

  28. Hou W. Characterization of codon usage pattern in SARS-CoV-2. Virol J. 2020;17:138. https://doi.org/10.1186/s12985-020-01395-x.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Li G, Zhang L, Xue P. Codon usage divergence in Delta variants (B.1.617.2) of SARS-CoV-2. Infect Genet Evol. 2022;97:105175. https://doi.org/10.1016/j.meegid.2021.105175.

    Article  CAS  PubMed  Google Scholar 

  30. Wang W, Huang P, Jiang N, Lu H, Zhang D, Wang D, et al. A thioredoxin homologous protein of Plasmodium falciparum participates in erythrocyte invasion. Infect Immun. 2018;86:e00289-e318. https://doi.org/10.1128/IAI.00289-18.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Wu Y, Jin L, Li Y, Zhang D, Zhao Y, Chu Y, et al. The nucleotide usages significantly impact synonymous codon usage in Mycoplasma hyorhinis. J Basic Microbiol. 2021;61:133–46. https://doi.org/10.1002/jobm.202000592.

    Article  CAS  PubMed  Google Scholar 

  32. Ahmadpour E, Rahimi MT, Ghojoghi A, Rezaei F, Hatam-Nahavandi K, Oliveira SMR, et al. Toxoplasma gondii infection in marine animal species, as a potential source of food contamination: a systematic review and meta-analysis. Acta Parasitol. 2022;67:592–605. https://doi.org/10.1007/s11686-021-00507-z.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Ayana D, Temesgen K, Kumsa B, Alkadir G. Dry season Eimeria infection in dairy cattle and sheep in and around Adama and Bishoftu Towns, Oromia, Ethiopia. Vet Med (Auckl). 2022;13:235–45. https://doi.org/10.2147/VMRR.S377017.

    Article  PubMed  Google Scholar 

  34. Daily JP, Minuti A, Khan N. Diagnosis, treatment, and prevention of malaria in the US: a review. JAMA. 2022;328:460–71. https://doi.org/10.1001/jama.2022.12366.

    Article  PubMed  Google Scholar 

  35. Guven E, Akyuz M, Kirman R, Balkaya I, Avcioglu H. Zoonotic Babesia microti infection in wild rodents in Erzurum province, Northeastern Turkey. Zoonoses Public Health. 2022;69:875–83. https://doi.org/10.1111/zph.12983.

    Article  CAS  PubMed  Google Scholar 

  36. Huang M, Yin Y, Shi K, Zhang H, Cao X, Song X. Neospora caninum seroprevalence in water buffaloes in Guangxi, China. Anim Biotechnol. 2022. https://doi.org/10.1080/10495398.2022.2126369.

    Article  PubMed  Google Scholar 

  37. Murnik LC, Daugschies A, Delling C. Cryptosporidium infection in young dogs from Germany. Parasitol Res. 2022;121:2985–93. https://doi.org/10.1007/s00436-022-07632-2.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Sojka D, Jalovecká M, Perner J. Babesia, Theileria, Plasmodium and hemoglobin. Microorganisms. 2022;10:1651. https://doi.org/10.3390/microorganisms10081651.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Afriat A, Zuzarte-Luís V, Bahar Halpern K, Buchauer L, Marques S, Chora ÂF, et al. A spatiotemporally resolved single-cell atlas of the Plasmodium liver stage. Nature. 2022;611:563–9. https://doi.org/10.1038/s41586-022-05406-5.

    Article  CAS  PubMed  Google Scholar 

  40. Dear JD, Birkenheuer A. Babesia in North America: an update. Vet Clin North Am Small Anim Pract. 2022;52:1193–209. https://doi.org/10.1016/j.cvsm.2022.07.016.

    Article  PubMed  Google Scholar 

  41. Gondim LFP, McAllister MM. Experimental Neospora caninum infection in pregnant cattle: different outcomes between inoculation with tachyzoites and oocysts. Front Vet Sci. 2022;9:911015. https://doi.org/10.3389/fvets.2022.911015.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Melo LRB, Sousa LC, Lima BA, Silva ALP, Lima EF, Ferreira LC, et al. The diversity of Eimeria spp. in cattle in the Brazilian semiarid region. Rev Bras Parasitol Vet. 2022;31:e006422. https://doi.org/10.1590/S1984-29612022037.

    Article  PubMed  Google Scholar 

  43. Scorza AV, Tyrrell P, Wennogle S, Chandrashekar R, Lappin MR. Experimental infection of cats with Cryptosporidium felis. J Feline Med Surg. 2022;24:1060–4. https://doi.org/10.1177/1098612X211053477.

    Article  PubMed  Google Scholar 

  44. Teimouri A, Goudarzi F, Goudarzi K, Alimi R, Sahebi K, Foroozand H, et al. Toxoplasma gondii infection in immunocompromised patients in Iran (2013–2022): a systematic review and meta-analysis. Iran J Parasitol. 2022;17:443–57. https://doi.org/10.18502/ijpa.v17i4.11271.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Han H, Dong H, Zhu S, Zhao Q, Jiang L, Wang Y, et al. Molecular characterization and analysis of a novel protein disulfide isomerase-like protein of Eimeria tenella. PLoS ONE. 2014;9:e99914. https://doi.org/10.1371/journal.pone.0099914.

    Article  CAS  PubMed  Google Scholar 

  46. Mfeka MS, Martínez-Oyanedel J, Chen W, Achilonu I, Syed K, Khoza T. Comparative analyses and structural insights of new class glutathione transferases in Cryptosporidium species. Sci Rep. 2020;10:20370. https://doi.org/10.1038/s41598-020-77233-5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Piao X, Ma Y, Liu S, Hou N, Chen Q. A novel thioredoxin-like protein of Babesia microti involved in parasite pathogenicity. Front Cell Infect Microbiol. 2022;12:826818. https://doi.org/10.3389/fcimb.2022.826818.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Shahzad M, Garg R, Yadav S, Devi A, Ram H, Banerjee PS. Comparative evaluation of Babesia bigemina truncated C-terminal rhoptry associated protein-1 and 200 kDa merozoite protein in indirect enzyme-linked immunosorbent assay. Ticks Tick Borne Dis. 2021;12:101783. https://doi.org/10.1016/j.ttbdis.2021.101783.

    Article  PubMed  Google Scholar 

  49. Venancio Brochi JC, Pereira LM, Yatsuda AP. Extracellular H2O2, peroxiredoxin, and glutathione reductase alter Neospora caninum invasion and proliferation in Vero cells. Exp Parasitol. 2022;242:108381. https://doi.org/10.1016/j.exppara.2022.108381.

    Article  CAS  PubMed  Google Scholar 

  50. Wang Q, Lyu X, Cheng J, Fu Y, Lin Y, Abdoulaye AH, et al. Codon usage provides insights into the adaptive evolution of mycoviruses in their associated fungi host. Int J Mol Sci. 2022;23:7441. https://doi.org/10.3390/ijms23137441.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Zhang ZW, Li TT, Wang JL, Liang QL, Zhang HS, Sun LX, et al. Functional characterization of two thioredoxin proteins of Toxoplasma gondii using the CRISPR-Cas9 system. Front Vet Sci. 2021;7:614759. https://doi.org/10.3389/fvets.2020.614759.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Boissinot S. On the base composition of transposable elements. Int J Mol Sci. 2022;23:4755. https://doi.org/10.3390/ijms23094755.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Zhou H, Ren R, Yau SS. Utilizing the codon adaptation index to evaluate the susceptibility to HIV-1 and SARS-CoV-2 related coronaviruses in possible target cells in humans. Front Cell Infect Microbiol. 2023;12:1085397. https://doi.org/10.3389/fcimb.2022.1085397.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Masłowska-Górnicz A, van den Bosch MRM, Saccenti E, Suarez-Diez M. A large-scale analysis of codon usage bias in 4868 bacterial genomes shows association of codon adaptation index with GC content, protein functional domains and bacterial phenotypes. Biochim Biophys Acta Gene Regul Mech. 2022;1865:194826. https://doi.org/10.1016/j.bbagrm.2022.194826.

    Article  CAS  PubMed  Google Scholar 

  55. Carpentier F, Rodríguez de la Vega RC, Jay P, Duhamel M, Shykoff JA, Perlin MH, et al. Tempo of degeneration across independently evolved nonrecombining regions. Mol Biol Evol. 2022;39:msac060. https://doi.org/10.1093/molbev/msac060.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Tyagi A, Nagar V. Genome dynamics, codon usage patterns and influencing factors in Aeromonas hydrophila phages. Virus Res. 2022;320:198900. https://doi.org/10.1016/j.virusres.2022.198900.

    Article  CAS  PubMed  Google Scholar 

  57. Munjal A, Khandia R, Shende KK, Das J. Mycobacterium lepromatosis genome exhibits unusually high CpG dinucleotide content and selection is key force in shaping codon usage. Infect Genet Evol. 2020;84:104399. https://doi.org/10.1016/j.meegid.2020.104399.

    Article  CAS  PubMed  Google Scholar 

  58. Khandia R, Singhal S, Kumar U, Ansari A, Tiwari R, Dhama K, et al. Analysis of Nipah virus codon usage and adaptation to hosts. Front Microbiol. 2019;10:886. https://doi.org/10.3389/fmicb.2019.00886.

    Article  PubMed  Google Scholar 

  59. Yang C, Zhao Q, Wang Y, Zhao J, Qiao L, Wu B, et al. Comparative analysis of genomic and transcriptome sequences reveals divergent patterns of codon bias in wheat and its ancestor species. Front Genet. 2021;12:732432. https://doi.org/10.3389/fgene.2021.732432.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Beelagi MS, Kumar SS, Indrabalan UB, Patil SS, Prasad A, Suresh KP, et al. Synonymous codon usage pattern among the S, M, and L segments in Crimean-congo hemorrhagic fever virus. Bioinformation. 2021;17:479–91. https://doi.org/10.6026/97320630017479.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Patil SS, Indrabalan UB, Suresh KP, Shome BR. Analysis of codon usage bias of classical swine fever virus. Vet World. 2021;14:1450–8. https://doi.org/10.14202/vetworld.2021.1450-1458.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Huang X, Jiao Y, Guo J, Wang Y, Chu G, Wang M. Analysis of codon usage patterns in Haloxylon ammodendron based on genomic and transcriptomic data. Gene. 2022;845:146842. https://doi.org/10.1016/j.gene.2022.146842.

    Article  CAS  PubMed  Google Scholar 

  63. Kumar U, Khandia R, Singhal S, Puranik N, Tripathi M, Pateriya AK, et al. Insight into codon utilization pattern of tumor suppressor gene EPB41L3 from different mammalian species indicates dominant role of selection force. Cancers (Basel). 2021;13:2739. https://doi.org/10.3390/cancers13112739.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Liu H, Lu Y, Lan B, Xu J. Codon usage by chloroplast gene is bias in Hemiptelea davidii. J Genet. 2020;99:8.

    Article  PubMed  Google Scholar 

  65. Chakraborty S, Yengkhom S, Uddin A. Analysis of codon usage bias of chloroplast genes in Oryza species: codon usage of chloroplast genes in Oryza species. Planta. 2020;252:67. https://doi.org/10.1007/s00425-020-03470-7.

    Article  CAS  PubMed  Google Scholar 

  66. Hershberg R, Petrov DA. Selection on codon bias. Annu Rev Genet. 2008;42:287–99. https://doi.org/10.1146/annurev.genet.42.110807.091442.

    Article  CAS  PubMed  Google Scholar 

  67. Liu XY, Li Y, Ji KK, Zhu J, Ling P, Zhou T, et al. Genome-wide codon usage pattern analysis reveals the correlation between codon usage bias and gene expression in Cuscuta australis. Genomics. 2020;112:2695–702. https://doi.org/10.1016/j.ygeno.2020.03.002.

    Article  CAS  PubMed  Google Scholar 

  68. Yu X, Liu J, Li H, Liu B, Zhao B, Ning Z. Comprehensive analysis of synonymous codon usage bias for complete genomes and E2 gene of atypical porcine Pestivirus. Biochem Genet. 2021;59:799–812. https://doi.org/10.1007/s10528-021-10037-y.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Li H, Sun L, Jiang Y, Wang B, Wu Z, Sun J, et al. Identification and characterization of Eimeria tenella EtTrx1 protein. Vet Parasitol. 2022;310:109785. https://doi.org/10.1016/j.vetpar.2022.109785.

    Article  CAS  PubMed  Google Scholar 

  70. Lu J, Wei N, Cao J, Zhou Y, Gong H, Zhang H, et al. Evaluation of enzymatic activity of Babesia microti thioredoxin reductase (Bmi TrxR)-mutants and screening of its potential inhibitors. Ticks Tick Borne Dis. 2021;12:101623. https://doi.org/10.1016/j.ttbdis.2020.101623.

    Article  PubMed  Google Scholar 

  71. Narayan A, Mastud P, Thakur V, Rathod PK, Mohmmed A, Patankar S. Heterologous expression in Toxoplasma gondii reveals a topogenic signal anchor in a Plasmodium apicoplast protein. FEBS Open Bio. 2018;8:1746–62. https://doi.org/10.1002/2211-5463.12527.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  72. Song X, Yang X, Xue Y, Yang C, Wu K, Liu J, et al. Glutaredoxin 1 deficiency leads to microneme protein-mediated growth defects in Neospora caninum. Front Microbiol. 2020;11:536044. https://doi.org/10.3389/fmicb.2020.536044.

    Article  PubMed  PubMed Central  Google Scholar 

  73. Temesgen TT, Tysnes KR, Robertson LJ. Use of oxidative stress responses to determine the efficacy of inactivation treatments on Cryptosporidium oocysts. Microorganisms. 2021;9:1463. https://doi.org/10.3390/microorganisms9071463.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  74. Tiwari S, Sharma N, Sharma GP, Mishra N. Redox interactome in malaria parasite Plasmodium falciparum. Parasitol Res. 2021;120:423–34. https://doi.org/10.1007/s00436-021-07051-9.

    Article  PubMed  Google Scholar 

  75. Benisty H, Hernandez-Alias X, Weber M, Anglada-Girotto M, Mantica F, Radusky L, et al. Genes enriched in A/T-ending codons are co-regulated and conserved across mammals. Cell Syst. 2023;14:312-323.e3. https://doi.org/10.1016/j.cels.2023.02.002.

    Article  CAS  PubMed  Google Scholar 

  76. Lamolle G, Iriarte A, Musto H. Codon usage in the flatworm Schistosoma mansoni is shaped by the mutational bias towards A+T and translational selection, which increases GC-ending codons in highly expressed genes. Mol Biochem Parasitol. 2021;247:111445. https://doi.org/10.1016/j.molbiopara.2021.111445.

    Article  CAS  PubMed  Google Scholar 

  77. Prabha R, Singh DP, Sinha S, Ahmad K, Rai A. Genome-wide comparative analysis of codon usage bias and codon context patterns among cyanobacterial genomes. Mar Genomics. 2017;32:31–9. https://doi.org/10.1016/j.margen.2016.10.001.

    Article  PubMed  Google Scholar 

  78. Pepe D, de Keersmaecker K. Codon bias analyses on thyroid carcinoma genes. Minerva Endocrinol. 2020;45:295–305. https://doi.org/10.23736/S0391-1977.20.03252-6.

    Article  PubMed  Google Scholar 

  79. Rakwong P, Keawchana N, Ngasaman R, Kamyingkird K. Theileria infection in bullfighting cattle in Thailand. Vet World. 2022;15:2917–21. https://doi.org/10.14202/vetworld.2022.2917-2921.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  80. Cepeda AS, Andreína Pacheco M, Escalante AA, Alzate JF, Matta NE. The apicoplast of Haemoproteus columbae: a comparative study of this organelle genome in Haemosporida. Mol Phylogenet Evol. 2021;161:107185. https://doi.org/10.1016/j.ympev.2021.107185.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This research was funded by Natural Science Foundation of Liaoning Province of China (2022-BS-323, 2022-BS-325); 2021 Youth Science and Technology Talents Support Plan from Boze Project of Jinzhou Medical University (JYBZQT2109); National College Students Innovation and Entrepreneurship Training Program (X202210160022, X202210160050).

Author information

Authors and Affiliations

Authors

Contributions

DW and BY contributed to conception and design of the study. DW and BY performed the statistical analysis. DW wrote the first draft of the manuscript. BY wrote sections of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.

Corresponding author

Correspondence to Baoling Yang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Sources of the coding sequence in apicomplexan protozoa Trxs.

Additional file 2: Table S2.

Codon base composition in apicomplexan protozoa Trxs.

Additional file 3: Table S3.

Relative synonymous codon usage in apicomplexan protozoa Trxs.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, D., Yang, B. Analysis of codon usage bias of thioredoxin in apicomplexan protozoa. Parasites Vectors 16, 431 (2023). https://doi.org/10.1186/s13071-023-06002-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13071-023-06002-w

Keywords