Tissue-specific transcriptomes of Anisakis simplex (sensu stricto) and Anisakis pegreffii reveal potential molecular mechanisms involved in pathogenicity
Parasites & Vectorsvolume 11, Article number: 31 (2018)
Larval stages of the sibling species of parasitic nematodes Anisakis simplex (sensu stricto) (s.s.) (AS) and Anisakis pegreffii (AP) are responsible for a fish-borne zoonosis, known as anisakiasis, that humans aquire via the ingestion of raw or undercooked infected fish or fish-based products. These two species differ in geographical distribution, genetic background and peculiar traits involved in pathogenicity. However, thus far little is known of key molecules potentially involved in host-parasite interactions. Here, high-throughput RNA-Seq and bioinformatics analyses of sequence data were applied to the characterization of the whole sets of transcripts expressed by infective larvae of AS and AP, as well as of their pharyngeal tissues, in a bid to identify transcripts potentially involved in tissue invasion and host-pathogen interplay.
Approximately 34,000,000 single-end reads were generated from cDNA libraries for each species. Transcripts identified in AS and AP encoded 19,403 and 10,424 putative peptides, respectively, and were classified based on homology searches, protein motifs, gene ontology and biological pathway mapping. Differential gene expression analysis yielded 226 and 339 transcripts upregulated in the pharyngeal regions of AS and AP, respectively, compared with their corresponding whole-larvae datasets. These included proteolytic enzymes, molecules encoding anesthetics, inhibitors of primary hemostasis and virulence factors, anticoagulants and immunomodulatory peptides.
This work provides the scientific community with a list of key transcripts expressed by AS and AP pharyngeal tissues and corresponding annotation information which represents a ready-to-use resource for future functional studies of biological pathways specifically involved in host-parasite interplay.
Food-borne diseases are caused by a variety of chemical insults and pathogenic organisms, including parasites, with Anisakis spp. being the only fish-borne parasites able to trigger an allergic response in humans . Species of Anisakis are indeed responsible for a relatively poorly known food-borne zoonosis, known as ‘anisakiasis’, that occurs in large areas of the globe, including Japan and other easternmost regions, as well as the Netherlands, Germany, France, Spain, Croatia  and Italy, amongst others .
Anisakis spp. (Ascaridoidea: Anisakidae) are nematodes with a cosmopolitan distribution whose life-cycle depends on aquatic hosts . Definitive and intermediate hosts are marine mammals and crustaceans, respectively, while fish and squids can act as paratenic hosts, harbouring infective third-stage larvae mostly in their body cavities. However, larvae are often found in fish muscles (fillets), where they migrate before the death of the host . The occurrence of larval nematodes in fish fillets is of particular medical and economic concern; indeed, beside the effects on the marketability of marine products, Anisakis third-stage larvae (L3 s) are the causative agents of a human disease known as anisakiasis. This occurs as a consequence of accidental ingestion of L3 s, and consists of a mild to severe disease classified as gastric anisakiasis (GA), intestinal (IA) and extraintestinal anisakiasis depending on the localization of the larva [6, 7]. In addition, infection with Anisakis spp. may cause sensitisation to parasite allergens [8, 9] that, following subsequent exposure, can result in a variety of systemic reactions . Moreover, Anisakis spp. have been recently observed in the same localization with gastro-intestinal tumors [11,12,13]. While, thus far most reports of anisakiasis originate from regions of the world where consumption of raw or undercooked fish is common (e.g. Japan) , the global prevalence of gastrointestinal and allergic anisakiasis is likely to be severely underestimated, particularly because of the intrinsic limitations of currently available diagnostic tools.
Anisakis simplex (Rudolphi, 1809) (sensu stricto) (s.s.) is responsible for almost all of the reported human cases of anisakiasis in Japan, whereas its sibling species, Anisakis pegreffii Campana-Rouget & Biocca, 1955, is responsible for most cases of anisakiasis in southern Europe . Anisakis simplex (s.s.) displays higher tolerance for artificial gastric juice and acid [14, 15] and penetrates both agar and fish muscle [15, 16], as well as digestive tract tissues of Wistar rats  more efficiently than A. pegreffii. Interestingly, recent comparative analyses of the transcriptomes and proteomes of A. simplex (s.s.) and A. pegreffii larvae revealed both qualitative and quantitative differences of the potential allergens responsible for the onset of allergic anisakiasis [18, 19], thus calling on more investigations into potential adverse effects elicited by these two species.
Indeed, in spite of growing concerns for public health due to anisakiasis, the molecular mechanisms responsible for the pathogenicity of Anisakis spp. remain largely unknown. Pharyngeal excretory glands of the Anisakis larval stages have long been hypothesized to be implicated in such mechanisms through the release of proteolytic enzymes [20,21,22], and data from other parasitic nematodes of the same superfamily (Ascaridoidea) suggest that peptidases could play key roles in biological pathways linked to fundamental host-pathogen interactions . However, thus far and to the best of our knowledge, no data are available on the molecules transcribed by the pharynx of Anisakis spp. The identification of these molecules and the characterization of their expression profiles compared to other larval tissues may provide clues as to their role(s) in biological pathways associated with the pathogenicity of these parasites. Therefore, in the present study, an in-depth analysis of differential gene expression between the whole larva and the pharyngeal tissues of both A. simplex (s.s.) and A. pegreffii was carried out, transcriptomes were obtained and molecules putatively responsible for the pathogenicity of these two species were identified.
Three fish species were collected and analyzed between 2015 and 2016: the Atlantic mackerel Scomber japonicus from the major FAO 27 fishing area (North East Atlantic region), the European anchovy Engraulis encrasicolus and the European pilchard Sardina pilchardus from the FAO 37 (Mediterranean). Viscera and fillets of each fish were visually inspected under a stereomicroscope for the detection of Anisakis larvae. Parasites were collected, washed and stored in sterile PBS for subsequent dissection of the pharyngeal tissues (defined as the apical portion starting with the linear pharynx and continuing with the globular ventricular oesophageal appendix (see Fig. 1) henceforth designated as ‘PX’). Tissues were dissected under a stereomicroscope and the remainder of the larva (‘WL’) was stored separately for further DNA and RNA extractions (see below). Samples corresponding to PX and WL of each larva were homogenized and processed using the TRIsure reagent (Bioline, London, UK), according to the manufacturer’s instructions, for simultaneous isolation of DNA and RNA, thus overcoming potential biases due to partial removal of larval tissues or organs.
Genomic DNA was used as a template for molecular identification of parasite species, based on PCR-RFLP analysis of the nuclear ribosomal marker internal transcribed spacer (ITS) according to published diagnostic keys .
RNA extraction and quality check
Total RNA was extracted using the TRIsure™ reagent (Bioline) and contaminating genomic DNA was removed using the DNAse in the Turbo DNA-free kit (Life Technologies, Waltham, USA). Three independent biological replicates were analysed, each one consisting of total RNA extracted from three WL and ten PX of each A. simplex (s.s.) (‘AS’) and A. pegreffii (‘AP’), respectively. The quality of the extracted RNA was verified visually on agarose gel, while resulting quantities were measured using the Take3 Module of Synergy HT Multi-Detection Microplate Reader (Biotek, Winooski, USA) and a Bioanalyzer 2100 (Agilent Technologies, Waldbronn, Germany).
cDNA library construction and RNA-sequencing
RNA strand-specific libraries were created using the New England Biolabs Kit (NEBNext® Ultra™, Ipswich, USA), according to manufacturer’s instructions (Illumina, San Diego, California). In brief, polyadenylated (PolyA+) RNA was purified using the NEB (NEBNext® Poly(A) mRNA Magnetic Isolation Module) from 3 μg of total RNA from each WL and PX sample, fragmented and reverse transcribed into cDNA. Following fragmentation of the mRNA, first and dUTP-based second strand synthesis were carried out, followed by end-repair, A-tailing, ligation of the indexed Illumina adapters and digestion of the dUTP-strand. Ligated products of 150–400 bp were excised from agarose gels and PCR amplified. Products were cleaned using a MinElute PCR purification kit (Qiagen, Hilden, Germany) and the quality of each cDNA library was verified on an Agilent 2100 Bioanalyzer. For RNA-Seq, pooled libraries were single-end sequenced on a HiSeq2500 platform (Illumina) using HiSeq Flow Cell v4 and 125 bp read length.
Bioinformatics analyses of sequence data
Raw reads were trimmed and filtered for Illumina adaptor sequences, sequences with suboptimal read quality (i.e. PHRED score ≥ 25.0) and sequences shorter than 125 bp using Trimmomatic v.0.36 .
For AS, the high-quality single-end reads representing both WL and PX were mapped to the available A. simplex reference genome (Assembly GCA_000951095.1) with BWA-MEM version 0.7.12-r1039  with default parameters. Resulting alignments were extracted and information linked to genome regions to which reads were mapped (e.g. intron-exon boundaries) were obtained from the GTF annotation file available from the WormBase repository database at http://parasite.wormbase.org/Anisakis_simplex_prjeb496/. Then, the number of raw reads mapping to each A. simplex gene was calculated using the Rsubread package in R (https://www.r-project.org/). Transcripts differentially expressed in WL and PX were identified by DESeq2 package in R (|logFC| > 2). In order to control for errors associated with multiple pairwise comparisons, a false-discovery rate correction (FDR < 0.05) was applied to the data set .
For AP, clean reads obtained from high-throughput sequencing of WL and PX were pooled and assembled de novo using the Trinity software . Subsequently, reads from each WL and PX were individually aligned to the assembled transcriptome using Bowtie, and the relative expression of each transcript was estimated via the RSEM package. The trimmed mean of M-values normalization method (TMM) was used for cross sample normalization, and the normalized expression of each transcript was presented as transcripts per million (TPM). Assembled transcripts with < 10 aligned reads were eliminated from subsequent analyses (http://deweylab.biostat.wisc.edu/rsem/). Differentially expressed transcripts between WL and PX were identified by DESeq2 package in R (|logFC| > 2; FDR < 0.05).
Putative ortholog transcripts shared between AS and AP were identified using a reciprocal best hit analysis on the longest ORF, and by retaining only the sequences aligned with a continuous region covering at least 30% of the query sequence . To better define the secretory nature of predicted proteins in these tissue-specific repertoires, we refined the search of predicted signal peptides in both oesophageal and whole body-enriched catalogues. Among enriched subsets, only contigs encoding for peptides starting with Met were retrieved and the probability to carry signal peptides was evaluated using the on-line prediction server SignalP .
Transcripts encoding for proteins belonging to five families, i.e. astacins, ShK toxin, EB cystein-rich modules, Kunitz and CRISPs (CAP superfamily), were selected amongst molecules upregulated in the PX of each AS and AP and subjected to further analyses using the BLAST search tool, Clustal Omega, Artemis and domain organization platform available from the EMBL-EBI Pfam website. These molecules were selected based on their documented roles in key mechanisms of host-parasite interactions .
For AP, transcript annotation was performed by comparing sequences to data available in the nucleotide sequence collection (Nt) of NCBI and in the non-redundant (Nr) (www.ncbi.nlm.nih.gov) and SwissProt (http://expasy.org/) databases using BLASTn and BLASTx algorithms, respectively. The software Trinotate (Transcriptome Functional Annotation and Analysis) (https://trinotate.github.io/) was used to extract annotation information from the eggNOG, KEGG and Gene Ontology databases using default settings. The coding regions of transcripts were predicted using TransDecoder .
Samples and species identification
A total of 85 anisakid larvae were collected, i.e. 31 from Atlantic mackerels, 32 from European anchovies and 22 from European pilchards. Of these, 31 larvae collected from the Atlantic mackerel were identified as AS, and 34 larvae collected from European anchovies and European pilchards were identified as AP following restriction analysis with HinfI . Twenty larvae collected from European pilchards were identified as Hysterothylacium aduncum and were thus excluded from the study. Three WL and PX samples from each AS and AP were subjected to high-throughput RNA-Seq.
The Anisakis simplex (s.s.) and A. pegreffii transcriptomes
A total of 32,859,191 and 37,606,372,125 bp reads were generated for each WL and PX of AS, while 23,397,005 and 36,382,470 reads were generated from WL and PX of AP, respectively (Table 1). Following pre-processing for quality, a total number of 29,011,662 (WL-AS) and 32,853,551 (PX-AS), 21,128,142 (WL-AP) and 32,711,755 (PX-AP) reads were retained for further analyses. Raw reads generated in the present study have been submitted in the Sequence Read Archive (SRA) database of NCBI (http://www. ncbi.nlm.nih.gov/sra) under accession number PRJNA374530.
Given the availability of a draft genome sequence for A. simplex  (Wormbase BioProject PRJEB496; http://parasite.wormbase.org/Anisakis_simplex_prjeb496/Info/Index/), raw reads generated for AS were mapped to the existing genome assembly sequences (Wormbase BioProject PRJEB496); approximately 90% of reads for AS-WL and AS-PX could be mapped to the available genome sequences, with 86% of these mapping only once. The 61,865,213 pooled reads obtained from both WL and PX of AS, mapped to a total of 19,403 genes (out of 20,971) in the AS genome. Conversely, due to the unavailability of a draft genome sequence for AP, raw reads generated from WL and PX samples for this species were assembled de novo (see Methods). The pooled 53,839,897 reads for AP-WL and AP-PX yielded, following quality-filtering, 21,195 contigs (Table 1). Shared transcripts (putative orthologs) between AS and AP were identified by pairwise BLASTP reciprocal best hit analysis (E-value cut-off: 1e-06; minimum hit coverage 30%), performed on the longest ORF identified for each transcript, resulting in a list of 4635 putative orthologs (Additional file 1: Table S1).
Transcript annotation and differential gene expression analysis
Annotation information linked to AS transcript-encoding genes were derived from data available from the WormBase repository database at http://parasite.wormbase.org/Anisakis_simplex_prjeb496/. For AP, a total number of 10,424 predicted peptides could be inferred from the 21,195 assembled transcripts (Table 1). Of these transcripts, ~35% could be annotated via BLAST searches against the nr, SwissProt, KEGG and GO databases (Table 1). Detailed annotation information for individual AP transcripts identified in this study is available in Additional file 2: Table S2.
For each AS and AP, complete lists of differentially expressed genes (DEGs) in WL and PX containing annotations informations are available in Additional file 3: Table S3 and Additional file 4: Table S4.
The search of signal peptides in both pharyngeal and whole larva enriched catalogues was carried out with the aim to better define the secretory nature of predicted proteins. In both AS and AP, the pharynx and the whole larva subsets (Table 2) were therefore sorted to retrieve only peptides starting with Met and the probability to carry signal peptides was evaluated using the SignalP Server (Table 2). Putative secreted proteins were detected in transcripts upregulated in the PX of each AS and AP (20 and 36% of the total number of upregulated transcripts, respectively) pointing out a secretory aptitude of pharyngeal repertoires. Totals of 16 and 22% of transcripts upregulated in WL of each AS and AP were predicted to encode a signal peptide.
Pfam enrichment analyses were also performed in both AS and AP by comparing the pharyngeal upregulated subset (FDR < 0.05) with the whole transcript sequence datasets (Table 3).
For AS, a total number of 335 differentially expressed genes were identified (Fig. 2a), of which 109 were upregulated in WL and 226 in PX (Additional file 2: Table S2, Additional file 4: Table S4). Seventy-one and 107 of these genes, respectively, were annotated with Pfam information (Additional file 2: Table S2, Additional file 4: Table S4). The remaining genes could not be annotated based on the information available in current public databases. Transcripts upregulated in WL encoded mostly enzymes with proteolytic activity, e.g. peptidases M1 and S10 (Additional file 2: Table S2). In the PX subset, 34.5% contigs encoded for proteins with putative roles in worm metabolism and physiology, including sugar transport, nervous system and motility. This reflects the biological function of this anatomical region, which includes, for instance, the nervous ring and the digestive oesophagous (see Fig. 1). A further 16% (36/226) of transcripts upregulated in the PX of this species encoded for (i) peptidases; (ii) members of the CAP superfamily, including proteins similar to known allergens from the venom of wasps, honeybees (such as histidine phosphatase superfamily members and cysteine-rich secretory proteins) and snakes (such as those carrying a pancreatic trypsin inhibitor Kunitz domain); and (iii) molecules with putative immune-modulatory function (e.g. leucine-rich repeat; immunoglobulin subtype, immunoglobulin-like domain, immunity-related GTPases-like) (Table 3). In particular, putative peptidases included (i) aspartic peptidases M1 (n = 1), (ii) astacin peptidase M12A (n = 5), (iii) peptidase M13 (n = 2), (iv) serine carboxypeptidase S10 (n = 1), as well as hemopexin-like metallopeptidases, carboxylesterases and ShKT Stichodactyla helianthus toxin (n = 3). Other predicted proteins upregulated in the PX subset with similarity to known allergens included molecules encoding for cysteine-rich domains, such as Kunitz, lustrins, trypsin inhibitors and ‘Clostridium epsilon toxin’; the latter was only detected only in the PX subset of AS (Table 3 and Additional file 1: Table S1).
A total of 502 DEGs were identified in AP, of which 163 were upregulated in WL and 339 in PX (Fig. 2b and Additional file 3: Table S3, Additional file 4: Table S4). Of these, 80 and 88 could be annotated with Pfam terms, while 83 and 100, respectively, matched amino acid sequences available in the NCBI nr database (Additional file 3: Table S3, Additional file 4: Table S4).
Amongst transcripts upregulated in the PX subset, 43 (48%) encoded for proteins with putative structural and motility functions (i.e. myosins and tropomyosins) or playing roles in worm physiology and metabolism. Similarly to AS, transcripts encoding for (i) peptidases, (ii) members of the CAP superfamily, and (iii) “toxins” were enriched in AP-PX subset. In particular, peptidases M1 (n = 1), M12A (n = 6) and M13 (n = 1), members of the CAP superfamily (n = 8) and peptides containing ‘ShK toxin’ domains (n = 2) were identified (Additional file 2: Table S2).
GO terms enrichment analysis was performed to compare groups of transcripts upregulated in the WL and PX of each AS and AP (cf. Additional file 5: Figure S1 and Additional file 6: Figure S2, respectively).
Putative orthologous transcripts encoding for proteins upregulated in the PX of both AS and AP, of interest based on their putative biological function in the tissues analysed, were characterized in further detail. Two transcripts in AP and one in AS encoded for putative ShK toxin peptides, with different domain organization as revealed by amino acid sequences alignments (Fig. 3). The first ShK peptide identified in AP (AP1) contained three adjacent ShK domains followed by a cysteine-rich portion at the N-terminus, in contrast to AS predicted ortholog that contain only two ShK domains. The second putative ShK peptide contained two adjacent domains in both AS and AP (Fig. 3). Interestingly, the C-terminus of ShKgene1 partially matched known allergens of A. simplex (s.l.), as revealed by BLAST analysis (Anis 7 GenBank accession number ABL77410, query coverage 93% and identity 25%; Anis 12 GenBank accession number AGC60031, query coverage 46% and identity 33%; Anis 14 GenBank accession number BAT62430, query coverage 71% and identity 34%). A transcript encoding for a similar ShK C-term could not be identified in AS using the Artemis tool, likely as a result of the current fragmentation of available genomic scaffolds.
ShK toxin domains were also found associated to astacin peptides (Fig. 4). In particular, five putative orthologous peptides encoding for astacin metalloendopeptidase were identified in AS and AP, with a different number of ShK modules (one or two, Fig. 5). Such modular structures were observed in other organisms: while 38% of sequences with only astacin domain retrieved in the Pfam database belonged to parasitic nematodes, around 80 and 83% of sequences with astacin domain associated to one or two ShK modules, respectively, were represented by parasitic nematodes.
Among transcripts encoding for cysteine-rich proteins, orthologous members of the CRISP-SCP family (CAP-superfamily) were detected in both AS and AP (Additional file 7: Figure S3). In particular, one transcript significantly upregulated in the PX of both species encoded for typical conserved residues of SCP-like proteins. Conversely, no differential expression was detected for the remaining transcripts encoding for the other three CRISP orthologues (alignments of peptides are available in Additional file 7: Figure S3).
Transcripts encoding putative proteins containing an EB module, often associated with Kunitz domains, were upregulated in the PX of both AS and AP: two orthologues were identified, with a further paralogue in AS (Additional file 8: Figure S4). In particular, five EB modules were identified in AP1 (Anisakis EB module 1), four in AS_EB1A and three in AS_EB1B (Additional file 8: Figure S4a) whereas, for Anisakis EB module 2, four adjacent EB modules were observed in both AP and AS (Additional file 8: Figure S4b).
Finally, transcripts encoding putative peptides carrying Kunitz domains were enriched in AS PX only. Two putative peptides, both containing conserved a cysteine-rich region known as lustrin, showed homology to “major allergen” Anis1 found in Anisakis, Ascaris and Toxocara (Additional file 9: Figure S5). One of the two peptides encompasses two Kunitz domains followed by two lustrin domains (Additional file 9: Figure S5a), while the second contained seven repeates of lustrin-Kunitz domains (Additional file 9: Figure S5b).
Parasitic nematodes infect more than one billion people globally, causing substantial suffering and massive economic losses . Anisakidosis is an emerging fish-borne zoonosis of major public health concern. Nevertheless, the impact of the disease is largely underestimated and our current understanding of the mechanisms of infection and clinical disease in humans is still scant.
While knowledge of the complex mechanisms underlying host-pathogen interplay in several parasitic nematodes is expanding [33,34,35], studies of the interactions between Anisakis and its accidental hosts are in their infancy [18, 19]. Previous evidence suggested that secretions from the pharyngeal subventral and esophageal dorsal glands of the parasites may play roles in the intraluminal and extracorporeal digestion of tissues of natural (intermediate or paratenic) hosts  and in the invasion of the gastrointestinal mucosa of human accidental hosts .
In an effort to investigate the nature and the potential role(s) of molecules expressed by the pharyngeal tissues of Anisakis spp. in pathways linked to pathogenicity, we analyzed the whole larval and pharyngeal transcriptomes of the two pathogenic sibling species AS and AP. The analysis of transcriptomes provided a number of transcripts similar to those inferred from previous studies of Anisakis spp. and of other non-model species of parasitic nematodes [19, 35, 36]. In addition, several shared transcripts were identified in these two species, albeit the total number of orthologous genes may be underestimated due to the unavailability of a draft genome sequence for A. pegreffii. The number of full-length peptides predicted from assembled transcript sequences for AP was significantly lower than the corresponding number for AS, thus indicating that a significant number of AP transcripts could not be assembled with confidence.
Nevertheless, comparative analyses of levels of gene transcription in the WL and PX of these two species led to the identification of groups of molecules with putative roles in mechanisms of parasite pathogenicity. Amongst these molecules, the cysteine-rich secretory proteins (CRISPs), belonging to the CAP superfamily (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins), were signifincatly upregulated in the PX of both AS and AP. Members of this family are secreted proteins with extracellular endocrine or paracrine functions, identified in several eukaryotes, including arthropods, flatworms and roundworms (reviewed by Cantacessi et al. ). In nematodes, members of this family are also known as Ancylostoma-secreted proteins or activation-associated secreted proteins (ASPs) (reviewed by Cantacessi et al. ). Increased transcription of ASPs is observed during the transition of the ensheathed, free living L3 of the dog hookworm, Ancylostoma caninum, to its exsheathed parasitic form . While the exact function(s) of nematode ASPs are yet to be uncovered, this observation suggests that these molecules may be involved in biological pathways associated with penetration of host tissues and/or interactions with the host immune system (cf. [35, 36]).
A group of proteins with roles in host tissue penetration and digestion encoded by transcripts were differentially expressed in the PX of both AS and AP is represented by the peptidases. In particular, transcripts encoding for metalloproteinases (i.e. aminopeptidases, astacins and neprilysins) were particularly abundant in AS. Aminopeptidases catalyze the removal of single amino acids from the amino N-terminus of small peptides, thus playing a role in their final digestion . In addition, members of this family hydrolyse the epoxide leukotriene-A4 to form an inflammatory mediator . The Anisakis orthologue might be involved in eliciting a host immune and/or inflammatory response. Astacins are implicated in the activation of growth factors, degradation of polypeptides, and processing of extracellular proteins, also sharing common features with serralysins, matrix metalloendopeptidases, and snake venom proteases . Astacins might also play a role as anticoagulants; indeed, the hematophagus mollusk Colubraria secretes Astacin/ShKT-containing proteins that may inhibit ion channels, interfere with hemostasis and facilitate the spreading of the toxins in the body of the prey by degrading extracellular matrix molecules and inactivating several endogenous vasoactive peptides . In the free-living nematode Caenorhabditis elegans, an astacin enzyme is expressed in the hypodermal tissue and is required for normal collagen secretion . Astacins are also found in the parasitic filarioid Brugia malayi and in the trichostrongylid Haemonchus contortus  where they play a role in collagen processing and cuticle formation. Proteases belonging to the M13 family include, amongst other proteins, the neprilysins, molecules found in a wide range of vertebrates (including mammals), and invertebrates (e.g. Drosophila and C. elegans [44, 45]). In mammals, these proteins are involved in cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of neuropeptides in the central nervous system, while their biological functions in nematodes are yet unclear. However, in experimentally infected mice, M13 peptidases secreted by Trichinella spiralis have been demonstrated to participate in the early onset of intestinal inflammation . Similar mechanisms may occur over the penetration of intestinal tissues by Anisakis larvae. Amongst other peptidase-encoding transcripts upregulated in the PX of both AS and AP and with putative roles in host tissue digestion, serine carboxypeptidases have been previously detected in Angiostrongylus cantonensis  and Strongyloides ratti  where, similarly to aminopeptidases, they also serve as digestive enzymes.
Molecules encoding for potential anaesthetics were also detected amongst transcripts differentially expressed in the PX of both AS and AP, including predicted peptides containing a ShK toxin domain. In venomous organisms, the ShKT is a signature of potent ion channel blockers , whereas the presence of this protein motif in oxidoreductases and prolyl hydroxylases of plants, and in astacin-like metalloproteases and trypsin-like serines proteases of helminths confers these enzymes potential roles in potassium channel-modulatory activities. However, in the zoonotic dog roundworm Toxocara canis, secreted mucins implicated in immune evasion mechanisms are characterized by the presence of evolutionarily distinct modules, i.e. the mucin and ShKT domains , which raises questions on the possible roles on Anisakis ShKT domain-encoding molecules in interactions between the parasite and the host immune system. Moreover, the association of astacin and ShK toxin domain appears to be particularly common in parasitic nematodes.
Besides transcripts of interest detected in both AS and AP, a relatively small number of molecules were exclusively upregulated in the PX of AS. These molecules included peptides encoding Kunitz domains, with trypsin inhibitor activity, and aspartic proteases. Proteins with Kunitz domains are degrading enzymes that act as protease inhibitors, with a functionally conserved role in cuticle formation in a diverse range of nematodes . In blood-sucking organisms such as ticks [51, 52] and molluscs , proteins containing Kunitz domains are possibly involved in the inhibition of thrombin and coagulation factors. Functions in mechanisms aimed to prevent attack by host proteolytic enzymes and coagulation factors in the host blood stream are suggested for the zoonotic tapeworm Echinococcus granulosus , and for the blood flukes Schistosoma japonicum and S. mansoni [54, 55]. Excretory/secretory products from A. simplex (s.l.) larvae contain molecules with anticoagulant activities that may play a role in the pathogenesis of anisakiasis in accidental hosts . In Anisakis, proteins with Kunitz domains with trypsin-like proteinase properties have been previously identified in studies examining the proteolytic enzymatic activity of this parasite [20, 57, 58]; their optimal activity was recorded at pH 7.5 (approximately the pH of the terminal ileum) and a host body temperature of 36–37 °C, thus supporting the hypothesis that these enzymes are activated during invasion of the intestine in the homeothermic hosts. In addition, one of the major known Anisakis allergens, i.e. Anis1, contains a Kunitz-domain . A transcript upregulated in AS PX showed high homology to this antigen (Additional file 9: Figure S5a, b). Indeed, it is possible to hypothesize that novel molecules other than the already known peptides with antigenic and pathogenic properties may elicit the inflammatory and immune response observed in humans. Aspartic proteases (pepsin-like) were also included amongst the upregulated AS molecules. The best known source of aspartic proteases is the stomach of mammals , and pepsin was identified as a positive regulator of cathepsin-B involved in the penetration of rat gut by Angiostrongylus cantonensis [60, 61]. Results from our study suggest that transcripts encoding selected Kunitz-domain containing proteins and aspartic proteases were exclusively expressed by AS. However, given that the set of AP transcripts were assembled de novo in the absence of a genome reference sequence, and the significantly lower number of predicted peptides, their presence in the genome and transcriptome of AP cannot be fully excluded.
By undertaking the first large-scale analysis of molecules expressed in the PX of Anisakis, i.e. the organ putatively involved in host recognition/adhesion/invasion, we herein provide the community with a ready-to-use molecular groundwork for in-depth studies of biological pathways specifically involved in host-parasite interplay. The identification of species- and/or tissue-specific molecules in two Anisakis species provides new information on the expression of genes potentially involved in host tissue invasion. The results obtained may assist in the characterization of properties or functions of these molecules, and their exploration as potential novel targets for treatments and control strategies . Further investigations using experimental in vitro controlled conditions as well as differential gene expression between species and in other developmental life cycle stages, are needed to reliably asses the roles of these moleculesin the pathogenesis of anisakiasis.
Anisakis simplex (s.s.)
Ancylostoma-secreted proteins or activation-associated secreted proteins
Superfamily of proteins (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins)
Count per million reads
Cysteine-rich secretory proteins
Differentially expressed genes
False discovery rate
Anisakis third-stage larvae
Trimmed mean of M-values normalization method
Transcripts per million
Panel EFSA. On biological hazards (BIOHAZ) scientific opinion on risk assessment of parasites in fishery products. EFSA J. 2010;8(4):1543.
Mladineo I, Popović M, Drmić-Hofman I, Poljak V. A case report of Anisakis pegreffii (Nematoda, Anisakidae) identified from archival paraffin sections of a Croatian patient. BMC Infect Dis. 2016;1(16):42.
Mattiucci S, D’Amelio S. Anisakiasis. In: Bruschi F, editor. Helminth infection and their impact on global public health. Vienna: Springer; 2014.
Anderson RC. Nematode parasites of vertebrates: their development and transmission. Wallingford, UK: CAB International; 1992.
Cipriani P, Smaldone G, Acerra V, D'Angelo L, Anastasio A, Bellisario B, et al. Genetic identification and distribution of the parasitic larvae of Anisakis pegreffii and Anisakis simplex s.s. in European hake Merluccius merluccius from the Tyrrhenian Sea and Spanish Atlantic coast: implications for food safety. Int J Food Microbiol. 2015;198:1–8.
Audicana MT, Ansotegui IJ, de Corres LF, Kennedy MW. Anisakis simplex: dangerous-dead and alive? Trends Parasitol. 2002;18(1):20.
Nogami Y, Fujii-Nishimura Y, Banno K, Suzuki A, Susumu N, Hibi T, et al. Anisakiasis mimics cancer recurrence: two cases of extragastrointestinal anisakiasis suspected to be recurrence of gynecological cancer on PET-CT and molecular biological investigation. BMC Med Imaging. 2016;16:31.
Mattiucci S, Fazii P, De Rosa A, Paoletti M, Megna AS, Glielmo A, et al. Anisakiasis and gastroallergic reactions associated with Anisakis pegreffii infection, Italy. Emerg Infect Dis. 2013;19(3):496.
Chung YB, Lee J. Clinical characteristics of gastroallergic anisakiasis and diagnostic implications of immunologic tests. Allergy Asthma Immunol Res. 2014;6(3):228.
Nieuwenhuizen NE. Anisakis - immunology of a foodborne parasitosis. Parasite Immunol. 2016;38(9):548.
Mineta S, Shimanuki K, Sugiura A, Tsuchiya Y, Kaneko M, Sugiyama Y, et al. Chronic anisakiasis of the ascending colon associated with carcinoma. J Nippon Med Sch. 2006;73(3):169.
Garcia-Perez JC, Rodríguez-Perez R, Ballestero A, Zuloaga J, Fernandez-Puntero B, Arias-Díaz J, Caballero ML. Previous exposure to the fish parasite Anisakis as a potential risk factor for gastric or colon adenocarcinoma. Medicine (Baltimore). 2015;94(40):e1699.
Sonoda H, Yamamoto K, Ozeki K, Inoye H, Toda S, Maehara Y. An Anisakis larva attached to early gastric cancer: report of a case. Surg Today. 2015;45(10):1321–5.
Quiazon KM, Yoshinaga T, Ogawa K. Experimental challenge of Anisakis simplex sensu stricto and Anisakis pegreffii (Nematoda: Anisakidae) in rainbow trout and olive flounder. Parasitol Int. 2011;60(2):126–31.
Arizono N, Yamada M, Tegoshi T, Yoshikawa M. Anisakis simplex sensu stricto and Anisakis pegreffii: biological characteristics and pathogenetic potential in human anisakiasis. Foodborne Pathog Dis. 2012;9(6):517–21.
Suzuki J, Murata R, Hosaka M, Araki J. Risk factors for human Anisakis infection and association between the geographic origins of Scomber japonicus and anisakid nematodes. Int J Food Microbiol. 2010;137(1):88–93.
del Carmen Romero M, Valero A, Navarro-Moll MC, Martín-Sánchez J. Experimental comparison of pathogenic potential of two sibling species Anisakis simplex s.s. and Anisakis pegreffii in Wistar rat. Tropical Med Int Health. 2013;18(8):979–84.
Arcos SC, Ciordia S, Roberston L, Zapico I, Jiménez-Ruiz Y, Gonzalez-Muñoz M, et al. Proteomic profiling and characterization of differential allergens in the nematodes Anisakis simplex sensu stricto and A. pegreffii. Proteomics. 2014;14(12):1547–68.
Baird FJ, Su X, Aibinu I, Nolan MJ, Sugiyama H, Otranto D, et al. The Anisakis transcriptome provides a resource for fundamental and applied studies on allergy-causing parasites. PLoS Negl Trop Dis. 2016;10(7):e0004845.
Matthews B. The source, release and specificity of proteolytic enzyme activity produced by Anisakis simplex larvae (Nematoda: Ascaridida) in vitro. J Helminthol. 1984;58:175–85.
Buzzell GR, Sommerville RI. The structure of the esophagus in the third-stage infective larva of Anisakis sp. (Nematoda: Anisakidae). Trans Am Microsc Soc. 1985;104(1):86–94.
Sakanari JA, McKerrow JH. Identification of the secreted neutral proteases from Anisakis simplex. J Parasitol. 1990;76(5):625–30.
Tort J, Brindley PJ, Knox D, Woklfe KH, Dalton JP. Proteinases and associated genes of parasitic helminths. Adv Parasitol. 1999;43:161–266.
D'Amelio S, Mathiopoulos KD, Santos CP, Pugachev ON, Webb SC, Picanço M, Paggi L. Genetic markers in ribosomal DNA for the identification of members of the genus Anisakis (Nematoda: Ascaridoidea) defined by polymerase-chain-reaction-based restriction fragment length polymorphism. Int J Parasitol. 2000;30(2):223–6.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997v2 [q-bio.GN]; 2013.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57:289–300.
Haas BJ, Papanicolaou A, Yassour M, et al. De novo transcript sequence reconstruction from RNA-seq: reference generation and analysis with trinity. Nat Protoc. 2013;8:1494–512.
Petrella V, Aceto S, Musacchia F, Colonna V, Robinson M, Benes V, et al. De novo assembly and sex-specific transcriptome profiling in the sand fly Phlebotomus perniciosus (Diptera, Phlebotominae), a major old world vector of Leishmania infantum. BMC Genomics. 2015;16:847.
Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8:785–6.
Howe KL, Bolt BJ, Shafie M, Kersey P, Berriman M. WormBase ParaSite - a comprehensive resource for helminth genomics. Mol Biochem Parasitol. 2017;215:2–10.
Lustigman S, Prichard RK, Gazzinelli A, Grant WN, Boatin BA, McCarthy JS, Basáñez MGA. Research agenda for helminth diseases of humans: the problem of helminthiases. PLoS Negl Trop Dis. 2012;6(4):e1582.
Jex AR, Liu S, Li B, Young ND, Hall RS, Li Y, et al. Ascaris suum draft genome. Nature. 2011;479:529–33.
Wang T, Van Steendam K, Dhaenens M, Vlaminck J, Deforce D, Jex AR. Proteomic analysis of the excretory-secretory products from larval stages of Ascaris suum reveals high abundance of glycosyl hydrolases. PLoS Negl Trop Dis. 2013;3;7(10):e2467.
Zhou RQ, Ma GX, Korhonen PK, Luo YL, Zhu HH, Luo YF, et al. Comparative transcriptomic analyses of male and female adult Toxocara canis. Gene. 2017;600:85–9.
Ma X, Zhu Y, Li C, Shang Y, Meng F, Chen S, Miao L. Comparative transcriptome sequencing of germline and somatic tissues of the Ascaris suum gonad. BMC Genomics. 2011;12:481.
Cantacessi C, Campbell BE, Visser A, Geldhof P, Nolan MJ, Nisbet AJ, et al. A portrait of the "SCP/TAPS" proteins of eukaryotes - developing a framework for fundamental research and biotechnological outcomes. Biotechnol Adv. 2009;27(4):376–88.
Datu BJ, Gasser RB, Nagaraj SH, Ong EK, O'Donoghue P, McInnes R, et al. Transcriptional changes in the hookworm, Ancylostoma caninum, during the transition from a free-living to a parasitic larva. PLoS Negl Trop Dis. 2008;2(1):e130.
Rawlings ND, Barrett AJ, Bateman A. MEROPS: the database of proteolytic enzymes, their substrates and inhibitors. Nucleic Acids Res. 2012;40(Database issue):D343–50.
Bond JS, Beynon RJ. The astacin family of metalloendopeptidases. Protein Sci. 1995;4(7):1247–61.
Modica MV, Lombardo F, Franchini P, Oliverio M. The venomous cocktail of the vampire snail Colubraria reticulata (Mollusca, Gastropoda). BMC Genomics. 2015;16:441.
Stepek G, McCormack G, Page AP. Collagen processing and cuticle formation is catalysed by the astacin metalloprotease DPY-31 in free-living and parasitic nematodes. Int J Parasitol. 2010;40:533–42.
Stepek G, McCormack G, Page AP. The Kunitz domain protein BLI-5 plays a functionally conserved role in cuticle formation in a diverse range of nematodes. Mol Biochem Parasitol. 2010;169(1):1–11.
Sitnik JL, Francis C, Hens K, Huybrechts R, Wolfner MF, Callaerts P. Neprilysins: an evolutionarily conserved family of metalloproteases that play important roles in reproduction in Drosophila. Genetics. 2014;196(3):781–97.
Yamada K, Hirotsu T, Matsuki M, Butcher RA, Tomioka M, Ishihara T, et al. Olfactory plasticity is regulated by pheromonal signaling in Caenorhabditis elegans. Science. 2010;24(329(5999)):1647–50.
Barbara G, De Giorgio R, Stanghellini V, Corinaldesi R, Cremon C, Gerard N, et al. Neutral endopeptidase (EC 220.127.116.11) downregulates the onset of intestinal inflammation in the nematode infected mouse. Gut. 2003;52(10):1457–64.
Morassutti AL, Levert K, Pinto PM, da Silva AJ, Wilkins P, Graeff-Teixeira C. Characterization of Angiostrongylus cantonensis excretory-secretory proteins as potential diagnostic targets. Exp Parasitol. 2012;130(1):26–31.
Soblik H, Younis AE, Mitreva M, Renard BY, Kirchner M, Geisinger F, et al. Life cycle stage-resolved proteomic analysis of the excretome/secretome from Strongyloides ratti - identification of stage-specific proteases. Mol Cell Proteomics. 2011;10(12):M111.010157.
Rachamim T, Sher D. What Hydra can teach us about chemical ecology - how a simple, soft organism survives in a hostile aqueous environment. Int J Dev Biol. 2012;56(6–8):605.
Maizels RM. Toxocara canis: molecular basis of immune recognition and evasion. Vet Parasitol. 2013;193(4):365–74.
Liao M, Zhou J, Gong H, Boldbaatar D, Shirafuji R, et al. Hemalin, a thrombin inhibitor isolated from a midgut cDNA library from the hard tick Haemaphysalis longicornis. J Insect Physiol. 2009;55:164–73.
Schwarz A, Cabezas-Cruz A, Kopecký J, Valdés JJ. Understanding the evolutionary structural variability and target specificity of tick salivary Kunitz peptides using next generation transcriptome data. BMC Evol Biol. 2014;14:4.
Ranasinghe SL, Fischer K, Zhang W, Gobert GN, McManus DP. Cloning and characterization of two potent Kunitz type protease inhibitors from Echinococcus granulosus. PLoS Negl Trop Dis. 2015;9(12):e0004268.
Ranasinghe SL, Fischer K, Gobert GN, McManus DPA. Novel coagulation inhibitor from Schistosoma japonicum. Parasitology. 2015;142(14):1663–72.
Ranasinghe SL, Fischer K, Gobert GN, McManus DP. Functional expression of a novel Kunitz type protease inhibitor from the human blood fluke Schistosoma mansoni. Parasit Vectors. 2015;8:408.
Perteguer MJ, Raposo R, Cuellar C. vitro study on the effect of larval excretory/secretory products and crude extracts from Anisakis simplex on blood coagulation. Int J Parasitol. 1996;26:105–8.
Kennedy MW. Parasitic nematodes: antigens, membranes and genes. Boca Raton: CRC Press: Taylor and Francis Ltd; 1991.
Matthews B. Behaviour and enzyme release by Anisakis sp. larvae (Nematoda: Ascaridida). J Helminthol. 1982;56:177–83.
Moneo I, Caballero ML, Gómez F, Ortega E, Alonso MJ. Isolation and characterization of a major allergen from the fish parasite Anisakis simplex. J Allergy Clin Immunol. 2000;106(1):177–82.
Szecsi PB. The aspartic proteases. Scand J Clin Lab Invest Suppl. 1992;210:5–22.
Long Y, Cao B, Wang Y, Luo D. Pepsin is a positive regulator of ac-cathB-2 involved in the rat gut penetration of Angiostrongylus cantonensis. Parasit Vectors. 2016;9:286.
Rosa BA, Jasmer DP, Mitreva M. Genome-wide tissue-specific gene expression, co-expression and regulation of co-expressed genes in adult nematode Ascaris suum. PLoS Negl Trop Dis. 2014;8(2):e2678.
Berland B. Nematodes from some Norwegian marine fishes. Sarsia. 1961;2:1–50.
The authors thank Carlo Taccari for graphical support.
This work has been funded by the European Society of Clinical Microbiology and Infectious Diseases (ESCMID) who funded a Research Grant  to SC. Research in CC laboratory is supported by grants by the Royal Society and the Isaac Newton Trust/Wellcome Trust/University of Cambridge.
Availability of data and materials
Raw sequence data analysed in this article have been deposited in the NCBI Sequence Read Archive database under the project number PRJNA374530, while annotated sequence data are available from Additional file 2: Table S2, Additional file 3: Table S3, Additional file 4: Table S4.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Putative orthologs of Anisakis simplex (s.s.) (AS) identified in the transcriptome of Anisakis pegreffii (AP) and vice versa. (XLS 3657 kb)
Annotation information of Anisakis simplex (s.s.) (AS) differentially expressed genes (DEGs) in the whole larva (WL) and in the pharynx (PX). Corresponding peptide sequences are also included. (XLSX 9154 kb)
Anisakis pegreffii transcript sequences and, where available, corresponding peptides and annotation information, and differentially expressed genes (DEGs) upregulated in the whole larva (WL) and in the pharynx (PX). (XLSX 170 kb)
Data including TPM and FDR of Anisakis simplex (s.s.) (AS) and Anisakis pegreffii (AP) differentially expressed genes (DEGs) in the whole larva (WL) and in the pharynx (PX). (XLSX 197 kb)
Gene Ontology term distribution of differentially expressed genes (DEGs) in Anisakis simplex (s.s.). (JPEG 579 kb)
Gene Ontology term distribution of of differentially expressed genes (DEGs) in Anisakis pegreffii. (JPEG 982 kb)
Alignment of four orthologous putative peptides belonging to CRISPs (CAP superfamily) in Anisakis simplex (s.s.) (AS) and A. pegreffii (AP). (TIFF 674 kb)
Alignment of two EB-module containing orthologues in Anisakis simplex (s.s.) (AS) and A. pegreffii (AP), indicated as module 1 and 2. Conserved cysteines of EB domain are highlighted in red and the whole domain is highlighted in violet. (TIFF 2315 kb)
Alignement of Anisakis simplex (s.s.) (AS) putative peptide containing Kunitz domain (a: ASIM_0001030201 and b: ASIM_0001813801) with sequences of major allergens AniS1 of A. simplex (AGC60037), A. pegreffii (AGC60032), Toxocara canis (KHN76536), Ascaris suum (ERG81619), Trichuris trichiura (CDW53412), Necator americanus (XP_013296851) Brugia malayi (XP_001901728), T. canis (KHN72724), A. suum (ERG79727), Strongyloides ratti (CEF68915) and Caenorhabditis elegans (NP_001256870). (JPEG 201 kb)
About this article
- Anisakis simplex (sensu stricto)
- Anisakis pegreffii
- Pharyngeal transcriptome
- Differential gene expression