Spatial distribution and risk factors for human cysticercosis in Colombia

Background Cysticercosis is a zoonotic neglected tropical disease (NTD) that affects humans and pigs following the ingestion of Taenia solium eggs. Human cysticercosis poses a substantial public health burden in endemic countries. The World Health Organization (WHO) aims to target high-endemicity settings with enhanced interventions in 17 countries by 2030. Between 2008 and 2010, Colombia undertook a national baseline serosurvey of unprecedented scale, which led to an estimated seroprevalence of T. solium cysticercus antibodies among the general population of 8.6%. Here, we use contemporary geostatistical approaches to analyse this unique dataset with the aim of understanding the spatial distribution and risk factors associated with human cysticercosis in Colombia to inform how best to target intervention strategies. Methods We used a geostatistical model to estimate individual and household risk factors associated with seropositivity to T. solium cysticercus antibodies from 29,253 people from 133 municipalities in Colombia. We used both independent and spatially structured random effects at neighbourhood/village and municipality levels to account for potential clustering of exposure to T. solium. We present estimates of the distribution and residual correlation of seropositivity at the municipality level. Results High seroprevalence was identified in municipalities located in the north and south of Colombia, with spatial correlation in seropositivity estimated up to approximately 140 km. Statistically significant risk factors associated with seropositivity to T. solium cysticercus were related to age, sex, educational level, socioeconomic status, use of rainwater, consumption of partially cooked/raw pork meat and possession of dogs. Conclusions In Colombia, the distribution of human cysticercosis is influenced by socioeconomic considerations, education and environmental factors related to the spread of T. solium eggs. This information can be used to tailor national intervention strategies, such as targeting spatial hotspots and more highly exposed groups, including displaced people and women. Large-scale seroprevalence surveys accompanied by geospatial mapping are an essential step towards reaching the WHO’s 2021‒2030 NTD roadmap targets. Graphical Abstract Supplementary Information The online version contains supplementary material available at 10.1186/s13071-021-05092-8.


Background
The zoonotic tapeworm, Taenia solium, is responsible for taeniasis/cysticercosis which is included in the World Health Organization's (WHO's) list of prioritised neglected tropical diseases (NTDs) [1]. Humans are the definitive hosts of T. solium and harbour the adult tapeworm in their bowel. Pigs are intermediate hosts, infected by larval cysts (cysticerci) following ingestion of parasite eggs and proglottids [2] in human faeces. Eggs hatch in the pig's digestive system, and the released oncospheres first penetrate the intestinal wall, entering the bloodstream, and then become encysted in striated muscle, brain, liver and subcutaneous and other tissues. Porcine cysticercosis is often asymptomatic [2,3], although cysts in pig brain tissue can cause neurocysticercosis (NCC) and epileptic seizures [4]. Humans contract taeniasis following consumption of tissue cysts in poorly cooked pork meat. Taeniasis is usually asymptomatic, but mild symptoms, including abdominal pain, distension, diarrhoea and nausea, may appear [2]. Humans can also be infected with T. solium eggs, typically from ingestion of food contaminated with human faecal material [5] or food washed with contaminated water [6]. Internal auto-infestation following regurgitation of proglottids in the stomach has also been suggested as an additional route of infection [2,5,7]. Infection with T. solium eggs causes cysticercosis which manifests most severely when cysts migrate to the central nervous system, resulting in NCC [2]. Morbidity from NCC associated with seizures, epilepsy and other neurological sequelae is driven by the number and location of cysts or following the degeneration of viable cysts [8].
Taeniasis/cysticercosis is widely endemic globally. Taenia solium cysticercosis antibody seroprevalence, indicative of exposure, ranges from 1.8 to 31.2% in Latin America, from 12.6 to 19.2% in Asia and from 7.7 to 34.5% in Africa (as measured using an enzyme-linked Immunoelectrotransfer blot [EITB] assay) [9], which highlights substantial variation in exposure to T. solium eggs across settings. NCC is responsible for the predominant disease burden associated with T. solium infection, accounting for approximately 30% of epilepsy cases in endemic countries and 3% globally [10]. In addition, this zoonosis impacts the pork meat market, with small producers experiencing economic losses due to the reduction in value of infected pork meat [4] and a market shift towards home slaughtering and selling [11].
In Colombia, taeniasis/cysticercosis poses a substantial public health problem [12], with an estimated life-time prevalence of epilepsy of 20.9 per 1000 individuals and a prevalence of neurocysticercosis (by computed tomography scan) of 13.9% [13]. The country-wide prevalence of T. solium cysticercus antibodies was estimated at 8.6% from a national serosurvey of more than 29,000 people conducted between 2008 and 2010 [14]. Despite the unprecedented scale of this epidemiological survey-and the development by the Pan American Health Organization in 2015 of a formal plan of surveillance and control in Colombia [12]-there has been little implementation of systematic surveillance or intervention activities. Consequently, the epidemiology of T. solium in Colombia is unlikely to have changed substantively during the past decade since these data were generated. Thus, the dataset remains the most comprehensive and relevant countrywide cross-sectional 'snapshot' of T. solium epidemiology anywhere across the globe and a unique information resource.
Here, we analyse this dataset using a contemporary geostatistical approach to understand the spatial distribution of T. solium cysticercus seropositivity in Colombia, as well as individual and household risk factors associated with exposure to the parasite. This work extends the original analysis of these data [14] by integrating the effects of individual covariates and spatial clustering at multiple hierarchical levels within a single statistical framework. We present maps of the spatial distribution of T. solium cysticercus seropositivity in Colombia, estimates of spatial correlation and demographic, socioeconomic, behavioural and other risk factors associated with exposure to this zoonotic NTD.

Study design
The data were collected by the Colombian National Health Institute (Instituto National de Salud) between 2008 and 2010 with the aim of estimating T. solium human cysticercosis antibody seroprevalence and associated risk factors. Details of the original data collection can be found in [14]. Briefly, individuals aged from 2 to 64 years, from 23 departments and Bogotá district, living in 133 municipalities with > 5000 inhabitants and a health centre were eligible for inclusion. The small proportion of total municipalities sampled (133/1122) was due to logistical and financial constraints. A three-stage cluster random sampling approach was used, covering 23 out of Colombia's 32 departments (first administrative level unit) and Bogotá district (Additional file 1: Figure S1). The municipality constituted the primary sample unit (PSU) and was stratified according to level of urbanization, rural and urban population composition and the Unsatisfied Basic Needs Index (Indice de Necesidades Básicas Insatisfechas) [15]. Within each stratum, the secondary sample unit (SSU) was defined as a neighbourhood (urban) or village (rural) with > 10 households and selected by random sampling. Finally, 10 households in each SSU were randomly selected, and one person belonging to each household (between the age of 2 and 64 years) was selected at random from those present at the interview. Following informed consent, finger-prick blood samples were obtained from 29,360 participants, and each sample was assessed for the presence of circulating T. solium cysticercus antibodies at the National Health Institute Reference Laboratory (Laboratorio de Parasitología del Instituto Nacional de Salud) by enzyme-linked immunosorbent assay (ELISA), with a reported sensitivity of 100% and specificity of 97.5% [16]. Participants also completed a questionnaire on sociodemographic information, hygiene habits, health conditions, food consumption habits, living conditions and animal ownership and management. The questionnaire was developed by the research team in Colombia, with input from experts on cysticercosis. It was first tested in a pilot survey carried out in 216 homes in the municipality of Caqueza (Department of Cundinamarca), from 28 August to 2 September 2008 and adjusted accordingly. Teams in the field were trained on the use of the questionnaire before it was applied on the whole sample. Details on the cleaning and coding of this dataset can be found in Additional file 1: Text S1.

Model-building and analysis of residual spatial correlation
Before performing the geospatial analysis, an initial exploratory analysis was undertaken (using R version 4.0.5 [17]). Given the clustered nature of the data, a hierarchical univariate mixed-effects logistic regression model was fitted to test the association between each explanatory variable (covariate) and human seropositivity to T. solium cysticerci, with each model including two independent random effects terms to capture correlation at the municipality and neighbourhood/village (depending on urban or rural location) levels. Explanatory variables with a P-value ≤ 0.25 (a conservative cut-off to avoid missing potentially important variables), derived from a likelihood ratio test, were retained in the subsequent hierarchical multivariable mixed-effects logistic regression model.
The generic structure of all models is given by: where Y is a binary vector of observations indicating whether an individual tested positive for T. solium cysticercus antibodies, assuming a Bernoulli distribution; µ is a vector of probabilities for testing positive; β is a vector of regression coefficients, and X is the design matrix of explanatory variables; U and Z are vectors of independent and normally distributed random effects terms associated with municipalities and neighbourhoods/villages, respectively; and σ and τ are the standard deviations of the respective random effects terms (indicative of the degree of variability at each hierarchical level). From the final fitted models, adjusted odds ratios (ORs), 95% confidence intervals (95% CIs) and P-values were obtained for each risk factor. All notations/parameters are summarised in Additional file 1: Table S1. A sub-analysis on risk factors in those individuals owning pigs (n = 3154) was also conducted (methodological details are given in Additional file 1: Text S1).
Following fitting of the multivariable mixed-effects model, a variogram analysis was performed to assess the presence of residual spatial correlation [17]. Since the geographical coordinates were available only for the municipalities and not for the neighbourhoods/villages, the empirical variogram was computed only on U , the estimated random effects at the municipality level. A Monte Carlo test for the null hypothesis of spatial independence was performed based on 10,000 random permutations of U amongst the sampled municipalities. The variograms computed on the permuted random effects represent the sampling distribution of the estimated variogram in the absence of spatial correlation. If the empirical variogram ordinates fall outside of the 95% CI obtained from the Monte Carlo test, then there is some evidence of spatial correlation at municipality level.

Incorporating spatial structure
In the presence of spatial correlation, the independent random effects at the municipality level, U , were replaced with a set of spatially structured random effects, S(x) , where x is a vector with the centroids of the sampled municipalities. S(x) is a spatial Gaussian process with variance σ 2 and correlation functionρ(µ) = exp − µ ϕ , where µ is the distance between a pair of municipality centroids and ϕ is a parameter that controls the rate at which the spatial correlation decays with increasing distance. Conditional on these spatially structured random effects, the observations can still be considered as independent Bernoulli random variables [18]. The spatially structured model was fitted using the integrated nested Laplace approximation (INLA) and stochastic partial differential equation (SPDE) approaches [19,20] which implement approximate Bayesian inference in a computationally less intensive manner to alternative Markov chain Monte Carlo (MCMC) approaches. A flat Gaussian prior with mean and precision equal to zero was assigned to the model intercept term; other fixed effects were assigned independent vague Gaussian priors with mean zero and precision equal to 0.001. For the precision of the independent neighbourhood/village random effects, 1/τ , a vague Gamma prior was used, and for the parameters σ 2 and ϕ of the spatially-structured random effects, we adopted penalised complexity priors [21]. Adjusted ORs and 95% credible intervals (95% CrIs) were obtained for each risk factor from the final fitted model.

Study population and seroprevalence distribution
Of the 29,360 observations, 29,253 (99.6%) observations were kept for analysis, with 107 removed due to missing covariate values. Participants were mostly located in urban areas (77.9%), mostly aged 21-50 years (64.4%) and mostly women (68.5%); the main occupational activity was housewife/houseman (44.5%). Socioeconomic stratum 1 (lowest of 4 socioeconomic strata, excluding displaced people) was the most frequently represented socioeconomic stratum (49.5%), and participants most frequently had a partial or complete secondary school educational level (45.8%) ( Table 1). The mean seroprevalence of T. solium cysticercus antibodies was 9.6%, ranging from 0.5% in the Department of Caldas to 38.7% in the Department of Vaupés (Additional file 1: Table S2). Municipalities with the highest seroprevalence were located in the north and south of Colombia ( Fig. 1), while municipalities with lower seroprevalence were concentrated in the central part of the country.

Risk factors for human seropositivity without spatial structure
From the univariate mixed-effects logistic regression model with two random effects, food consumption in streets, washing hands after toilet usage and owning animals other than dogs and pigs (cattle, cats, birds) were excluded from further (multivariate) analysis, having a P-value > 0.25 (Additional file 1: Table S3). Consequently, 16 explanatory variables were included in the multivariable mixed-effect logistic regression with two random effects. Increasing age (as age categories), being female, owning dogs and using rainwater as a water source were significantly associated with increased odds of being seropositive for T. solium cysticercus antibodies; increasing education level, socioeconomic status and consuming partially cooked/raw pork meat once per week were significantly associated with decreased odds of being seropositive (Additional file 1: Table S4). Risk factor analysis results from the sub-analysis of those owning pigs (n = 3154) are reported in Additional file 1: Text S2 and Additional file 1: Table S5. Figure 2 shows a map of the residual variation in the seroprevalence of T. solium cysticercus antibodies at the municipality level that is unexplained by the covariates in the non-spatial mixed-effects model. Figure 3 shows a variogram analysis carried out on the municipalities' estimated random effects. The empirical variogram falls partially outside of the 95% confidence bands, suggesting the presence of spatial correlation in seroprevalence at the municipality level (unexplained by the covariates) up to approximately 120-140 km; further than this distance, the variation between two spatial points starts to plateau This CI Confidence interval estimate was determined more precisely from the fitted geostatistical model (see below) to a value of 139 km.

Geostatistical model
The geostatistical model estimated a strong spatial correlation at the municipality level of up to 139 km. The ORs and 95% CrIs associated with each covariate included in the final multivariable model (which accounts for spatial correlation at the municipality level) are given in Table 2.
Notably, the odds of testing positive for T. solium cysticercus antibodies was 1.29-fold (95% CrI = 1.15-1.46) greater for females than for males, and the odds of testing positive generally increased with age. For example, adults aged between 21 and 60 years were approximately twofold more likely to test positive than children in the age range 2-10 years. Lower educational levels were significantly associated with increased odds of seropositivity, with the highest estimated odds associated with no formal education. Displaced people had 2.20-fold (95% CrI = 1.15-4.28) higher odds of being seropositive than people in the highest socioeconomic stratum; there was no significant difference among other socioeconomic strata. The use of rainwater as a water source was associated with 1.6-fold (95% CrI = 1.21-2.13) higher odds of being positive compared to the use of a well or cistern, and dog owners were at significantly increased odds of testing positive (OR = 1.19, 95% CrI = 1.08-1.31) than non-owners.Consumption of partially cooked/raw pork meat once per week was associated with a significantly decreased odds of testing positive (OR: 0.59, 95% CrI: 0.36 -0.90) compared to no consumption. Place of residence, occupation, frequency of washing vegetables, excreta elimination and owning animals other than dogs (including pigs) were not significantly associated with testing positive for T. solium cysticercus antibodies.

Discussion
The 2008-2010 Colombian cysticercosis serosurvey generated unique and unprecedented information on exposure to T. solium cysticercosis at a national scale. The work presented here extends the original analysis of these data [14] by using contemporary geostatistical techniques to evaluate individual-level risk factors associated with seropositivity to T. solium cysticerci and, simultaneously, spatial clustering at a sub-national (municipality) scale. The results contribute important 0 100 200 300 400 500 600 km N Random Effects Û −1 0 1 2 Fig. 2 Residual variation in Taenia solium cysticercus seroprevalence at the municipality level across Colombia. The map represents the residual variation in cysticercus seroprevalence at the municipality level that is not explained by the covariates in the non-spatial mixed-effects model information on factors associated with exposure to T. solium cysticerci. They also indicate that similar largescale epidemiological surveys will be needed if hyperendemic foci of transmission are to be identified and targeted for intensified interventions in 17 endemic countries, as per the WHO's 2021-2030 NTD roadmap targets for taeniasis/cysticercosis [22]. Here, and in the original analysis of these data [14], women were more likely than men to be positive for T. solium cysticercus antibodies. This finding is consistent with the results of numerous other studies undertaken in Latin America [2,9,[23][24][25][26][27]; by contrast, in other endemic regions, such as sub-Saharan Africa, being male is associated with an increased risk of exposure [28] and of antigen positivity [29,30]. The mechanisms underlying these epidemiological patterns remain unclear. Different household roles associated with handling household-owned animals, food and water may be important, although many variables pertaining to these activities were accounted for in this analysis. Notwithstanding the underlying cause, women could be an important target for educational campaigns in Colombia, not just because of their apparent increased risk of exposure, but also because they are often being responsible for the majority of food handling and preparation activities, which would be all the more important if they were also tapeworm carriers.
The trend for increasing seropositivity with age is unsurprising given that T. solium cysticercus antibodies probably persist for several years. Seropositivity may thus be considered as an indicator of lifetime prior exposure. Praet et al. [31] explored age-dependent dynamics of T. solium cysticercus antibody positivity in more depth by fitting mathematical models to similar age-seroprevalence data collected in Ecuador. Their results suggested that higher antibody seroreversion rates occur following first exposure (representing the primary humoral response), followed by a lower seroreversion rate after the boosting effect of subsequent exposures (representing secondary humoral response), causing saturation in antibody seroprevalence with age. Hence, where transmission is relatively intense-and repeated exposures are common-one might expect to see similar saturating age-seroprevalence profiles. By contrast, in lower transmission settings, the effect of seroreversion following first exposure-and the less frequent boosting effect of subsequent exposures-may be more evident in seroprevalence profiles, possibly resulting in a decline in seropositivity in older age groups.
Exposure to T. solium is known to be greater for individuals with lower educational levels, those from lower socioeconomic strata [6,32] and those facing social marginalisation [9,[33][34][35]. Our findings are consistent with these previously reported findings, with the odds of displaced people testing positive being almost twofold higher than people in the highest socioeconomic stratum. Internal displacement in Colombia is a major issue that often involves the poorest and most disadvantaged people [36], but if the control of T. solium is to become comprehensive, displaced people may require enhanced interventions. Health education could be one such option for control in specific populations using tools such as "The Vicious Worm" [37], as there is some evidence that health education campaigns specific to T. solium can impact transmission [38]. It is, however, likely that to achieve substantial, sustained reductions in the prevalence of T. solium or elimination, particularly in highly endemic areas, a One Health approach targeting the whole T. solium system, including infections in pigs, humans and the environment, will be required [39,40], as recently shown by intervention trials in Peru and Zambia [41,42].
The only variable related to food and water sources or hygiene practices that was significantly associated with seropositivity to T. solium cysticercus antibodies was the use of rainwater. Individuals in households using rainwater as opposed to water stored in wells or cisterns had a 1.6-fold higher odds of seropositivity. Waterborne cysticercosis transmission is supported in the literature, given that the eggs can survive in fresh, brackish and salt waters [32,[43][44][45] and can contaminate vegetables [45]. Other variables, such as open-field defecation or the use of unsanitary latrines [46,47],  that one might also expect to be associated with exposure to T. solium were not identified in our analysis as significant risk factors. We also found that the odds of seropositivity significantly decreased when individuals consumed partially cooked/raw pork meat once per week, an observation possibly confounded by wealth (i.e., wealthier individuals consuming more meat). One might expect that consumption of partially cooked/raw pork meat would be associated with increased odds of seropositivity, given that taeniasis (adult tapeworm) carriers are at risk of autoinfection. However, more research is needed to understand the relative contribution of this route of transmission to overall cysticercosis risk [48]. A particularly striking finding of our analysis was the association between owning dogs and significantly increased odds of test positivity. Dogs in Asia have been reported to test positive for T. solium antibodies [49,50], potentially implicating them as alternative intermediate hosts. Transmission to humans has also been suggested to occur via the consumption of raw or uncooked canine meat [51], although this practice is thought to be extremely rare and not widely reported in Latin America. Moreover, the role of dogs as potential hosts for T. solium remains somewhat speculative. Given the coprophagic habits of dogs and their close interaction with humans, it is also possible (and perhaps more likely) that dogs act as mechanical vectors of T. solium eggs.
A further striking finding is that among the 10.8% (n = 3,154) of individuals owning pigs, we did not find a significantly increased odds of seropositivity, only a nonsignificant increase in those owning fewer than 10 pigs (possibly indicative of smallholder, subsistence farmers, compared to individuals owning > 10 pigs, which may represent wealthier farmers). A further sub-analysis of pig owners (Additional file 1: Text S2) found no association between seropositivity and pig management practices (e.g. free roaming, feeding wastes, drinking free water, among others). These findings contrast with those reported in other studies in Latin America and other geographical settings, in which human cysticercosis has been associated with owning pigs [2,33,52]. Some farming Crl Credible interval, OR odds ratio *Statistically significant practices, such as using waste or water and mix concentrate as feed, and the lack of drainage systems were nonsignificantly associated with increased seropositivity. However, because this sub-analysis was based on a much smaller sample (n = 3154) with only 388 seropositive individuals, there was limited power to detect significant associations.
In addition to exploring individual and household risk factors associated with exposure to T. solium, our geostatistical approach enabled identification of spatial clusters where seropositivity was higher, so-called hotspots (in the north and south of Colombia), or lower (in the central and western areas of the country) than could be explained by the included covariates (Fig. 2). Hotspots where seropositivity was higher than could be explained by the covariates coincided with areas with higher seroprevalence (16-40%) in the northern coastal area and areas bordering Venezuela (Departments of Atlántico, Magdalena, Cesar, La Guajira), in the northern-central region (Departments of Antioquia and Bolívar), in Vaupés (south-east, bordering Brazil) and in the south, in regions bordering Peru and Brazil (Department of Amazonas; Fig. 1). Neither human nor pig population density was explicitly included in the model and, therefore, these variables could help to explain some of this clustering (because of the potential for increased contamination of the environment with T. solium eggs where humans and pigs are abundant). While population densities are heterogeneous across Colombia, some of the highest human population densities are generally found in the north and north-east of the country [53], alongside the highest pig population in the Pacific (east costal), Andean (northeast/north-west) and Caribbean regions (north), as estimated from the Gridded Livestock Database in 2007 [54]. Furthermore, it should be noted that given the level of spatial analysis, we were only able to detect spatial variation at the municipality level.
Local climatic, environmental and ecological conditions may also play a role in the observed clustering. In a recent systematic review, Jansen et al. [45] identified that Taenia spp. eggs can survive in the environment for up to 1 year in favourable conditions of high humidity, moderate temperatures (5-25 °C) and presence of surface water. Moreover, invertebrates, including dung beetles (Ammophorus rubripes), can also act as mechanical vectors for the dispersal of Taenia spp eggs [55,56]. Hence, it is highly likely that local conditions-unaccounted for in our statistical model-will influence spatial patterns of exposure.
Although the serosurvey data analysed here are unique in presenting a picture of exposure to T. solium cysticercosis at a national scale, geographical coverage is incomplete and the sampling approach may have introduced some biases. In particular, the selection of municipalities with > 5000 individuals and a health centre is likely to have created a bias towards sampling in more densely populated urban areas. This led to an underrepresentation of rural communities, which may typically have had less access to health care and possibly lower overall health. In addition, nine departments were excluded from sampling (due to logistical and resource constraints) and overall, only a relatively small fraction (12%) of Colombia's municipalities were sampled (133/1122). Women are highly represented, and this is likely due to the decision of randomizing only the individuals present at the interview for inclusion in the study. Also, the data were collected in 2008-2010, over a decade ago, and may therefore not reflect precisely contemporary epidemiological conditions. Nonetheless, we believe that, in the absence of wide-spread national control efforts, the distribution and endemic situation of T. solium are unlikely to have changed substantively over the past decade and, therefore, the data provide a useful snapshot of endemic conditions across the country. Due to the nature of surveys, other forms of bias and reverse causation are also possible.
Moreover, it cannot be excluded that any of the encountered associations are confounded by unmeasurable or unknown risk factors and that the a priori decision to drop a certain number of variables might have increased the model residuals, not including possible confounders. On the other hand, the unstructured nature of some variables or the probable collinearity with other exposures made this choice desirable. Despite the lack of data concerning some geographical areas in Colombia, the authors still consider the study outcomes as valuable and indicative of the situation of cysticercosis in the country. In addition, the information provided in the current study could be further used to build models that can spatially predict the disease seroprevalence in nonsampled areas [17], offering a cost-effective tool for decision-makers in places where direct sampling did not take place.
Mapping the distribution and seroprevalence of T. solium in endemic countries is a crucial next step in realising the WHO's goals of implementing intensified control in hyperendemic areas of 17 countries by 2030 [22]. Currently, country-wide data on the transmission of T. solium, such as those analysed here for Colombia, are scarce, and thus there is a great deal of work to be done to identify hyperendemic areas in which to implement intensified interventions. Moreover, although working definitions of 'hyperendemicity' have been proposed [57], there is not yet a consensus on the definition of endemicity levels for T. solium infection. Geostatistical approaches will play an important role in identifying areas of high transmission, particularly if they can be parameterized to identify likely areas of high transmission using Geographical Information System (GIS) data that have comprehensive global coverage. Although our study focused on the identification of risk factors associated with exposure to T. solium and residual degrees of spatial clustering, similar geostatistical and machine learning approaches can be used that focus on predicting the spatial distribution of disease using GIS data [17]. Such approaches, conducted at national and global scales, will be crucial in assisting progress towards the WHO's 2030 goals [22,58].

Conclusions
Taeniasis/cysticercosis is a major public health problem and an important cause of epilepsy and other neurological sequelae in many regions of the world. The WHO aims to target this zoonotic NTD with enhanced control where transmission is most intense, although epidemiological data at national and subnational scales remain scarce. The 2008-2010 baseline epidemiological survey undertaken by the Colombian government remains unprecedented in scale and geographical coverage, generating data that are unique and provide a highly valuable resource for understanding the spatial epidemiology of T. solium cysticercosis. By taking a contemporary geostatistical approach, we have highlighted key associations between human cysticercosis antibody seropositivity and individual-and household-level risk factors, while also identifying spatial hotspots of exposure, unexplained by the measured covariates. These findings could be used to inform the design of intervention strategies in Colombia, such as targeting spatial hotspots and more highly exposed groups (such as displaced people and women), and also to illustrate how important geostatistical modelling will be as a tool to inform and support the WHO NTD roadmap in its 2021-2030 goals for taeniasis/cysticercosis.
Additional file 1: Table S1. Notation of parameters used for model building (analysis of residual spatial correlation) and incorporation of spatial structure. Table S2. Seroprevalence of Taenia solium cysticercus antibodies in Colombia. Table S3. Distribution of seropositive individuals, crude odds ratios (ORs) of testing positive for Taenia solium cysticercus antibodies by ELISA and associated 95% confidence intervals (CIs) from the univariate mixed-effects model. Table S4. Distribution of seropositive individuals, multivariable mixed-effects logistic regression adjusted ORs of testing positive for Taenia solium cysticercus antibodies by ELISA and associated CIs. Table S5. Pig management sub-set analysis (n = 3154). Pig management practices, distribution of seropositive individuals, crude odds ratios (ORs) of testing positive for Taenia solium cysticercus antibodies by ELISA and associated 95% CIs. Figure S1. Map displaying the sampled municipalities in Colombia (2008-2010). Text S1. Supplementary methods. Text S2. Results: pig management sub-analysis