BioRssay: an R package for analyses of bioassays and probit graphs

Karunarathne, Piyal; Pocquet, Nicolas; Labbé, Pierrick; Milesi, Pascal

doi:10.1186/s13071-021-05146-x

Short report
Open access
Published: 24 January 2022

BioRssay: an R package for analyses of bioassays and probit graphs

Parasites & Vectors volume 15, Article number: 35 (2022) Cite this article

4847 Accesses
12 Citations
5 Altmetric
Metrics details

Abstract

Dose–response relationships reflect the effects of a substance on organisms, and are widely used in broad research areas, from medicine and physiology, to vector control and pest management in agronomy. Furthermore, reporting on the response of organisms to stressors is an essential component of many public policies (e.g. public health, environment), and assessment of xenobiotic responses is an integral part of World Health Organization recommendations. Building upon an R script that we previously made available, and considering its popularity, we have now developed a software package in the R environment, BioRssay, to efficiently analyze dose–response relationships. It has more user-friendly functions and more flexibility, and proposes an easy interpretation of the results. The functions in the BioRssay package are built on robust statistical analyses to compare the dose/exposure–response of various bioassays and effectively visualize them in probit-graphs.

Graphical Abstract

Background

Bioassays aim at evaluating the potency of a compound. They usually consist in measuring the response of a “population” (e.g. organisms, populations, tissues, cells lines, strains, etc.) to increasing doses (or intensities or exposure times) of a stimulus (most of the time a xenobiotic or a chemical) to quantify specific dose–response relationships (also known as exposure–response relationships). Because of their effectiveness, dose–response relationship analyses are widely used in a large spectrum of scientific disciplines (e.g. epidemiology, microbiology, toxicology, environment quality monitoring, vector and pest control, and parasite biology). A prime example of such analyses is the monitoring of resistance to xenobiotics. Since the 1950s, xenobiotics (e.g. insecticides, pesticides, fungicides) have been widely used to control populations of vectors or pests [1]. However, as a counter-result, resistance mechanisms to such substances have been selected in targeted populations, undermining their efficiency [2]. Therefore, establishing and comparing the resistance levels of various populations to various xenobiotics is at the core of the World Health Organization (WHO) recommendations in order to define/adjust vector control strategies. These resistance assessments are usually done by exposing batches of individuals (adults or larvae) to varying doses of the xenobiotic to assess their responses (e.g. mortality or knockdown effect).

Despite bioassays being used in many fields, there has been a substantial lack of easily accessible statistical infrastructure for their analysis. In 2013, we developed an R script with a robust statistical background to describe and compare dose–mortality relationships [3], and it has since been used in several similar studies (e.g. [4,5,6,7,8,9,10]). In order to make it more user-friendly and easily accessible to the scientific community, we have now developed it into an R package called BioRssay, with more flexibility and an improved presentation of the results.

Workflow

BioRssay is a comprehensive compilation of scripts in R language [11] designed to analyze dose–response relationships (or exposure–response: mortality, knockdown effect, etc.) from bioassays of one or more strains, lines, tissues, cells, etc. [hereafter referred to as “population(s)”]. This package provides a complete analytic workflow, from data quality assessment to statistical analyses and data visualization, as follows (see also Fig. 1).

a.
In the first step, the base mortality in the controls (i.e. a sample of the population not exposed to the tested stimulus) is taken into account to control for the mortality (or effect) associated with the experiment itself, regardless of the exposure (this can be critical when the exposure requires a long period). Data are adjusted following Abbott's formula using a maximum likelihood approach (optim function, method “L-BFGS-B”, stats R package [11]), to estimate correction factors (Eq. 1), while taking into account heterogeneity in the mortality rates between replicates (see example in Table 1; NB: users should be cautious in their use of the data if the heterogeneity is high) [12]:
$$1- \frac{{m}_{T}}{{m}_{C}},$$
(1)

where m_T and m_C are the survival rates in the treatments and the controls, respectively. By default, the correction is applied if mortality in the control is higher than 5%, but any threshold can be specified.
b.
Then, for each population independently, a generalized linear model (GLM) is fitted to the probit-transformed mortality rates, with the quasi-binomial family to account for possible overdispersion (Eq. 2); if Abbott’s correction has been applied, the model is fitted to the adjusted mortality rates (see above, point a).
$$E\left({\Phi }^{-1}(P(Y=1|X)\right)=\beta X,$$
(2)

where ${\Phi }^{-1}$ is the probit function, Y is the mortality rates, X is a design matrix, and $\beta$ the vector of effects of the covariables included in the design matrix (intercept and dose). The estimated effects (i.e. slope and intercept, with their standard errors) are reported. The slope and the intercept are used to test the linearity of the log-dose response using a Chi-square test of homogeneity (Eq. 2) between the model predictions and the probit-transformed data; significant deviations from linearity may for example reflect mixed populations, or a threshold effect.
$${\chi }^{2}=\sum \frac{{\left(\mathrm{observed}-\mathrm{predicted}\right)}^{2}}{\mathrm{predicted}},$$
(3)
c.
The parameters estimated using Eq. 2 are then used to compute lethal doses (LD, also lethal concentrations, LC) with their corresponding 95% confidence intervals (CI), following the approach developed by Johnson et al. [13] from Finney [14], which allows us to take the heterogeneity of the data (h) into account, with larger h leading to larger CI. Heterogeneity-related parameters (h and g in Table 2) are reported; according to Finney's recommendation: “With almost all good sets of data, g will be substantially smaller than 1.0 and seldom greater than 0.4.” By default, LD are computed at 25%, 50%, and 95% of mortality; alternatively, any level of LD and CI can be specified by the user. Resistance ratios (RR), with their 95% CI, are calculated according to Robertson and Preisler [15]. They measure the magnitude of the difference in dose–responses of two populations when exposed to the same stimulus. For a given LD, the LD of a focal population is divided by the LD of the population with the lowest one (usually a susceptible reference, Table 2).
d.
When several populations (or strains, lines, etc.) are exposed to the same stimulus, we offer the possibility to test whether they present significant differences in their responses (slope and/or magnitude). We use a likelihood ratio test (LRT) to compare a null model, assuming that all the data come from the same population (i.e. where the dose is the only covariable in the design matrix, X in Eq. 2), with the complete model where the dose, the population effect, and their interaction are included in the design matrix X (Eq. 2). As in point b, the effects are fitted with the glm function in R, with the quasi-binomial family to account for overdispersion.

With only two populations (e.g. comparison between an unknown population and a reference), a significant difference between the two models indicates that these populations differ in the magnitude of the response and/or slope. With more than two populations, it means that at least two populations show differing responses in the magnitude of the response and/or slope. In this case, pairwise comparisons (post hoc LRT) are implemented to identify the differing populations, testing for statistical significance of the difference between all population pairs; the Holm–Bonferroni method is then used to control for the family-wise error inherent in multiple testing [16].

Note that only populations that pass the linearity test should be compared (see above point b): If at least one of the populations failed the linearity test, the validity of comparing the slope or magnitude of their dose-mortality response is, at best, highly questionable (not recommended). These populations should be presented, but only as such (accordingly, no regression is fitted to these populations by BioRssay, see point e). Nonetheless, it is still valid to compare the slope and magnitude between the remaining populations, i.e. those that show a linear log-dose response.
e.
Customizable (confidence levels, colors, symbols, etc.) plots of the probit-transformed regressions are drawn in the last step (Fig. 2a, b). By default, if the probit-transformed response significantly deviates from linear regression, the data are connected by segments; the (invalid) regression and associated CI are not plotted (e.g. Fig. 2b, DZOU population, red squares). Confidence intervals around the regressions can be removed or added, and users can specify any levels of CI.

Table 1 A subset of bioassays conducted on three strains of Anopheles gambiae mosquitoes exposed to increasing doses of temephos insecticide (data from [5])

Full size table

Table 2 Parameters estimated from the probit-transformed data

Full size table

Benchmarking BioRssay

What makes BioRssay stand out for dose–response analyses and how can it complement other similar R packages and functions? We discuss three R packages that provide dose–response analysis: drc [17], protti [18] and lava [19].

The core of the drc package is to provide the user with a comprehensive set of model fitting followed by dose–response analysis [17], while BioRssay has been designed primarily to facilitate the analysis of bioassay data in a ready-to-go approach. As such, BioRssay is optimized to compare dose–response between many different populations exposed to the same stimulus. Relevant quantities, as LD and RR, along with their CI, are automatically generated, and a function (model.signif) is dedicated to the comparison of dose–responses. Producing these results would require combining the output of several functions from drc packages (e.g. drm, ED, ED.comp, comped, comParm, predict.drc, anova.drc). Another main difference stems from the use of a quasi-binomial for model fitting in BioRssay to account for possible data overdispersion, which is not possible in drc, despite the variety of models implemented. Finally, BioRssay also includes data preprocessing by implementing Abbott’s correction for mortality in controls before fitting the dose–response. The drm function in drc is much more thorough in terms of error model selection, allowing a dose–response analysis using for example binomial, Poisson, four- and five-parameter log-logistic models, and Weibull models. The mselect function of drc is handy in identifying the correct dose–response model for populations that fail the linearity test.
The function fit_drc_4p from the package protti [18] is a wrapper function for the drc package’s drm function implementing a four-parameter log-logistic model.
The function PD in the lava package [19] allows for dose–response calculation for binomial regression models.

Package accessibility and concluding remarks

The package is available freely on GitHub at https://github.com/milesilab/BioRssay (https://doi.org/10.5281/zenodo.5172072). We also provide a comprehensive workflow and tutorials, from data preparation to the interpretation of the results, in a dedicated GitHub page https://milesilab.github.io/BioRssay/. Further, the package carries example data sets for self-tests, and more data can be downloaded at https://github.com/milesilab/DATA. The code is maintained by P. Karunarathne (piyalkarumail@yahoo.com). For suggestions or further development please contact the corresponding authors, P. Labbé and P. Milesi.

The BioRssay package can be installed in the R environment using the following code:

Availability of data and materials

Accessible at https://github.com/milesilab/DATA; Example 1 and 2 (Fig. 2 and Table 2) can be reproduced using the following dataset included in the package:

Abbreviations

LD:: Lethal dose
WHO:: World Health Organization
CI:: Confidence interval
LRT:: Likelihood ratio tests
RR:: Resistance ratio

References

Carson R. Silent Spring. Houghton Mifflin Company; 1962.
Labbé P, David J-P, Alout H, Milesi P, Djogbénou L, Pasteur N, et al. 14 - Evolution of Resistance to Insecticide in Disease Vectors. In: Tibayrenc MBT-G and E of ID Second E, editor. London: Elsevier; 2017. p. 313–39
Milesi P, Pocquet N, Labbé P. BioRssay: a R script for bioassay analyses. 2013. https://drive.google.com/file/d/1qMNC2EQlxBnOunuaauta1BCQcLesnrFX/view?usp=sharing
Alout H, Labbé P, Berthomieu A, Makoundou P, Fort P, Pasteur N, et al. High chlorpyrifos resistance in Culex pipiens mosquitoes: strong synergy between resistance genes. Heredity (Edinb). 2016;116:224–31.
Article CAS Google Scholar
Pocquet N, Darriet F, Zumbo B, Milesi P, Thiria J, Bernard V, et al. Insecticide resistance in disease vectors from Mayotte: an opportunity for integrated vector management. Parasit Vectors Springer. 2014;7:1–12.
Article Google Scholar
Badolo A, Sombié A, Pignatelli PM, Sanon A, Yaméogo F, Wangrawa DW, et al. Insecticide resistance levels and mechanisms in Aedes aegypti populations in and around Ouagadougou. Burkina Faso PLoS Negl Trop Dis. 2019;13:e0007439.
Article CAS Google Scholar
Assogba BS, Milesi P, Djogbénou LS, Berthomieu A, Makoundou P, Baba-Moussa LS, et al. The ace-1 locus is amplified in all resistant Anopheles gambiae mosquitoes: fitness consequences of homogeneous and heterogeneous duplications. PLoS Biol. 2016;14:e2000618.
Article Google Scholar
Yaméogo F, Wangrawa DW, Sombié A, Sanon A, Badolo A. Insecticidal activity of essential oils from six aromatic plants against Aedes aegypti, dengue vector from two localities of Ouagadougou. Burkina Faso Arthropod Plant Interact. 2021;1:8.
Google Scholar
Epelboin Y, Wang L, Giai Gianetto Q, Choumet V, Gaborit P, Issaly J, et al. CYP450 core involvement in multiple resistance strains of Aedes aegypti from French Guiana highlighted by proteomics, molecular and biochemical studies. PLoS ONE. 2021;16:e0243992.
Article CAS Google Scholar
Perrier S, Moreau E, Deshayes C, El-Adouzi M, Goven D, Chandre F, et al. Compensatory mechanisms in resistant Anopheles gambiae AcerKis and KdrKis neurons modulate insecticide-based mosquito control. Commun Biol. 2021;4:1–16.
Article Google Scholar
R Core Team. A language and environment for statistical computing, R Foundation for Statistical Computing; 2013. 2020.
Abbott WS. A method of computing the effectiveness of an insecticide. J Econ Entomol College Park. 1925;18:265–7.
Article CAS Google Scholar
Johnson RM, Dahlgren L, Siegfried BD, Ellis MD. Acaricide, fungicide and drug interactions in honey bees (Apis mellifera). PLoS ONE. 2013;8:e54092.
Article CAS Google Scholar
Finney DJ. Probit analysis. Cambridge: Cambridge University Press; 1971.
Google Scholar
Robertson JL, Preisler HK. Pesticide bioassays with arthropods. Boca Raton: CRC; 1992.
Google Scholar
Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat. 1979;65–70:98.
Google Scholar
Ritz C, Baty F, Streibig JC, Gerhard D. Dose-response analysis using R. PLoS ONE. 2015;10:1–13.
Article Google Scholar
Quast J-P, Schuster D. Protti: Bottom-up Proteomics and LiP-MS Quality Control and Data Analysis Tools. 2021.
Holst KK, Budtz-Jørgensen E. Linear latent variable models: The lava-package. Comput Stat. 2013.

Download references

Acknowledgements

We want to thank Jérôme Chopard, Haoues Alout, Mathieu Tiret, Mylène Weill, and Nicole Pasteur for their valuable comments on earlier versions of the script and its outputs. We are indebted to the two anonymous reviewers for their suggestions that helped improve the package. This study was funded by the Agence Nationale de la Recherche, ANR-20-CE34-0007.

Funding

Not applicable. Open access funding provided by Uppsala University.

Author information

Pierrick Labbé and Pascal Milesi contributed equally to the work and are listed in alphabetic order

Authors and Affiliations

Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36, Uppsala, Sweden
Piyal Karunarathne & Pascal Milesi
Institut Pasteur de Nouvelle-Calédonie, URE-Entomologie Médicale, Nouméa, New Caledonia
Nicolas Pocquet
Institut Universitaire de France, 1 Rue Descartes, 75231 Cedex 05, Paris, France
Pierrick Labbé
Institut Des Sciences de L’Evolution de Montpellier (UMR 5554, CNRS-UM-IRD-EPHE), Université de Montpellier, 34095 Cedex 5, Montpellier, France
Pierrick Labbé
SciLifelab, Uppsala, Sweden
Pascal Milesi

Authors

Piyal Karunarathne
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Pocquet
View author publications
You can also search for this author in PubMed Google Scholar
Pierrick Labbé
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Milesi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

PM and PL compiled the methodology and statistical approach; PL, PM, NP, and PK wrote the scripts, and PK compiled the functions and the software package. PK, PL, and PM prepared the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Pierrick Labbé or Pascal Milesi.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Karunarathne, P., Pocquet, N., Labbé, P. et al. BioRssay: an R package for analyses of bioassays and probit graphs. Parasites Vectors 15, 35 (2022). https://doi.org/10.1186/s13071-021-05146-x

Download citation

Received: 15 September 2021
Accepted: 28 December 2021
Published: 24 January 2022
DOI: https://doi.org/10.1186/s13071-021-05146-x

BioRssay: an R package for analyses of bioassays and probit graphs