 Short report
 Open access
 Published:
BioRssay: an R package for analyses of bioassays and probit graphs
Parasites & Vectors volume 15, Article number: 35 (2022)
Abstract
Dose–response relationships reflect the effects of a substance on organisms, and are widely used in broad research areas, from medicine and physiology, to vector control and pest management in agronomy. Furthermore, reporting on the response of organisms to stressors is an essential component of many public policies (e.g. public health, environment), and assessment of xenobiotic responses is an integral part of World Health Organization recommendations. Building upon an R script that we previously made available, and considering its popularity, we have now developed a software package in the R environment, BioRssay, to efficiently analyze dose–response relationships. It has more userfriendly functions and more flexibility, and proposes an easy interpretation of the results. The functions in the BioRssay package are built on robust statistical analyses to compare the dose/exposure–response of various bioassays and effectively visualize them in probitgraphs.
Graphical Abstract
Background
Bioassays aim at evaluating the potency of a compound. They usually consist in measuring the response of a “population” (e.g. organisms, populations, tissues, cells lines, strains, etc.) to increasing doses (or intensities or exposure times) of a stimulus (most of the time a xenobiotic or a chemical) to quantify specific dose–response relationships (also known as exposure–response relationships). Because of their effectiveness, dose–response relationship analyses are widely used in a large spectrum of scientific disciplines (e.g. epidemiology, microbiology, toxicology, environment quality monitoring, vector and pest control, and parasite biology). A prime example of such analyses is the monitoring of resistance to xenobiotics. Since the 1950s, xenobiotics (e.g. insecticides, pesticides, fungicides) have been widely used to control populations of vectors or pests [1]. However, as a counterresult, resistance mechanisms to such substances have been selected in targeted populations, undermining their efficiency [2]. Therefore, establishing and comparing the resistance levels of various populations to various xenobiotics is at the core of the World Health Organization (WHO) recommendations in order to define/adjust vector control strategies. These resistance assessments are usually done by exposing batches of individuals (adults or larvae) to varying doses of the xenobiotic to assess their responses (e.g. mortality or knockdown effect).
Despite bioassays being used in many fields, there has been a substantial lack of easily accessible statistical infrastructure for their analysis. In 2013, we developed an R script with a robust statistical background to describe and compare dose–mortality relationships [3], and it has since been used in several similar studies (e.g. [4,5,6,7,8,9,10]). In order to make it more userfriendly and easily accessible to the scientific community, we have now developed it into an R package called BioRssay, with more flexibility and an improved presentation of the results.
Workflow
BioRssay is a comprehensive compilation of scripts in R language [11] designed to analyze dose–response relationships (or exposure–response: mortality, knockdown effect, etc.) from bioassays of one or more strains, lines, tissues, cells, etc. [hereafter referred to as “population(s)”]. This package provides a complete analytic workflow, from data quality assessment to statistical analyses and data visualization, as follows (see also Fig. 1).

a.
In the first step, the base mortality in the controls (i.e. a sample of the population not exposed to the tested stimulus) is taken into account to control for the mortality (or effect) associated with the experiment itself, regardless of the exposure (this can be critical when the exposure requires a long period). Data are adjusted following Abbott's formula using a maximum likelihood approach (optim function, method “LBFGSB”, stats R package [11]), to estimate correction factors (Eq. 1), while taking into account heterogeneity in the mortality rates between replicates (see example in Table 1; NB: users should be cautious in their use of the data if the heterogeneity is high) [12]:
$$1 \frac{{m}_{T}}{{m}_{C}},$$(1)where m_{T} and m_{C} are the survival rates in the treatments and the controls, respectively. By default, the correction is applied if mortality in the control is higher than 5%, but any threshold can be specified.

b.
Then, for each population independently, a generalized linear model (GLM) is fitted to the probittransformed mortality rates, with the quasibinomial family to account for possible overdispersion (Eq. 2); if Abbott’s correction has been applied, the model is fitted to the adjusted mortality rates (see above, point a).
$$E\left({\Phi }^{1}(P(Y=1X)\right)=\beta X,$$(2)where \({\Phi }^{1}\) is the probit function, Y is the mortality rates, X is a design matrix, and \(\beta\) the vector of effects of the covariables included in the design matrix (intercept and dose). The estimated effects (i.e. slope and intercept, with their standard errors) are reported. The slope and the intercept are used to test the linearity of the logdose response using a Chisquare test of homogeneity (Eq. 2) between the model predictions and the probittransformed data; significant deviations from linearity may for example reflect mixed populations, or a threshold effect.
$${\chi }^{2}=\sum \frac{{\left(\mathrm{observed}\mathrm{predicted}\right)}^{2}}{\mathrm{predicted}},$$(3) 
c.
The parameters estimated using Eq. 2 are then used to compute lethal doses (LD, also lethal concentrations, LC) with their corresponding 95% confidence intervals (CI), following the approach developed by Johnson et al. [13] from Finney [14], which allows us to take the heterogeneity of the data (h) into account, with larger h leading to larger CI. Heterogeneityrelated parameters (h and g in Table 2) are reported; according to Finney's recommendation: “With almost all good sets of data, g will be substantially smaller than 1.0 and seldom greater than 0.4.” By default, LD are computed at 25%, 50%, and 95% of mortality; alternatively, any level of LD and CI can be specified by the user. Resistance ratios (RR), with their 95% CI, are calculated according to Robertson and Preisler [15]. They measure the magnitude of the difference in dose–responses of two populations when exposed to the same stimulus. For a given LD, the LD of a focal population is divided by the LD of the population with the lowest one (usually a susceptible reference, Table 2).

d.
When several populations (or strains, lines, etc.) are exposed to the same stimulus, we offer the possibility to test whether they present significant differences in their responses (slope and/or magnitude). We use a likelihood ratio test (LRT) to compare a null model, assuming that all the data come from the same population (i.e. where the dose is the only covariable in the design matrix, X in Eq. 2), with the complete model where the dose, the population effect, and their interaction are included in the design matrix X (Eq. 2). As in point b, the effects are fitted with the glm function in R, with the quasibinomial family to account for overdispersion.
With only two populations (e.g. comparison between an unknown population and a reference), a significant difference between the two models indicates that these populations differ in the magnitude of the response and/or slope. With more than two populations, it means that at least two populations show differing responses in the magnitude of the response and/or slope. In this case, pairwise comparisons (post hoc LRT) are implemented to identify the differing populations, testing for statistical significance of the difference between all population pairs; the Holm–Bonferroni method is then used to control for the familywise error inherent in multiple testing [16].
Note that only populations that pass the linearity test should be compared (see above point b): If at least one of the populations failed the linearity test, the validity of comparing the slope or magnitude of their dosemortality response is, at best, highly questionable (not recommended). These populations should be presented, but only as such (accordingly, no regression is fitted to these populations by BioRssay, see point e). Nonetheless, it is still valid to compare the slope and magnitude between the remaining populations, i.e. those that show a linear logdose response.

e.
Customizable (confidence levels, colors, symbols, etc.) plots of the probittransformed regressions are drawn in the last step (Fig. 2a, b). By default, if the probittransformed response significantly deviates from linear regression, the data are connected by segments; the (invalid) regression and associated CI are not plotted (e.g. Fig. 2b, DZOU population, red squares). Confidence intervals around the regressions can be removed or added, and users can specify any levels of CI.
Benchmarking BioRssay
What makes BioRssay stand out for dose–response analyses and how can it complement other similar R packages and functions? We discuss three R packages that provide dose–response analysis: drc [17], protti [18] and lava [19].

The core of the drc package is to provide the user with a comprehensive set of model fitting followed by dose–response analysis [17], while BioRssay has been designed primarily to facilitate the analysis of bioassay data in a readytogo approach. As such, BioRssay is optimized to compare dose–response between many different populations exposed to the same stimulus. Relevant quantities, as LD and RR, along with their CI, are automatically generated, and a function (model.signif) is dedicated to the comparison of dose–responses. Producing these results would require combining the output of several functions from drc packages (e.g. drm, ED, ED.comp, comped, comParm, predict.drc, anova.drc). Another main difference stems from the use of a quasibinomial for model fitting in BioRssay to account for possible data overdispersion, which is not possible in drc, despite the variety of models implemented. Finally, BioRssay also includes data preprocessing by implementing Abbott’s correction for mortality in controls before fitting the dose–response. The drm function in drc is much more thorough in terms of error model selection, allowing a dose–response analysis using for example binomial, Poisson, four and fiveparameter loglogistic models, and Weibull models. The mselect function of drc is handy in identifying the correct dose–response model for populations that fail the linearity test.

The function fit_drc_4p from the package protti [18] is a wrapper function for the drc package’s drm function implementing a fourparameter loglogistic model.

The function PD in the lava package [19] allows for dose–response calculation for binomial regression models.
Package accessibility and concluding remarks
The package is available freely on GitHub at https://github.com/milesilab/BioRssay (https://doi.org/10.5281/zenodo.5172072). We also provide a comprehensive workflow and tutorials, from data preparation to the interpretation of the results, in a dedicated GitHub page https://milesilab.github.io/BioRssay/. Further, the package carries example data sets for selftests, and more data can be downloaded at https://github.com/milesilab/DATA. The code is maintained by P. Karunarathne (piyalkarumail@yahoo.com). For suggestions or further development please contact the corresponding authors, P. Labbé and P. Milesi.
The BioRssay package can be installed in the R environment using the following code:
Availability of data and materials
Accessible at https://github.com/milesilab/DATA; Example 1 and 2 (Fig. 2 and Table 2) can be reproduced using the following dataset included in the package:
Abbreviations
 LD:

Lethal dose
 WHO:

World Health Organization
 CI:

Confidence interval
 LRT:

Likelihood ratio tests
 RR:

Resistance ratio
References
Carson R. Silent Spring. Houghton Mifflin Company; 1962.
Labbé P, David JP, Alout H, Milesi P, Djogbénou L, Pasteur N, et al. 14  Evolution of Resistance to Insecticide in Disease Vectors. In: Tibayrenc MBTG and E of ID Second E, editor. London: Elsevier; 2017. p. 313–39
Milesi P, Pocquet N, Labbé P. BioRssay: a R script for bioassay analyses. 2013. https://drive.google.com/file/d/1qMNC2EQlxBnOunuaauta1BCQcLesnrFX/view?usp=sharing
Alout H, Labbé P, Berthomieu A, Makoundou P, Fort P, Pasteur N, et al. High chlorpyrifos resistance in Culex pipiens mosquitoes: strong synergy between resistance genes. Heredity (Edinb). 2016;116:224–31.
Pocquet N, Darriet F, Zumbo B, Milesi P, Thiria J, Bernard V, et al. Insecticide resistance in disease vectors from Mayotte: an opportunity for integrated vector management. Parasit Vectors Springer. 2014;7:1–12.
Badolo A, Sombié A, Pignatelli PM, Sanon A, Yaméogo F, Wangrawa DW, et al. Insecticide resistance levels and mechanisms in Aedes aegypti populations in and around Ouagadougou. Burkina Faso PLoS Negl Trop Dis. 2019;13:e0007439.
Assogba BS, Milesi P, Djogbénou LS, Berthomieu A, Makoundou P, BabaMoussa LS, et al. The ace1 locus is amplified in all resistant Anopheles gambiae mosquitoes: fitness consequences of homogeneous and heterogeneous duplications. PLoS Biol. 2016;14:e2000618.
Yaméogo F, Wangrawa DW, Sombié A, Sanon A, Badolo A. Insecticidal activity of essential oils from six aromatic plants against Aedes aegypti, dengue vector from two localities of Ouagadougou. Burkina Faso Arthropod Plant Interact. 2021;1:8.
Epelboin Y, Wang L, Giai Gianetto Q, Choumet V, Gaborit P, Issaly J, et al. CYP450 core involvement in multiple resistance strains of Aedes aegypti from French Guiana highlighted by proteomics, molecular and biochemical studies. PLoS ONE. 2021;16:e0243992.
Perrier S, Moreau E, Deshayes C, ElAdouzi M, Goven D, Chandre F, et al. Compensatory mechanisms in resistant Anopheles gambiae AcerKis and KdrKis neurons modulate insecticidebased mosquito control. Commun Biol. 2021;4:1–16.
R Core Team. A language and environment for statistical computing, R Foundation for Statistical Computing; 2013. 2020.
Abbott WS. A method of computing the effectiveness of an insecticide. J Econ Entomol College Park. 1925;18:265–7.
Johnson RM, Dahlgren L, Siegfried BD, Ellis MD. Acaricide, fungicide and drug interactions in honey bees (Apis mellifera). PLoS ONE. 2013;8:e54092.
Finney DJ. Probit analysis. Cambridge: Cambridge University Press; 1971.
Robertson JL, Preisler HK. Pesticide bioassays with arthropods. Boca Raton: CRC; 1992.
Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat. 1979;65–70:98.
Ritz C, Baty F, Streibig JC, Gerhard D. Doseresponse analysis using R. PLoS ONE. 2015;10:1–13.
Quast JP, Schuster D. Protti: Bottomup Proteomics and LiPMS Quality Control and Data Analysis Tools. 2021.
Holst KK, BudtzJørgensen E. Linear latent variable models: The lavapackage. Comput Stat. 2013.
Acknowledgements
We want to thank Jérôme Chopard, Haoues Alout, Mathieu Tiret, Mylène Weill, and Nicole Pasteur for their valuable comments on earlier versions of the script and its outputs. We are indebted to the two anonymous reviewers for their suggestions that helped improve the package. This study was funded by the Agence Nationale de la Recherche, ANR20CE340007.
Funding
Not applicable. Open access funding provided by Uppsala University.
Author information
Authors and Affiliations
Contributions
PM and PL compiled the methodology and statistical approach; PL, PM, NP, and PK wrote the scripts, and PK compiled the functions and the software package. PK, PL, and PM prepared the manuscript. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Karunarathne, P., Pocquet, N., Labbé, P. et al. BioRssay: an R package for analyses of bioassays and probit graphs. Parasites Vectors 15, 35 (2022). https://doi.org/10.1186/s1307102105146x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1307102105146x