Figures
Abstract
Visceral leishmaniasis (VL) is a neglected tropical disease that is globally distributed and has the potential to cause very serious illness. Prior literature highlights the emergence and spread of VL is influenced by multiple factors, such as socioeconomic status, sanitation levels or animal and human reservoirs. The study aimed to retrospectively investigate the presence and infectiousness of VL in Rio Grande do Norte (RN), Brazil between 2007 and 2020. We applied a hierarchical Bayesian approach to estimate municipality-specific relative risk of VL across space and time. The results show evidence that lower socioeconomic status is connected to higher municipality-specific VL risk. Overall, estimates reveal spatially heterogeneous VL risks in RN, with a high probability that VL risk for municipalities within the West Potiguar mesoregion are more than double the expected VL risk. Additionally, given the data available, results indicate there is a high probability of increasing VL risk in the municipalities of Natal, Patu and Pau dos Ferros. These findings demonstrate opportunities for municipality-specific public health policy interventions and warrant future research on identifying epidemiological drivers in at-risk regions.
Author summary
As a neglected tropical disease (NTD), Leishmaniasis contributes to poor health outcomes, suffering, and death of vulnerable populations. Leishmania infantum or Leishmania donovani is transmitted from animals to humans through the sandfly vector and can be fatal in 5% of the cases even with treatment. Visceral leishmaniasis is a severe form of Leishmaniasis. The purpose of this study was to estimate regions of high Visceral leishmaniasis risk in Rio Grande do Norte, Brazil between 2007 and 2020 to help inform public health initiatives and potential health policy interventions for this disease. Visceral leishmaniasis is a reportable disease and therefore VL cases are documented to the Brazilian Ministry of Health database through passive surveillance. Passive surveillance data can suffer from missing information; individuals may be asymptomatic, or may not seek medical care allowing for diagnosis. When modeling disease risk from passive surveillance data, assuming there is an absence of disease in regions without documented cases can underestimate disease risk. In this paper we estimate disease risk and identify areas with high-risk and increasing infection trends over time while considering data characteristics such as missing information and high correlation of risk factors.
Citation: Hernandez HG, Brown GD, Lima ID, Coutinho JF, Wilson ME, Nascimento ELT, et al. (2023) Hierarchical spatiotemporal modeling of human visceral leishmaniasis in Rio Grande do Norte, Brazil. PLoS Negl Trop Dis 17(4): e0011206. https://doi.org/10.1371/journal.pntd.0011206
Editor: Marilia Sá Carvalho, Fundacao Oswaldo Cruz, BRAZIL
Received: September 5, 2022; Accepted: March 1, 2023; Published: April 3, 2023
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Data Availability: The data and code underlying the results presented in the study are available on ResearchGate (https://www.researchgate.net/project/Infectious-Disease-Modeling-Leishmania-Infection).
Funding: HGH, GDB, JFC, SMBJ, CAP, and JJO were supported under Fogarty International Center of the National Institutes of Health, Award Number R01TW010500 (https://www.fic.nih.gov/). MEW was supported under the Department of Veterans’ Affairs, Department of Research and Development, Award Number I01BX000536 (https://www.research.va.gov/). IDL and ELTN received no specific funding for this work. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Human visceral leishmaniasis (VL) is endemic in Brazil [1–12] and is recognized as a neglected tropical disease by the World Health Organization [13]. In Brazil, VL etiology stems from the protozoan Leishmania infantum, where the domestic dog serves as a sentinel host with the phlebotomine sand fly as the typical vector [14]. In the northeast region of Brazil, VL has historically been viewed as endemic in rural regions [15]. In recent decades the geographical epidemiology of VL has diversified with increasing VL cases occurring in more urban and peri-urban regions due to urbanization across Brazil [16].
The dramatic increase in VL incidence across Northeastern Brazil has led to more research focus on this region. Costa et al. [17] and Jeronimo et al. [18] highlight VL spread to urban areas within the northeastern region of Brazil in the last two decades of the 20th century. Silva et al. [6] reported increasing VL cases from 2012 to 2017 in major cities of the Paraíba state in Brazil, Almeida et al. [19] observed spatial expansion of VL from 2007 to 2017 in Fortaleza, a coastal city within the Ceará state in Brazil, and da Silva Lima et al. [20] noted increasing VL fatality in the last decade in Piauí, Brazil. The state of Alagoas, Brazil also experienced geographic spread of VL from 2007 to 2018, primarily in rural areas [7]. Such trends warrant government awareness and require results from improved statistical modeling to better understand, and inform, the dynamics of VL transmission patterns to guide public health policy.
Several studies have investigated the risk factors for VL susceptibility in Northeastern Brazil by considering socioeconomic, environmental, infrastructure, and demographic information. With the strong connection between social class and VL incidence [21, 22] a common source of covariate information to model socioeconomic constructs in previous work [2, 3, 5] has been a Social Vulnerability Index [23] or the Human Development Index [24]. Vulnerability indices are a blend of several quantitative variables measuring similar constructs. Social vulnerability indices from Costa et al. [23] subjectively weigh multiple Census covariates to construct three vulnerability indices measuring infrastructure, human capital and income. In contrast, the Human Development Index created by the United Nations Development Programme [24], is a univariate index of life expectancy, education, and per capita income. While vulnerability indices may facilitate interpretability for individual constructs, the correlation between the variables that make up these indices is not mitigated. Using correlated variables as covariates in a statistical model, as seen in [2, 3, 5], can lead to issues of multicollinearity, like a loss of precision of parameter estimates which can affect hypothesis test results and model predictions, from which conclusions would be unreliable.
Studies like [1] that do not use vulnerability indices and instead model the effects of correlated variables on the outcome of interest should consider potential issues of multicollinearity. In their study, Cota et al. [1] use generalized additive mixed models (GAMM) to verify the univariate association between exploratory variables and the outcome of interest. They conduct multiple hypothesis tests to identify correlated explanatory variables that can contribute to issues of multicollinearity. When bivariate associations between exploratory variables exceeded 85%, Cota et al. [1] keep the variable with the lowest p-value from the univariate GAMM models. Aside from the inherent presence of ecological fallacy in ecological studies, we note that it may be difficult to control for all correlations between explanatory variables in the statistical model. In this study we aim to highlight a method through which correlation of explanatory variables can be mitigated. In the present study, we address issues of multicollinearity through a factor analysis which results in latent variables for which correlation is mitigated.
Additionally, previous studies [3, 4, 8] analyze incidence of VL across space, using Moran’s I, and over time via segmented or joinpoint regression. Exploratory spatial techniques like Moran’s I can help quantify the presence of spatial correlation. However, unlike regression approaches, these exploratory techniques do not explain or help to predict how the outcome of interest varies between regions based on observed risk factors. Techniques like segmented or joinpoint regression can help explain changes in the outcome based on observed risk characteristics. Still, temporal regression techniques fit separately from spatial models ignore important correlation information between and within regions that can better inform model parameter estimates and predictions. Therefore, in this study, we jointly model spatial and temporal correlations.
Furthermore, utilizing these statistical methods within a classical framework in the presence of missing information, can underestimate risk and produce incorrect spatio-temporal correlation as they require unrealistic assumptions that missing information imply absence of disease. In surveillance data, areas with no documentation of disease could indicate disease absence in that region. This phenomena could also arise from a lack of reporting, especially if cases are present in surrounding areas or are documented later in time. This issue may be addressed within the classical modeling framework were we first could predict missing data via an imputation model and then fit the desired model for inference. However, in this two-step approach the spatial dependence observed and fitted in the desired model would not be used to inform the missing data imputation. Therefore, our modeling approach within a Bayesian framework allows us to jointly model the spatial dependence in the data and address the missing characteristics in the data. The advantage of jointly modeling these two components is that we use spatial information when impute missing values via same model that is used for statistical inference.
The purpose of this manuscript is to evaluate spatio-temporal risk of VL cases between 2007 and 2020 in RN. The specific contributions are two-fold: 1) addressing issues of multicollinearity between observed Census data through Exploratory Factor Analysis (EFA) and 2) jointly modeling observed incidence of VL and imputing missing data. The second contribution is addressed by fitting our model within a Bayesian framework through which imputation efforts can be informed by spatial and temporal information.
Materials and methods
Ethics statement
The study was approved by the Ethical Review Board of the Federal University of Rio Grande do Norte and by CONEP (CAAE 83319618.5.0000.5537).
Study area
The study area is a Northeastern state of Brazil named Rio Grande do Norte (RN). This region is centrally located at 14° S latitude and 51° W longitude with a total area of about 52, 796 km2. Rio Grande do Norte is home to approximately 3.5 million individuals across 167 municipalities.
The RN state can be divided into four meso-regions (and the number of municipalities within each) which are West Potiguar (62), Central Potiguar (37), Agreste Potiguar (43), and East Potiguar (25). The meso regions and the municipalities represented within each are displayed in Fig 1. Along with reporting municipality-specific findings, statistical results are summarized for these meso-regions.
Location and Mesoregions of Rio Grande do Norte. The image on the left highlights the northeastern location of Rio Grande do Norte (in red) within Brazil. The image on the right shows the four Mesoregions in Rio Grande do Norte and municipalities within each region. This figure was created using the geobr package in R [25].
Visceral leishmaniasis data
Visceral leishmaniasis cases in RN between 2007 and 2020 were obtained from passive surveillance conducted by the state of Rio Grande do Norte Health Surveillance Department and the Brazilian Ministry of Health (DATASUS; https://datasus.saude.gov.br/acesso-a-informacao/doencas-e-agravos-de-notificacao-de-2007-em-diante-sinan/). These government health agencies fund VL treatment for those diagnosed with the disease and reported by state agencies. The data analyzed in this study includes patients whose disease status was formally diagnosed via ELISA and/or parasitological confirmation of VL. The data available included the municipality where infection was suspected to have occurred along with the municipality where VL cases were diagnosed. If the municipality of infection was unknown (1042 patients) or municipality of infection was outside of RN (13 patients), the municipality of diagnosis within RN was used as the spatial location in the statistical model. The year attached to each case was the year the Brazilian Ministry of Health was notified of the diagnosed case.
A total of 2,036 VL cases were documented over the fourteen-year study period within 120 of the 167 municipalities in Rio Grande do Norte. Fig 2 shows the yearly rates of VL per 100,000 within each municipality, and Fig 3 features the average rates of VL per 100,000 within each municipality between 2007 and 2020. These figures display the pattern of VL cases across space and over time based on the diagnosed VL cases during the study period. Many municipalities in the West Potiguar mesoregion and some in the East Potiguar mesoregion were observed with large population-standardized VL rates compared to other regions in RN. These municipalities include some of the most populous areas in RN such as Natal, Mossorro, Parnamirim, São Gonçalo do Amarante, Açu, and Extremoz.
Spatio-temporal distribution of human visceral leishmaniasis rates per 100,000 between 2007 and 2020. This figure was created using the geobr package in R [25].
Distribution of average human visceral leishmaniasis rates per 100,000 in Rio Grande do Norte between 2007 and 2020. This figure was created using the geobr package in R [25].
Spatial covariates
Our first research aim was to evaluate VL risk across space and over time in RN. Data representing socioeconomic, environmental and demographic constructs were collected for each municipality from Brazil’s 2010 Census data to quantify regional VL risk throughout RN. Explicit definitions of data collected for the Brazilian Federal Census [26] and the methodology of data collection [27] are provided by the Brazilian Institute of Geography and Statistics.
Socioeconomic status of each municipality was quantified by two covariates: the proportion of economically active individuals and a nominal income distribution (for residents above the age of ten). Economically active status was defined for an individual who was employed during the week of data collection [26]. The eight income categories provided by the Census describe proportion of individuals whose monthly income fell within one of the following nominal classes: no income, up 1/2, 1/2 to 1, 1 to 2, 2 to 5, 5 to 10, 10 to 20, or more than 20 times the monthly minimum wage for 2010 which was five hundred and ten reals (510.00 BRL). The no income category included individuals who received only benefits as their income. The nominal income categories, and distribution of percentage of residents within each municipality that fall within those categories, are displayed in Fig 4. Income brackets were combined to provide a three-tier economic distribution within each municipality to describe those with low income (0 to 1 times the minimum income), middle income (1 to 5 times the minimum income), and upper income (5 or more times the minimum income).
Covariates characterizing municipality-specific environmental factors included land development (urban or rural), livestock production and level of sanitation of homes (Figs 5 and 6). Livestock production was comprised of large livestock (cattle, horse), midsize livestock (swine, goat, sheep) and small livestock (poultry). Census livestock data at a municipality level informs on the percent of each livestock type that every municipality produces relative to the RN total. Livestock was included to incorporate possible animal reservoirs which may influence the distribution and incidence of VL. Numerous mammals such as chickens [28, 29], swine [30], horses [31, 32] and cattle [10] have been documented as blood sources for sandflies in Brazil and thus serve as possible reservoirs. Livestock production for buffalo and quail were also available but were omitted from the analysis due to a lack of spatial heterogeneity. Households with adequate forms of sanitation had water supplied by the general water distribution network, disposal of sanitary sewage by the general sewage system (or septic tank) and direct or indirect garbage collection. Households missing at least one form of sanitation were considered to have a semi-inadequate sanitation status, while households missing all forms of sanitation were labeled as having inadequate sanitation. Covariates measuring the age distributions for each municipality were also included. Age distribution was quantified using the following age brackets: 0–5 yrs, 6–14 yrs, 15–24 yrs, 25–39 yrs, 40–59 yrs, and 60 years or more (Fig 7). Age distribution was included as a covariate given Brazil’s changing demographics of VL incidence increasing in older adults [15, 33].
Distribution of percent of residents that were reported as economically active and illiterate within each municipality as captured by the 2010 Brazilian Federal Census. Additionally shown in this figure is the distribution of percent of homes with inadequate or semi-adequate sanitation statues and the percent of homes that were located in rural or urban settings within each municipality.
Distribution of percentages of each animal. Two municipalities not plotted in this figure, Mossoro and Parnamirim, had the largest poultry production of all municipalities. The percent of poultry production for both was approximately 16%.
The 2010 Brazilian Federal Census age data was quantified as the percent of individuals within each municipality that fell in each age bracket. The figure above shows the distribution of percentages of residents falling within each age bracket of all RN municipalities.
Statistical methods
In this manuscript, we first define the likelihood structure assumed for the observed data from which municipality-specific risk of VL across space and time are estimated. We discuss assumptions on missing data and how our modeling framework works with missing data. Next, we review several modeling structures we considered to account for spatial and temporal heterogeneity across RN. Then, we use an EFA to create interpretable and uncorrelated factor scores that result from common sources of variability in the highly correlated Census data. The resulting EFA factor scores are used as covariates in the model to address the risk of multicollinearity within the statistical model. We justify the need for covariates in the model by fitting models with and without covariate information and comparing model fit by the Deviance Information Criteria (DIC) [34]. The final model combines the EFA results with the best fitting spatial-temporal structure. This resultant model is fit within a hierarchical Bayesian framework where disease risk is estimated in areas where incidence of VL were observed. In areas where incidence of VL was missing, disease risk is predicted based on the observed data. Integrated Nested Laplace Approximation (INLA) methodology was used to obtain approximate Bayesian model inference with this methodology implemented through the R-INLA software [34, 35]. The prior parameters and hyperparameters specification for each model fitted used the default R-INLA specifications.
General structure of the hierarchical Bayesian model.
The study area and period covered fourteen years across 167 municipalities in RN. We define i and t to be the municipality and year indices such that i ∈ {1, 2, …, 167} and t ∈ {1, 2, …, 14}. The Poisson likelihood in Eq 1 was assumed for the visceral leishmaniasis cases in RN in which Yit and ηit represent the number of VL cases and relative risk (RR) for the ith municipality and tth year, respectively [36].
(1)
The focus of this study is to model disease risk by considering municipality risk factor heterogeneity and spatio-temporal patterns as shown in Eq 2. The disease risk parameter (ηit), is modeled as a function (f) of risk factors (xi), their effect on disease risk (β), and a function (g) of a vector of random effects (θit) capturing spatio-temporal patterns. The functional form of functions f and g are discussed in future sections.
(2)
Furthermore, Eit represents the expected number of cases for the ith municipality and tth year in the event the ith municipality’s risk of disease is unaffected spatially or temporally. This yearly area-specific expected quantity can be calculated as a weighted average of the prevalence of the disease (p) and the area’s population that year (nit). The yearly population for each municipality is estimated by the Brazilian Institute of Geography and Statistics (IBGE) while the prevalence of VL can be estimated as the ratio of total VL cases in RN and the overall RN population total. The yearly prevalence estimate would require the observation of all VL cases in RN each year. However, as shown in Figs 2 and 3, VL cases were not documented for all municipalities in RN each year. We avoid making the restrictive assumption that missing information implies absence of disease, as there could be individuals infected with VL that went undiagnosed during the study period.
Therefore, we establish a baseline prevalence estimate using the complete data. Let be the set of location/time indices with observed data such that Yit and nit for
correspond to the number of observed VL cases and the population estimates for municipalities with observed VL cases. Based on these observed values, we can calculate a naive estimate of VL prevalence (
) in RN as a ratio of the total observed VL cases and the total population estimates of municipalities with observed cases. We can also obtain a naive estimate of the yearly expected number of disease cases for location i at year t (Eit). This expected measure can be calculated as a product of the naive prevalence estimate weighted by the location population estimate that year (Eq 3).
(3)
The hierarchical Bayesian model was adopted to allow for flexible modeling of area-specific relative risk in which a linear function of a set of fixed effects and a corresponding set of random effects contribute to the modeling of the risk outcomes. The random effects structure served to explain spatio-temporal patterns of relative risk, whereas the fixed effects serve to explain spatial risk variation potentially driven by municipality-specific risk factors. Furthermore, this modeling framework allows for the prediction of disease risk for areal units with missing data by utilizing the estimated posterior distribution of disease risk parameters. Therefore, given the spatio-temporal nature of VL and many municipalities not reporting VL cases in multiple years, a hierarchical Bayesian model was adopted to estimate and predict visceral leishmaniasis risk in RN between 2007 and 2020 more accurately.
Spatio-temporal structure selection.
Fitting a Bayesian hierarchical model allowed for flexible specifications of the random effects to explain spatio-temporal patterns of VL risk. Shown below are the five (M1–M5) candidate models with fixed effects structure f(xi, β) = β0 and differing random effects structures g(θit) considered.
(M1)
(M2)
(M3)
(M4)
(M5)
Model M1, the base candidate model, was a scaled version of the Besag-York-Mollié (BYM) model that accounts for between and within-municipality spatial variability through a spatial random effect (ϕi) [37]. In the BYM model, the spatial random effect promoting smooth area relative risk estimates is defined as ; the sum of a spatially structured random effect (
) and a unstructured spatial random effect (
). The spatially structured random effect follows a normal prior centered at the average spatial effects for surrounding municipalities and variance weighted by the number of neighboring areas. Therefore, the spatially structured random effect accounts for similarities in disease outcomes between neighboring areas. The unstructured random effect accounts for independent municipality-specific non-spatial heterogeneity and follows a zero-mean normal prior with variance
. A mathematical representation of the priors for the BYM model parameters is specified below where
indexes the municipalities that share boundaries with the ith municipality and
is the number of municipalities that share boundaries with the ith municipality.
The scaled Besag-York-Mollie model (BYM2), discussed in [38], is a reparameterized version of the BYM model where the spatial random effect (ϕi) is a weighted average of the structured and unstructured components. Specifically, where ui and vi are respectively similar to
and
in the BYM model but are standardised to have generalised variance equal to one [34, 35]. In the BYM2 model, τ represents the marginal precision contribution from ui and vi while λ is a mixing parameter representing the percent of marginal variation explained by the spatially structured random component. Scaling of the BYM model allows for a uniform interpretation of the marginal variance across varying spatial structures and enables easier specification of prior distributions for BYM2 model parameters [38]. Default INLA Penalized Complexity (PC) priors for log transformed τ and logit transformed λ hyperparameters were used. PC priors were introduced by [38]. The mathematical expression of PC priors for the BYM2 model is thoroughly outlined in [38].
The second candidate model (M2) adds a global linear time effect (ψ) to explain the yearly trend in VL risk for the state of Rio Grande do Norte between 2007 and 2020. The time effect was given a default normal prior with mean and precision parameter of 0 and 0.001, respectively. Model M3 adds to model M2 by including an area-specific linear time interaction effect (δi), as introduced by [39], to account for the linear difference in risk over time for the ith area relative to the global RN trend. A negative interaction term (δi < 0) indicates the municipality-specific trend that is less steep than the overall RN temporal trend. A positive interaction term (δi > 0) implies a steeper municipality-specific risk trend compared to the overall RN trend. The log transformed area-specific trend was given a log gamma prior with shape parameter of one and rate (or inverse-scale parameter) of 0.00005. The log gamma prior on the log transformed parameter of interest is equivalent to placing a gamma prior with the same shape and rate parameters on the parameter of interest.
The linear effects of time on the log transformed risk of VL, as specified in models M2 and M3, may not be a reasonable assumption. Therefore, models M4 and M5 specified a temporally structured effect using a Gaussian random walk model of order 1 (γt) and an overall unstructured time effect (ωt), respectively, as seen in [40]. The overall time effect had a normal prior centered at zero and a precision hyperparameter that followed a gamma prior with shape parameters of 1 and rate parameter of 0.00005. Model M5 expanded model M4 to include an unstructured space and time interaction parameter (δit) that adjusts for spatio-temporal variation in the data unaccounted for the main space and time effects [40]. Knorr-Held [40] propose other ways in which this interaction can be structured based on various assumptions. The δit parameter was given a normal prior with precision hyperparameter following a gamma prior with default shape and rate parameters of 1 and 0.00005 respectively.
Fixed effects structure.
Risk factors obtained from the 2010 Census jointly account for social, economic and environmental differences between municipalities. Empirical Spearman’s correlation coefficients for social vulnerability indices derived from RN’s 2010 Census data show correlation estimates ranging between 0.33 and 0.55 (S2 Fig). We conduct an EFA to preserve the interpretability of covariate effects on disease risk estimates while mitigating correlation between the Census data. An EFA is a dimension reduction technique that helps uncover underlying relationships between large number of correlated variables [41]. Factor analysis techniques require verification that the observed data is suitably intercorrelated. To assess adequacy of the 2010 IBGE Census data for Exploratory Factor Analysis, Bartlett’s test of Sphericity [42] and Kaiser-Meyer-Olkin (KMO) statistics [43] were jointly considered. In this manuscript, we utilize an EFA using the R package psych [44] to replace the correlated Census information, as seen in S1 Fig, with the resulting factor scores in the statistical model from which we draw inference. The EFA conducted assumed a linear correlation between the Census variables and used a principal factor extraction method to obtain the factor scores. To mitigate correlation between the resulting factor scores and address risk of multicollinearity within the statistical model, a varimax rotation assuming orthogonality between the latent variables was used. Categorical variables describing within-municipality characteristics (i.e. all covariates except livestock variable) provide information regarding 100% of the municipality. Therefore, for a categorical variable, k − 1 categories were included in the factor analysis since the kth category can be described as a linear function of remaining k − 1 categories.
The resulting p factor scores served as the covariates in the model to inform on municipality risk factor heterogeneity. To access adequacy of adding covariates to the spatially and temporally structured candidate models, we additionally fit models M1–M5 with linear combination of the factor scores f(xi, β) = β0 + xiβ. In the covariate structure, xi denotes the 1 × p vector of factor scores for municipality i. For all models fitted with covariates, the fixed effects coefficients (β) were given a normal distribution prior with mean parameter zero and precision of 0.001.
Results
Spatial and temporal model structure
DIC results used to compare candidate models considered are seen in Table 1. Model M1, which assumed no temporal trend on disease risk, had the highest DIC results. Models M2 and M4, which assumed change in risk of VL over time was the same for each municipality, had the next highest DIC results. The remaining two models, Models M3 and M5, had the lowest DIC results. While Model M3 allowed for changes in VL risk over time to vary between municipalities, it assumed changes in VL risk were constant over time, whereas Model M5 allowed for non-constant changes of VL risk each year for each municipality. The added flexibility provided by Model M5 fit the data best indicated as the model with the lowest DIC.
Exploratory factor analysis
Factor analysis tools assume there is a set of latent variables that give rise to the observed set of correlated data. Bartlett’s test using the Census variables resulted in statistically significant test (p − value < 0.001). A KMO statistic of 0.78 was observed for the overall Census variables with all single-item KMOs greater than or equal to 0.69. The joint results for Bartlett’s test of Sphericity and Kaiser-Meyer-Olkin statistics indicate the Census data was adequate for factor analysis.
The number of factors to extract from the EFA analysis were obtained by examining the scree plot which was estimated using parallel analysis. The scree plot suggested extracting three factors representing the latent relationships among the Census variables. Fig 8 shows the predominant relationship (negative or positive) of the Census variables towards each factor. The relationship between Census data and the resulting factors is represented by the EFA factor loadings which are provided in more detail in S1 Table.
Factor 1 was primarily made up of economic, education, and environmental variables. Factor 2’s main contribution was age distribution within municipality, and Factor 3 was predominantly measured by the degree of municipality livestock production relative to the overall production in Rio Grande do Norte. Therefore, in labeling the factors resulting from the EFA analysis, Factor 1 represents the Socioeconomic Status of a municipality, Factor 2 represents the Age Distribution of a municipality and Factor 3 measures Livestock Production of a municipality.
DIC measures for models M1–M5 fitted with the resulting factor scores as covariates can be found in Table 1. Adding covariate information improves model fit for the spatially and temporally structured candidate models based on DIC measures which provides evidence that the covariate information helps to explain variability in disease risk. Model M5 with covariates has the lowest DIC measure of all models fitted. The results for this best fitting model are further dissected in the following sections.
Final model inference
The best fitting model to evaluate risk of disease across Rio Grande do Norte between 2007 and 2020 was a log-linear Poisson hierarchical model (M5). In this model, the log-transformed relative risk structure for municipality i at time t is a function of a linear combination of fixed effects f(xi, β) = (β0 + xiβ) and a linear combination of random effects g(θit) = (ϕi + γt + ωt + δit). In the fixed effects structure, xi = [x1i, x2i, x3i] represents the first, second, and third EFA factor scores for location i while β = [β1, β2, β3] is the vector of coefficients to estimate in addition to the intercept β0 parameter. For the random effects structure, the location-specific spatially structured and unstructured random effects are denoted as ui and vi, respectively. The γt and ωt parameters in the model denote the yearly change in VL risk estimated for all municipalities. Risk trends are allowed to vary each year within a municipality through the δit parameter which increases or decreases the trend each year as necessary based on the observed data.
We estimated posterior means, 95% Highest Posterior Density (HPD) credible intervals, and posterior probabilities which summarize the posterior distributions of parameters of interest. Posterior distributions help quantify our uncertainty about the parameters of interest given the observed data. Positive covariate parameters would result in higher VL risk estimates while negative covariate parameters would lead to lower VL risk estimates.
The visceral leishmaniasis RR estimates after adjusting for the EFA factor score (ηit) were used to answer the first research aim to evaluate disease risk in RN across space and over time. Based on the model structure, a RR equal to one indicates the risk of the average municipality in RN. Relative risks greater than two indicate more than double the risk of VL than what would be expected if disease risk was not spatially or temporally different in RN. A RR more than double what would be expected was deemed to be clinically significant (i.e. high-risk). Fig 9 shows the VL RR across RN over time. Fig 10 displays the exceedance probabilities that the RR estimates surpass a value of two which, by our definition, would be deemed clinically significant. Exceedance probabilities provide strength of evidence towards clinically significant or high-risk regions. In Fig 10, areas with high probability indicate there is strong evidence that these are high-risk regions which could benefit from future interventions and resources to address local disease prevalence. We also include S3 Fig to show exceedance probabilities of the RR estimates greater than one. This additional figure highlights municipalities that, given the data, are likely to have risk of VL greater than what would be expected if disease risk was not spatially or temporally different S3 Fig.
Spatial and Temporal posterior mode estimate of municipality-level relative risk of disease. This figure shows regions where, based on model results, it is likely that the risk of disease is higher than what we deem clinically significant. This figure was created using the geobr package in R [25].
The spatial and temporal posterior probability that a municipality’s relative risk estimate shown in Fig 9 is greater than two. Darker shades of red indicate it is likely that the municipality’s relative risk estimate is clinically significant. This figure was created using the geobr package in R [25].
Relative risk trends by mesoregions for all municipalities can be seen in S4 Fig. For most municipalities, regardless of mesoregion, the RR trends were relatively consistent over the study period. S4 Fig shows few municipalities had RR trends that varied greatly over time. To identify which municipalities had large changes in risk, the variability in yearly percentage change of RR relative to the prior year was calculated. Risk trends for municipalities within the top 5% highest variability in yearly change of risk are shown in Fig 11. This figure allows for a direct comparison of changes in VL RR trends within and between mesoregions for the highest varying municipalities.
This figure shows the most varying municipality-specific RR trends within the four mesoregions in RN. Municipality-specific risk trends were obtained as posterior distribution estimates if VL cases were observed or as predictions from the posterior predictive distributions if VL cases were unobserved for each year. Intervals included in this figure for each trend are 95% HPD credible intervals. The horizontal line represents a relative risk of one.
Table 2 informs on the posterior mean and 95% HPD credible intervals for the EFA factor scores coefficients estimated. The posterior probability that an effect is positive is displayed in the last column of the table to quantify the model’s uncertainty of the associations between the Socioeconomic, Age and Livestock factor scores and disease risk estimates.
Discussion
Municipalities with poor socioeconomic characteristics were reflected through high EFA Socioeconomic factor scores. Characteristics denoting a poor socioeconomic status included high percent of illiteracy, homes with inadequate sanitation, rural surroundings, low-income population, and economically inactive population. Municipalities with opposing characteristics would have lower Socioeconomic factor scores. The posterior mode estimate of the Socioeconomic Factor was positive, further indicating that socioeconomic status was positively related to relative risk of VL. A municipality that only differs by one unit less in the Socioeconomic Factor from another would be estimated to have a lower relative risk of VL (by about 26%) compared to the municipality with the higher Socioeconomic Factor score. It is also estimated that there is a 99.99% probability that there is a positive relationship between poor socioeconomic status and higher risk of VL. The connection between socioeconomic status and risk of VL in Brazil is strongly supported by prior literature [22, 45, 46]. Mediating factors such as sanitation [47, 48] and literacy [4, 11] contribute to risk in the same direction as the present analysis.
A municipality’s contribution to the overall RN Livestock was a characteristic emerging from the EFA that described potential environmental risks of VL. Municipalities that were large producers of sheep, horses, swine, cattle or goats had higher Livestock Production factor scores. There was a negative association estimated for the Livestock factor score and risk of VL. Specifically, a municipality with a one unit increase in livestock production factor score would be estimated to have approximately 20% lower risk of VL compared to a municipality that has a lower production levels. The model estimated a 99.99% probability that there is a negative association between higher livestock productions and lower relative risk of VL. Previous studies have also reported this association of large livestock acting as protective agents against VL, though with varied working hypotheses [49–52]. On one hand, owning large livestock such as cattle may function as a sign of better nutritional status of those residents, thereby providing for overall better immune status and a proxy for overall better health augmenting their immune systems against VL [49–51]. On the other hand, the presence of cattle sheds could promote breeding sites for sandflies [51, 52]. Additionally, it is worth noting that the Livestock Production factor is not directly measuring density of animals and their coexistence with human populations. Instead, this variable could be a proxy for a municipality’s economic productivity representing industrial areas that might be more economically distinct that other areas. This means highly productive areas could be characterized with a higher economic status that might explain the association between livestock production and VL risk estimated by the model. More research is needed to confirm these hypotheses in Northeastern Brazil.
The EFA’s third construct uncovered from the observed Census data described the age distribution within a municipality. Municipalities with high Age Distribution factor scores had a larger percentage of individuals twenty-four years age or younger and larger percentage of households with semi-inadequate sanitation compared to areas with opposite characteristics. The results of the association between the Age distribution factor scores and the risk of VL were inconclusive as the posterior mean of this effect was small and close to zero. Furthermore the probability that the association between the Age distribution factor scores and the risk of VL was positive or negative was approximately the same. Although VL has historically been mostly identified in children, recent studies support that the average age of VL cases diagnosed has increased which may be attributed to an aging Brazilian population or, as recent studies hypothesize, may be due to various factors like improved nutrition, medicare, and education [15, 33]. However, the current results suggest that given the data collected, there is still inconclusive evidence on how the age distribution of a municipality relates to risk of VL infection. As mentioned previously, due to the nature of the data observed, this study relates risk of disease at a municipality level. Therefore, results cannot explain variability of age effects on VL risk at an individual level.
The present study revealed spatially heterogeneous VL risks and demonstrates opportunities for mesoregion and municipality-specific public health policy interventions. Fig 9 shows most municipalities in the southwest West Potiguar mesoreigon have average relative risk estimates ranging between two and six times the expected intensity if there was no spatial or temporal variability in disease outcomes in RN. Between 2007 and 2020, Fig 10 shows municipalities in West Potiguar consistently had higher VL relative risk estimates than municipalities in the Central, Agreste and Eastern mesoregions. Specifically, we see that the southwest municipalities are highly likely to have risk of VL more than double the expectation. West Potiguar borders the states of Ceará and Paraíba, who have also experienced increasing incidence rates of VL over time, respectively [6, 8]. The exceedance probabilities of relative risk greater than two for the Central and Agreste Potiguar regions range between 0% and 80% across time, with a few northern municipalities showing high exceedance probabilities during the earlier years of this study. Whereas the majority of the municipalities in the Eastern Potiguar region exhibited low exceedance probabilities of the RR being greater than two, suggesting the relative risk of VL in this region is likely lower than 2. S3 Fig, further suggests the municipalities in red are high likely to have a RR greater than one.
The top three municipalities whose estimates exhibited an upward trend include Pau Dos Ferros, Patu, and Natal. Specifically, Natal’s trend cyclically increased until 2016 before trending downward. The cyclical nature of Natal’s estimated risk trend may suggest that another peak could occur in the near future. Such finding of VL behavior in Natal agrees with recent research [15, 33], which characterizes Natal as a city that may require public attention and health policy interventions. Cyclical changes in risk estimates were not unique to Natal as other municipalities’ risk trends shared this cyclical characteristic. These changes in risk may stem from cyclical changes in the environment, such as those brought on from natural phenomena like El Niño/La Niña, which may influence sandfly density through climate alterations [12, 53]. Future research is needed to explore the variables that are creating this upward cyclic trend in VL risk. Also observed are Mossoro’s and Açu’s downward risk trends during the study period. Mossoro and Natal are the two most populous municipalities in RN. Further research is warranted to understand their diverging risk trends.
Another noteworthy phenomenon the present model identified is in the coastal metropolis regions of East Potiguar. Natal, Nisia Floresta, and Extremoz displayed high RR of VL within this mesoregion, but despite it’s geographical proximity to these municipalities, Parnamirim’s risk trend was estimated to be lower. Parnamirim’s lower risk indicates the number of VL cases observed in this municipality were lower than what we expected based on the overall prevalence of disease in RN and the municipality’s population over time. Future research to identify the differing epidemiological drivers of VL around Natal is warranted.
Limitations
Visceral leishmaniasis is a complex disease. We do not claim the present set of covariates embodies an exhaustive representation of the epidemiology of visceral leishmaniasis or risk of the disease in RN, or that we have included all of the most relevant covariates that influence a municipality’s regional disease risk. Climate information, such as precipitation or humidity, may be fruitful to incorporate in future investigations as the RN region is affected by cycles of drought which could interfere with sand fly density [54].
It is noteworthy to mention that we treated the EFA factors as fixed covariates, which assumes the factor scores are known and not estimated. This assumption makes the model partially Bayesian as it did not account for uncertainty when estimating these factors. However, we believe this lack of uncertainty is minimal since the factor scores were estimated irrespective of the outcome of interest (VL cases). Additionally, the covariate risk factors used in the EFA were treated as point-in-time observations since 2010 was the only available Census collection within the study time period, at the time of this writing. Incorporating temporal Census covariates could augment future model inference.
Geographical sparsity of observed VL counts was also a limitation; the data did not contain VL cases for a total of 47 municipalities throughout the 14-year study period. In our study, municipalities without documented VL counts were treated as missing data and our model predicted the outcomes for those regions. However, the true missingness mechanisms are unknown, and the proposed approach may still acquire bias.
The location used for each disease case in this study was also a limitation. Depending on the data available, the location of each VL case was either the municipality of infection or the municipality of diagnosis. First, we note that the municipality of diagnosis might not be the municipality of residence for all patients since some individuals might have traveled to larger cities due to limited healthcare services in small/rural cities. Due to this limitation in the observed data, the results in this study might underreport disease risk in rural regions. Secondly, while the municipality of infection can help identify disease reservoirs and therefore inform on the population at risk, municipality of infection was not observed for every participant in the study. When municipality of infection was not observed then the municipality of diagnosis was used. We believe that individuals in the municipality of diagnosis are also at risk since they are exposed to infected individuals that travel to these areas for diagnosis purposes. Therefore, we do not expect that using either locations in the analysis would affect the estimated parameters, inference, or risk maps results since we believe the population at risk can include the populations in either municipality.
Finally, the present study contained positive VL cases which were medically confirmed under passive surveillance. However, VL prevalence relies on disease transmission rates that are interdependent of animal reservoirs, sandfly reservoirs, and human reservoirs, which were not observed for the present analysis. Integrating these data sources in future research is important to better inform more localized and individual risk of disease.
Conclusion
The purpose of this manuscript was to introduce a modeling approach to evaluate disease risk while addressing shortcomings in data collected such as missing information and correlated covariates. The results focus on estimated risk of visceral leishmaniasis across Rio Grande do Norte, Brazil between 2007 and 2020 after adjusting for potential risk factors at a municipality level. We applied a hierarchical Bayesian approach to estimate municipality-specific relative risk of VL across space and time. Our results show that poor socioeconomic status and lower livestock production are related to increases in VL risk in RN. Estimates indicate it is highly likely that many municipalities within the West Potiguar mesoregion of RN have VL risk larger than double what would be expected if there was no spatio-temporal disease heterogeneity. The present observations revealed spatially heterogeneous VL risks and demonstrated opportunities for municipality-specific public health policy interventions. Additionally, our model estimated that the risk of VL in Natal, and possibly in Patu and Pau dos Ferros, is increasing over time which warrants future research to identifying epidemiological drivers in those areas.
Supporting information
S1 Fig. Spearman’s correlation coefficients between 2010 Brazilian Federal Census variables.
https://doi.org/10.1371/journal.pntd.0011206.s001
(TIF)
S2 Fig. Spearman’s correlation coefficients between three Social Vulnerability Indices that were constructed using the 2010 Brazilian Federal Census variables.
https://doi.org/10.1371/journal.pntd.0011206.s002
(TIF)
S3 Fig.
The spatial and temporal posterior probability that a municipality’s relative risk estimate shown in Fig 9 is greater than one. Darker shades of red indicate it is likely the municipality’s observed VL cases were greater than that number expected based on overall prevalence of VL in RN and the municipality’s population size. This figure was created using the geobr package in R [25].
https://doi.org/10.1371/journal.pntd.0011206.s003
(TIF)
S4 Fig.
The trends plotted how municipality-specific posterior mode estimates of RR over time within the four mesoregions in RN. The red represents a relative risk of on.
https://doi.org/10.1371/journal.pntd.0011206.s004
(TIF)
S1 Table. Factor loadings for the three factors estimated in the Exploratory Factor Analysis.
https://doi.org/10.1371/journal.pntd.0011206.s005
(PDF)
References
- 1. Cota G, Erber AC, Schernhammer E, Simões TC. Inequalities of visceral leishmaniasis case-fatality in Brazil: A multilevel modeling considering space, time, individual and contextual factors. PLoS Neglected Tropical Diseases. 2021;15(7):e0009567. pmid:34197454
- 2. Andrade AW, Souza CD, Carmo RF. Temporal and spatial trends in human visceral leishmaniasis in an endemic area in Northeast Brazil and their association with social vulnerability. Transactions of The Royal Society of Tropical Medicine and Hygiene. 2021;.
- 3. Ribeiro CJ, Dos Santos AD, Lima SV, da Silva ER, Ribeiro BV, Duque AM, et al. Space-time risk cluster of visceral leishmaniasis in Brazilian endemic region with high social vulnerability: An ecological time series study. PLoS Neglected Tropical Diseases. 2021;15(1):e0009006. pmid:33465104
- 4. Almeida AS, Ribeiro CJN, Carlini CC, Santos RS, Dos Santos AD, Tavares DS, et al. Spatial and spatiotemporal dynamics of visceral leishmaniasis in an endemic North-eastern region of Brazil. Geospatial Health. 2020;15(2).
- 5. de Freitas Rocha AT, Mira de Espindola G, Araujo Soares MR, de Ribamar de Sousa Rocha J, Nery Costa CH. Visceral leishmaniasis and vulnerability conditions in an endemic urban area of Northeastern Brazil. Transactions of The Royal Society of Tropical Medicine and Hygiene. 2018;112(7):317–325.
- 6. Silva AB, Coêlho HFC, de Sousa Freitas FI, Mota CdAX, da Silva SV Junior, Lima CMBL. Temporal distribution of visceral leishmaniasis in Paraíba state, northeastern brazil, by geographic region, 2012-2017. Research, Society and Development. 2021;10(6):e9410615575–e9410615575.
- 7. Braz BMdA, Silva RBS, Lins SC, Silva DRX, Ramalho WM, Melo MAd. Demographic and spatial study of visceral leishmaniasis in the state of Alagoas, Brazil, during 2007-2018. Revista da Sociedade Brasileira de Medicina Tropical. 2021;54. pmid:33605380
- 8. Cavalcante KKdS, Moreno JdO, Nzundu Boigny R, Cavalcante FRA, Florêncio CMGD, Alencar CH. Risk areas for Human Visceral Leishmaniosis 2007–2018: A temporal and spatial analysis in Ceará, Brazil. Zoonoses and Public Health. 2021;. pmid:34128330
- 9. Abrantes TR, Werneck GL, Almeida ASd, Figueiredo FB. Environmental factors associated with canine visceral leishmaniasis in an area with recent introduction of the disease in the State of Rio de Janeiro, Brazil. Cadernos de Saúde Pública. 2018;34. pmid:29412315
- 10. Valero NNH, Prist P, Uriarte M. Environmental and socioeconomic risk factors for visceral and cutaneous leishmaniasis in São Paulo, Brazil. Science of The Total Environment. 2021;797:148960. pmid:34303257
- 11. Falcão de Oliveira E, Oliveira AGd, Arruda CCPd, Fernandes WdS, Medeiros MJd. Spatio-temporal modeling of visceral leishmaniasis in Midwest Brazil: An ecological study of 18-years data (2001–2018). Plos One. 2020;15(10):e0240218. pmid:33007033
- 12. Silva Neto ABd, Oliveira EFd, Encina CCC, Figueiredo HRd, Paranhos Filho AC, Oliveira AGd. Effects of El Niño-Southern oscillation on human visceral leishmaniasis in the Brazilian State of Mato Grosso do Sul. Memórias do Instituto Oswaldo Cruz. 2020;115. pmid:32130366
- 13.
WHO. Control of the leishmaniases: report of a meeting of the WHO Expert Commitee on the Control of Leishmaniases, Geneva, 22-26 March 2010. World Health Organization; 2010.
- 14. Toepp AJ, Petersen CA. The balancing act: Immunology of leishmaniosis. Research in Veterinary Science. 2020;130:19–25. pmid:32109759
- 15. Lima AL, de Lima ID, Coutinho JF, de Sousa UP, Rodrigues MA, Wilson ME, et al. Changing epidemiology of visceral leishmaniasis in northeastern Brazil: a 25-year follow-up of an urban outbreak. Transactions of the Royal Society of Tropical Medicine and Hygiene. 2017;111(10):440–447. pmid:29394411
- 16. Conti RV, Lane VFM, Montebello L, Pinto VL Junior, et al. Visceral leishmaniasis epidemiologic evolution in timeframes, based on demographic changes and scientific achievements in Brazil. 2016;.
- 17. Costa C, Pereira HF, Araújo MV. Visceral leishmaniasis epidemic in the State of Piaui, Brazil, 1980-1986. Revista de saúde pública. 1990;24(5):361–372. pmid:2101528
- 18. Jeronimo SM, Oliveira RM, Mackay S, Costa RM, Sweet J, Nascimento ET, et al. An urban outbreak of visceral leishmaniasis in Natal, Brazil. Transactions of the Royal Society of Tropical Medicine and Hygiene. 1994;88(4):386–388. pmid:7570812
- 19. Almeida CP, Cavalcante FRA, Moreno JdO, Florêncio CMGD, Cavalcante KKdS, Alencar CH. Visceral Leishmaniasis: temporal and spatial distribution in Fortaleza, Ceará State, Brazil, 2007-2017. Epidemiologia e Serviços de Saúde. 2020;29. pmid:33295587
- 20. da Silva Lima MAR, Freitas MBP, de Jesus Sousa JM, Correia ASL, de Sousa Baldoino AC, da Silva Alves D, et al. Série histórica da morbimortalidade por leishmaniose no Estado do Piauí, Brasil, de 2010 a 2020. Research, Society and Development. 2021;10(7):e15610716264–e15610716264.
- 21. Okwor I, Uzonna J. Social and economic burden of human leishmaniasis. The American Journal of Tropical Medicine and Hygiene. 2016;94(3):489. pmid:26787156
- 22. Werneck G, Braga E. Socioeconomic factors predict the increase of incidence rates of visceral leishmaniasis in highly endemic areas in Brazil. International Journal of Infectious Diseases. 2019;79:131.
- 23. Costa MA, dos Santos MPG, Marguti B, Pirani N, Pinto CVdS, Curi RLC, et al. Vulnerabilidade Social no Brasil: conceitos, métodos e primeiros resultados para municípios e regiões metropolitanas brasileiras. Texto para Discussão; 2018.
- 24.
UNDP. Human Development Report 2010. The Real Wealth of Nations: Pathways to Human Development; 2010.
- 25.
Pereira R, Gonçalves C, et al. geobr: Loads Shapefiles of Official Spatial Data Sets of Brazil. 2019. Available: GitHub repository-https://github.com/ipeaGIT/geobr;.
- 26.
IBGE. Estatísticas de Gênero: Notas técnicas; 2014. Available from: www.ibge.gov.br/apps/snig/v1/notas_metodologicas.html?loc=0.
- 27.
IBGE. Metodologia do censo demogáfico 2010; 2013.
- 28. de Sousa Ferreira T, Timbo RV, Minuzzi-Souza TT, de Almeida Rocha D, Neiva M, de Albuquerque Ribeiro J, et al. High molecular prevalence of Leishmania in phlebotomine sand flies fed on chicken blood in Brazil. Veterinary Parasitology. 2018;259:80–84. pmid:30056989
- 29. Guimaraes-e Silva AS, Silva SdO, Ribeiro da Silva RC, Pinheiro VCS, Rebêlo JMM, Melo MN. Leishmania infection and blood food sources of phlebotomines in an area of Brazil endemic for visceral and tegumentary leishmaniasis. PLoS One. 2017;12(8):e0179052. pmid:28837565
- 30. Fonteles RS, Pereira Filho AA, Moraes JLP, Pereira SRF, Rodrigues BL, Rebêlo JMM. Detection of Leishmania DNA and blood meal identification in sand flies (Diptera: Psychodidae) from Lençois Maranhenses national park region, Brazil. Journal of Medical Entomology. 2018;55(2):445–451. pmid:29281057
- 31. Pradella GD, Duarte CA, Escobar TA, Zuravski L, Góss GC, Skupien JA, et al. Risk and protective factors of Leishmaniasis in the rural area of the western border region of Rio Grande do Sul, Brazil. BMC Veterinary Research. 2021;17(1):1–8.
- 32. Pradella GD, Escobar TA, Duarte CA, Lübeck I, Góss GC, Lagreca LFJ, et al. Identification of Leishmania spp. in horses and a dog from rural areas of Uruguaiana, Rio Grande do Sul, Brazil. Semina: Ciências Agrárias. 2020;41(6):2687–2694.
- 33. Lima ID, Lima AL, Mendes-Aguiar CdO, Coutinho JF, Wilson ME, Pearson RD, et al. Changing demographics of visceral leishmaniasis in northeast Brazil: Lessons for the future. PLoS Neglected Tropical Diseases. 2018;12(3):e0006164. pmid:29509765
- 34. Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the Royal Statistical Society: Series b (Statistical Methodology). 2009;71(2):319–392.
- 35. Martins TG, Simpson D, Lindgren F, Rue H. Bayesian computing with INLA: new features. Computational Statistics & Data Analysis. 2013;67:68–83.
- 36. Clayton D, Kaldor J. Empirical Bayes estimates of age-standardized relative risks for use in disease mapping. Biometrics. 1987; p. 671–681. pmid:3663823
- 37. Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics. 1991;43(1):1–20.
- 38. Simpson D, Rue H, Riebler A, Martins TG, Sørbye SH. Penalising model component complexity: A principled, practical approach to constructing priors. Statistical science. 2017;32(1):1–28.
- 39. Bernardinelli L, Clayton D, Pascutto C, Montomoli C, Ghislandi M, Songini M. Bayesian analysis of space—time variation in disease risk. Statistics in Medicine. 1995;14(21-22):2433–2443. pmid:8711279
- 40. Knorr-Held L. Bayesian modelling of inseparable space-time variation in disease risk. Statistics in Medicine. 2000;19(17-18):2555–2567. pmid:10960871
- 41.
Fabrigar LR, Wegener DT. Exploratory factor analysis. Oxford University Press; 2011.
- 42. Bartlett MS. A further note on the multiplying factors for various chi-square approximations in factor analysis. Journal of the Royal Statistical Society: Series B. 1954;16:296–98.
- 43. Kaiser HF. An index of factorial simplicity. Psychometrika. 1974;39(1):31–36.
- 44.
Revelle W. psych: Procedures for Psychological, Psychometric, and Personality Research; 2022. Available from: https://CRAN.R-project.org/package=psych.
- 45. Alvar J, Yactayo S, Bern C. Leishmaniasis and poverty. Trends in Parasitology. 2006;22(12):552–557. pmid:17023215
- 46. Werneck G, Costa C, Walker A, David J, Wand M, Maguire J. Multilevel modelling of the incidence of visceral leishmaniasis in Teresina, Brazil. Epidemiology & Infection. 2007;135(2):195–201. pmid:16824254
- 47. Costa C, Werneck G, Rodrigues L, Santos M, Araújo I, Moura L, et al. Household structure and urban services: neglected targets in the control of visceral leishmaniasis. Annals of Tropical Medicine & Parasitology. 2005;99(3):229–236.
- 48. Cerbino Neto J, Werneck GL, Costa CHN. Factors associated with the incidence of urban visceral leishmaniasis: an ecological study in Teresina, Piauí State, Brazil. Cadernos de Saúde Pública. 2009;25:1543–1551. pmid:19578575
- 49. Anstead GM, Chandrasekar B, Zhao W, Yang J, Perez LE, Melby PC. Malnutrition alters the innate immune response and increases early visceralization following Leishmania donovani infection. Infection and Immunity. 2001;69(8):4709–4718. pmid:11447142
- 50. Bern C, Haque R, Chowdhury R, Ali M, Kurkjian KM, Vaz L, et al. The epidemiology of visceral leishmaniasis and asymptomatic leishmanial infection in a highly endemic Bangladeshi village. The American Journal of Tropical Medicine and Hygiene. 2007;76(5):909–914. pmid:17488915
- 51. Bern C, Courtenay O, Alvar J. Of cattle, sand flies and men: a systematic review of risk factor analyses for South Asian visceral leishmaniasis and implications for elimination. PLoS Neglected Tropical Diseases. 2010;4(2):e599. pmid:20161727
- 52. Afonso MMdS, Duarte R, Miranda JC, Caranha L, Rangel EF. Studies on the feeding habits of Lutzomyia (Lutzomyia) longipalpis (Lutz & Neiva, 1912)(Diptera: Psychodidae: Phlebotominae) populations from endemic areas of American visceral leishmaniasis in northeastern Brazil. Journal of Tropical Medicine. 2012;2012. pmid:22315621
- 53. Franke CR, Ziller M, Staubach C, Latif M. Impact of El Niño/southern oscillation on visceral Leishmaniasis, Brazil. Emerging Infectious Diseases. 2002;8(9):914. pmid:12194766
- 54. Mendes CS, Coelho AB, Feres JG, deSouza EC, Antonio daCunha D. The impact of climate change on leishmaniasis in Brazil. Ciencia & Saude Coletiva. 2016;21(1). pmid:26816183