Figures
Abstract
Background
Spatial data are often aggregated by area to protect the confidentiality of individuals and aid the calculation of pertinent risks and rates. However, the analysis of spatially aggregated data is susceptible to the modifiable areal unit problem (MAUP), which arises when inference varies with boundary or aggregation changes. While the impact of the MAUP has been examined previously, typically these studies have focused on well-populated areas. Understanding how the MAUP behaves when data are sparse is particularly important for countries with less populated areas, such as Australia. This study aims to assess different geographical regions’ vulnerability to the MAUP when data are relatively sparse to inform researchers’ choice of aggregation level for fitting spatial models.
Methods
To understand the impact of the MAUP in Queensland, Australia, the present study investigates inference from simulated lung cancer incidence data using the five levels of spatial aggregation defined by the Australian Statistical Geography Standard. To this end, Bayesian spatial BYM models with and without covariates were fitted.
Results and conclusion
The MAUP impacted inference in the analysis of cancer counts for data aggregated to coarsest areal structures. However, area structures with moderate resolution were not greatly impacted by the MAUP, and offer advantages in terms of data sparsity, computational intensity and availability of data sets.
Citation: Jahan F, Haque S, Hogg J, Price A, Hassan C, Areed W, et al. (2025) Assessing the influence of the modifiable areal unit problem on Bayesian disease mapping in Queensland, Australia. PLoS ONE 20(1): e0313079. https://doi.org/10.1371/journal.pone.0313079
Editor: T. Ganesh Kumar, Galgotias University, INDIA
Received: April 5, 2024; Accepted: October 17, 2024; Published: January 28, 2025
Copyright: © 2025 Jahan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Simulated lung cancer data for Queensland used in the research are made available in the form of supplmentary information.
Funding: QUT Centre for Data Science.
Competing interests: NO authors have competing interests.
Introduction
It is well known that health outcomes vary by residential location. Spatial modelling of health data (e.g. disease mapping) can provide critical insights into the geographic patterns of and the relationships between health outcomes and their determinants. Aggregating spatial data by area is a popular practice in disease mapping as it helps: (1) identify clustering of disease, (2) provide insights into the spatial variability of disease burden, and (3) identify health disparities across different geographic regions [1]. Spatial aggregation can also help protect confidentiality and aid in calculation of pertinent rates and risks. But aggregated spatial data are susceptible to the modifiable areal unit problem (MAUP), where the choice of spatial units used to aggregate the data can significantly impact the analysis results.
The term, MAUP, was used for the first time by Openshaw and Taylor [2] showing the systematic variation in correlation values using different boundary systems for analysis. The MAUP may emerge from two different mechanisms [3]: the zoning effect, where the total number of areas is kept constant but boundaries are redefined; and the scaling effect, where the spatial resolution or aggregation level is altered leading to changes in the total number of areas. Analysing datasets from the same population using different zonations or scales may lead to inconsistent results, which is the essence of the MAUP [3]. Multiple cancer studies have demonstrated that different methods of aggregating health data led to varying interpretations of disease clusters (e.g. [4, 5]), with different zoning schemes and scales of aggregation resulting in different clusters being identified and impacting conclusions about high-risk areas.
Some have argued that aggregation beyond the minimal level yields untrustworthy results [6], while others believe there can be equally valid but different conclusions from analysing data at different aggregation levels [7]. Understanding the MAUP is crucial for analysis of spatial data at the small area level [8], since there are still several unresolved issues [9].
Ongoing research is examining how the MAUP affects various types of data and different geographical structures [7, 10–13], however, Australia’s geography is unique due to its vast geographical size, diverse ecological systems, and highly heterogeneous population distribution. As a result, the MAUP may have different implications for health data analysis in Australia compared to other countries. To date, there are only a handful of published works [6, 11, 14] investigating the impact of the MAUP in the Australian context.
In Australia, MAUP studies have modelled emergency department [11] or hospital data [14] using the popular Bayesian Besag, York and Mollié (BYM) model [15] in the city of Perth, Western Australia. There were 3890 SA1s and 164 SA2s in Perth in 2011, considered in this study, with median populations of 400 (for statistical areas level 1 (SA1s)), and 10000 (for larger SA2s). These studies have proposed various solutions to the MAUP, including using an overlay aggregation method for disease mapping by incorporating information from two nested aggregate levels [14], or minimising the influence of the MAUP through only reporting results at very high resolution, such as SA1s [11]. However, there are far greater numbers of people who present to hospital emergency departments or are admitted to hospitals than experience less common diseases or conditions.
Internationally, there has been some examination of MAUP for less common diseases. In the United States MAUP was evaluated when modelling breast cancer incidence in Indiana State Cancer Registry from 2010 to 2015, but even at the smallest levels examined, areas had between 600 to 3000 residents [16]. Their suggestion to avoid MAUP was to disaggregate counts to approximate individual-point locations, as occurred for a New Hampshire study on birth defects [17]. This used Restricted and Controlled Monte Carlo (RCMC) to disaggregate area-level counts by randomly placing individuals in pixels within the original area, with a higher probability of being placed where the population was higher. Disadvantages included running multiple times to account for the inherent uncertainty in placement, and then being restricted to using analyses designed for individual data.
Investigating the impact of the MAUP in regions with sparse data is crucial for accurate and reliable spatial analyses. In countries like Australia, where large areas have low population densities, this investigation is particularly important to inform policies and decisions that are equitable, effective, and tailored to the unique characteristics of these regions. Understanding the MAUP ensures that data-driven decisions truly reflect the realities of these areas, leading to better outcomes for their inhabitants and environments.
Given the limited suggestions offered to date, there remains a significant opportunity to elucidate the influence of the MAUP within sparse data contexts. The impact of the MAUP has not previously been explored on outcomes with low counts and very sparse populations, such as cancer within rural and remote Australia. The present study aims to fill this research gap by exploring the implications of Bayesian spatial modelling applied to sparse data aggregated at diverse boundary levels across an entire state in Australia.
In this paper we explore the impact of the MAUP using Bayesian spatial models applied to synthetic-style cancer data, which has lower overall counts than previous studies, across different levels of aggregation in Queensland, Australia. We consider the impact on both spatial disease patterns and covariate inference.
Materials and methods
Spatial structures
The Australian Statistical Geography Standard (ASGS) 2016, developed by the Australian Bureau of Statistics, defines a hierarchy of area structures with five levels of resolution. The area structure with the finest resolution is termed “mesh blocks”, and the state of Queensland has 67,047 mesh blocks with a median population of 82. The remaining levels of the hierarchy are termed “Statistical Areas” (SA) and the resolution decreases from SA1 (n = 11,507 areas, median population of 401) to SA4 (n = 19, median population of 198,975), excluding the zero population areas. The hierarchy is nested, so that each mesh block lies wholly within a single SA1, each SA1 lies within a single SA2 and so on.
Data
Simulated lung cancer incidence data for Queensland, Australia were provided by Cancer Council Queensland and reflects the distribution for lung cancer incidence over a ten year period, 2005–2014 [18]. The lung cancer incidence (number of cases) were simulated at the SA2 level, and then aggregated to SA3 and SA4 levels, and dis-aggregated to SA1 and mesh block levels using publicly available geographical correspondence files [19]. The correspondence files enable more accurate aggregations and dis-aggregations by accounting for the total populations of mesh and SA levels [20]. Counts and populations at each level of aggregation were calculated by adding the counts and populations for the respective areas. At each level of aggregation, indirectly-standardised incidence rates for area i (SIRobs,i) were calculated as the ratio of the observed counts (yi) divided by the expected counts (Ei), based on the state-wide observed counts in age group k (yk), and the population in age group k for area i (popik) and state-wide (popk), as follows.
Fig 1 shows the simulated cancer counts for each of the different aggregation levels in Queensland, followed by the observed standardised incidence rates in Fig 2. Disease counts are not a good representation of disease risk, since they do not account for the total population in the area. Hence the SIR, which is the ratio of observed to expected incidence cases, is used to summarise the underlying pattern of lung cancer incidence in the simulated Queensland data. Table 1 presents the descriptive statistics of the data.
Note that the maps for each aggregation level have different colour scales to reflect how the average counts per area increases as aggregation occurs.
Socio-Economic Indexes for Areas (SEIFA).
Lung cancer is known to have a strong association with socioeconomic factors, with higher incidence among areas with greater socioeconomic disadvantage [21]. To study the impact of the MAUP on inference at different levels of spatial aggregation, we included a covariate: area-level socioeconomic disadvantage. We used the Australian Bureau of Statistics’ Index of Relative Socioeconomic Disadvantage (IRSD), categorised into quintiles, where a region in the first quintile is amongst the most disadvantaged regions and a region in the fifth quintile is amongst the least disadvantaged regions [22]. The IRSD scores and quintiles are publicly available at the SA1 and SA2 level, and thus, SA2-level IRSD scores were disaggregated to mesh block and aggregated to higher regions (SA3 and SA4) following the methods recommended by the Australian Bureau of Statistics [23], and formed into quintiles.
Ethical approval.
Only simulated data were used in this manuscript, lung cancer data was simulated by Cancer Council Queensland following the appropriate ethical consideration [18]. This was approved by the Data Custodians of the original data. The other datasets used in this manuscript are the geographical boundary levels, population counts which are publicly available via Australian Bureau of Statistics website [24].
Statistical methods
The impact of the MAUP on varying area levels was assessed by investigating the underlying spatial autocorrelation in observed data and model residuals. We also compared the posterior summaries of the parameters of interest from the Bayesian spatial models fitted to five different aggregate levels (mesh blocks to SA4 level).
Spatial autocorrelation.
When spatial autocorrelation is present, analyses conducted need to be able to adequately incorporate it [25]. While there are multiple indices and tests available to detect the presence of spatial autocorrelation, in this study we use Moran’s I, an inferential statistic used to measure spatial autocorrelation [26], in two ways. First, to assess if the presence of spatial autocorrelation varies among different aggregations of raw data. Second, to check if the model residuals have any remaining spatial autocorrelation. Spatial models are fitted using a spatially structured error component to accommodate the spatial autocorrelation in the observed data. However, there could be well fitting models that provide good predictions but still have spatial autocorrelation indicated by significant Moran’s I on model residuals [27].
Bayesian spatial model: BYM model.
The observed cancer counts from different small area aggregations are modelled using a very well-known Bayesian spatial modelling framework, a BYM model [15]. The BYM model is a type of generic three level Bayesian hierarchical model, where at the first stage the observed disease counts are modelled using an appropriate likelihood, the second stage models the spatial association by specifying appropriate structures and the third stage specifies the hyperprior distributions [1].
The BYM model incorporates extra variation at the second stage via two random effect terms: a) a spatially structured component and b) a spatially unstructured component. Inclusion of both spatially structured and unstructured random effects enable the disease rates to be smoothed at both global and local level [28]. For the spatially structured term, the most commonly used approach for areal spatial data is the Gaussian Markov random field (GMRF) model which conditions area i within its neighbourhood [29, 30], particularly in the form of the conditional autoregressive (CAR) prior. The detailed BYM model used is presented below:
Let Yi be the observed lung cancer counts, which can be expressed under the general disease mapping framework using a Poisson likelihood [1]:
where Ei is the expected cancer counts in each area i. The present study will have different values of N for different boundary levels (Table 1). The log-relative risk, denoted μi, is often expressed as a regression equation:
where the intercept α denotes an overall fixed effect, β is the covariate effects associated with the vector of covariates xi relating to area i and ψi are the spatial random effects. In the context of this study, xi represents the SEIFA IRSD quintiles for each area, where the first quintile (most disadvantaged) is considered as the reference category and β is hence a 4 × 1 vector representing the covariate effect of SEIFA IRSD quintiles 2, 3, 4, and 5. The Gaussian priors,
and
are used for α and β. The value of hyperparameters
and
are usually chosen to be a large number. In the present study, we have used the default priors and parameters provided by the CARBayes package (
) [31]. The random effect ψi = ui+vi, where ui is a spatial random effect a CAR prior structure and vi is the unstructured random effect. The conditional distribution of each ui can be expressed as:
where wij are the weights defining the relationship between area i and its neighbours such that:
The prior for the unstructured random component vi is typically an independent normal distribution,
The hyperpriors placed on the variance terms and
are Inverse-Gamma(a, b) with hyperparameters a = 1 (shape) and b = 0.01 (scale) [31].
Implementation.
All the analyses in this study were implemented using R [32]. In total, ten BYM models were fitted covering every combination of aggregation level (mesh block, SA1, SA2, SA3 and SA4 level) and model specification (with and without covariates). Fully Bayesian inference was conducted via Markov chain Monte Carlo in the R package CARBayes version 5.2.5 [31]. For a single chain, we used 1,500,000 total iterations, discarding the first 500,000 as burnin. The posterior draws were thinned by 100 to reduce autocorrelation, leaving 10,000 iterations. Model convergence was checked through Geweke diagnostics [33]. The spatial autocorrelation of the observed data and model residuals were checked using Moran’s I with the R package spdep version 1.2–8 [34].
The associated R code for model implementation and calculation of spatial autocorrelation indices are available in the Github repository: https://github.com/Farzana-Jahan/MAUP.git.
Results
The simulated lung cancer data had a total of 20,700 lung cancer cases, with the median number per area ranging from 0.27 in meshblocks to 972 in SA4s (Table 1). There was large variation in observed and modelled SIRs across the state, with more remote western regions typically having higher rates, and the same area could have very different raw and modelled SIRs at different levels of aggregation (Figs 2–4). High spatial autocorrelation was present in the observed data at the mesh block, SA1 and SA2 levels, but not higher levels of aggregation (Table 2). After modelling, regardless of whether the socioeconomic variable was included, the residuals still showed high positive spatial autocorrelation (Table 2), but this likely could be attenuated by including additional covariates. However, the models for each level of aggregation had predominately spatially structured smoothing occurring, as demonstrated through the fraction of spatial variation, and this was slightly attenuated after including socioeconomic details for SA1 to SA3, and markedly for SA4 (Tables 3 and 4).
Differences were found in the estimates for socioeconomic coefficients depending on the level of aggregation. For instance, β4 (least disadvantaged) was essentially 0 and non-significant at the mesh block level, while all other aggregations indicated significantly negative values (Table 4). This meant that for SA1 and above, least disadvantaged areas had lower lung cancer diagnosis rates compared to most disadvantaged areas (the reference category). In contrast, estimates for β2 and β3 had negative associations at higher resolution that turned to insignificance for SA3 and SA4s (Table 4). These differences could be a result of the MAUP. However, apart from meshblocks, each level of aggregation showed a consistent socioeconomic gradient from the modelled results, with increasingly negative coefficients as socioeconomic disadvantage diminished.
It is evident from the posterior summaries that inference changes with different aggregation of areas across Queensland. For models both with and without covariates, the width of the credible intervals for regression parameters (α, βi), and variance parameters ( and
), were smaller for lower levels of aggregation (SA1 and SA2) and wider for the larger boundaries (SA3 and SA4) (Tables 3 and 4 and Figs 5 and 6), except for inference using the mesh block level data.
Discussion and conclusion
The present study sought to understand the impact of the MAUP in spatial analysis of sparse disease data across different administrative boundary levels when using a popular Bayesian spatial model in a geographically challenging environment. We found important differences in all aspects: spatial patterns, inference from estimates and coefficient results.
One proposed method to minimising the MAUP is through using high spatial resolution [11]. However, we found that meshblocks, which typically represent 30 to 60 dwellings [35], were too sparse for a condition as rare as cancer. Estimates for model parameters at the meshblock level often greatly differed from all other aggregation levels, and the large number of areas made it computationally intensive. While the next aggregation level of SA1s performed better, it is extremely difficult to obtain health data with location either provided as an SA1, or with the detailed street address needed to assign an SA1. Given SA2s tend to be the lowest realistic area size possible in health datasets, the large differences in modelled covariate coefficients between SA1s and SA2s and different spatial patterns amplifies the importance of acknowledging the MAUP when conducting spatial analyses.
While modeling at the mesh block level can increase the sample size (i.e., the number of areas) and reduce heterogeneity within areas, it can also lead to situations where many areas have a small or zero number of cases or population, resulting in a loss of information and the potential for oversmoothing. This aligns with the findings in Fontanet et al. [16] where the smallest level failed to provide meaningful inference due to smaller populations and counts. Kok et al. [11] also found this to be the case when analysing hospitalisation rates for foot-related issues among the Indigenous population of Australia. Our findings, in conjunction with theirs, suggest that while aiming for the highest level of resolution may be optimal to reduce the biases of the MAUP, using mesh blocks can introduce other problems such as small numbers, zero populations, and computational inefficiencies. In small-area health studies, particularly those involving rare diseases or small populations, sparse data can lead to highly imprecise rate estimates. This imprecision is amplified by the MAUP, causing substantial variability in the identification of disease clusters based on different spatial units [4, 36]. As a result, health interventions and resource allocations might be misdirected due to the unreliability of the spatial analysis.
In addition, advantages of using the well-defined boundary levels of SA1 and SA2 also enables easy incorporation of additional demographic and socio-economic data into disease models [37]. Detailed spatial data at these levels can often be more effectively communicated to stakeholders, policymakers, and the public, facilitating better understanding and engagement.
With higher aggregations, we found that the spatial autocorrelation of the data is no longer significant (for SA3 and SA4 levels), so use of spatial modelling/smoothing may not be required. But since SA1 and SA2 level data has significant spatial autocorrelation, analysis of cancer incidence data at these levels should be performed using a model that accounts for spatial autocorrelation.
The study has several limitations arising from data and methodology restrictions. Data constraints required SA1 and meshblock level data to be created from the SA2 level. The high level of heterogeneity within larger areas such as SA3s and above means that area-level socioeconomic variables are not released, and this may have impacted results, despite using recommended methodology to create these. However, this is also a feature of the MAUP: both the outcome and its covariates are impacted by aggregation. Some models did not fit well, considering the large amount of spatial correlation remaining in the residuals. As inadequate adjustment for spatial autocorrelation can bias fixed effects, in practical applications researchers are encouraged to explore different spatial structures and more useful covariates, particularly spatially correlated covariates. Our focus was on the commonly used Australian areal structures (Mesh block, SA1, SA2, SA3 and SA4), but rezoning boundaries (while keeping land areas consistent) can also identify the MAUP. We were interested in seeing how sparse data performed in the challenging Australian context, but our approach could be repeated considering other health conditions of varying prevalence, or including all of Australia.
In conclusion, as a result of this work we propose an alternative narrative regarding the optimal aggregation level for spatial modelling. Although it is common to analyse spatial data at the lowest level of aggregation possible, we suggest that a natural balance is instead preferable for inference, modelling and most importantly, reducing the impact of the MAUP. In the context of Australia, we found that although analysing cancer incidence at the mesh block level (lowest aggregation) alleviates the concerns of the MAUP, is not advisable due to the high computational burden and extremely noisy and sparse data. Moreover, when using higher aggregation levels such as SA3 and SA4, uncertainty increases, making it more difficult to identify useful covariates as a result of the MAUP. Thus, for the lung cancer data used in this work, we recommend modelling at the SA1 or SA2 level. Due to the sparsity inherent in disaggregating cancer data, we hypothesise that a similar conclusion may be drawn for other cancers and other countries with areas of low population density.
References
- 1. Best N, Richardson S, Thompson A. A comparison of Bayesian spatial models for disease mapping. Statistical methods in medical research. 2005;14(1):35–59. pmid:15690999
- 2. Openshaw S. The modifiable areal unit problem. Concepts and techniques in modern geography. 1984;38, Norwich:GeoBooks.
- 3. Wong DW. The modifiable areal unit problem (MAUP). WorldMinds: geographical perspectives on 100 problems: commemorating the 100th anniversary of the association of American geographers 1904-2004. 2004; p. 571–575.
- 4. Fotheringham AS, Wong DW. The modifiable areal unit problem in multivariate statistical analysis. Environment and planning A. 1991;23(7):1025–1044.
- 5. Gregorio DI, DeChello LM, Samociuk H, Kulldorff M. Lumping or splitting: seeking the preferred areal unit for health geography studies. International Journal of Health Geographics. 2005;4:1–10. pmid:15788100
- 6. Tuson M, Yap M, Kok MR, Murray K, Turlach B, Whyatt D. Incorporating geography into a new generalized theoretical and statistical framework addressing the modifiable areal unit problem. International journal of health geographics. 2019;18(1):1–15. pmid:30917821
- 7. Buzzelli M. Modifiable areal unit problem. International encyclopedia of human geography. 2020; p. 169.
- 8. Nelson JK, Brewer CA. Evaluating data stability in aggregation structures across spatial scales: revisiting the modifiable areal unit problem. Cartography and Geographic Information Science. 2017;44(1):35–50.
- 9.
Manley D. Scale, Aggregation, and the Modifiable Areal Unit Problem. In: Handbook of regional science. Springer; 2013. p. 1157–1171.
- 10. Marí-Dell’Olmo M, Oliveras L, Vergara-Hernández C, Artazcoz L, Borrell C, Gotsens M, et al. Geographical inequalities in energy poverty in a Mediterranean city: Using small-area Bayesian spatial models. Energy Reports. 2022;8:1249–1259.
- 11. Kok MR, Tuson M, Yap M, Turlach B, Boruff B, Vickery A, et al. Impact of the modifiable areal unit problem in assessing determinants of emergency department demand. Emergency Medicine Australasia. 2021;33(5):794–802. pmid:33517585
- 12. Dapena AD, Vázquez EF, Morollón FR. The role of spatial scale in regional convergence: the effect of MAUP in the estimation of β-convergence equations. The Annals of Regional Science. 2016;56(2):473–489.
- 13. Jeffery C, Ozonoff A, Pagano M. The effect of spatial aggregation on performance when mapping a risk of disease. International journal of health geographics. 2014;13(1):1–9. pmid:24625068
- 14. Tuson M, Yap M, Kok MR, Boruff B, Murray K, Vickery A, et al. Overcoming inefficiencies arising due to the impact of the modifiable areal unit problem on single-aggregation disease maps. International journal of health geographics. 2020;19(1):1–18.
- 15. Besag J, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics. Annals of the Institute of Statistical Mathematics. 1991;43(1):1–20.
- 16. Fontanet CP, Carlos H, Weiss JE, Diaz MCG, Shi X, Onega T, et al. Evaluating Geographic Health Disparities in Cancer Care: Example of the Modifiable Areal Unit Problem. Annals of Surgical Oncology. 2023;30(12):6987–6989. pmid:37658267
- 17. Shi X, Miller S, Mwenda K, Onda A, Rees J, Onega T, et al. Mapping disease at an approximated individual level using aggregate data: a case study of mapping New Hampshire birth defects. International Journal of Environmental Research and Public Health. 2013;10(9):4161–4174. pmid:24018838
- 18.
Cancer Council Queensland. Simulated Lung cancer data for Queensland; 2015. Email: Statistics@cancerqld.org.au.
- 19.
ABS. Australian statistical geography standard (ASGS); 2016. Available from: https://www.abs.gov.au/AUSSTATS/abs@.nsf/allprimarymainfeatures/871A7FF33DF471FBCA257801000DCD5F?opendocument.
- 20.
ABS. Main Features-Overview; 2016. Available from: https://www.abs.gov.au/ausstats/abs@.nsf/Lookup/by%20Subject/1270.0.55.001~July%202016~Main%20Features~Overview~1.
- 21. Wong MC, Lao XQ, Ho KF, Goggins WB, Tse SL. Incidence and mortality of lung cancer: global trends and association with socioeconomic status. Scientific reports. 2017;7(1):14300. pmid:29085026
- 22.
ABS. Socio-Economic Indexes for Areas; 2016. Available from: https://www.abs.gov.au/ausstats/abs@.nsf/mf/2033.0.55.001.
- 23.
ABS. 2033.0.55.001—Census of Population and Housing: Socio-Economic Indexes for Areas (SEIFA), Australia, 2016 —.nsf; 2016. https://www.abs.gov.au/AUSSTATS/abs@.nsf/DetailsPage/2033.0.55.0012016.
- 24.
ABS. Australian Bureau of Statistics; 2016. Available from: https://www.abs.gov.au/.
- 25.
Cliff A, Ord J. The Problem of spatial autocorrelation. Papers in Regional Science 1, Studies in Regional Science, 25-55, edited by A. J Scott; 1969.
- 26.
Tiefelsdorf M. Modelling spatial processes: the identification and analysis of spatial relationships in regression residuals by means of Moran’s I. Springer; 2000.
- 27. Kim D. Predicting the magnitude of residual spatial autocorrelation in geographical ecology. Ecography. 2021;44(7):1121–1130.
- 28. Sue Bell B, Broemeling LD. A Bayesian analysis for spatial processes with application to disease mapping. Statistics in Medicine. 2000;19(7):957–974.
- 29. CARLIN BP, Xia H. Assessing environmental justice using Bayesian hierarchical models: two case studies. Journal of Exposure Analysis & Environmental Epidemiology. 1999;9(1). pmid:10189628
- 30. Escaramís G, Carrasco JL, Ascaso C. Detection of significant disease risks using a spatial conditional autoregressive model. Biometrics. 2008;64(4):1043–1053. pmid:18261162
- 31. Lee D. CARBayes: an R package for Bayesian spatial modeling with conditional autoregressive priors. Journal of Statistical Software. 2013;55(13):1–24.
- 32.
R Core Team. R: A Language and Environment for Statistical Computing; 2016. Available from: https://www.R-project.org/.
- 33.
Lam P. Convergence diagnostics. Lecture presented at Government. 2002;.
- 34.
Bivand R, Altman M, Anselin L, Assunção R, Berke O, Bernat A, et al. Package ‘spdep’; 2015. Available from: https://cran.r-project.org/web/packages/spdep/index.html.
- 35.
ABS. Mesh Blocks; Jul 2021—Jun 2026. Available from: https://www.abs.gov.au/statistics/standards/australian-statistical-geography-standard-asgs-edition-3/jul2021-jun2026/main-structure-and-greater-capital-city-statistical-areas/mesh-blocks.
- 36. Amrhein CG. Searching for the elusive aggregation effect: evidence from statistical simulations. Environment and planning A. 1995;27(1):105–119.
- 37. Kang SY, McGree J, Mengersen K. The choice of spatial scales and spatial smoothness priors for various spatial patterns. Spatial and Spatio-temporal Epidemiology. 2014;10:11–26. pmid:25113587