Skip to main content
Advertisement
  • Loading metrics

A framework and pilot study for assessing usability of flood data portals for interdisciplinary research

  • Breanne K. Langlois ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Supervision, Writing – original draft

    breelanglois@gmail.com

    Affiliation Friedman School of Nutrition Science and Policy, Tufts University, Boston, Massachusetts, United States of America

  • Aliya Magnuson,

    Roles Formal analysis, Writing – original draft, Writing – review & editing

    Affiliation School of Arts and Sciences, Tufts University, Medford, Massachusetts, United States of America

  • Timothy Griffin,

    Roles Funding acquisition, Supervision, Writing – review & editing

    Affiliation Friedman School of Nutrition Science and Policy, Tufts University, Boston, Massachusetts, United States of America

  • Erin Coughlan de Perez,

    Roles Supervision, Writing – review & editing

    Affiliations Friedman School of Nutrition Science and Policy, Tufts University, Boston, Massachusetts, United States of America, Red Cross Red Crescent Climate Centre, The Hague, The Netherlands

  • Elena Naumova,

    Roles Funding acquisition, Supervision, Writing – review & editing

    Affiliation Friedman School of Nutrition Science and Policy, Tufts University, Boston, Massachusetts, United States of America

  • Magaly Koch

    Roles Funding acquisition, Methodology, Supervision, Writing – review & editing

    Affiliation Center for Remote Sensing, Boston University, Boston, Massachusetts, United States of America

Abstract

There is a lack of datasets to study the climate and human outcomes nexus. There are many flood data portals due to recent improvements in flood identification using satellites, providing opportunities to study the human impacts. The development of these portals is rapid and there is currently no standard for evaluating their usability for interdisciplinary research. This paper addresses this important data gap. We put forth a usability framework that includes data availability, approaches to flood identification, alignment, velocity, variety, and user feasibility aspects. We piloted it through an in-depth review and user survey of NASA Worldview (NW), Global Flood Awareness System (GloFAS), Global Flood Monitoring System (GFMS), Global Surface Water Explorer (GSWE), and Dartmouth Flood Observatory (DFO) GSWE and GloFAS were rated most favorably. Respondents had discrepancies in their opinions on the clarity of the goals and platform accessibility for GFMS, DFO, and NW, and in data and visualization quality for all portals. Historical data and measures of flood recurrence and other characteristics are needed. Flood data products should be provided in multiple formats, aggregated by sub-national boundaries, with mechanisms that delineate incomplete or unreliable data. Flood data portals should include interdisciplinary research as part of their mission. Their longevity and maintenance should be secured to preserve these important data sources for future research. This framework can be adapted and used to enable interdisciplinary spatial and survey data linkages.

Introduction

The frequency and duration of flood events is increasing globally, presenting threats to public health and nutrition through vector borne diseases, water contamination, and agricultural losses [110]. Flooding due to river overflow, coastal seawater intrusion, or flash floods are all impacted by climate factors such as extreme precipitation and sea level rise [1113]. Other factors such as urbanization, land use, mitigation infrastructures, and socioeconomic conditions also impact flood risk [14, 15]. With a growing global population exposed, floods are among the most impactful environmental hazards [6, 16, 17]. The destructive nature of floods warrants data for situational awareness and disaster planning efforts, and research of the human impacts.

Currently, datasets to study the impacts of environmental changes on food security, nutrition, and health outcomes are lacking. Spatial and survey data linkages–the process of assembling information from these different data sources into one dataset–are needed. Remotely sensed imagery from optical and synthetic-aperture radar sensors are instrumental for this purpose [13, 18, 19]. After decades of Earth observing satellite missions, abundant data are now available to the public for flood identification, monitoring, forecasting, and assessment of long-term environmental changes. These data present an opportunity for researchers to study the human impacts of disasters and environmental changes. The challenge, however, is integrating geospatial and population data that traditionally do not speak to each other. There is a heavy user burden for researchers outside the earth sciences to process geospatial data products into formats that can be linked to population surveys (e.g., extracting raster values into polygon attributes and aggregating data by administrative boundaries). With many flood data portals and products available, it is unclear which ones are best suited for use with population datasets.

To address this data gap, we propose a usability framework applied to flood data portals and pilot it among interdisciplinary graduate students with backgrounds predominantly in public health and nutrition who are potential end-users of these portals. We derived the framework from metrics defined in a prior paper for assessing modern dashboards using principles of evidence, efficiency, emphasis, and ethics. An important component of this framework is the end-user, or external perspective [20]. Other papers have evaluated data usability within public health and disaster relief, but interdisciplinary perspectives are lacking [2123]. Findings from this pilot study may be of interest to developers of these portals to improve their data products. This framework can be adapted and may be useful for researchers interested in spatial and survey data linkages. We hope to raise awareness of these data sources to facilitate this critical interdisciplinary research.

Overview of remote sensing and flood mapping

Thirty meter resolution imaging (once considered high resolution, but currently considered moderate resolution) provided by optical sensors aboard Landsat satellites has been valuable to monitor seasonality and long-term changes in surface water over four decades [24]. With the launch of NASA’s Terra (1999) and Aqua (2002) satellites, daily tracking of wide areas (250 m resolution) of the Earth’s surface water has been made possible by the optical sensor MODIS (moderate-resolution imaging spectroradiometer) [6, 2527]. Synthetic aperture radar sensors (SAR) such as those aboard Sentinel-1 satellites, operated by the Copernicus program, have the advantage of penetrating cloud cover [19, 27, 28]. Multi-satellite precipitation data products from NASA’s Global Precipitation Measurement Mission are used in flood detection models [29].

Identifications and monitoring of floods and long-term changes remains challenging, despite availability of data provided by satellites [24]. Landsat imagery is limited by a wide temporal coverage (8 days), however, this has been recently resolved by a new product that combines Landsat and Sentinel-2 images to derive daily surface water extent maps with global coverage [24, 28]. While MODIS sensors can monitor large floods daily, it cannot detect floods where there is persistent cloud cover, complex terrain such as urban environments, or smaller flash floods [6]. Sentinel-1’s SAR sensors are limited in flood detection capability in the presence of wind or vegetation as the contrast between flooded and non-flooded areas is reduced [19, 30, 31]. Due to the temporal cycle of Sentinel-1 satellites (2 to 3 days at best), SAR flood detection is more accurate for longer duration events rather than flash floods [19, 30, 31]. Several indices derived from satellite imagery, such as NDWI (Normalized Difference Water Index), WI (Water Index), RSWIR (Red and Short Wave Infra-Red), or GSWIR (Green and Short Wave Infra-Red), have been used for flood mapping, but each have limitations (e.g. tendency to classify cloud shadow as water) [32, 33]. NDWI, for example, tends to underestimate inundated areas while GSWIR, in contrast, tends to overestimate [32].

In recent years many websites and data archives containing geospatial flood data products (i.e. flood data portals) have come into existence, each with their own mission and purpose [6, 24, 29, 3438]. These portals use various sources of information from satellite imagery, government, or media sources to provide information or generate usable products about floods. Some portals such as USDA’s Global Agricultural & Disaster Assessment System are interactive web map viewers that allow users to visualize floods without any special software [24, 3840]. Some offer data products in different formats [25, 29, 37, 38, 41, 42]. The Dartmouth Flood Observatory, for example, provides a downloadable database of flood events [6, 25, 40]. Others, such as the Global Surface Water Explorer, provide high resolution map products of surface water changes [24].

Interdisciplinary example: Quantifying flood in studies examining the impact on nutrition and food security

Measurement of flood varies widely within the literature on nutrition and food security. One challenge is measuring flood at the community or household level. Another measurement challenge is distinguishing the type of flood–e.g. river, pluvial, coastal, or multiform floods–which can have different impacts [43, 44]. Many studies that have examined nutrition and food security impacts of floods have focused on single flood disasters, comparing pre/post or flooded and non-flood areas [4556]. Some studies have measured flood through self-report using surveys, having the unique advantage of directly identifying flood-affected households but raising the risk of bias due to potential overreporting of flood exposure [46, 57, 58]. Others have used direct measures from satellite or other spatial data, aggregated at various levels [10, 5961]. These studies typically do not differentiate the type of flood or main cause. Few studies in this body of literature have linked spatial data to household surveys. One recent study examined the effect of flooding on food security across broad regional, country, and all-continent scales in Africa and found both positive and negative impacts of floods and their related meteorological conditions. This study highlighted the important data gaps for assessment of flooding and food security outcomes across different spatial and temporal aggregations [44].

Linking spatial and survey data is challenging due to usability aspects. While flood data portals classify whether an area is flooded, with varying approaches, the differing spatial and temporal attributes make it difficult to link this information to surveys which include community, sub-district, or district level boundaries. In contexts where flooding is recurrent and commonplace, such a measure could deepen understanding of the impacts and how to respond. The heterogeneity in the literature makes it difficult to compare studies across contexts to determine, broadly, the human impacts of floods. But also, this heterogeneity makes it difficult to compare across studies within the same context. Currently there is no universal definition or classification system of flood for use in disaster relief efforts or research.

Goals of this paper

We share a usability framework for flood data portals for research on human outcomes that incorporates the end user perspective. To showcase this framework, we conducted a pilot study using NASA Worldview (NW), Global Surface Water Explorer (GSWE), Global Flood Awareness System (GloFAS), Global Flood Monitoring System (GFMS), and Dartmouth Flood Observatory (DFO). This framework can be adapted and aids researchers wanting to link spatial and survey data to study how natural disasters and environmental changes are impacting human outcomes, e.g., health, economics, migration, food security, or nutrition.

Flood data portals included in this pilot study

All flood data portals included in this review are publicly accessible. This is not an exhaustive list or systematic review of all flood monitoring or flood data portals in existence, of which there are many. We selected five commonly used flood data portals that had global coverage based on a literature review and exploratory search for data in consultation with an expert in the application of Remote Sensing and Geographic Information Systems. Our criteria were that the data were freely and publicly available. We included:

  • NASA Worldview (NW): NW is a large satellite data archive with both an interactive, dynamic (spatiotemporal) map viewer and data repository offering download of multiple spatial data layers. A few of the many data products it offers are related to floods or surface water extent [37]. Hosted by the United States government’s National Aeronautics and Space Administration (NASA) and built by the Earth Science Data and Information System Project, NW includes satellite derived data from all aspects of the earth system and aims to support monitoring of natural hazards by providing access to current, daily satellite imagery data [37].
  • Global Surface Water Explorer (GSWE): GSWE is an interactive, dynamic map viewer allowing visualization of long-term changes in the Earth’s surface water over 37 years [24, 42]. GSWE displays datasets comprising the Global Surface Water collection (since 1984) produced using Landsat satellite imagery. While these data are continuously being updated with no end date, the data products are made available periodically. Currently, the 1984–2021 dataset is available. Data are accessible via several download options provided through a separate data access page. The European Commission’s Joint Research Centre developed this portal and corresponding datasets under the Copernicus Program to support a range of applications such as water resource management, conservation, and food security [42]. While not specific to flood events, this portal provides information on the extent and change of surface water including development of new permanent water bodies, e.g. from seasonal to permanent inundated areas.
  • Global Flood Awareness System (GloFAS): GloFAS provides an interactive map viewer to visualize current flood forecasts and detailed information about how to access the underlying data [35, 41]. Data are derived from satellite imagery and are accessible through the Copernicus Climate Data Store and can also be downloaded through the map viewer by creating an account. Implemented by the European Commission and Copernicus Program, the Global Flood Awareness System aims to support flood disaster preparedness efforts through global flood monitoring and forecasting of large river basins [41]. This is a component of the Copernicus Emergency Management Service.
  • Global Flood Monitoring System (GFMS): GFMS is an experimental, quasi-global flood monitoring system using real-time, multi-satellite precipitation data to create a gridded flood model that users can view on 1/8th degree resolution maps [29, 62]. GFMS uses TRMM (Tropical Rainfall Measuring Mission) multi-satellite precipitation analysis and global precipitation measurement (GPM) integrated multi-satellite retrievals for GPM (IMERG) to create the flood model which is based on land surface and river tracing routing models [29]. The maps allow users to change the different parameters and view time series of regional areas. The portal provides several variables that can be viewed, but the only data available for download is flood intensity (in depth) above threshold. This portal is developed and managed by the University of Maryland and funded by NASA.
  • Dartmouth Flood Observatory (DFO): DFO is a resource of information about large flood events worldwide, provided through a range of dynamic maps and visualizations and a downloadable historical archive developed from media reports and other sources [25]. DFO originated at Dartmouth College in 1993 and then transferred to the University of Colorado in 2010 where it is hosted currently. DFO is often the first to map major flood events. Its mission is to support research and other humanitarian and water management applications.

Methods

Ethics statement

Ethical approval for the external user survey was obtained from the Tufts Social, Behavioral & Educational Research Institutional Review Board. The study met criteria for exempt status. All participants were informed of the study by reviewing a consent script on the first page of the webform and selecting “I consent to participate” to begin the survey.

Usability framework

Our framework incorporated data usability aspects as well as user feasibility. The data usability component included an in-depth review of the portals for data accessibility and availability, approaches to flood identification through the data sources and flood detection methods, and data quality through alignment, velocity, and variety (Table 1).

thumbnail
Table 1. Usability characteristics for flood data portals.

https://doi.org/10.1371/journal.pclm.0000511.t001

The user feasibility component included a survey of graduate students reviewing and rating each of the portals for several criteria. The survey captured participants’ opinions about the portals’ goals and scope, data and visualization quality, and platform accessibility from an external user perspective. We developed the survey using metrics adapted from a previous study which were based on principles of evidence, efficiency, emphasis, and ethics (Table 2) [20]. Under the evidence principle are metrics assessing the quality of a portal’s data products and whether the data is presented in line with the goals and objectives. Under the efficiency principle are metrics assessing the content organization and visual properties of a portal. Under the emphasis principle are metrics assessing a portal’s “user-friendliness” and ability to recognize the needs of a target audience and tailor the content to their perceptions, abilities, and interests. Finally, the ethics principle seeks to evaluate whether best practices are applied.

thumbnail
Table 2. Assessment metrics for external user survey of flood data portals derived from principles of evidence, efficiency, emphasis, and ethics1.

https://doi.org/10.1371/journal.pclm.0000511.t002

From the above principles, Zhou et al. designed metrics to assess functionality and operation features of nutrition dashboards, which communicate nutrition data through interactive visualizations (Table 2) [20]. While these metrics were designed mainly for the evaluation of visualization dashboards, they can be applied more broadly to other types of data platforms. The functionality and operation aspects aligned with our purposes related to the practicality of flood data portals and provided the opportunity and structure to develop a survey instrument to capture end-users’ opinions in a quantifiable way, which is critical for making recommendations to improve the portals. We developed the survey by reviewing the criteria from Zhou et al. and tailoring it to the application of flood data portals.

Pilot study

From February to March 2022, we explored the flood data portals in depth and completed a structured form containing the usability characteristics defined in Table 1 (see Appendix for data gathering form). The forms were completed with as much detail as possible that could be obtained from the websites and associated articles. We summarized the information in a table.

From May to September 2022, we recruited graduate students from Tufts University and Boston University to review each portal and complete a webform survey (see Appendix). For each portal, participants were asked to spend no more than 20 minutes exploring the portal and rating the assessment metrics using 5-point Likert responses. The survey was created and conducted using the Kobo Toolbox platform.

We used Stata 17 (StataCorp LLC, College Station, TX) for analysis of the external user assessment survey. Survey responses were recorded using a 5-point Likert scale defined as: Strongly Disagree, Disagree, Agree, Strongly Agree, Not clear/ Difficult to assess. For each of the 25 metrics in the survey, we calculated percent agreement (i.e. rated either ‘Strongly Agree’ or ‘Agree’) for each portal. Then, we assessed interrater agreement using a generalization of weighted Kappa allowing for the situation of more than 2 raters, more than 2 ratings, and an incomplete design. We calculated the Kappa statistic for each flood data portal and interpreted the level of agreement as 0 to .20 = slight, .21 to .40 = fair, .41 to .60 = moderate, .61 to .80 = substantial, and .81 to 1 = perfect. Survey respondents were also asked to leave notes about each of the portals they reviewed. We summarized common themes from these notes inductively.

Results

Usability characteristics

Table 3 shows the suite of flood-related data products offered by the flood data portals. A comparative summary of the usability characteristics across the flood data portals is presented in Table 4.

thumbnail
Table 3. Summary of flood-related data products provided by the portals in this pilot study.

https://doi.org/10.1371/journal.pclm.0000511.t003

thumbnail
Table 4. Usability characteristics of flood-monitoring data portals in this pilot study.

https://doi.org/10.1371/journal.pclm.0000511.t004

Data accessibility and availability

All five portals in this review are in the public domain and free to users. All are currently maintained but it was unclear whether they receive long-term funding. The aims and missions varied and included flood monitoring (NW, GloFAS), flood forecasting (GloFAS), disaster preparedness (GloFAS), humanitarian (DFO), water-management decision-making and applications (DFO, GSWE). Only DFO specified a research purpose. The GFMS is an experimental system and did not specify an aim or mission.

All portals make their data products accessible directly from the site but NW and GloFAS require login. The portals offer 18 different flood-related data products (Table 3). Most provide information about flood occurrences from satellite imagery. For example, the MODIS NRT Global Flood Product from NW is a daily global map of flooding derived from Terra and Aqua satellite imagery. The collection of Global Surface Water datasets from GSWE capture changes in surface water occurrence at 30m resolution (previously considered high, but currently considered moderate resolution) from Landsat imagery since 1984. While not specifically measuring flood events, GSWE datasets can be used to identify flooded areas. GloFAS data products measure ongoing and upcoming river floods from Sentinel-1 imagery. While the other portals offer multiple data products, GFMS offers only 1, flood intensity (in depth), derived from multi-satellite precipitation data. DFO data provide information on occurrence of large flood events, compiled mostly from media sources.

In addition to data on flood occurrence, NW and GSWE capture recurrence and NW, GloFAS, GFMS, and DFO provide data on other characteristics of floods (Table 3). The Flood Hazard: Frequency and Distribution gridded data from NW measures frequency of floods between 1985–2003 using DFO data. The Recurrence dataset from GSWE measures inter-annual frequency of surface water between 1984 to present time, capturing areas that were episodically flooded. NW offers Mortality Risk and Economic Risk Flood Hazard layers, providing information on mortality and economic losses. Both are derived from the International Emergency Events Database (EM-DAT). GloFAS flood monitoring has data layers of the affected population and affected land use. GFMS provides data on flood depth. DFO provides the most information about floods in its Global Archive of flood events including main cause, area affected, detailed locations, names of rivers, duration, known number of deaths and displaced, and damages in USD. None of the portals’ data products distinguish or classify the type of flood, but DFO does provide the main cause and a measure of flood severity. GloFAS data only includes river floods.

Most of the data products are geospatial, provided in raster or vector data formats. Only DFO provides a non-geospatial format–an Excel sheet of flood events.

Approaches to flood identification and their potential errors

The portals had different approaches to flood identification and minimizing type 1 (false positive) or type 2 (false negative) errors by: compositing (NW), ensemble of different flood detection algorithms (GloFAS), classification over an extended time (GSWE), or combining land surface and river runoff models (GFMS).

NW classifies floods via the MODIS Near Real Time Global Flood Product and the Flood Hazard: Frequency and Distribution layer. The Global Flood Product is derived from the MODIS (Moderate Resolution Imaging Spectroradiometer) sensor datasets from both Terra and Aqua satellites. Flood identification is based on time compositing water detections over 3 periods (1, 2, and 3-day) and differentiating them from surface water using a reference mask [63]. If a water detection does not match the reference layer, it is classified as flood. The compositing approach minimizes false flood detections by combining multiple observations, thus reducing error from clouds. For each period, all observed water detections are combined, and pixels are classified as water if they exceed thresholds of 1, 2, or 3 observations based on the periods, respectively. While this approach minimizes error, the Global Flood Product still has limitations and false detections cannot be entirely removed, especially if cloud cover is persistent over multiple observations [63]. False-negative detections may result from obscuration from buildings, canopy, or clouds, while false-positives may still result from cloud or terrain shadow or dark rock or soil. Users are encouraged to compare flood maps to reflectance imagery to determine potential sources of false-positives or false-negatives [63]. Another potential source of error is the reference mask used to differentiate surface water from floods, which is outdated and planned for improvement [63].

The Flood Hazard: Frequency and Distribution layer from NW is derived from DFO data (discussed further below).

GSWE uses high resolution Landsat satellite imagery to classify changes in surface water from 1984–2021 [24]. This portal differs from the others in its goal to detect and classify surface water, not flood events, over multiple decades. Challenges include variations in measurement across the Landsat sensors (TM, ETM+, OLI) and in the conditions in which they’re observed [24]. False water detection may be caused by shadows (e.g. from buildings, terrain, or cloud), or lava flow. GSWE applies a combination of expert systems, visual analytics, and evidential reasoning techniques to classify pixels and prevent water classification errors [24]. This is followed by manual visual inspection and cleaning of the maps to remove false detections, and further validation to handle remaining errors [24]. An important feature of GSWE data is its classification over an extended period. Because classifications are made over time, the temporal behavior of the pixels is also used in determining their likelihood of being water, thus strengthening the accuracy of the classification.

GloFAS captures river flooding using Sentinel-1 SAR satellite data and a combination of 3 separate flood detection algorithms [64]. A final ensemble algorithm generates the final flood product based on a consensus of the 3 flood detection algorithms [64]. Like NW, a reference water mask is used to differentiate flooding from permanent surface water [64]. The ensemble approach helps prevent flood classification errors attributed to any one of the flood detection algorithms. However, classification errors are possible due to limitations inherent with SAR data. False positives can occur from land characteristics such as very dry soil, frozen ground, wet snow, or impervious surfaces or from radar shadowing [64]. False negatives can occur from urbanization or dense vegetation or strong, heavy winds and rainfall [64]. To address these classification challenges, GloFAS produces exclusion mask (for effects due to ground surface characteristics) and advisory flag (for effects due to weather conditions) layers along with their flood product layers [64]. Users can apply these layers to determine which pixels to disregard.

GFMS uses a real-time multi-satellite precipitation data product from NASA’s Global Precipitation Measurement Mission to identify floods using a dual land surface with river runoff model–the Dominant River Tracing-Routing Integrated with Variable Infiltration Capacity Environment (DRIVE) model [29]. Flood is defined when the routed runoff is greater than the flood threshold of that grid cell. Precipitation biases and the size and duration of flood events can impact GFMS flood detection capability [29]. False positives can occur in areas with dams or related to winter surface features in higher latitudes. False negatives can occur due to underestimation of orographic rainfall (due to hills or mountains). Generally, GFMS has improved flood detection performance with longer, larger floods.

Unlike the other portals, the DFO historical archive is based mainly on media sources (news reports, governmental and international relief agency websites). DFO captures large flood events spanning large geographic extents which are mostly hand drawn based on where the media reported flooding. In some instances, the archive might also have satellite observations to determine the flood extents. While it is a good source for identifying where large flood events occur globally, it is not useful for identifying which specific locations or communities are affected, making it prone to type 1 error. The differences in news coverage by country and region may impact the accuracy of information about a particular flood event.

Alignment, velocity, variety

We assessed data quality through alignment, velocity, and variety (Table 2). All portals provide data products structured by spatiotemporal attributes, but only DFO provides a dataset of flood events. All provide daily or Near Real Time data except for GSWE which classifies the whole timespan. DFO is the most important source of historical data about flood events from the 1980s. NW and GSWE also provide historical data products from the 1980s, but the products available through NW are derived from DFO data and EM-DAT. While GSWE has historical data, it captures changes over time and is not specific to flood events. GSWE and GloFAS provide the highest resolution data products, but only DFO provides detailed locations including names of rivers and towns.

All portals update their data products regularly. GFMS, NW, GloFAS, GFMS, and DFO (River and Reservoir Watch), provide daily updates. DFO may have some poor or missing data in the 1990s. The flood data products provided by NW cover the 1985–2003 and January 2022-present timespans, but there is a gap between 2003 to January 2022.

External user feasibility assessment

We present an exploratory descriptive and qualitative assessment of graduate student raters from the external user assessment survey. Eight raters participated in the survey, with 6 completing ratings for all 5 portals. Fig 1 shows the percent agreement. According to raters’ opinions, GSWE and GloFAS were ranked most favorably for their usability on average (73% average agreement), while DFO and NW were ranked least favorably (39% and 37%, respectively) and GFMS was in the middle (58%). Fig 1 shows the breakdown by the assessment metrics and areas where the portals can improve their usability. For goals and scope, raters agreed for GSWE, GloFAS, and GFMS, but did not seem to agree for DFO and NW. Ratings for data and visualization quality were mixed across the portals, with raters agreeing on different aspects for each portal. For platform accessibility, raters agreed for GSWE, but less so for the other portals. For contacts, communication, and conflicts of interest, raters agreed for GFMS, GloFAS, and GSWE and did not seem to agree for DFO and NW. Inter-rater reliability for each of the portals ranged from slight to fair. When examined across all 5 rating categories and across 2 rating categories (either agree or disagree), respectively, Kappa coefficients were 0.06 and 0.40 for GSWE, 0.03 and 0.16 for GloFAS, 0.25 and 0.40 for GFMS, 0.05 and 0.08 for DFO, and 0.08 and 0.11 for NW.

thumbnail
Fig 1. Percent agreement among raters of external user assessment of flood portals in this pilot study.

https://doi.org/10.1371/journal.pclm.0000511.g001

Notes provided useful feedback about the portals. For DFO, comments indicated the difficulty of navigating the website. E.g.,

The website is not intuitive and difficult to navigate.”

There is a wealth of useful data on this portal, however, the site is not user-friendly.

For GFMS, the ease of navigating the portal was commonly described. E.g.,

This website was easy to navigate.”

This portal is well organized and provides easy to-use instructions.

But improvements could be made to the visualization tools, e.g.:

The visual presentation and design leaves somewhat to be desired—there are missing spaces and grammatical errors, and the formatting changes throughout the webpage.”

“Some data links were dead, and some visualization options resulted in blank white areas without any stated reason as to why data could not be produced.

Comments about GloFAS were positive and indicated that it was clear and user-friendly, e.g.:

This website is the most clearly laid out and the outputs are all described well.”

Comments about GSWE were mixed. Comments indicated the site was well-designed. One rater felt the data download process was difficult, while another commented on the difficulty of the graphs:

You are able to explore the data but downloading it is a bit more complicated. The website is well designed though and easy to use!”

The tool to explore global surface water was impressivehowever, either I was not able to properly understand the graphs, or the graphs weren’t working properly.

Finally, comments about NW were focused on difficulty of accessing and downloading flood-specific data, e.g.:

While this tool is visually impressive, I couldn’t even find a way to view flooding or surface water after several minutes of exploration.”

Discussion

We addressed an important data gap in interdisciplinary research of how climate events are impacting human outcomes. We provided a usability framework for flood data portals, which previously did not exist. We piloted this framework using 5 flood data portals, which resulted in a detailed comparison that is useful to researchers and host organizations seeking to improve these important and valuable data products. Our framework can be adapted to other climate data products. We did not evaluate the correctness of the flood detection algorithms in our framework, as it depends largely on the intended use and our focus was on the end-users of these data products. We found considerable heterogeneity in the data products and information provided, and identified areas where these portals could improve. Overall, these portals should include research within their scope and provide products that facilitate interdisciplinary research.

There are several case examples of how data from these portals have been used in prior research (Table 5). DFO and GloFAS have been used more frequently than the other portals. DFO has had a broader range of applications in the literature including spatiotemporal variability of floods, impacts on mortality, displacement, and economy, while GloFAS has been used mainly for early warning systems [6573]. Several studies have linked DFO to other datasets [70, 72, 74]. It appears GFMS, NW, and GSWE have been used less frequently for research. Few studies seem to use GSWE, and these have focused on changes in delta morphology and coastal morphology over time due to human activity [75, 76]. We are not aware of any studies using GSWE variables to measure flooding. These case examples point to the need for improvements to usability so these important data products can be used more efficiently in longitudinal studies to examine human impacts of flooding.

Historical data products that span several decades are needed for longitudinal studies examining impacts of flooding on health, nutrition, or other population outcomes. A few of the portals provided historical data products, with DFO providing an important historical archive of floods since the 1980s. Historical data from NW was derived from DFO data and from EM-DAT but is not ongoing. GSWE also provided important historical data taken from satellite imagery, but it captured the overall change over time. Data products that capture flood events over a long period of time are valuable for population studies, though deriving such products retrospectively is difficult due to quality of data from earlier decades.

Most data captures flood occurrences, but measures of flood recurrence are needed. Only GSWE and NW provided this type of information. GSWE provides a ‘recurrence’ layer, which captures information about the inter-annual behavior showing areas that are inundated regularly versus those that are flooded episodically [42]. NW captures flood recurrence through its ‘Flood Hazard: Frequency and Distribution’ layer, providing information about flood frequency using data from the DFO historical archive [37]. Additionally, a planned future improvement to the NW Global Flood Product is to include a ‘recurring flood’ layer to identify regularly flooded areas [63]. Incorporating other information about floods is also important, such as size, duration, depth, or damages. Only DFO and NW provided other information about floods, including the human and economic impacts. Other classifications of flood exist that incorporate potential damages [13, 25, 77]. For example, the US National Weather Service uses a 3-level classification system based on the level of damage caused [13]. If flood data portals can extend beyond flood monitoring, to incorporating these other elements, the usability of their products will improve greatly.

Spatial and survey data linkages are challenging and not intuitive for researchers in public health and nutrition related fields. All the data products in this review require extra steps to be linked to surveys. None provide aggregations by administrative boundaries, which could facilitate this process. Having multiple data formats available that are accessible to a broad range of researchers could also facilitate this process. High resolution data products are important for data linkages, as aggregating up to community, sub-district, or district levels is more advantageous than downscaling coarse data. A deeper exploration of data integration methods is needed, including their challenges, benefits, and disadvantages. This is a future direction.

Survey respondents highlighted areas where the portals could improve or provide better clarification. DFO and NW could improve the clarity of their goals and scope. All portals could improve with respect to classifying flood types, providing more information about floods, and measuring recurrence. Sub-national and sub-yearly classifications are also needed to facilitate data linkages. GSWE was the most accessible platform, with raters agreeing on some aspects for GloFAS and improvements to user-friendliness, outdated links, tutorials, and data accessibility and formats could be made to the other portals. Taken together, this external user review can help these portals determine where they could improve either in terms of their products or the clarity of their websites. It has wider application to other types of data portals or environmental hazards. Addressing scalability is another future direction.

The low inter-rater agreement could reflect either the variability of the raters or of the portals themselves. The raters in this review were graduate students who were familiar with spatial and survey data, although some had more expertise than others. The low agreement was more likely due to the variability of the portals, indicating a need for common standards to improve their usability. Some criteria may have been easier to judge than others. Regardless, the percent agreement still provides valid and important information. A larger sample size would yield more precision.

We did not conduct an extensive review of all flood or disaster data portals in existence, of which there are many. We highlighted major aspects of 5 common flood data portals that we identified. However, it is possible that other flood data portals not included in this review have different usability aspects. While not specific to other flood data portals, our findings may provide general guidance on how to improve these important data sources. The portals included in this review–NW, GloFAS, GFMS, GSWE, DFO–may take specific recommendations from these findings.

Flood data portals should include research as part of their purpose and mission and should incorporate the interdisciplinary user perspective. Usability of the data products could be improved by providing sub-national aggregations, different data formats such as statistical datasets in Stata or R format, and mechanisms that clearly indicate where data may be incomplete or unreliable. Historical data products that span decades can provide an opportunity for longitudinal research of the human impacts of climate changes and extreme events. Measures of flood recurrence, not just occurrence, are needed. Data on other characteristics of flooding such as type of flood, depth, duration, and damages are also needed. Finally, the longevity and maintenance of these data portals should be secured through long-term funding for future generations and research.

Acknowledgments

The authors would like to acknowledge and thank Ryan Simpson, Alex Carroll, Tyler Stotland, and Nilakshi Chatterji for their prior work with Tufts Institute of the Environment, Environmental Research Fellowship which inspired this paper. Thank you to Leah Beaulac and Katherine Berry–alumni of the Tufts University Friedman School of Nutrition Science and Policy–for their support on this project. Lastly, we thank the graduate students from Tufts University and Boston University for sharing their opinions which were described in this paper.

References

  1. 1. The Human Cost of Weather-Related Disasters 1995–2015. 2015.
  2. 2. Food and Agriculture Organization of the United Nations. Agriculture on the Proving Grounds: Damage and loss. [cited 21 Nov 2021]. Available: https://www.fao.org/resources/digital-reports/disasters-in-agriculture/en/
  3. 3. Salm L, Nisbett N, Cramer L, Gillespie S, Thornton P. How climate change interacts with inequity to affect nutrition. Wiley Interdiscip Rev Clim Chang. 2021;12: e696.
  4. 4. Food and Agriculture Organization of the United Nations. The impact of disasters and crises on agriculture and food security: 2021. Rome, Italy; 2021.
  5. 5. Najibi N, Devineni N. Recent trends in the frequency and duration of global floods. Earth Syst Dyn. 2018;9: 757–783.
  6. 6. Tellman B, Sullivan JA, Kuhn C, Kettner AJ, Doyle CS, Brakenridge GR, et al. Satellite imaging reveals increased proportion of population exposed to floods. Nat 2021 5967870. 2021;596: 80–86. pmid:34349288
  7. 7. Alfieri L, Bisselink B, Dottori F, Naumann G, de Roo A, Salamon P, et al. Global projections of river flood risk in a warmer world. Earth’s Futur. 2017;5: 171–182.
  8. 8. Muh Aris Marfai B. GIS Modelling of River and Tidal Flood Hazards in a Waterfront City. International Institute for Geo-Information Science and Earth Observation. 2003.
  9. 9. Widiarto LA, Kingma N. Agricultural Loss Caused by 2007 Sidoharjo’s Flood and its House-hold Impact. Indones J Geogr. 2014;46: 156.
  10. 10. Pacetti T, Caporali E, Rulli MC. Floods and food security: A method to estimate the effect of inundation on crops availability. Adv Water Resour. 2017;110: 494–504. http://dx.doi.org/10.1016/j.advwatres.2017.06.019
  11. 11. Brunner MI, Swain DL, Wood RR, Willkofer F, Done JM, Gilleland E, et al. An extremeness threshold determines the regional response of floods to changes in rainfall extremes. Commun Earth Environ 2021 21. 2021;2: 1–11.
  12. 12. Donat MG, Lowry AL, Alexander L V., O’Gorman PA, Maher N. More extreme precipitation in the world’s dry and wet regions. Nat Clim Chang 2016 65. 2016;6: 508–513.
  13. 13. Klemas V. Remote Sensing of Floods and Flood-Prone Areas: An Overview. J Coast Res. 2015;314: 1005–1013.
  14. 14. Liu X, Huang Y, Xu X, Li X, Li X, Ciais P, et al. High-spatiotemporal-resolution mapping of global urban change from 1985 to 2015. Nat Sustain 2020 37. 2020;3: 564–570.
  15. 15. Jongman B, Winsemius HC, Aerts JCJH, Coughlan De Perez E, Van Aalst MK, Kron W, et al. Declining vulnerability to river floods and the global benefits of adaptation. Proc Natl Acad Sci USA. 2015;112: E2271–E2280. pmid:25902499
  16. 16. Ceola S, Laio F, Montanari A. Satellite nighttime lights reveal increasing human exposure to floods worldwide. Geophys Res Lett. 2014;41: 7184–7190.
  17. 17. Jongman B, Ward PJ, Aerts JCJH. Global exposure to river and coastal flooding: long term trends and changes. Glob Environ Chang. 2012;22: 823–835.
  18. 18. Serpico SB, Dellepiane S, Boni G, Moser G, Angiati E, Rudari R. Information extraction from remote sensing images for flood monitoring and damage evaluation. Proc IEEE. 2012;100: 2946–2970.
  19. 19. Mateo-Garcia G, Veitch-Michaelis J, Smith L, Oprea SV, Schumann G, Gal Y, et al. Towards global flood mapping onboard low cost satellites with machine learning. Sci Reports 2021 111. 2021;11: 1–12. pmid:33790368
  20. 20. Zhou B, Liang S, Monahan KM, Singh GM, Simpson RB, Reedy J, et al. Food and Nutrition Systems Dashboards: A Systematic Review. Adv Nutr. 2019;10: S308–S319. pmid:35254406
  21. 21. Sweta LO, Bijker W. Methodology for assessing the usability of earth observation-based data for disaster management. Nat Hazards. 2013;65: 167–199.
  22. 22. Simpson RB, Gottlieb J, Zhou B, Hartwick MA, Naumova EN. Completeness of open access FluNet influenza surveillance data for Pan-America in 2005–2019. Sci Reports 2021 111. 2021;11: 1–17. pmid:33437025
  23. 23. Zhang Y, Simpson RB, Sallade LE, Sanchez E, Monahan KM, Naumova EN. Evaluating Completeness of Foodborne Outbreak Reporting in the United States, 1998–2019. Int J Environ Res Public Heal 2022, Vol 19, Page 2898. 2022;19: 2898. pmid:35270590
  24. 24. Pekel JF, Cottam A, Gorelick N, Belward AS. High-resolution mapping of global surface water and its long-term changes. Nature. 2016;540: 418–422. pmid:27926733
  25. 25. The Flood Observatory. [cited 28 Jan 2021]. Available: https://floodobservatory.colorado.edu/index.html
  26. 26. Brakenridge GR. For Research, Humanitarian, and Water Management Applications.
  27. 27. MODIS Web. [cited 28 Apr 2022]. Available: https://modis.gsfc.nasa.gov/about/
  28. 28. OPERA. OPERA Dynamic Surface Water Extent from Harmonized Landsat Sentinel-2 Provisional Product. NASA Physical Oceanography Distributed Active Archive Center; 2023. https://doi.org/10.5067/OPDSW-PL3V1
  29. 29. Wu H, Adler RF, Tian Y, Huffman GJ, Li H, Wang J. Real-time global flood estimation using satellite-based precipitation and a coupled land surface and routing model. Water Resour Res. 2014;50: 2693–2717.
  30. 30. Twele A, Cao W, Plank S, Martinis S. Sentinel-1-based flood mapping: a fully automated processing chain. http://dx.doi.org/101080/0143116120161192304. 2016;37: 2990–3004. https://doi.org/10.1080/01431161.2016.1192304
  31. 31. Tarpanelli A, Mondini AC, Camici S. Effectiveness of Sentinel-1 and Sentinel-2 for flood detection assessment in Europe. Nat Hazards Earth Syst Sci. 2022;22: 2473–2489.
  32. 32. Memon AA, Muhammad S, Rahman S, Haq M. Flood monitoring and damage assessment using water indices: A case study of Pakistan flood-2012. Egypt J Remote Sens Sp Sci. 2015;18: 99–106.
  33. 33. Sanyal J, Lu XX. Application of Remote Sensing in Flood Management with Special Reference to Monsoon Asia: A Review. Nat Hazards 2004 332. 2004;33: 283–301. :NHAZ.0000037035.65105.95
  34. 34. The Flood Observatory. [cited 7 Mar 2022]. Available: https://floodobservatory.colorado.edu/index.html
  35. 35. Hirpa FA, Salamon P, Beck HE, Lorini V, Alfieri L, Zsoter E, et al. Calibration of the Global Flood Awareness System (GloFAS) using daily streamflow data. J Hydrol. 2018;566: 595–606. pmid:32226131
  36. 36. EOSDIS Worldview. [cited 26 Apr 2022]. Available: https://worldview.earthdata.nasa.gov/?v=-206.31832957697642,-46.546875,5.787079576976424,53.015625&t=2022-04-26-T23%3A12%3A34Z
  37. 37. Worldview | Earthdata. [cited 4 May 2022]. Available: https://earthdata.nasa.gov/worldview
  38. 38. UNU-INWEH Flood Mapping Tool. [cited 26 Apr 2022]. Available: https://floodmapping.inweh.unu.edu/
  39. 39. GADAS. [cited 26 Apr 2022]. Available: https://geo.fas.usda.gov/GADAS/index.html
  40. 40. Global Flood Database. [cited 29 Apr 2022]. Available: https://global-flood-database.cloudtostreet.ai/
  41. 41. Global Flood Awareness System–global ensemble streamflow forecasting and flood forecasting. [cited 5 May 2022]. Available: https://www.globalfloods.eu/
  42. 42. Global Surface Water Explorer. [cited 5 May 2022]. Available: https://global-surface-water.appspot.com/#features
  43. 43. Kruczkiewicz A, Cian F, Monasterolo I, Di Baldassarre G, Caldas A, Royz M, et al. Multiform flood risk in a rapidly changing world: what we do not do, what we should and why it matters. Environ Res Lett. 2022;17: 081001.
  44. 44. Reed C, Anderson W, Kruczkiewicz A, Nakamura J, Gallo D, Seager R, et al. The impact of flooding on food security across Africa. Proc Natl Acad Sci U S A. 2022;119: e2119399119. pmid:36252045
  45. 45. Health Effects of Flooding in Rural Bangladesh on JSTOR. [cited 30 Apr 2022]. Available: https://www.jstor.org/stable/23214183
  46. 46. Oskorouchi HR, Sousa-Poza A. Floods, food security, and coping strategies: Evidence from Afghanistan. Agric Econ (United Kingdom). 2021;52: 123–140.
  47. 47. Sherman M, Ford J, Llanos-Cuentas A, Valdivia MJ. Food system vulnerability amidst the extreme 2010–2011 flooding in the Peruvian Amazon: a case study from the Ucayali region. Food Secur. 2016;8: 551–570.
  48. 48. Salvucci V, Santos R. Vulnerability to Natural Shocks: Assessing the Short-Term Impact on Consumption and Poverty of the 2015 Flood in Mozambique. Ecol Econ. 2020;176.
  49. 49. Cutter SL. The Perilous Nature of Food Supplies: Natural Hazards, Social Vulnerability, and Disaster Resilience. http://dx.doi.org/101080/0013915720171252603. 2016;59: 5–15.
  50. 50. Alias NA, Nor NM. Measurement needs of food security during flood disaster in Bera, Pahang. Curr Res Nutr Food Sci. 2018;6: 83–88.
  51. 51. Akukwe TI, Oluoko-Odingo AA, Krhoda GO. Do floods affect food security? A before-and-after comparative study of flood-affected households’ food security status in South-Eastern Nigeria. Bull Geogr Socio-economic Ser. 2020;47: 115–131.
  52. 52. Parvez M, Islam MR, Dey NC. Household food insecurity after the early monsoon flash flood of 2017 among wetland (Haor) communities of northeastern Bangladesh: a cross-sectional study. Food Energy Secur. 2022;11: e326.
  53. 53. Doocy S, Leidman E, Aung T, Kirsch T. Household economic and food security after the 2010 Pakistan floods. Food Nutr Bull. 2013;34: 95–103. pmid:23767284
  54. 54. Rodriguez-Llanes JM, Ranjan-Dash S, Mukhopadhyay A, Guha-Sapir D. Flood-Exposure is Associated with Higher Prevalence of Child Undernutrition in Rural Eastern India. Int J Environ Res Public Health. 2016;13: 210. pmid:26861372
  55. 55. del Ninno C, Lundberg M. Treading water. The long-term impact of the 1998 flood on nutrition in Bangladesh. Econ Hum Biol. 2005;3: 67–96. pmid:15722263
  56. 56. Rodriguez-Llanes JM, Ranjan-Dash S, Degomme O, Mukhopadhyay A, Guha-Sapir D. Child malnutrition and recurrent flooding in rural eastern India: a community-based survey. BMJ Open. 2011;1: e000109. pmid:22080535
  57. 57. Sam AS, Abbas A, Surendran Padmaja S, Raghavan Sathyan A, Vijayan D, Kächele H, et al. Flood vulnerability and food security in eastern India: A threat to the achievement of the Sustainable Development Goals. Int J Disaster Risk Reduct. 2021;66: 102589.
  58. 58. Ahmad D, Afzal M. Flood hazards, human displacement and food insecurity in rural riverine areas of Punjab, Pakistan: policy implications. Environ Sci Pollut Res. 2021;28: 10125–10139. pmid:33164109
  59. 59. Riadi B. Strategy to Maintain Food Security in the Area of Flood Hazard in Karawang Regency. IOP Conf Ser Earth Environ Sci. 2018;165: 12029.
  60. 60. Rosalia ACT, Suryanto , Hakim L. Spatial analysis of the impact of flood and drought on food security index. Nat Environ Pollut Technol. 2021;20: 721–727.
  61. 61. Rahman MS, Di L. A systematic review on case studies of remote-sensing-based flood crop loss assessment. Agric. 2020;10.
  62. 62. Global Flood Monitoring. [cited 20 May 2023]. Available: http://flood.umd.edu/
  63. 63. Slayback D (NASA). MODIS NRT Global Flood Product. Greenbelt, MD; Available: https://www.earthdata.nasa.gov/global-flood-product
  64. 64. GFM Product User Manual. Available: https://extwiki.eodc.eu/GFM/PUM
  65. 65. Passerotti G, Massazza G, Pezzoli A, Bigi V, Zsótér E, Rosso M. Hydrological Model Application in the Sirba River: Early Warning System and GloFAS Improvements. Water 2020. 2020;12: 620.
  66. 66. Alfieri L, Burek P, Dutra E, Krzeminski B, Muraro D, Thielen J, et al. GloFAS–global ensemble streamflow forecasting and flood early warning. 17: 1161–1175.
  67. 67. Moraes MAE, Neto LA dos S, Pimentel A, Brown VLR. GloFAS as a Flood Alert System in Acre Civil Defense.
  68. 68. Chen A, Giese M, Chen D. Flood impact on Mainland Southeast Asia between 1985 and 2018—The role of tropical cyclones. J Flood Risk Manag. 2020;13: e12598.
  69. 69. Kundzewicz ZW, Pińskwar I, Brakenridge GR. Large floods in Europe, 1985–2009. Hydrol Sci J. 2013;58: 1–7.
  70. 70. Cunado J, Ferreira S. The Macroeconomic Impacts of Natural Disasters: The Case of Floods. Land Econ. 2014;90: 149–168.
  71. 71. Kundzewicz ZW, Pin’skwar I, Brakenridge GR. Changes in river flood hazard in Europe: a review. Hydrol Res. 2018;49: 294–302.
  72. 72. Hu P, Zhang Q, Shi P, Chen B, Fang J. Flood-induced mortality across the globe: Spatiotemporal pattern and influencing factors. Sci Total Environ. 2018;643: 171–182. pmid:29936160
  73. 73. Halgamuge MN, Nirmalathas A. Analysis of large flood events: Based on flood data during 1985–2016 in Australia and India. Int J Disaster Risk Reduct. 2017;24: 1–11.
  74. 74. Ferreira S, Ghimire R. Forest cover, socioeconomics, and reported flood frequency in developing countries. Water Resour Res. 2012;48: 8529.
  75. 75. Mentaschi L, Vousdoukas MI, Pekel JF, Voukouvalas E, Feyen L. Global long-term observations of coastal erosion and accretion. Sci Rep. 2018;8: 1–11. pmid:30150698
  76. 76. Nienhuis JH, Ashton AD, Edmonds DA, Hoitink AJF, Kettner AJ, Rowland JC, et al. Global-scale human impact on delta morphology has led to net land area gain. Nature. 2020;577: 514–518. pmid:31969725
  77. 77. Lassa JA. Emerging “Agricultural Involution” in Indonesia: Impact of Natural Hazards and Climate Extremes on Agricultural Crops and Food System. 2012.