Malcomb, Weaver and Krakowka (2014) published one of the first sub-national geographic climate change vulnerability models for a developing country (1.4). The authors intended for the study to be replicable across space (other African countries with similar data available) (7.1), time (when new survey data is published) (4.5 and 7.1), and vulnerability stimuli (7.1). The study’s social impacts are to address extreme vulnerability to climate change (1.3) and assisting in the allocation and evaluation of foreign aid (1.2). The methodology was designed to be “transparent and easily replicable” (2.1) in its use of “locally derived indicators and granular data” (2.1). The study was designed to address critiques of vulnerability models aimed at their uncertainty and sensitivity due to problems of scale and spatial aggregation, normative and subjective modelling decisions, and data availability, and challenges in model comparability (2.1). The model uses household adaptive capacity data from the United States Agency for International Development (USAID) Demographic and Health Surveys (DHS) (1.4 and 4.1) available in 44 African countries (7.1), livelihood sensitivity data from the USAID / Famine Early Warning Systems Network (FEWSnet) livelihood zones baseline surveys available in 23 African countries (3.6), and global physical exposure data from the United Nations Environment Programme (UNEP) Global Risk Data Platform.
This replication study is motivated by three factors. First, there is an urgent need to evaluate the reproducibility of research in human-environment and geographical sciences (HEGS) and to establish protocols and infrastructure for conducting and publishing reproduction/replication studies and reproducible research in HEGS. Second, a fully reproducible publication can be more readily replicated in new geographic, temporal, and thematic contexts, and tested for uncertainty due to data constraints and subjective modelling decisions. Third, climate change is causing increasingly severe in Africa. Improving the reproducibility and replicability of climate vulnerability research will hopefully enhance the potential for research to inform policy and reduce harm caused by climate change.
Malcomb et al (2014) produce two models of interest for Malawi. Figure 4, labelled “Malawi Household Resilience”, visualizes the average adaptive capacity score of households in each traditional authority. Figure 5, labelled “Malawi Composite Vulnerability Index”, visualizes vulnerability scores by locations (cells) in a continuous raster grid. In this study, we will attempt to identically reproduce figure 4 (adaptive capacity by traditional authority) and figure 5 (vulnerability grid) using The R Project for Statistical Computing and the same data sources cited in the original publication. We will visually compare our resulting reproduction figures with the original figures. Comparison will be aided by digitizing and joining the original figure results to the reproduction results for each model, and then calculating any differences between them. Differences will be visualized with thematic maps for both models, a confusion matrix for figure 4 (adaptive capacity by traditional authority), and a scatterplot for figure 5 (vulnerability grid). An exact reproduction should produce exact replicas of the rank order of traditional authorities by adaptive capacity and grid cells by vulnerability. We will test this with the Spearman’s Rho Correlation Coefficient, expecting values of 1 for perfect correlation.
The original study is a descriptive geographic multi-criteria analysis based on local expert opinion, and therefore has no testable hypotheses or effects.
The replication study data and code will be made available in a GitHub repository to the greatest extent that licensing and file sizes permit. The repository will be made public at github.com/HEGSRR/RPr-Malcomb-2014
Malcomb, D. W., E. A. Weaver, and A. R. Krakowka. 2014. Vulnerability modeling for sub-Saharan Africa: An operationalized approach in Malawi. Applied Geography 48:17–30. DOI:[10.1016/j.apgeog.2014.01.004](https://doi.org/10.1016/j.apgeog.2014.01.004).
Reproducibility, Vulnerability, GIS, Climate Change, Africa
The reproduction study design will first implement the original study as closely as possible to reproduce the 2010 Household Resilience map (F4) and Malawi Vulnerability Map (F5). Our two confirmatory hypotheses are that we will be able to independently reproduce results for both maps.
The working hypotheses are therefore:
H1: There is no perfect positive correlation between Malcomb et al’s ranking of traditional authorities by household resilience and our reproduction study’s ranking of traditional authorities by household resilience.
H2: There is no perfect positive between Malcomb et al’s ranking of locations by climate vulnerability and our reproduction study’s ranking of locations by climate vulnerability.
We will evaluate each of these hypotheses using a Spearman’s Rho Correlation. A failure to reject these hypotheses would indicate that our results do not exactly match those of the original authors. A positive correlation approaching 1 would indicate a partial reproduction
The original study is observational and descriptive, with no hypotheses or effect sizes. The study is a multi-criteria analysis using geographic information systems (GIS) to implement a hierarchical geographic model of climate change vulnerability model in Malawi.
The spatial extent of the study was the country of Malawi. The spatial scale of the study was the third administrative level (traditional authorities) and a raster grid of unknown spatial resolution. The temporal extent of the study was explicitly 2004—2010 (4.5), but the contains secondary data collected earlier (3.6 and F5).
The original study included over 70 qualitative semi-structured interviews with officials from development agencies, government, and non-governmental organizations (1.4). 47 specific interviewee titles are listed in a “full list of the interviews” (A1), leaving uncertainty around the remaining 23 (or more) interviewees. The study included 11 village focus groups (1.4). The interviews and focus groups were selected to focus on villages and organizations connected at different levels of organization to “externally designed adaptation projects” (3.2). We presume that the interviewees and focus groups were selected based upon their association with externally designed and funded climate change adaptation projects, which are neither equally nor justly distributed (Barrett 2014). No minimum sample size or stopping criteria were defined.
The original authors used exclusively secondary data for the vulnerability models and did not subset or sample from the secondary data. The published results based on DHS surveys include data 203 traditional authorities in 2010 (F4), whereas the authors state that there are more than 250 populated traditional authorities in Malawi (4.4). The 2010 household resilience data is based upon 24,850 DHS household surveys (5.2). Furthermore, “not every traditional authority had surveys conducted within its administrative boundaries” (5.2). This suggests possible spatial sampling problems in the DHS survey data in the context of its application in this study. The authors’ rationales for using the third level administrative units of traditional authorities include matching the political level at which many projects are planned and assessed, and identifying hotspots of vulnerability that are lost in the aggregation to second level districts (4.4). The original paper does not contain any further detail on the spatial sampling or distribution of DHS surveys vis a vis traditional authorities.
The model themes, indicators, and weights were selected based upon 70 interviews and 11 village focus groups from field trips to Malawi in March and August of 2011 (1.4, 4.2 and A1). Themes and indicators were also contextualized in literature (3.3 through 3.7) and adjusted based on redundancy and representativeness across the country (4.3). The model and weights were adjusted through “several iterations of the model using alternative weighting schemes” (4.3) to produce a “final product that reflects Malawi’s contextual and perceptual vulnerability” (4.3).
Each theme was constructed of indicators from a single data provider: adaptive capacity is measured with USAID DHS surveys, livelihood sensitivity is measured with FEWSnet/Malawi Vulnerability Assessment Committee (MVAC) livelihood zones baseline data, and physical exposure is measured with UNEP Global Risk Data Platform data (T1 and T2). Although the authors emphasize a grounded local evidence-based selection of indicators and weights (2.1, 4.2, 5.1 and 7.1), other evidence in the publication suggests a model design based on a more pragmatic combination of factors including expert local opinion, deductive theory, and the availability and characteristics of data.
The study did not use any randomization.
The original study was conducted using STATA™ (4.4) and ArcGIS™ (4.6, F3 and F4) with unspecified software versions, by 2012 according to creation dates on map figures (F3, F4 and F5).
The study was originally conducted using ArcGIS and unspecified statistical software. This reproduction study uses R, including the rdhs package for DHS survey data, the sf package for vector analysis, the stars package for raster analysis, and the tmap package for cartography.
# set up default knitr parameters
knitr::opts_chunk$set(
  echo = FALSE,
  fig.width = 8,
  fig.path = paste0(here("results", "figures"), "/")
)
# these values allow you to access private and public raw data more efficiently
private_r <- here("data", "raw", "private")
public_r <- here("data", "raw", "public")
public_d <- here("data", "derived", "public")
scratch <- here("data", "scratch")Major lakes were downloaded from MASDAP, the Malawi Spatial Data Platform.
Dissolve lakes into a single multi-part feature with one field
EA containing the value Lake.
Livelihood zones geographic data may be downloaded from the FEWS NET Data Center at https://fews.net/fews-data/335.
Livelihood sensitivity data is derived from household economic analysis (HEA) baseline surveys of livelihood zones created by MVAC in collaboration with USAID and FEWSnet (3.6). Livelihood zones are distinct from traditional authorities (5.6). They are “geographic areas where populations share characteristics of farming practices, labor, and environmental coping strategies” (3.6). Eleven zones were surveyed in 2003 (3.6). An MVAC 2005 report on livelihood zones appears in the references with an expired URL (R).
Livelihood sensitivity is measured with the following variables from FEWSnet livelihood zone data.
Livelihood zones attribute data was provided by FEWS NET in the form
of one three spreadsheets describing typical livelihood profiles for
each zone, with one sheet for poor households, one for
middle income households, and one for rich
households. This data was based on focus groups with stakeholders in
each livelihood zone. The authors have summarized the individual
poor household spreadsheets into one comprehensive table of
variables relevant to the study.
In order to prepare geographic livelihood zone data for analysis,
geometry errors are fixed, national parks are removed, and the
coordinate reference system is transformed to EPSG:4326 (WGS 1984)
geographic coordinates. Livelihood zone attribute data is then joined to
the geographic data by livelihood zone code LZCODE.
Physical exposure data is derived from the United Nations Environment Programme (UNEP) Global Risk Data Platform (1.4) as global (3.7) continuous raster data (5.6). The climate vulnerability map also cites the Dartmouth Flood Observatory (1999-2007) (F5). According to the references to Peduzzi (2011, 2012), the data for flood risk and drought exposure is available from UNEP/DEWA/GRID-Europe at preview.grid.unep.ch/. The drought risk data is based on “a global monthly gridded precipitation dataset obtained from the Climatic Research Unit (University of East Anglia)” and “a global Standardized Precipitation Index based on Brad Lyon (IRI, Columbia University) methodology” (3.7).
Physical exposure is measured with the following two indicators.
The UNEP Global Risk Data Platform used for this research is no longer available online. The data is provided with the research compendium.
Household adaptive capacity data is derived from USAID DHS Surveys conducted in 2004 and 2010 (1.4). Readers are referred to the DHS website for an “explanation on using survey data with GPS information” (4.4). The website, www.measuredhs.com, is provided in the references, and forwards to dhsprogram.com. There were 24,850 household surveys in 2010 (5.2), providing data for 203 traditional authorities (F3).
Adaptive capacity is composed of assets and access with the following DHS survey variables.
Geographic USAID Demographic and Health Survey (DHS) data requires pre-approved access clearance and login credentials from the DHS Program. For this reproduction study, the following procedure was used to gain access:
The rdhs package can be used to download the data,
provided a login email and project name via console and password via
pop-up dialogue.
Download the Malawi 2010 survey data and geographic points.
Load tabular data of household surveys
Load geographic data of household survey clusters. Some household survey points are erroneously placed at the WGS 1984 coordinate reference system origin (Equator and Prime Meridian).
In order to simultaneously maximize reproducibility while avoiding direct redistribution of DHS GPS data, we spatially join the GPS data to the Traditional Authority enumeration areas. Adaptive capacity is ultimately mapped by traditional authority, but the data comes from household-level surveys. Surveys are grouped into clusters with one geographic point. Therefore, the traditional authority to which each survey will be assigned must be spatially joined to the cluster point, and then joined by attribute to the household survey. The adaptive capacity calculation at the household level also requires urban/rural status, which is stored in the cluster.
Many household surveys contain inconclusive answers (e.g. “I don’t know”) or are missing data for survey questions used in the adaptive capacity calculation. The livestock variable will be calculated as a sum of four livestock types, so we remove any household with uncertain answers about any of the livestock types and remove households with missing data for all livestock types. Households with answers about some livestock types and missing data for others are still included in the data.
We remove incomplete household surveys.
Some of the authors had already examined the data and attempted a reproduction study prior to writing the preregistered analysis plan.
The spatial extent of the study was the country of Malawi (OSM relation 195290), excluding large bodies of water, national parks or similarly reserved land, and areas missing data (4.5). 203 traditional authority areas were included in the original study (F4).
The authors suggest that the scale of the phenomena of vulnerability dynamics in the context of climate change is at the household level (1.4, 2.2, 3.1 and 4.4). The authors use the third administrative level (traditional authorities) as the spatial scale and units of analysis of household resilience (4.4 and F4). The spatial support for the final analysis of climate vulnerability is a raster grid (4.6, F5) with unknown spatial resolution—appearing finer than the size of traditional authorities and the smallest unit on the scale bar, which is 12.5 kilometers (F5). We presume that the spatial resolution may be identical to at least one of the gridded physical exposure raster inputs.
Edge effects and neighboring countries will not be addressed in the analysis (4.2). The spatial analysis techniques in this study are not sensitive to edge effects.
The analysis does not include creation of any spatial subgroups and does not measure or account for any spatial autocorrelation, spatial heterogeneity, or spatial anistropies.
The replication study will focus on reproducing 2010 household resilience (F4) and climate vulnerability (F5), excluding the 2004 household resilience analysis (F3). The aim of this reproduction is to produce results identical to the original study. Therefore, we will not collect new interview or focus group data. Additionally, qualitative interview and focus group data was not provided with the original study. Therefore, we will not attempt to reinterpret any qualitative data or determine new themes, indicators or weights for the models. The reproduction study will use the indicators and weights as they are described in the original study.
The replication study will use a different software environment, using replicable open source software over proprietary software. Specifically, the study will be completed using The R Project for Statistical Computing version 3.6.1 or later using RStudio version 1.3.1 or later, and the research will be completed in full on both Windows 10 and MacOS operating systems. A complete list of required R packages is not known at the time of preregistration, but will be reported with the final publication.
The study will attempt to reproduce the original methods exactly, but
some differences may be inevitable due to ambiguous or conflicting
information in the original article. We will plan to make the following
reasonable decisions, which may differ from the authors’ intentions: 1.
Figure 4 represents adaptive capacity, composed of assets and access. 1.
Adaptive capacity scores will be calculated for each household, and then
household scores will be spatially joined by traditional authority and
averaged. 1. Figure 5 represents vulnerability, composed of adaptive
capacity, livelihood sensitivity, and physical exposure. 1. Every
indicator will be rescaled to a 0 to 4 scale using the formula:
percent rank * 4. This method is a compromise from the
uncertainty caused by a 0 to 5 scale, quintiles, and nominal indicators.
1. High ranks (4) will be assigned to better and safer conditions for
each indicator. 1. Weighted aggregation will be formulated so that the
aggregate scores have a theoretical minimum of 0 and maximum of the
assigned percentage for the thematic concept. - Assets = ([land] * 0.06
+ [livestock units] * 0.04 + [wealth] * 0.04 + [number sick] * 0.03 +
[orphans] * 0.03) * 25 - Access = ([water] * 0.04 + [cell phone] * 0.04
+ [radio] * 0.03 + [electricity] * 0.03 + [cooking fuel] * 0.02 +
[urban/rural] * 0.02 + [female household] * 0.02) * 25 - Livelihood
sensitivity = ([subsistence food] * 0.06 + [wage income] * 0.06 + [cash
crop income] * 0.04 + [disaster coping] * 0.04) * 25 - Physical exposure
= (flood risk * 0.2 + drought exposure * 0.2) * 50 1. Each
thematic indicator will be rasterized or resampled to the UNEP/GRID data
input most closely resembling the spatial resolution of figure 5. 1.
Vulnerability will be calculated so that the aggregate scores have a
theoretical minimum of 0 and maximum of 100. This is achieved by
inverting physical exposure. - Vulnerability = Assets + Access +
Livelihood sensitivity + (40 - Physical Exposure) 1. Any traditional
authority missing adaptive capacity data from DHS surveys will be
removed / masked from the final vulnerability analysis.
The authors use the terms and definitions of “adaptive capacity”, “vulnerability” and “resilience” inconsistently in the paper. Depending on context, both “vulnerability” and “resilience” may refer only to the assets and access portion of the model or to the full model inclusive of adaptive capacity (assets and access), livelihood sensitivity, and exposure. In some sections and formulas, “adaptive capacity” is used, while in others it appears as its components (assets and access) or even as “household resilience”. The data mapped in figures 3 and 4 are described in different sections of the original paper as resilience, adaptive capacity, and/or vulnerability. The data mapped in figure 5 are described in different sections of the article as vulnerability or resilience. Confounding matters further, the authors’ definition for resilience is typical for resilience theory, but the formula for resilience is more typical of the well-known Intergovernmental Panel on Climate Change (IPCC) operationalization of “vulnerability” (Gallopín 2006, Smit and Wandel 2006).
Adaptive capacity is analyzed in the spatial units of traditional authorities (4.4, 5.2, F3 and F4). This aggregation from households to traditional authorities is accomplished with a spatial join (5.2) with an average of the individual household resilience scores (5.2, 6.3, F3 and F4), and classified into four classes with the Natural Breaks Jenks classification method (F3 and F4). In the methodology section, the authors state that “DHS Indicators were disaggregated to the village level” (4.4). Since the DHS data is described as using the household level of aggregation, it is not clear how or why the data would be “disaggregated” to the village level.
Vulnerability is analyzed in the spatial units of gridded cells (F5). Each theme was converted to the raster grid data format (4.6). The paper does not specify the parameters for raster conversion, including the relationship between vector polygons and raster grid cells or the spatial resolution of the grid cells. Presumably, only adaptive capacity and livelihood sensitivity were converted, since physical exposure is already in a raster grid format. The paper does not specify if or how any geographic transformation is required for the biophysical risk grids from the UNEP Global Risk Data Platform. It may be possible to infer the resolution and methods from close inspection of the final climate vulnerability map (F5).
In our reproduction study, the data visualized in figure 4 of the original study and referred to as 40% Adaptive Capacity in table 2 was referred to as adaptive capacity while the data visualized in figure 5 of the original study and referred to as 100% Household Resilience in table 2 was referred to as vulnerability.
For the purposes of the reproduction study, the data visualized in figure 4 of the original study and referred to as ‘40% Adaptive Capacity’ in table 2 will be referred to as adaptive capacity while the data visualized in figure 5 of the original study and referred to as 100% Household Resilience in table 2 will be referred to as vulnerability.
The variables for adaptive capacity are aggregated into thematic concepts and referenced in the original paper as outlined below:
Calculate percent rank for each component of household adaptive capacity. We had to make many assumptions about calculating individual components, e.g. about how to aggregate different forms of livestock, and which values to invert such that high numbers correspond to low capacity (e.g. number of orphans or sick members of the household). Rescaling to a quintile rank as described in the original study is unclear, especially considering the number of discrete or even binary inputs. We have made a judgement call to do this by calculating percent rank and multiplying by 4, producing a theoretical domain of 0 to 4 similar to that of quintiles.
Calculate household-level adaptive capacity scores based on original
study Table 2 weights. The indicators have already been rescaled to a
possible domain of 0 to 4, and the weights sum
to 0.4, giving a possible domain of adaptive capacity
scores from 0.0 to 1.6.
Summary statistics of adaptive capacity and its components at the household level.
Join adaptive capacity data to geographic TAs and rescale in attempt to match original publication. The original publication figure 4 shows ranges from 11.48 to 25.77, but after rescaling indicators to domains of 0 to 4 and multiplying by percentages in table 2 (which sum to 0.4), the theoretical domain is only 0 to 1.6. We might suppose that the authors had rescaled adaptive capacity to a possible domain of 0 to 40 in accordance with the 40% weight of adaptive capacity in the overall vulnerability model. Therefore, we may multiply our possible domain of 0 to 1.6 by 25 to achieve a possible domain of 0 to 40.
| rpac_unscaled | rpac | |
|---|---|---|
| nbr.val | 215.00 | 215.00 | 
| nbr.na | 24.00 | 24.00 | 
| min | 0.30 | 7.41 | 
| max | 0.68 | 16.90 | 
| range | 0.38 | 9.48 | 
| median | 0.43 | 10.66 | 
| mean | 0.44 | 10.99 | 
| std.dev | 0.07 | 1.80 | 
The original publication uses the Jenks Natural Breaks method to classify the data.
| rpac_class | n | 
|---|---|
| 1 | 67 | 
| 2 | 80 | 
| 3 | 53 | 
| 4 | 15 | 
| NA | 24 | 
Map reproduction results for comparison to figure 4.
In order to test the adaptive capacity results, we will georeference the original figure 4 map using the QGIS3 georeferencer plugin. Using a vector dataset of traditional authorities and the georeferenced map, we will then use zonal statistics to extract the average brightness values, (which represent four classes of adaptive capacity) for each traditional authority. We will use an interior buffer of the traditional authority polygons, optimized in order to avoid summarizing border symbol in zonal statistics while capturing as much of the choropleth color symbol as possible. After inspecting a histogram of the mean brightness values, we will reclassify the values as closely to the four classes on the original figure 4 as possible and then manually adjust the attribute values for any misclassified traditional authorities. We will compare original and reproduction household resilience results by creating a confusion matrix, calculating the Spearman’s Rho correlation coefficient (expecting a value of 1 for perfect positive correlation), and creating a thematic map of the difference between the original results and replication results.
Ordinal data from figure 4 was digitized in QGIS with the following procedure:
pdf file using
Adobe Acrobat Pro.png file with pixel
dimensions 1982 by 2811ta_v.gpkg using WGS 84 geographic coordinates (epsg:4326).
Use linear georeferencing with points in
metadata\malcomb_fig4.png.pointsta_v to UTM 36S epsg:32736:
ta_v_fig4.gpkg:utm36s.-600m:
ta_v_fig4.gpkg:utm36s.ta_v_fig4.gpkg:buffer_wgs84.ta_v_fig4.gpkg:r,
ta_v_fig4.gpkg:rb and ta_v_fig4.gpkg:rbgta_v layer by
the ID_2 attribute:
ta_v_fig4.gpkg:ta_v_fig4orac (original
adaptive capacity) using the field calculator and CASE
statements, choosing break points that classify most traditional
authorities correctly.orac attribute
for any mis-classified area.ta_v_fig4.gpkg:fig4_errors.
Other areas are coded as follows:| code | description | 
|---|---|
| -3 | polygon too small to discern color or pattern fill | 
| -2 | white fill not matching any legend item | 
| -1 | pattern fill for “missing DHS data” | 
| 1 | lowest adaptive capacity | 
| 2 | … | 
| 3 | … | 
| 4 | highest adaptive capacity | 
Load digitized figure 4 data and display counts of results. Convert
all forms of missing data to NA to be excluded from mapping
and statistics. Join original figure 4 adaptive capacity results to
ta_v.
| orac | n | 
|---|---|
| -3 | 3 | 
| -2 | 30 | 
| -1 | 3 | 
| 1 | 38 | 
| 2 | 56 | 
| 3 | 72 | 
| 4 | 37 | 
Map original figure 4.
Calculate and map difference between the two maps.
The comparison of Figure 4 in the reproduction study and the original
study reflects some areas in Northern and Southern Malawi where the
reproduction study has a lower adaptive capacity than the original study
(resulting in -2). Most areas reflect differences of 0 to -1, implying
that our study calculated slightly lower adaptive capacity of Malawi
than the original study.
##    
##      1  2  3  4
##   1 34 27  6  0
##   2  4 26 44  5
##   3  0  0 19 29
##   4  0  0  0  3## 
##  Spearman's rank correlation rho
## 
## data:  ta_v$rpac_class and ta_v$orac
## S = 268637, p-value < 2.2e-16
## alternative hypothesis: true rho is greater than 0
## sample estimates:
##       rho 
## 0.7891711Create bounding box representing the spatial extent of Malawi. Create
a raster grid frame matching the extent of the bounding box and the
spatial resolution of the drought exposure raster, which is
0.041667 decimal degrees. Although the flood risk raster
has a coarser spatial resolution, visual inspection of the original
figure 5 suggests that the finer spatial resolution of drought exposure
was used for the original analysis.
Convert adaptive capacity to raster grid.
Clip and warp drought exposure to match our extent and spatial resolution.
Create a mask with the adaptive capacity results so that lakes,
conservation areas, and traditional authorities with no data will not
skew the classification / rescaling of drought exposure. Apply this mask
to drought exposure. Masking is our own decision based on intuition: it
is not specified in the original publication.
Classify drought exposure into quintile classes (0 to 4) Then rescale to 20% by multiplying by 4.
Clip and warp flood risk to match our extent and spatial resolution.
Mask and rescale flood. Since flood is already on scale from 0 to 4,
simply multiply by 5 to achieve the 20% weight.
Calculate livelihood sensitivity indicators from FEWSnet livelihood zone baseline profiles of poor households according to table 2.
Rescale livelihood sensitivity indicators into quantiles.
##              pctOwnCrop pctIncWage pctIncCashCrops pctDisasterCope ownCrop
## nbr.val            18.0       18.0            18.0            18.0    18.0
## nbr.null            0.0        0.0            13.0             1.0     1.0
## nbr.na              0.0        0.0             0.0             0.0     0.0
## min                29.4        9.7             0.0             0.0     0.0
## max                88.0       50.3            75.1            71.9     4.0
## range              58.6       40.6            75.1            71.9     4.0
## sum              1059.3      489.6           171.8           236.5    36.0
## median             55.0       24.7             0.0             8.8     2.0
## mean               58.9       27.2             9.5            13.1     2.0
## SE.mean             3.1        2.6             5.3             3.7     0.3
## CI.mean.0.95        6.6        5.5            11.2             7.9     0.6
## var               176.8      121.3           507.8           251.0     1.6
## std.dev            13.3       11.0            22.5            15.8     1.3
## coef.var            0.2        0.4             2.4             1.2     0.6
##              wageIncome cashCropIncome disasterCope
## nbr.val            18.0           18.0         18.0
## nbr.null            1.0            1.0          1.0
## nbr.na              0.0            0.0          0.0
## min                 0.0            0.0          0.0
## max                 4.0            1.2          4.0
## range               4.0            1.2          4.0
## sum                36.0           17.6         36.0
## median              2.0            1.2          2.0
## mean                2.0            1.0          2.0
## SE.mean             0.3            0.1          0.3
## CI.mean.0.95        0.6            0.2          0.6
## var                 1.6            0.1          1.6
## std.dev             1.3            0.4          1.3
## coef.var            0.6            0.4          0.6Calculate aggregate livelihood sensitivity score
##              sensitivity
## nbr.val            18.00
## nbr.null            0.00
## nbr.na              0.00
## min                 5.88
## max                14.00
## range               8.12
## sum               161.65
## median              8.65
## mean                8.98
## SE.mean             0.56
## CI.mean.0.95        1.18
## var                 5.65
## std.dev             2.38
## coef.var            0.26Convert livelihood sensitivity into raster grid
Calculate an aggregated vulnerability score by adding low adaptive capacity (invert adaptive capacity by subtracting from the maximum score of 40), livelihood sensitivity, drought exposure, and flood risk.
\[ Vulnerability = (40 - Adaptive Capacity) + Livelihood Sensitivity + Drought Exposure + Flood Risk \]
The additive vulnerability scores are between 33.08 and 82.94. The map
reflects medium to high vulnerability in most areas (ranging around the
50-70 area).
Calculate an aggregated vulnerability score by multiplying low adaptive capacity (invert adaptive capacity by subtracting from the maximum score of 40), livelihood sensitivity, and drought exposure added to flood risk.
\[
Vulnerability = (40 - Adaptive Capacity) * Livelihood Sensitivity *
(Drought Exposure + Flood Risk)
\]
The geometric vulnerability scores are between 0 and 16,209.26. The high
areas of vulnerability that can be identified are in the north and south
of Malawi, whereas the low areas of vulnerability that can be identified
are in central Malawi.
In order to compare the Malawi vulnerability results, we will georeference the original figure 5 map using QGIS georeferencer plugin. We will vectorize the UNEP-Grid raster input most closely matching the published map and summarize the red, green, and blue brightness values of the original map using zonal statistics. We will add the green and blue brightness values together to convert the original color ramp into a linear scale of continuous values. We will compare original and reproduction Malawi vulnerability results by creating a scatterplot, Spearman’s Rho correlation coefficient (expecting a value near 1 for perfect positive correlation), and thematic map of the difference between the original results and replication results.
Comparing the reproduction of figure 5 with the original figure 5 requires first digitizing the original figure 5 (unclassified choropleth map with yellow to red gradient) in QGIS as follows:
pdf file using Adobe Acrobat Pro.png file with pixel
dimensions 1949 by 2811ta_v.gpkg using WGS 84 geographic coordinates (epsg:4326).
Use linear georeferencing with points in ...ta_capacity.tif raster to vector polygonszonal statisticsgeoref_bg.gpkg.To approximate data values from the yellow to red gradient of the original map, the blue and green bands are then added, inverted, and rescaled to a range from 0 to 100.
## 
##  Spearman's rank correlation rho
## 
## data:  vulnerability_p$orv and vulnerability_p$rpv
## S = 7087504387, p-value < 2.2e-16
## alternative hypothesis: true rho is not equal to 0
## sample estimates:
##       rho 
## 0.1974578Map differences in Figure 5
In conclusion, our replication study aimed to reproduce the 2010 household resilience (F4) and climate vulnerability (F5) assessment in Malawi. We clarified definitions of “adaptive capacity” and “vulnerability” in our reproduction study as the original study vaguely outlined the term definitions. The replication process involved various components, including recalculating adaptive capacity scores, rescaling indicators, and exploring different methods for aggregating vulnerability scores. While our study was generally successful in replicating several aspects of the original analysis, we encountered spots in Malawi that reflected different adaptive capacity. We attribute this to limitations in data availability and a lack of clarity regarding the original study’s decisions. These differences, particularly in the treatment of geometry errors and missing data, could not be definitely determined from the original study.
One aspect to think about in our study is the comparison map of Figure 4. The map compared adaptive capacity in our study vs the original study, and most areas reflect differences of 0 to -1, implying that our study calculated slightly lower adaptive capacity of Malawi than the original study. This may be due to methodological differences as there were minor variations to the original study.
Another important aspect of our study was the comparison of the additive and geometric aggregation methods for vulnerability assessment. The original study used the additive method, which resulted in high vulnerability scores for most areas in Malawi. This made it challenging to identify regions with a vulnerability score of 0. Implementing a geometric aggregation method, on the other hand, allowed for a broader range of vulnerability scores that included 0 and went up to 16,209. Using the geometric aggregation method resulted in more areas with lower vulnerability as seen in Figure 6. The difference was particularly evident when examining the maps in Figure 5 and 6. The additive method identified more regions in Malawi as highly vulnerable, whereas the geometric method predominantly highlighted the most vulnerable areas. Geometric aggregation acknowledges the interactions between indicators and provides a more nuanced perspective on vulnerability (Tate 2012).
Our study emphasizes the importance of clarity and transparency in reporting methodologies and data sources, as discrepancies in data availability and treatment can lead to variations in results. Additionally, the choice of aggregation method for vulnerability assessment is critical, as it can significantly impact the identification of vulnerable areas and influence resource allocation for climate adaptation initiatives.
However, our study is not without limitations. It is based on a limited population interview sample from Malawi, and it relies on private DHS GPS data that does not permit direct redistribution, making it challenging for others to reproduce our work. Therefore, future research in this field should prioritize access to data, transparent reporting, and the exploration of alternative aggregation methods to enhance the accuracy and utility of climate vulnerability assessments in Malawi and similar regions. These efforts are vital to inform effective climate resilience strategies and foster sustainable development in the face of climate change.
This report and its preregistration were written after already attempting the reproduction study, including acquisition and analysis of all of the secondary data sources required. However, the preregistered analysis plan was written as if we had no prior knowledge of the data other than what is documented in the study. Holler has previously reviewed and compared other climate vulnerability models for Malawi, and conducted a scoping study in the Lilongwe and Mangochi districts of Malawi in 2015, including meeting with the Regional Centre for Mapping of Resources for Development (RCMRD) consultants who created the Malawi Hazards and Vulnerability Atlas (2015).
Malcomb, D. W., E. A. Weaver, and A. R. Krakowka. 2014. Vulnerability modeling for sub-Saharan Africa: An operationalized approach in Malawi. Applied Geography 48:17–30. DOI:[10.1016/j.apgeog.2014.01.004](https://doi.org/10.1016/j.apgeog.2014.01.004).
Barrett, S. 2014. Subnational Climate Justice? Adaptation Finance Distribution and Climate Vulnerability. World Development 58:130–142. DOI: 10.1016/j.worlddev.2014.01.014. Gallopín, G. C. 2006. Linkages Between Vulnerability, Resilience, and Adaptive Capacity. Global Environmental Change 16 (3):293–303. DOI: 10.1016/j.gloenvcha.2006.02.004. Rufat, S., E. Tate, C. G. Burton, and A. S. Maroof. 2015. Social vulnerability to floods: Review of case studies and implications for measurement. International Journal of Disaster Risk Reduction 14:470–486. DOI: 10.1016/j.ijdrr.2015.09.013. Smit, B., and J. Wandel. 2006. Adaptation, adaptive capacity and vulnerability. Global Environmental Change 16 (3):282–292. DOI: 10.1016/j.gloenvcha.2006.03.008.