View on GitHub View Portfolio

Unveiling the Mysteries of Coral Bleaching

A Bayesian Approach to Understanding Causality

2024-05-06

Motivation

Coral bleaching occurs when corals experience stress due to changes in environmental conditions such as temperature, light, or nutrient levels. This stress leads corals to expel their symbiotic algae, resulting in the loss of their coloration and, in severe cases, coral death.

Several factors contribute to coral bleaching, including rising sea temperatures, sea-level changes, and ocean acidification, all of which are consequences of climate change. Understanding the key environmental drivers of bleaching is critical for conservation efforts.

The objective of this study is to identify and quantify the impact of key environmental covariates on coral bleaching while also assessing how bleaching has changed over time across different locations in Florida. Using spatiotemporal modeling, we analyze trends in coral bleaching by incorporating both spatial variation (reef locations) and temporal patterns (yearly changes) within a Bayesian regression framework.

Data

This study utilizes a dataset with 2,394 observations collected between 2005 and 2016 sourced from the Florida Reef Resilience Program. The data includes various environmental and spatial covariates hypothesized to influence coral bleaching. The response variable, Percent Bleaching, measures the proportion of coral affected in each transect. Below is a list of key environmental and geographic covariates that may contribute to bleaching events:

Table 1: Environmental and Geographic Covariates
Covariate Description Units
Date_Year Year of observation
City_Town_Name Categorical variable representing the specific city or town
Lat Latitude of the coral reef transect degrees
Lon Longitude of the coral reef transect degrees
Distance_to_Shore Distance from the reef to the shoreline (km) km
Exposure Level of wave exposure (e.g., sheltered, exposed)
Turbidity Water clarity, with higher values indicating more suspended particles NTU
Cyclone_Frequency Number of cyclones affecting the area per year r
Depth_m Depth of the coral reef (meters) meters
Windspeed Average wind speed (m/s) m/s
ClimSST Climatological sea surface temperature (°C) °C
SSTA Sea surface temperature anomaly (°C) °C
SSTA_DHW Degree heating weeks derived from SSTA
TSA Thermal stress anomaly (°C) °C
TSA_DHW Degree heating weeks derived from TSA

Percent Bleaching Distribution

The Percent Bleaching data exhibits a right-skewed distribution (Figure 1), with a substantial number of observations reporting 0% bleaching.

Figure 1: Percent Bleaching Density

Figure 1: Percent Bleaching Density

Spatial Structure

Coral bleaching observations were geographically distributed across Florida’s reef systems (Figure 2). Mapping Percent Bleaching reveals spatial clustering, with certain areas experiencing more severe bleaching than others.

Figure 2: Spatial Distribution of Coral Bleaching

Figure 2: Spatial Distribution of Coral Bleaching

Temporal Structure

The dataset spans 2005 to 2016, providing an opportunity to analyze bleaching trends over time (Figure 3). Boxplots of Percent Bleaching over the years, categorized by City_Town_Name, reveal distinct temporal patterns across locations.

Figure 3: Temporal Trends in Coral Bleaching (Boxplots)

Figure 3: Temporal Trends in Coral Bleaching (Boxplots)

Model Description

Model Specification

To model the proportion of coral bleaching $Y_i$ for $i = 1, …, 2394$, we use a Bayesian Beta regression with a logit link function:

\[Y_i \sim \text{Beta}(\mu_i \phi, (1-\mu_i) \phi)\]

where $\mu_i$ is the mean bleaching percentage, and $\phi$ is the precision parameter. Various models for the mean structure were examined and defined as:

$$ \begin{aligned} \textbf{Model 1}: \text{logit}(\mu_i) &= \text{Date_Year}_i\beta_1 + \text{Lat}_i\beta_2 + \text{Lon}_i\beta_3 + \sum_{p} X_{ip}\beta_p \\ \textbf{Model 2}: \text{logit}(\mu_i) &= \text{Date_Year}_i\beta_1 + g(\text{Lat}, \text{Lon}) + \sum_{p} X_{ip}\beta_p \\ \textbf{Model 3}: \text{logit}(\mu_i) &= f(\text{Date_Year}) + \text{Lat}_i\beta_2 + \text{Lon}_i\beta_3 + \sum_{p} X_{ip}\beta_p \\ \textbf{Model 4}: \text{logit}(\mu_i) &= f(\text{Date_Year}) + g(\text{Lat}, \text{Lon}) + \sum_{p} X_{ip}\beta_p \\ \textbf{Model 5}: \text{logit}(\mu_i) &= f_{\text{City_Town_Name}}(\text{Date_Year}) + \text{Lat}_i\beta_2 + \text{Lon}_i\beta_3 + \sum_{p} X_{ip}\beta_p \\ \textbf{Model 6}: \text{logit}(\mu_i) &= f_{\text{City_Town_Name}}(\text{Date_Year}) + g(\text{Lat}, \text{Lon}) + \sum_{p} X_{ip}\beta_p \\ \end{aligned} $$

where:

\[f_\text{City_Town_Name}(\text{Date_Year}) \sim \mathcal{GP} (0, (k_c(t_i, t_j))_{i,j = 1}^n) \\\]

where the covariance function for each city $c$ is:

\[k_c(t_i, t_j) = \sigma_c^2 \exp\left( -\frac{||t_i - t_j||^2}{2 \rho_c^2} \right)\]

with:

Tensor-Product Spline for Spatial Variation

\[g(\text{Lat}, \text{Lon}) = \sum_{k_1} \sum_{k_2} \beta_{k_1 k_2} B_{k_1}(\text{Lat}) B_{k_2}(\text{Lon})\]

where:

Fixed Effects

\[\sum_{p} X_{ip} \beta_p\]

where:

Prior Specification

This model accounts for both spatial and temporal dependencies, allowing for flexible trend estimation.

Data Preprocessing

Before fitting the model, we applied several preprocessing steps:

Model Comparison

We tested the 6 models abive to evaluate different approaches for capturing spatiotemporal variation in coral bleaching. The candidate models included:

After running convergence checks, the final model was selected using Leave-One-Out Cross-Validation (LOO-CV), ensuring it provided the best balance between fit and complexity.

Table 2: Model Comparison (LOO-CV)
Model Temporal Structure Spatial Structure elpd_diff_loo1 se_diff_loo2 p_loo3 looic4 se_looic5
Model 6 City-Specific GP Tensor Smooth 0.00 0.00 80.47 −5711.89 134.45
Model 5 City-Specific GP Linear −16.19 6.26 75.56 −5679.50 133.96
Model 4 Global GP Tensor Smooth −93.13 20.50 43.11 −5525.64 133.72
Model 3 Global GP Linear −124.21 22.02 33.40 −5463.46 131.95
Model 2 Linear Tensor Smooth −460.63 33.49 24.05 −4790.64 120.73
Model 1 Linear Linear −491.58 33.90 18.32 −4728.73 118.72
1 Difference in Expected Log pointwise Predictive Density for a new dataset
2 Standard Error of component-wise elpd_diff_loo between two models
3 Effective number of parameters
4 Leave-one-out Information Criteria
5 Standard Error of looic

Selected Model

Model 6 emerged as the best-performing model in the comparison based on Leave-One-Out Information Criterion (LOOIC) and expected log predictive density (ELPD). It achieved the lowest LOOIC and the highest ELPD, indicating superior predictive accuracy while effectively balancing model complexity.

A key advantage of Model 6 was its flexible structure, incorporating:

Compared to alternative models, Model 6 provided the best trade-off between fit and generalizability, avoiding overfitting while preserving essential temporal and spatial dependencies. However, some covariates exhibited credible intervals overlapping zero, suggesting they might not contribute meaningfully. To enhance interpretability and model efficiency, we performed an iterative variable selection process, systematically removing weak predictors and reassessing model performance.

Model Refinement and Variable Selection

To improve model parsimony and predictive performance, an iterative refinement process was conducted to remove covariates that did not contribute significantly to the model. The refinement process followed these steps:

  1. Identify Non-Significant Covariates

    • Variables whose 95% credible intervals contained zero were considered weak contributors.
  2. Iterative Variable Removal & Refitting

    • The least significant covariate was removed from the model.
    • The model was then refit without that covariate to assess its impact.
  3. Evaluate Model Fit via Bayes Factor & MAE

    • Bayes Factor (BF) Comparison: The refined model was compared to the previous iteration using bayes_factor(). If BF > 10, the new model was preferred.
    • LOOIC: The reliability of how the refined model generalizes to new data was estimated. If LOOIC was lower, the new model was retained.
    • Mean Absolute Error (MAE): The predictive performance was evaluated using the PPD from refined model compared to observed Percent Bleaching to check model improvement/degradation. If MAE improved or remained stable, the new model was retained.
  4. Repeat Until No Further Improvement

    • This process continued until all remaining covariates contributed meaningfully, ensuring the final model was both interpretable and robust.

Through this process, unnecessary covariates were systematically removed, leading to a final optimized model that retained only the most relevant predictors while maintaining strong predictive accuracy.

Table 3: Model Refinement Results (BF, LOOIC, and MAE)
Prior Model Refined Model Covariate Removed BF LOOIC MAE
Model 6 −5714.0 0.0758
Model 6 Model 7 ClimSST 92.0 −5714.2 0.0759
Model 7 Model 8 Exposure 89.4 −5714.3 0.0759
Model 8 Model 9 Depth_m 5439.5 −5716.0 0.0758
Model 9 Model 10 Cyclone_Frequency 14.1 −5714.1 0.0760

Final Model

Through iterative model comparison, Model 9 was selected as the best-performing model, showing improvements over Model 6 in terms of fit and interpretability. This selection process involved removing non-significant covariates one at a time while assessing model performance metrics. Importantly, the smoothing parameters remained unchanged throughout this refinement process, ensuring consistency in spatial and temporal trends. The final model captures essential environmental and climatic predictors, balancing complexity and generalizability.

Model 9 includes key predictors such as Distance to Shore, Turbidity, Cyclone Frequency, Windspeed, Sea Surface Temperature Anomalies (SSTA), Thermal Stress Anomaly (TSA), and Degree Heating Weeks derived from TSA (TSA_DHW). These covariates were retained based on their statistical significance and their ecological relevance to coral bleaching dynamics. The refined model structure provides a robust framework for understanding and predicting bleaching patterns, facilitating targeted conservation efforts.

Goodness of Fit

A key aspect of evaluating the selected model’s reliability is examining its ability to replicate observed data patterns. Posterior predictive checks provide a direct way to assess the extent to which simulations from the model align with the actual observed data.

Posterior Predictive Checks

To evaluate the model’s fit, we conducted posterior predictive checks (PPCs), which compare the observed data to simulated draws from the posterior predictive distribution. The following visualizations assess whether the model-generated data resemble the observed coral bleaching percentages.

Distribution Overlay

Figure 4 presents an overlay of the posterior predictive distribution (PPD) against the observed bleaching percentages. The solid black line represents the observed data ($y$), while the blue-shaded posterior simulations ($y_{rep}$) provide an indication of model uncertainty. The strong alignment between the observed and predicted densities suggests that the model successfully captures the overall distribution of coral bleaching percentages.

Figure 4: Posterior Predictive Distribution vs Observed Data

Figure 4: Posterior Predictive Distribution vs Observed Data

Distributional Statistics

The set of plots in Figure 5 evaluates how well the model reproduces key summary statistics of the observed data, including:

Each histogram represents the distribution of these statistics across 8000 posterior simulations, with the vertical black line indicating the observed statistic. The Bayesian p-values assess whether the observed value is typical less than the posterior predictive distribution values. Values close to 0.5 suggest a good fit, while values near 0 or 1 may indicate potential discrepancies.

Overall, these diagnostics confirm that the final model provides a reasonable approximation of the observed data, supporting its validity for inference and prediction.

Figure 5: Posterior Predictive Checks for Summary Statistics

Figure 5: Posterior Predictive Checks for Summary Statistics

Model Results

After validating the model’s performance through posterior predictive checks, we now examine the key results. This section explores the significance of model predictors, the temporal trends in bleaching events, and the spatial distribution of bleaching risk across Florida’s coral reefs.

Variable Importance

The table below presents the estimated fixed effects from the final Bayesian Beta regression model. Each coefficient represents the effect of a predictor on the proportion of coral bleaching. The interpretation of key predictors is as follows:

Table 4: Estimated Fixed Effects from Bayesian Beta Regression
Parameter β1 SD(β)2 95% CI3
Intercept -1.5876 0.2392 (-2.0547, -1.1155)
Distance_to_Shore 0.1005 0.0339 (0.0338, 0.1660)
Turbidity -0.0784 0.0247 (-0.1270, -0.0295)
Cyclone_Frequency -0.0524 0.0260 (-0.1034, -0.0016)
Windspeed -0.0466 0.0214 (-0.0888, -0.0042)
SSTA -0.0573 0.0267 (-0.1104, -0.0047)
TSA 0.1313 0.0299 (0.0725, 0.1917)
TSA_DHW 0.0885 0.0232 (0.0425, 0.1340)
1 Parameter estimate
2 Standard Deviation of parameter estimate
3 95% Credible Interval of parameter estimate

Key Observations:

Temporal Effects

To evaluate how bleaching trends evolve over time, we analyze posterior estimates of the temporal effect from the GP component.

The plot below illustrates the estimated temporal variation in bleaching probability across five Florida counties from 2005 to 2016:

Figure 6: Temporal Effects by County (Smoothed Trends)

Figure 6: Temporal Effects by County (Smoothed Trends)

Key Findings:

This faceted plot illustrates the modeled county-specific temporal variation in bleaching probability, capturing how bleaching risk fluctuates over time in different locations.

To provide a broader comparison of modeled bleaching trends across counties, the following plot presents an overlay of the estimated temporal effects without faceting:

Figure 7: Overlaid Temporal Trends in Percent Bleaching

Figure 7: Overlaid Temporal Trends in Percent Bleaching

Key Observations:

While this plot removes individual county facets, it retains the key modeled trends and provides a clearer comparative perspective on bleaching severity across regions.

Spatial Effects

The spatial effects plot provides insights into regional differences in bleaching susceptibility. The spatial random effect was modeled using a tensor-product spline, capturing localized variations beyond the fixed effects.

Figure 8: Spatial Effects of Coral Bleaching (Modeled Estimates)

Figure 8: Spatial Effects of Coral Bleaching (Modeled Estimates)

Key Findings:

Discussion

This study applied a spatiotemporal modeling approach to assess coral bleaching trends across Florida’s reef systems, capturing both geographic variation and temporal changes. The results demonstrate that prolonged TSA and TSA_DHW as well as Distance to Shore are the strongest predictors of bleaching, with additional associations observed for SSTA, turbidity, cyclone frequency, and wind speed. While thermal stress is well-documented as a primary driver of bleaching, this analysis suggests that local environmental conditions, such as water quality and storm activity, may influence bleaching severity in complex ways.

By incorporating both spatial and temporal variation, the model identifies region-specific and time-dependent patterns of bleaching risk, reinforcing the importance of localized conservation strategies. These findings emphasize the need for continued monitoring and adaptive management that considers both large-scale climate stressors and site-specific environmental factors.

Limitations

While this study provides valuable insights, several limitations should be considered:

Future Directions

While this study leveraged the most recent available FRRP data (2005–2016), future research could benefit from new data collection to assess whether the observed trends persist under current climate conditions. Additionally, further exploration of existing datasets could provide deeper insights into bleaching patterns by incorporating complementary environmental variables or alternative modeling approaches.

Potential areas for methodological refinement include:

By focusing on these methodological improvements, future research can build upon this study’s findings to further improve bleaching risk assessments and conservation planning.