Ramkumar Yaragarla
I Hug Trees
Published: 2025-12-04
Satellite preview

Sentinel-2 satellite preview of Vancouver-region, captured on 2025-10-14 from an altitude of approximately 786 km.

Statement of Problem

Rapid urbanization and climate change are placing unprecedented stress on cities worldwide. As built-up areas expand and green spaces decline, urban heat islands intensify, air quality deteriorates, and residents face increased health risks. Understanding the spatial distribution of vegetation and development is essential for evidence-based urban planning, climate adaptation, and public health protection. This analysis quantifies the current balance between green cover and built-up surfaces in Vancouver-region, identifies thermal vulnerability hotspots, and provides actionable recommendations for enhancing urban forestry and green infrastructure.

1. Urban Forestry Assessment: Current Vegetation & Built-Up Status

1.1 Vegetation Distribution (NDVI Analysis)

The Normalized Difference Vegetation Index (NDVI) statistics for the Vancouver-region reveal a mean NDVI value of 0.099, indicating a relatively sparse green cover across the area. The median NDVI value of 0.045 further suggests that more than half of the region has low vegetation density. The standard deviation of 0.136 points to a significant variability in vegetation cover. The minimum NDVI value is -1.0, and the maximum is 1.0, which shows a wide range of vegetation health and density. The 10th percentile (p10) is -0.006, and the 90th percentile (p90) is 0.315, highlighting the skewed distribution where most areas have low to moderate vegetation cover. Spatially, the NDVI color and greyscale maps (available at NDVI color map and NDVI greyscale) show that green cover is unevenly distributed, with pockets of higher vegetation density interspersed among predominantly low-density areas.

1.2 Built-Up Intensity (NDBI Analysis)

The Normalized Difference Built-Up Index (NDBI) statistics indicate a mean value of -0.017, suggesting a relatively low built-up intensity in the Vancouver-region. The median NDBI value of 0.003 and a standard deviation of 0.099 show a moderate spread in urbanization levels. The minimum NDBI value is -0.465, and the maximum is 1.0, pointing to areas with significant built-up intensity. The 10th percentile (p10) is -0.162, and the 90th percentile (p90) is 0.090, indicating that most of the region has low to moderate built-up density. The NDBI maps (NDBI color map and NDBI greyscale) highlight specific zones with higher built-up intensity, primarily around urban centers and along major transportation corridors.

NDVI color

NDVI Color Visualization

NDVI greyscale

NDVI Greyscale (Index Values)

NDBI color

NDBI Color Visualization

NDBI greyscale

NDBI Greyscale (Index Values)

Additional Vegetation & Water Indices

Beyond NDVI and NDBI, three complementary indices provide deeper insights into vegetation health and water presence:

EVI (Enhanced Vegetation Index)

EVI improves on NDVI by correcting for atmospheric conditions and canopy background noise. It is more sensitive in areas with dense vegetation, making it useful for monitoring forest health and identifying vegetation stress that NDVI might miss.

EVI color

EVI Color Visualization

EVI greyscale

EVI Greyscale (Index Values)

MNDWI (Modified Normalized Difference Water Index)

MNDWI enhances water body detection by using green and shortwave infrared bands. It is more effective than NDWI at separating water features from built-up areas, making it ideal for mapping urban water bodies like lakes, rivers, and reservoirs.

MNDWI color

MNDWI Color Visualization

MNDWI greyscale

MNDWI Greyscale (Index Values)

NDMI (Normalized Difference Moisture Index)

NDMI measures vegetation water content by comparing near-infrared and shortwave infrared reflectance. Higher values indicate well-hydrated vegetation, while lower values suggest drought stress. This index is valuable for assessing irrigation effectiveness and identifying areas at risk of vegetation die-off.

NDMI color

NDMI Color Visualization

NDMI greyscale

NDMI Greyscale (Index Values)

2. Vegetation-Development Balance Analysis

2.1 NDVI-NDBI Difference Analysis

The NDVI-NDBI difference analysis reveals a mean difference of 0.117, indicating a slight dominance of vegetation over built-up areas in the Vancouver-region. The bins data shows that 89.15% of the region falls within the range of -0.139 to 0.480, suggesting a balance between vegetation and built-up areas. Positive values (0.480 to 1.100) account for 9.94% of the region, indicating areas where vegetation significantly dominates. Negative values (-0.760 to -0.139) cover 0.90% of the region, showing areas where built-up intensity is higher. The spatial patterns, as shown in the difference map (NDVI-NDBI difference map) and legend (Difference legend), reveal that most of the region maintains a moderate balance, with some areas showing a clear vegetation advantage and others indicating higher built-up intensity.

2.2 Overlay Interpretation

The combined overlay image (Combined overlay) provides a comprehensive view of how NDVI and NDBI interact across the Vancouver-region. Patterns emerge where high NDVI values (green areas) overlap with low NDBI values (less built-up), indicating healthy vegetation in less urbanized zones. Conversely, areas with low NDVI and high NDBI values show urban centers with minimal green cover. This overlay helps identify zones where interventions could enhance green cover or manage urban growth more sustainably.

NDVI-NDBI difference

NDVI-NDBI Difference Map (Green = vegetation-dominated; Red = built-up-dominated)

Legend
Combined overlay

Combined NDVI-NDBI Overlay

3. Urban Heat Island Risk & Thermal Vulnerability

3.1 Heat Risk Distribution

The Urban Heat Index (UHI) bins analysis identifies high-risk zones where built-up intensity exceeds vegetation cover. The mean UHI value is -0.117, with 9.24% of the region falling into the highest risk category (-1.100 to -0.480), indicating areas with significant urban heat island effects. Moderate risk zones (-0.480 to 0.140) cover 89.15% of the region, suggesting a widespread but less severe heat risk. Low-risk zones (0.140 to 0.760) account for 0.90% of the region, indicating areas where vegetation effectively mitigates heat. Spatially, the heat index map (Heat index map) and legend (Heat index legend) show that high-risk areas are primarily concentrated in urban centers, posing significant public health risks due to elevated temperatures.

3.2 Key Findings & Implications

  • Total green cover percentage: 10.9%
  • Healthy vegetation percentage: 2.06%
  • Vegetation health score: 0.421
  • Green infrastructure score: 0.216
Urban heat index

Urban Heat Index Map (Red = high risk zones)

Legend

4. Strategic Recommendations for Green Infrastructure

4.1 What's Working Well

  • Areas with strong green cover, particularly in parks and protected natural zones, contribute to maintaining ecological balance.
  • Successful cooling corridors along rivers and greenways help mitigate urban heat island effects.

4.2 Critical Challenges

  • High heat risk zones without adequate vegetation, particularly in densely built-up urban centers, require immediate attention.
  • Declining green cover in specific areas, as indicated by the low healthy vegetation percentage (2.06%), needs targeted reforestation efforts.

4.3 Evidence-Based Recommendations

  1. Implement urban greening projects in high heat risk zones to enhance vegetation cover and reduce urban heat island effects.
  2. Promote the development of cooling corridors connecting green spaces to improve air circulation and lower temperatures in urban areas.
  3. Encourage community-led initiatives for tree planting and maintenance to increase the healthy vegetation percentage and overall green cover.

Analysis Coverage Area

The interactive map below shows the exact geographical bounds of this satellite analysis. The colored overlay represents the NDVI coverage area overlaid on OpenStreetMap. You can zoom and pan to explore how the analysis boundaries align with streets, neighborhoods, and landmarks in Vancouver-region.

Interactive map: Use mouse/touch to zoom and pan. The overlay shows the satellite image bounds used for NDVI/NDBI calculations.

Note: The analysis boundaries may extend beyond administrative city limits as they represent the satellite image crop captured on 2025-10-14. This ensures complete coverage of the urban area and surrounding regions for comprehensive vegetation and heat risk assessment.

Why Urban Green Cover Matters

Urban forests and green spaces are critical infrastructure for healthy, livable cities. Trees and vegetation reduce air pollution, lower urban temperatures through shade and evapotranspiration, manage stormwater, support biodiversity, and improve mental health and well-being. As cities grow denser and climate change intensifies heat events, monitoring and protecting urban green cover becomes essential for public health and environmental resilience.

Satellite remote sensing provides objective, repeatable measurements of vegetation health and urban development patterns across entire metropolitan areas. The Normalized Difference Vegetation Index (NDVI) quantifies photosynthetic activity and green cover, while the Normalized Difference Built-up Index (NDBI) identifies impervious surfaces and development intensity. Together, these indices reveal the changing balance between nature and urban growth—and highlight where intervention is most needed.

Understanding NDVI and NDBI

NDVI (Normalized Difference Vegetation Index) measures the difference between near-infrared light (strongly reflected by healthy vegetation) and red light (absorbed by chlorophyll). Values range from -1 to +1, with higher values indicating denser, healthier vegetation. Typical ranges: 0.6-0.9 = dense forest; 0.3-0.6 = moderate vegetation; 0.1-0.3 = sparse vegetation; <0.1 = bare soil or built-up areas.

NDBI (Normalized Difference Built-up Index) uses shortwave infrared and near-infrared bands to identify constructed surfaces. Positive values indicate built-up areas (roads, buildings, concrete), while negative values suggest natural or vegetated land. When NDBI exceeds NDVI in an area, it signals high development intensity and elevated urban heat island risk.

README Note

Our Mission: We want this research dataset brief to be Simple, Authentic, and Repeatable.

1. Title of the Dataset

Urban Green Cover and Heat Risk Assessment

2. What This Dataset Is About

This dataset was created to help anyone understand how vegetation and green cover in this region is changing over time. It brings together processed satellite data, simple calculations, and a few observations that give the bigger picture.

3. Why This Dataset Matters

At I Hug Trees, we believe that Geospatial Satellite imagery and processed data should be accessible to everyone. While grounded in scientific principles, outputs are presented in accessible formats so that technical imagery and calculations resonate with ordinary people and communities. Because only when it is relatable, can it tell clear stories about our greenery and urban life: shaping how we live, how we breathe, and how we cope with rising heat.

This dataset tries to make that easier. Whether you are a researcher, policy maker, student or just curious about the environment, these numbers and images help you see trends that are not obvious at first glance.

4. Source of the Data

  • Satellite: Sentinel-2
  • Provider: Microsoft Planetary Computer
  • Acquisition Window: Past 60 days (filtered by cloud cover)
  • Cloud Cover Threshold: < 30%
  • Initial Tile Discovery: Copernicus Data Space Ecosystem browser

5. How the Data Was Processed

Basic Preprocessing:

It is important to get the right tile from the Sentinel-2 database for imagery processing. The browser feature from the Copernicus Data Space Ecosystem helped us identify the correct tile, its bounds, and the subset coordinates needed for extraction. It is always essential to double-check the preview image to verify that the fetched tile truly corresponds to the target region.

Next, we used an AWS Lambda environment for the bounds-discovery phase. This step involved fetching band data from the Microsoft Planetary Computer for dates where cloud cover was below 30% within the past 60 days. Once the acquisition date met this condition, we moved to AWS EC2 server scripts written in Python to download raw band data and process them into COGs (Cloud Optimised GeoTIFFs) along with additional indices.

These processed COGs and index outputs are then used for image displays, NDVI interpretation, and HTML digest features published on our platform. All indices raw value outputs are in JSON files for easy repeatable processing. We currently run this workflow on a quarterly schedule for each identified region.

Cloud Masking:

This dataset uses Sentinel-2's Scene Classification Layer (SCL) to remove pixels affected by clouds, shadows, haze, and saturation. Only surface-clear classes are kept, ensuring that the vegetation indices are calculated from clean, reliable pixels. The masking is applied at the pixel level, meaning every index (NDVI, EVI, etc.) is computed only on valid areas after stripping away noisy regions. This results in more trustworthy COGs, cleaner previews, and more meaningful temporal comparisons.

Calculation Formulas Used:

  • NDVI = (NIR − Red) ÷ (NIR + Red)
  • EVI = 2.5 × (NIR − Red) ÷ (NIR + 6×Red − 7.5×Blue + 1)
  • NDWI = (Green − NIR) ÷ (Green + NIR)
  • NDBI = (SWIR1 − NIR) ÷ (SWIR1 + NIR)

Tools Used:

Data Source: Microsoft Planetary Computer (Sentinel-2 L2A), rasterio, GDAL
Computation: Python 3, NumPy, SciPy, Pillow, Rasterio, rio-cogeo
AWS Pipeline: Lambda (triggers), EC2 (processing), S3 (storage), Bedrock (AI summaries)
Mapping: Leaflet.js, tile layers served from S3
Automation: Python boto3, Cron (EC2 quarterly jobs)

6. File Contents

The dataset includes metadata.json with satellite tile information, various index outputs (NDVI, EVI, NDWI, NDBI, MNDWI, NDMI), and statistical summaries. Please find the detailed list of files available for download in the Download Data & Maps section below.

7. How to Use This Dataset

You can explore the NDVI trends, plug the json file into your favourite tool, build visualisations, or compare it with earlier datasets. It's created to be flexible.

8. Leveraging AI

AI helped speed up some parts of the work, like spotting unusual patterns, creating brief insights, and checking for inconsistencies. All metadata and bin-statistics JSON files were loaded and parsed into structured dictionaries, ensuring the AI receives clean, context-rich inputs for stable summarisation and interpretation.

9. Limitations & Things to Keep in Mind

  • Cloud cover may affect accuracy
  • NDVI has known limitations
  • Spatial resolution is 10 meters
  • Some patterns may need ground truth validation

10. License / Permissions

Please refer to the How to Cite This Analysis section below for citation guidelines. This dataset is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0).

11. Contact

For any questions, collaborations, or clarifications, feel free to reach out at: nature@ihugtrees.org

Data & Methods

Data Sources

  • Satellite: Sentinel-2C Level-2A (atmospherically corrected)
  • Provider: Microsoft Planetary Computer
  • Observation Date: 2025-10-14
  • Cloud Cover: 0.001693%
  • Spatial Resolution: 10 meters (NDVI), 20 meters (NDBI, resampled to 10m)

Index Calculations

  • NDVI = (NIR - Red) / (NIR + Red) using Bands 8 and 4
  • NDBI = (SWIR1 - NIR) / (SWIR1 + NIR) using Bands 11 and 8
  • Difference Index = NDVI - NDBI (positive = vegetation-dominated; negative = built-up-dominated)
  • Urban Heat Index = Composite metric flagging areas where NDBI > NDVI (high heat risk)

Processing Workflow

Images were processed using Python with the pystac-client and rasterio libraries. Cloud masking was applied using the QA60 scene classification layer. Statistics were computed using rasterio zonal statistics and exported as JSON for analysis. All geospatial outputs are provided as Cloud-Optimized GeoTIFFs (COGs) for efficient web access and GIS integration.

Limitations

  • Analysis represents a single-day snapshot; seasonal and temporal trends require time-series analysis
  • Cloud cover and atmospheric conditions affect image quality
  • 10-meter resolution may not capture individual trees or small green spaces
  • Study area boundaries reflect the satellite image crop, not administrative city limits
  • Heat risk is inferred from spectral indices; ground validation recommended for mitigation planning

Download Data & Maps

Images & Visualizations

Geospatial Data (Cloud-Optimized GeoTIFFs)

Statistical Data

How to Cite This Analysis

Recommended Citation

Yaragarla, R. (2025). Urban Green Cover & Heat Risk Assessment: Vancouver-region. I Hug Trees. Retrieved from https://ihugtrees.org/data-analytics/sentinel-ndvi/Vancouver-region/2025/12/04/digest.html

Satellite data: Copernicus Sentinel-2 (ESA), processed via Microsoft Planetary Computer.

BibTeX Entry

@misc{ihugtrees_urban_vancouver-region_2025,
  author = {Yaragarla, Ramkumar},
  title = {Urban Green Cover \& Heat Risk Assessment: Vancouver-region},
  year = {2025},
  publisher = {I Hug Trees},
  url = {https://ihugtrees.org/data-analytics/sentinel-ndvi/Vancouver-region/2025/12/04/digest.html},
  note = {Satellite data: Copernicus Sentinel-2}
}
        

License

This analysis and associated datasets are licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). You are free to share and adapt this work with appropriate attribution.

Planetary Computer Citation

If using Microsoft Planetary Computer data, please cite: microsoft/PlanetaryComputer (2022)

Further Reading