Mapping Housing Stock Characteristics from Drone Images for Climate Resilience in the Caribbean

Isabelle Tingzon, Nuala Margaret Cowan, and Pierre Chrzanowski
The World Bank Group, GFDRR
{tisabelle, ncowan, pchrzanowski}@worldbank.org

Abstract

Comprehensive information on housing stock is crucial for climate adaptation initiatives aiming to reduce the adverse impacts of climate-extreme hazards in high-risk regions like the Caribbean. In this study, we propose a workflow for rapidly generating critical baseline housing stock data using very high-resolution drone images and deep learning techniques. Specifically, our work leverages the Segment Anything Model and convolutional neural networks for the automated generation of building footprints and roof classification maps. By strengthening local capacity within government agencies to leverage AI and Earth Observation-based solutions, this work seeks to improve the climate resilience of the housing sector in small island developing states in the Caribbean.

1 Introduction

The Caribbean is among the world’s most climate-vulnerable regions due to the prevalence and intensity of extreme climate hazards such as storms, floods, and landslides. Category 5 hurricanes like Dorian, Irma, and Maria have devastated many small island developing states (SIDS) in recent years, leaving widespread trails of loss and destruction across the region. SIDS bear substantial economic costs from climate-extreme events, with the highest degree of damages often sustained in the housing sector [19]. Hurricane Maria, for example, destroyed over 90% of Dominica’s housing stock, accumulating costs over 200% of the nation’s GDP [18]. As global temperatures continue to rise, extreme weather events will only grow in severity, putting many more vulnerable homes and shelters at risk.

In response to these challenges, governments and international organizations have developed ambitious climate resilience programs to reduce the adverse effects of extreme climatic hazards in the housing sector [19, 9]. Climate resilience initiatives generally require comprehensive housing stock information to inform better retrofitting, reconstruction, and relocation plans. However, the traditional house-to-house approach to identifying high-risk buildings can be extremely expensive and time-consuming to implement, prompting the need for more timely and cost-efficient alternatives.

Recent years have seen a growing interest in using drones for disaster risk reduction and recovery [9, 28]. Previous works have successfully applied deep learning (DL) techniques to very high-resolution (VHR) aerial imagery to extract baseline housing information such as building footprints and rooftop attributes [27, 10, 22, 14, 24]. However, despite the evident advantages DL and Earth observation (EO)-based technologies, the widespread adoption of these solutions is often hindered by gaps in local capacity to develop and maintain systems for generating baseline exposure datasets.

This work aims to bridge these gaps by providing government agencies with an end-to-end workflow for rapidly generating critical baseline housing information using VHR drone images. Specifically, we leverage the Segment Anything Model (SAM) for building footprint delineation and convolutional neural networks (CNNs) for roof type and roof material classification. We also evaluate the cross-country generalizability of roof classification models across SIDS to determine the extent to which models trained in one country can be adapted to another. This work is developed under the Digital Earth Project for Resilient Housing and Infrastructure in partnership with the Government of the Commonwealth of Dominica (GoCD) and the Government of Saint Lucia (GoSL) to enhance the climate resilience of the housing sector of SIDS in the Caribbean.

2 Data

To generate our ground truth datasets, we use (1) VHR aerial images in the form of aircraft- and drone-derived optical imagery and (2) building footprints in vector polygons, with Dominica and Saint Lucia as our primary regions of interest.

VHR Aerial Images. We acquired the following VHR aerial images from partner government agencies GoCD and GoSL, the World Bank Global Program for Resilient Housing (GPRH) [8], and the open data platform OpenAerialMap [5]: (1) aircraft-derived post-disaster orthophotos of Dominica; (2) aircraft-derived orthophotos of Saint Lucia; (3) pre- and post-disaster drone images of 10 villages and cities across Dominica; (4) drone images of 3 districts in Saint Lucia. The spatial resolutions of aircraft-derived images range from 10 to 20 cm/px, whereas the resolutions of drone-derived images range from 2 to 5 cm/px. For more information on the spatial resolution, coverage, and year of acquisition of the aerial images, see Table 3 in the Appendix.

Building Footprints Data. We obtained nationwide building footprints delineated from the aircraft-derived orthophotos of Dominica and Saint Lucia from the World Bank. For drone images with no corresponding building footprints, we initially looked to alternative data sources such as OpenStreetMap [6], Microsoft [4], and Google [2]. However, we found these publicly available building footprint datasets to be nonviable due to significant misalignment with the underlying drone images, as illustrated in Figure 2. To address this challenge, we used the Segment Anything Model (SAM) to delineate building instances directly from drone images [15]. Additional information on the SAM configuration used in this study is detailed in Section 3.

Rooftop Image Tiles. For roof classification, we selected approximately 15,000 building footprints across Dominica and Saint Lucia. For each building footprint, we extract the minimal bounding rectangle of the building polygon, scaled by a factor of 1.5, from the corresponding aerial image. We then proceeded to annotate a total of 9,150 buildings in Dominica and 5,175 buildings in Saint Lucia via visual interpretation of VHR aerial images. The buildings are classified based on two attributes: roof type and roof material, the class distributions of which are presented in Table 1. We note that Saint Lucia does not contain any samples for the "Blue Tarpaulin" category. Figure 1 illustrates examples of drone image tiles for each roof type and roof material category.

Table 1: The distribution of roof type and roof material labels across Dominica and Saint Lucia.

		Dominica			Saint Lucia
		Train	Test	Total	Train	Test	Total
Roof Type	Gable	2,669	653	3,322	2,347	585	2,932
	Hip	1,579	393	1,972	1,089	271	1,360
	Flat	1,894	475	2,369	456	106	562
	No Roof	1,190	297	1,487	269	52	321
Roof Material	Healthy metal	1,934	482	2,416	2,396	598	2,994
	Irregular metal	1,733	432	2,165	1,113	276	1,389
	Concrete/cement	1,240	312	1,552	328	75	403
	Blue tarpaulin	1,094	260	1,354	0	0	0
	Incomplete	1,331	332	1,663	324	65	389
Source	Aircraft	5,936	0	5,936	2,485	0	2,485
	Drone	1,396	1,818	3,214	1,676	1,014	2,690
	Total	7,332	1,818	9,150	4,161	1,014	5,175

3 Methods

This section outlines our workflow for generating critical housing stock information in the Caribbean using drone images, as summarized in Figure 3.

To extract building footprints from drone images, we use Segment-Geospatial for segmenting raster images based on the Segment Anything Model (SAM) [29, 20, 21, 15]. We also leverage Language Segment Anything [16] to combine instance segmentation with text prompts to generate masks of specific objects in the drone images. We set our text prompt to "house", the box threshold (i.e. the threshold value used for object detection in the image) to 0.30, and the text threshold (i.e. the threshold value used to associate the detected objects with the provided text prompt) to 0.30. As a post-processing step, we apply the Douglas-Peucker algorithm to simplify the generated building polygons with a tolerance of $5e^{-6}$ [12].

Given the VHR aerial images and corresponding building footprint polygons, we proceed with developing our roof classification models. We begin by fine-tuning CNN models pre-trained on the ImageNet dataset [11] with architectures ResNet50 [13], VGG-16 [23], Inceptionv3 [25], and EfficientNet-B0 [26] using cross-entropy loss for both roof type and roof material classification tasks. The input rooftop image tiles are zero-padded to a square of size 224 x 224 px for ResNet50, EfficientNet-B0, and VGG-16 and 299 x 299 px for InceptionV3. We set the batch size to 32 and the maximum number of epochs to 60, and we use an Adam optimizer with an initial learning rate of $1e^{-}5$ , which decays by a factor of 0.1 after every 7 epochs with no improvement. For data augmentation, we implement horizontal and vertical image flips with a probability of 0.50 and random rotations ranging from $-90^{\circ}$ to $90^{\circ}$ . Given that our data is imbalanced (see Table 1), we implement random oversampling for the minority classes. To prevent overconfident predictions, we apply label smoothing as a regularization technique, with smoothing set to 0.1 [17].

4 Results and Discussion

For each country in our dataset, we set aside 80% of the data for training and the remaining 20% for testing, using stratified random sampling for the test set to preserve the percentage of samples per class as shown in Table 1. We note that the test sets for both countries are comprised entirely of drone images. Additionally, to test whether geographically diverse training data improves the prediction, we combine the training sets across Dominica and Saint Lucia (henceforth referred to as the "combined" dataset). We report the standard performance metrics F1-score, precision, recall, and accuracy.

Our results indicate that for Dominica, the best performance is achieved by an EfficientNet-B0 model for roof type classification (F1-score: 87.1%) and a ResNet50 model for roof material classification (F1-score: 89.5%). Likewise, for Saint Lucia, the best F1-score is attained by a ResNet50 model for roof type classification (F1-score: 89.5%) and an EfficientNet-B0 model for roof material classification (F1-score: 91.71%). For models trained on the combined dataset (i.e., "combined model"), the best results are obtained by an EfficientNet-B0 model for roof type classification (F1-score: 90.0%) and an Inceptionv3 model for roof material classification (F1-score: 90.4%). For the complete results and sample outputs, see Table 4 and Figure 4, respectively.

Next, we investigate the cross-country generalizability of the best-performing models by evaluating their performance on the designated test sets of each country. As shown in Table 2, our results indicate that for roof type classification, the combined model performs marginally better than models trained using only local data (i.e. data from the same country); however, for roof material classification, we find that locally trained models consistently outperform the combined model. In general, we find that local models demonstrate high levels of variability in performance in out-of-distribution countries, indicating the importance of collecting localized training data and the need for further studies on domain adaptation to reduce performance degradation in the face of geographical distribution shifts.

Table 2: Cross-country generalization of roof type and roof material classification models.

Training Data	Test Data	F1 score	Precision	Recall	Accuracy
(a) Roof Type
Dominica	Dominica	87.08	87.06	87.37	86.41
Dominica	Saint Lucia	88.39	88.93	87.92	93.04
Saint Lucia	Dominica	82.59	84.72	81.39	82.31
Saint Lucia	Saint Lucia	89.49	94.11	85.97	94.28
Combined	Dominica	88.02	88.51	88.01	87.39
Combined	Saint Lucia	91.92	93.63	90.36	95.67

Training Data	Test Data	F1 score	Precision	Recall	Accuracy
(a) Roof Material
Dominica	Dominica	89.50	90.49	88.84	89.44
Dominica	Saint Lucia	90.83	92.04	89.93	92.94
Saint Lucia	Dominica	64.15	63.14	67.32	72.99
Saint Lucia	Saint Lucia	91.71	93.81	90.00	93.79
Combined	Dominica	88.17	89.66	87.16	87.78
Combined	Saint Lucia	90.37	93.10	88.58	93.55

5 Local Capacity Building

The overarching goal of this project is to strengthen local capacity in SIDS to leverage AI and EO-based solutions for resilient housing operations. To this end, our team is assisting government agencies in establishing Geographic Information Systems (GIS) units capable of generating, managing, and maintaining large-scale disaster risk datasets. We have partnered with the Humanitarian OpenStreetMap Team (HOT) [3] to design a training program on how to operate drones, coordinate pilots, process collected imagery, and manage the resulting geospatial datasets. Additionally, we have developed educational resources on geospatial data processing aimed at local government staff, community mappers, and disaster responders, including Google Colaboratory and Jupyter Notebook tutorials demonstrating how to run the SAM and CNN models to quickly generate baseline housing stock data from locally collected drone images [1, 7].

6 Conclusion

This work proposes a workflow for filling baseline exposure data gaps in the Caribbean using VHR drone images and DL. We demonstrate how computer vision and drone-based technologies can be used to rapidly generate housing stock information, including building footprints and roof classification maps, for disaster risk reduction and recovery. Based on our evaluation of the cross-country generalizability of DL models, we urge caution in applying locally-trained models off the shelf to new geographic regions and emphasize the importance of collecting local, highly contextualized training data. We also emphasize the importance of local capacity building, skills development, and co-creation of geospatial datasets in deploying sustainable AI-for-climate solutions, especially in Global South contexts. We conclude this study by urging governments in SIDS to invest in the digital infrastructure and local capabilities needed to sustainably generate and maintain EO-derived housing stock data for climate resilience.

Acknowledgments

This work is developed under Digital Earth for Resilient Housing and Infrastructure in the Caribbean, a World Bank project funded by the Global Facility for Disaster Reduction and Recovery (GFDRR), in partnership with the Government of the Commonwealth of Dominica (GoCD) and the Government of Saint Lucia (GoSL). This study builds on the initial work done by the World Bank’s Global Program for Resilient Housing (GPRH). We thank Mike Fedak and Christopher Williams for their assistance in providing data access.

References

[1] Building Footprint Delineation for Disaster Risk Reduction and Response (Part I). https://colab.research.google.com/github/GFDRR/caribbean-rooftop-classification/blob/master/tutorials/01_building_delineation.ipynb.
[2] Google Open Buildings. https://sites.research.google/open-buildings/. Accessed on 15.09.2023.
[3] Humanitarian OpenStreetMap Team (HOT). https://www.hotosm.org/.
[4] Microsoft Building Footprints. https://www.microsoft.com/en-us/maps/building-footprints. Accessed on 15.09.2023.
[5] OpenAerialMap. https://map.openaerialmap.org/. Accessed on 15.09.2023.
[6] OpenStreetMap Buildings. https://osmbuildings.org/. Accessed on 15.09.2023.
[7] Rooftop Classification from Drone Imagery for Disaster Risk Reduction and Reponse (Part II). https://colab.research.google.com/github/GFDRR/caribbean-rooftop-classification/blob/master/tutorials/02_building_classification.ipynb.
[8] Global Program for Resilient Housing (GPRH). https://www.worldbank.org/en/topic/disasterriskmanagement/brief/global-program-for-resilient-housing, January 2022.
[9] World Bank. Capturing Housing Data in Small Island Developing States, 2022. Washington, DC. Creative Commons Attribution CC BY 4.0.
[10] M Buyukdemircioglu, R Can, and S Kocaman. Deep learning based roof type classification using very high resolution aerial imagery. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, 43:55–60, 2021.
[11] Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
[12] David H Douglas and Thomas K Peucker. Algorithms for the reduction of the number of points required to represent a digitized line or its caricature. Cartographica: the international journal for geographic information and geovisualization, 10(2):112–122, 1973.
[13] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
[14] Xingliang Huang, Libo Ren, Chenglong Liu, Yixuan Wang, Hongfeng Yu, Michael Schmitt, Ronny Hänsch, Xian Sun, Hai Huang, and Helmut Mayer. Urban Building Classification (UBC)-A Dataset for Individual Building Detection and Classification From Satellite Imagery. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1413–1421, 2022.
[15] Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C Berg, Wan-Yen Lo, et al. Segment anything. arXiv preprint arXiv:2304.02643, 2023.
[16] Luca Medeiros. Language Segment-Anything. https://github.com/luca-medeiros/lang-segment-anything, 2023.
[17] Rafael Müller, Simon Kornblith, and Geoffrey E Hinton. When does label smoothing help? Advances in neural information processing systems, 32, 2019.
[18] Government of the Commonwealth of Dominica. Post-Disaster Needs Assessment Hurricane Maria September 18, 2017. https://www.gfdrr.org/sites/default/files/publication/Dominica_mp_012418_web.pdf, September 2017.
[19] Government of the Commonwealth of Dominica. Dominica Climate Resilience and Recovery Plan 2020–2030. https://odm.gov.dm/wp-content/uploads/2022/02/CRRP-Final-042020.pdf, 2020.
[20] Lucas Osco. AI-RemoteSensing: A collection of Jupyter and Google Colaboratory notebooks dedicated to leveraging Artificial Intelligence (AI) in Remote Sensing applications, June 2023.
[21] Lucas Prado Osco. AI-RemoteSensing. https://github.com/LucasOsco/AI-RemoteSensing, 2023.
[22] Tahmineh Partovi, Friedrich Fraundorfer, Seyedmajid Azimi, Dimitrios Marmanis, and Peter Reinartz. Roof Type Selection based on patch-based classification using deep learning for high-resolution Satellite Imagery. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences-ISPRS Archives, 42(W1):653–657, 2017.
[23] Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
[24] Roman A Solovyev. Roof material classification from aerial imagery. Optical Memory and Neural Networks, 29:198–208, 2020.
[25] Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jon Shlens, and Zbigniew Wojna. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2818–2826, 2016.
[26] Mingxing Tan and Quoc Le. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pages 6105–6114. PMLR, 2019.
[27] Isabelle Tingzon, Nuala Margaret Cowan, and Pierre Chrzanowski. Fusing VHR Post-disaster Aerial Imagery and LiDAR Data for Roof Classification in the Caribbean using CNNs. arXiv preprint arXiv:2307.16177, 2023.
[28] Luis Triveno, Sarah Antos, Jan Koers, and Victor Endo. Coupling Imagery from Drones and Street-View with Proper Incentives To Promote Sustainable Urban Cadasters in Developing Countries. In Proceedings of the World Bank Conference on Land and Poverty, Washington, DC, USA, pages 25–29, 2019.
[29] Qiusheng Wu and Lucas Prado Osco. samgeo: A Python package for segmenting geospatial data with the Segment Anything Model (SAM). Journal of Open Source Software, 8(89):5663, 2023.

Appendix

Table 3: Aircraft- and drone-derived aerial images used in this study.

Coverage

Resolution

Year

Source

Data Provider

Building

Count

Dominica

(nationwide)

20.0 cm/px

2018-2019

Aircraft

GoCD

5,936

Colihaut

2.7 cm/px

2017

Drone

GoCD

373

Coulibistrie

2.3 cm/px

2017

Drone

GoCD

158

Delices

4.3 cm/px

2018

Drone

OpenAerialMap [5]

380

Dublanc

2.9 cm/px

2017

Drone

GoCD

126

Kalinago

3.3 cm/px

2018

Drone

OpenAerialMap [5]

102

Laplaine

4.9 cm/px

2018

Drone

OpenAerialMap [5]

456

Marigot

3.4 cm/px

2018

Drone

OpenAerialMap [5]

387

Pichelin

3.4 cm/px

2017

Drone

GoCD

149

Roseau

2.6 cm/px

2017

Drone

GoCD

348

Salisbury

3-5 cm/px

2018

Drone

OpenAerialMap [5]

280

Saint Lucia

(nationwide)

10.0 cm/px

2022

Aircraft

GoSL

2,485

Castries

4.5 cm/px

2019

Drone

GPRH [8]

1,084

Dennery

4.2 cm/px

2019

Drone

GPRH [8]

742

Gros Islet

3.6 cm/px

2019

Drone

GPRH [8]

864

Table 4: Test set (%) results of different CNN architectures for roof type and roof material classification trained using (a) only Dominica data, (b) only Saint Lucia data, and (c) using a combination of Dominica and Saint Lucia. Models trained on the combined dataset are evaluated using the combined test sets of Dominica and Saint Lucia.

		F1 score	Precision	Recall	Accuracy
(a) Dominica
Roof Type	VGG16	86.15	86.86	85.79	85.37
	ResNet50	86.16	85.80	86.70	85.70
	Inceptionv3	86.53	86.93	86.44	85.70
	EfficientNet-B0	87.08	87.06	87.37	86.41
Roof Material	VGG16	89.21	90.11	88.73	88.89
	ResNet50	89.50	90.49	88.84	89.44
	Inceptionv3	88.24	89.72	87.36	88.17
	EfficientNet-B0	89.00	90.20	88.23	88.94

		F1 score	Precision	Recall	Accuracy
(b) Saint Lucia
Roof Type	VGG16	88.02	91.29	85.36	93.10
	ResNet50	89.49	94.11	85.97	94.28
	Inceptionv3	87.50	94.32	83.02	93.45
	EfficientNet-B0	88.10	94.77	83.51	93.29
Roof Material	VGG16	88.27	92.38	85.41	91.52
	ResNet50	91.55	92.99	90.28	93.20
	Inceptionv3	91.42	93.28	89.91	93.15
	EfficientNet-B0	91.71	93.81	90.00	93.79

		F1 score	Precision	Recall	Accuracy
(c) Combined (Dominica + Saint Lucia)
Roof Type	VGG16	88.70	88.70	88.72	89.03
	ResNet50	88.71	88.63	88.87	89.24
	Inceptionv3	89.38	89.29	89.62	89.60
	EfficientNet-B0	90.03	90.07	90.25	90.45
Roof Material	VGG16	90.18	92.09	88.72	90.73
	ResNet50	89.82	91.76	88.36	90.80
	Inceptionv3	90.42	92.03	89.10	90.80
	EfficientNet-B0	90.09	92.37	88.45	91.09