News — 20 August, 2024
The Power of Volunteers: Remote Mapping in Gaza and Other Conflict Areas
As damage reaches record levels in the Gaza Strip, expert volunteers coordinated by HOT have recently completed a 5-month project to create the most comprehensive building footprints dataset available for Gaza using OpenStreetMap. Looking at pre-October 2023 imagery, this effort added a net total of 59,223 buildings to the dataset and improved thousands of existing features. In this post, we share how our work in Gaza illustrates our approach to open mapping in conflict zones, and examines the potential role of artificial intelligence and machine learning (AI/ML) in the sector.
As an open map of the world, OpenStreetMap (OSM) is a valuable tool for understanding the impacts of war and informing humanitarian response. For example, OSM can provide a baseline dataset of pre-conflict building footprints to be compared with damage analyses.
In Gaza, the escalation of conflict since October 2023 has resulted in a prolonged humanitarian crisis. Understanding the impact of the conflict on buildings has been challenging because pre-existing datasets from AI/ML models and OSM were not accurate enough to create a complete buildings baseline. The high density of buildings and the inability to safely gather on-the-ground information compounded these challenges. In such hard-to-reach areas, HOT’s remote and crowdsourced mapping methodology proved effective in collecting detailed information visible on aerial imagery.
In February 2024, after consultation with humanitarian and UN actors working in Gaza, HOT initiated the creation of pre-conflict dataset of all building footprints in the area in OSM. HOT’s community of OpenStreetMap volunteers completed all the data work, coordinating through HOT’s Tasking Manager. The volunteers meticulously added missing buildings and improved existing data. Due to concerns over data protection and quality, only expert volunteer teams were assigned to map and validate the area. As in other areas that are hard to reach due to conflict, HOT balanced the data needs with responsible data practices based on the context.
The updated data is live in OpenStreetMap and available via HDX or HOT’s Export Tool.
Comparing AI/ML with human-verified OSM building datasets in conflict zones
AI/ML is increasingly used to rapidly generate building footprints across large areas. Sources for automated building footprints range from worldwide datasets by Microsoft or Google to smaller-scale open community-managed tools such as HOT’s new application, fAIr.
Now that HOT volunteers have completely updated and validated all OSM buildings in visible imagery pre-conflict, OSM has 18% more individual buildings in the Gaza strip than Microsoft’s ML buildings dataset (an estimated 330,079 buildings vs 280,112 buildings). However, in regions where there has not been a coordinated update effort in OSM, the numbers may differ. For example, in Sudan where no large-scale editing campaign has occurred, OSM contains just under 1.5 million buildings, compared to over 5.82 million buildings in Microsoft’s ML data. It is important to note that the ML datasets have not been human-verified and their accuracy is sometimes uncertain. Google Open Buildings has over 26 million building features in Sudan, but visual inspection reveals that many of these features are noise in the data, with the model incorrectly identifying structures in the uninhabited desert.
If spatial coverage is relatively similar between OSM and ML datasets, automated datasets created by ML image detection work well for many use cases. For other use cases, the precision and verification of a human-checked dataset, such as OSM, is a better fit for the data requirements. For example, during our work in Gaza, we identified issues with ML building datasets that were avoided by working with experienced human volunteers (see image below comparing OSM to Microsoft’s ML building datasets). However, it’s important to highlight that ML building datasets differ so might not have the same issues.
What’s next?
When all buildings in conflict zones are reflected in one pre-conflict dataset, data analysts can track which buildings are functional and which are destroyed. Eventually, we will be able to track demolition and rebuilding when the context allows. HOT welcomes data requests from the humanitarian community and will aim to balance requests with anticipated usefulness, available resources, and protection issues to address humanitarian data gaps in conflict zones worldwide.
With increasingly complex emergencies displacing a record number of people globally, aid organizations are struggling to keep up. While automation is improving the speed and ability to remotely collect information in hard-to-reach areas, HOT’s approach to coordinating trained volunteers is more important than ever. Lessons learned from the data work in Gaza demonstrates that when higher accuracy data is needed, incorporating a human view into automated ML datasets remains crucial.
There are many ways to get involved with our efforts. Contact us at data@hotosm.org to partner, volunteer, or donate to ongoing efforts in conflict-affected areas worldwide.For information on recent damage in Gaza or Sudan, visit UNOSAT’s Gaza Interactive Data Platform and UNOSAT Analyses, or the Decentralized Damage Mapping Group.
Pdf version with the summary of Gaza Update 2024
Cover image: Gaza City in 2006. Public domain.