Vision-Language Models (VLMs) in GeoAI Systems: Enhancing Brownfield Change Detection through Semantic Reasoning
Loading...
Files
Date
Contributor
Advisor
Editor
Performer
Department
Instructor
Depositor
Speaker
Researcher
Consultant
Interviewer
Interviewee
Narrator
Transcriber
Annotator
Journal Title
Journal ISSN
Volume Title
Publisher
Journal Name
Volume
Number/Issue
Starting Page
1610
Ending Page
Alternative Title
Abstract
This paper presents a hybrid GeoAI methodology for semantic change detection in satellite imagery by integrating Vision–Language Models (VLMs) into an unsupervised clustering-based pipeline. Building on earlier work using K–Means clustering to validate pre-selected brownfield regions from SPOT (2021–2023) and aerial imagery, we address the persistent issue of semantic ambiguity, particularly in high-uncertainty zones. We introduce a zero-shot, reasoning-based verification layer that evaluates whether visual differences across time are structurally meaningful. This approach improves interpretability, traceability, and diagnostic robustness. Evaluation across 1,000 human-labeled samples and a temporally unseen test set of size 200 (SPOT23–SPOT24) demonstrates notable improvement in ambiguous zones in reasoning quality and error transparency, especially where prior methods faltered. Our framework maintains the speed and scalability of clustering while injecting semantic precision through natural language decision paths. Designed with the UN’s SDG 11 (Sustainable Cities and Communities) in mind particularly for brownfield redevelopment this work contributes to scalable, interpretable, and operationally viable GeoAI systems.
Description
Citation
Extent
10 pages
Format
Type
Conference Paper
Geographic Location
Time Period
Related To
Proceedings of the 59th Hawaii International Conference on System Sciences
Related To (URI)
Table of Contents
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International
Rights Holder
Catalog Record
Local Contexts
Email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.
