57

GeoAI Agency Primitives

Akram Zaytar
Rohan Sawahn
Caleb Robinson
Gilles Q. Hacheme
Girmaw A. Tadesse
Inbal Becker-Reshef
Rahul Dodhia
Juan Lavista Ferres
Main:4 Pages
3 Figures
Bibliography:2 Pages
2 Tables
Appendix:2 Pages
Abstract

We present ongoing research on agency primitives for GeoAI assistants -- core capabilities that connect Foundation models to the artifact-centric, human-in-the-loop workflows where GIS practitioners actually work. Despite advances in satellite image captioning, visual question answering, and promptable segmentation, these capabilities have not translated into productivity gains for practitioners who spend most of their time producing vector layers, raster maps, and cartographic products. The gap is not model capability alone but the absence of an agency layer that supports iterative collaboration. We propose a vocabulary of 99 primitives for such a layer -- including navigation, perception, geo-referenced memory, and dual modeling -- along with a benchmark that measures human productivity. Our goal is a vocabulary that makes agentic assistance in GIS implementable, testable, and comparable.

View on arXiv
Comments on this paper