GeoAI Agency Primitives

2 April 2026

Akram Zaytar

Rohan Sawahn

Caleb Robinson

Gilles Q. Hacheme

Girmaw A. Tadesse

Inbal Becker-Reshef

Rahul Dodhia

Juan Lavista Ferres

LM&Ro

VLM

AI4CE

ArXiv (abs)PDF HTML Github

Main:4 Pages

3 Figures

Bibliography:2 Pages

2 Tables

Appendix:2 Pages

Abstract

We present ongoing research on agency primitives for GeoAI assistants -- core capabilities that connect Foundation models to the artifact-centric, human-in-the-loop workflows where GIS practitioners actually work. Despite advances in satellite image captioning, visual question answering, and promptable segmentation, these capabilities have not translated into productivity gains for practitioners who spend most of their time producing vector layers, raster maps, and cartographic products. The gap is not model capability alone but the absence of an agency layer that supports iterative collaboration. We propose a vocabulary of $9$ primitives for such a layer -- including navigation, perception, geo-referenced memory, and dual modeling -- along with a benchmark that measures human productivity. Our goal is a vocabulary that makes agentic assistance in GIS implementable, testable, and comparable.

View on arXiv

Comments on this paper