Requisition Id 16118 Overview: As a U.S. Department of Energy (DOE) Office of Science national laboratory, Oak Ridge National Laboratory (ORNL) has an extraordinary history of solving some of the nation's most complex scientific and security challenges. ORNL's mission is carried out by a dedicated and creative staff working across disciplines to accelerate scientific discovery and translate research into impactful energy, environmental, and national security solutions. The Geospatial Data Modelling Group within the Human Dynamics Section, part of the Geospatial Science and Human Security Division at ORNL, is seeking a Geospatial Data Engineer to support research and operational workflows focused on scalable geospatial data science, applied machine learning, and production-grade engineering practices to deliver repeatable, defensible, and time-dynamic geospatial products in support of national security, humanitarian response, disaster assessment, and resilience planning. In this technical role, the candidate will collaborate with an interdisciplinary team of human geographers, population scientists, geospatial analysts, data scientists, and software engineers. They will contribute across the full lifecycle of geospatial modeling efforts: data acquisition and preparation, feature engineering, model development and evaluation, MLOps and codebase maintenance, automation, and quality assurance. A key component of this position is building agentic AI workflows that help discover, gather, validate, and standardize open-source data for downstream geospatial analytics and machine learning. The position offers a unique opportunity to work on applied spatial analytics and geospatial data modeling at scale, leveraging diverse geospatial, demographic, and remotely sensed data sources. While the role does not require independent development of novel AI algorithms, it does require strong implementation skills, sound statistical judgment, and an ability to translate methods into reliable, maintainable, and well-documented pipelines. Major Duties and Responsibilities: Develop, maintain, and operationalize geospatial data science pipelines across ingestion, feature engineering, training, inference, evaluation, and delivery, using reproducible MLOps practices (version control, testing, experiment tracking, containerization, and CI/CD). Support implementation of agentic AI workflows to discover, gather, and prepare data from open-source repositories (e.g., catalogs, APIs, and bulk downloads), including provenance tracking, metadata extraction, and licensing/usage notes. Build scalable geospatial data preparation and validation routines for raster and vector data (projection harmonization, spatial joins, tiling/chunking, and QA/QC). Develop geospatial validation frameworks for model outputs (e.g., comparisons to reference datasets, spatial cross-validation, summary dashboards, and automated report generation). Support documentation, metadata development, and version tracking for data products and model releases; contribute to technical summaries, figures, and reports/publications as appropriate. Participate in code reviews, model reviews, and data readiness reviews to ensure analytical defensibility, transparency, and fitness-for-use in operational and decision-support contexts. Collaborate with research staff to integrate new data sources, indicators, and modeling approaches into existing workflows; communicate clearly across technical and domain teams. Basic Qualifications Bachelor's degree and 3+ year's experience in Geography, GIScience, Computer Science, Data Science, Statistics, Engineering, or a related field with a strong quantitative and software development emphasis. * Demonstrated experience with geospatial analysis using Python in a production or research to production environment leveraging common geospatial libraries (e.g., geopandas, rasterio, shapely, pyproj) and/or enterprise GIS tooling (e