Geospatial Data Engineer

Oak Ridge National Laboratory

Oak Ridge, TN

JOB DETAILS
SKILLS
Algorithms, Analysis Skills, Application Programming Interface (API), Artificial Intelligence (AI), Code Reviews, Communication Skills, Computer Science, Continuous Deployment/Delivery, Continuous Integration, Data Collection, Data Management, Data Modeling, Data Quality, Data Science, Data Sets, Decision Support, Demographics, Environmental Impact, Fitness, Geographic Information Systems (GIS), Geography, Geospatial Analysis, Licensing, Machine Learning, Machine Tool, Metadata, Model Review, Model Validation, Open Source, Operational Support, Operations Research, Product Support, Production Machining, Production Systems, Publications, Python Programming/Scripting Language, Quality Assurance, Quality Control, Reporting Dashboards, Scalable System Development, Shallow Parsing, Software Development, Software Engineering, Source Code/Configuration Management (SCM), Spatial Data, Statistics, Support Documentation, Test Automation, Testing, United States Department of Energy (DOE)
LOCATION
Oak Ridge, TN
POSTED
30+ days ago

Requisition Id 16118 Overview: As a U.S. Department of Energy (DOE) Office of Science national laboratory, Oak Ridge National Laboratory (ORNL) has an extraordinary history of solving some of the nation's most complex scientific and security challenges. ORNL's mission is carried out by a dedicated and creative staff working across disciplines to accelerate scientific discovery and translate research into impactful energy, environmental, and national security solutions. The Geospatial Data Modelling Group within the Human Dynamics Section, part of the Geospatial Science and Human Security Division at ORNL, is seeking a Geospatial Data Engineer to support research and operational workflows focused on scalable geospatial data science, applied machine learning, and production-grade engineering practices to deliver repeatable, defensible, and time-dynamic geospatial products in support of national security, humanitarian response, disaster assessment, and resilience planning. In this technical role, the candidate will collaborate with an interdisciplinary team of human geographers, population scientists, geospatial analysts, data scientists, and software engineers. They will contribute across the full lifecycle of geospatial modeling efforts: data acquisition and preparation, feature engineering, model development and evaluation, MLOps and codebase maintenance, automation, and quality assurance. A key component of this position is building agentic AI workflows that help discover, gather, validate, and standardize open-source data for downstream geospatial analytics and machine learning. The position offers a unique opportunity to work on applied spatial analytics and geospatial data modeling at scale, leveraging diverse geospatial, demographic, and remotely sensed data sources. While the role does not require independent development of novel AI algorithms, it does require strong implementation skills, sound statistical judgment, and an ability to translate methods into reliable, maintainable, and well-documented pipelines. Major Duties and Responsibilities: Develop, maintain, and operationalize geospatial data science pipelines across ingestion, feature engineering, training, inference, evaluation, and delivery, using reproducible MLOps practices (version control, testing, experiment tracking, containerization, and CI/CD). Support implementation of agentic AI workflows to discover, gather, and prepare data from open-source repositories (e.g., catalogs, APIs, and bulk downloads), including provenance tracking, metadata extraction, and licensing/usage notes. Build scalable geospatial data preparation and validation routines for raster and vector data (projection harmonization, spatial joins, tiling/chunking, and QA/QC). Develop geospatial validation frameworks for model outputs (e.g., comparisons to reference datasets, spatial cross-validation, summary dashboards, and automated report generation). Support documentation, metadata development, and version tracking for data products and model releases; contribute to technical summaries, figures, and reports/publications as appropriate. Participate in code reviews, model reviews, and data readiness reviews to ensure analytical defensibility, transparency, and fitness-for-use in operational and decision-support contexts. Collaborate with research staff to integrate new data sources, indicators, and modeling approaches into existing workflows; communicate clearly across technical and domain teams. Basic Qualifications Bachelor's degree and 3+ year's experience in Geography, GIScience, Computer Science, Data Science, Statistics, Engineering, or a related field with a strong quantitative and software development emphasis. * Demonstrated experience with geospatial analysis using Python in a production or research to production environment leveraging common geospatial libraries (e.g., geopandas, rasterio, shapely, pyproj) and/or enterprise GIS tooling (e

About the Company

O

Oak Ridge National Laboratory