Albuquerque, NM30+ days ago
Participates in the configuration and tuning of batch queuing systems in high-performance, parallel computing production environments; collects and analyzes system utilization statistics and logs; identifies computer system anomalies and operational problems; and provides systems support for web applications, SSO and LDAP, name resolution, and cloud storage services as needed to support secure research computing, Kubernetes/CyVerse deployments, API-based microservices, AI/ML inference services, and large-scale research data workflows. Under general supervision and in close collaboration with a multidisciplinary data analyst team, the qualified candidate will help design, deploy, administer, and continuously improve the computing platforms, servers, containers, workflows, and security processes that support AI/ML, clinical and translational informatics, CFDE/OMOP-linked data resources, GPU/HPC systems, Kubernetes/CyVerse environments, and multi-institutional research collaboration.