b'nn n n n n n n n n n n n nn n n n n n n n n n n n n n n n n n Cloud Engineer - - 30740 - UCLA Healthn n nnnn nn n n n nn n
n n n Skip to content nn
n
n
n n
n UCLA Health Home Page
n n
nn nnn
n nn
n Main menu. Press enter or space keys to expands and escape key to collapse
nn
n n
n
n
n nn
nn
n n
nn n
n
n
n
n n n
n
n
n n
n Cloud Engineern
n
n
n
n n n n n
n
n n n n n n nn
n n
n
n nnnn n n
n n n n n n n
n
n
n General Informationn
n
nn
n Press space or enter keys to toggle section visibility
n
n n n
n
n
n n
n Work Location: Los Angeles, CA, USAn
n
n
n
n Onsite or Remoten
n n
n Flexible Hybridn
n
n
n
n Work Schedulen
n n
n Monday-Friday 8:00 am-5:00 pmn
n
n
n
n Posted Daten
n n
n 06/10/2026n
n
n
n n
n Salary Range: $128500 - 298100 Annuallyn
n
n
n
n Employment Typen
n n
n 2 - Staff: Careern
n
n
n
n Durationn
n n
n Indefiniten
n
n
n
n Job #n
n n
n 30654n
n
n
n
n n n n n n n n n
n
n
n Primary Duties and Responsibilitiesn
n
nn
n Press space or enter keys to toggle section visibility
n
n n n
n
n
n n
n
The Cloud Engineer will design, build, and operate infrastructure and applications supporting UCLA Healthxe2x80x99s Analytics Platform across both on-premises and multi-cloud environments (AWS, Azure, GCP).
This role focuses on enabling secure, scalable AI/ML and GenAI platforms, with an emphasis on automation, reliability, and compliance in a regulated healthcare setting.
Key Responsibilities
Design, implement, and manage cloud and hybrid infrastructure supporting analytics and AI/ML workloads
Build and operate MLOps capabilities, including:
Model training and inference platforms
Model and artifact management
CI/CD and deployment pipelines
Observability and monitoring solutions
Cost optimization controls
Develop and maintain automation and infrastructure-as-code (IaC) solutions for provisioning and configuration
Troubleshoot and resolve complex system and environment issues across cloud and on-prem platforms
Establish platform guardrails to ensure secure, reliable, and compliant operations
Collaborate with cross-functional teams to:
Gather requirements
Design and prototype solutions
Implement and test deployments
Support ongoing operations and enhancements
Apply security, privacy, and governance controls aligned with healthcare data regulations
Execute release, deployment, and configuration management processes
What Youxe2x80x99ll Bring
Strong background in cloud engineering and platform operations
Experience with multi-cloud environments (AWS, Azure, GCP)
Proficiency in:
Automation, scripting, and infrastructure-as-code
CI/CD pipeline development and optimization
Monitoring and observability tools
Experience supporting AI/ML or data platform workloads (preferred)
Ability to troubleshoot complex systems and drive solutions independently
Strong collaboration skills and the ability to translate business requirements into technical solutions
Salary Range: $128500 - $298100 annually.
n
n
n
n
n n n n n n n n n
n
n
n Job Qualificationsn
n
nn
n Press space or enter keys to toggle section visibility
n
n n n
n
n
n n
n
xe2x80xa2xe2x80x83xe2x80x83BS/MS in Computer Science (or equivalent)
xe2x80xa2xe2x80x83xe2x80x83AWS Certified Cloud Engineer, Architect, Administrator Certifications requiredxc2xa0
xe2x80xa2xe2x80x83xe2x80x837+ years of advanced knowledge and experience as an AWS Cloud Engineer in all core services and offerings. AWS experience a plus
xe2x80xa2xe2x80x83xe2x80x8315+ years of advanced knowledge and experience of Microsoft Technologies such as, Windows server and Linux based servers, enterprise system support experience and strong background in systems engineering and administration for both operating systems
xe2x80xa2xe2x80x83xe2x80x8315+ years of advanced knowledge and experience with enterprise scale Windows technologies such as Server platforms, Desktop platforms, Exchange Environments, Active Directory, IIS, Windows Clustering, Virtualization and Collaboration tools. AWS Certification or equivalent experience preferred
xe2x80xa2xe2x80x83xe2x80x83Working knowledge ofxc2xa0 DevOps-like work or experience in a real time operational role
xe2x80xa2xe2x80x83xe2x80x83Advanced knowledge of analytics and AI/ML platform services across AWS, Azure, and GCP (e.g., AWS SageMaker/Bedrock, Azure Machine Learning/Azure OpenAI, Google Vertex AI) and how to operate them securely at enterprise scale.
xe2x80xa2xe2x80x83xe2x80x83Experience enabling teams to build and deploy ML/AI solutions by providing reusable platform capabilities (reference architectures, templates, SDK/CLI standards, self-service onboarding, and guardrails) rather than only project-specific implementations.
xe2x80xa2xe2x80x83xe2x80x83Hands-on experience operationalizing ML/AI workloads on cloud platforms (AWS/Azure/GCP): managed training/inference, batch vs real-time serving, feature/metadata management, model registry, and cost/performance optimization.
xe2x80xa2xe2x80x83xe2x80x83Strong MLOps/platform engineering experience: CI/CD for ML and GenAI, automated validation gates, reproducible pipelines, environment promotion, artifact/version management, and production monitoring (drift, data quality, latency, cost) using cloud-native and/or enterprise tooling (e.g., Azure DevOps/GitHub Actions, SageMaker Pipelines, Vertex AI Pipelines, MLflow, Terraform).
xe2x80xa2xe2x80x83xe2x80x83GenAI platform experience (AWS/Azure/GCP): deploying and governing LLM applications using managed services (e.g., Bedrock/Azure OpenAI/Vertex AI), RAG architectures, embeddings and vector databases/search, prompt/version management, and evaluation/guardrails for safety and groundedness.
xe2x80xa2xe2x80x83xe2x80x83Responsible AI + governance experience for regulated environments: PHI/PII protections, access controls, encryption and key management, audit logging, model/endpoint risk assessments, bias/fairness considerations, and policy enforcement aligned to HIPAA and secure SDLC.
xe2x80xa2xe2x80x83xe2x80x83Strong data engineering foundations that support AI platforms: standardized data ingestion/ETL/ELT, data quality/lineage, dataset and feature pipeline design, schema/version management, and integration with lake/lakehouse platforms (e.g., S3/ADLS/GCS with Spark/Databricks/BigQuery/Synapse) for feature and training data readiness.
xe2x80xa2xe2x80x83xe2x80x83Experience operating scalable training/inference platforms (GPU/accelerated workloads): capacity planning/quotas, cluster or managed compute configuration, distributed training concepts, performance tuning, and chargeback/showback in cloud environments.
n
n
n
n n
n
As a condition of employment, the final candidate who accepts an offer of employment will be required to disclose if they have been subject to any final administrative or judicial decisions within the last seven years determining that they committed any misconduct; or have filed an appeal of a finding of substantiated misconduct with a previous employer.
Current/former UC employees are subject to a personnel file review.
n
n
n
n
n n n
nn
nn
n n nn n
n
n
n n Apply Now n
n
n
n
n
n
n n Apply as Internal/Current Employee n
n
n
n
n
n
n n Refer a Friend n
n
n
n
n n
n nn n n n nn nn
n
n n
n
n
n
n
n nn n
n
n
n
n n UCLA Healthn n
n
nn
n n Accessibility n |n n Privacy policy n
n n
n n
n nn n
n n
n nn n
n n
n nn n
n n
n nn n
n n
n n
n
n
n
n n
nn nn nn n n n nn nn n '