Software: Operations & Reliability Lead

Truenorth Corporation

Guaynabo, P.R.

JOB DETAILS
JOB TYPE
Full-time
LOCATION
Guaynabo, P.R.
POSTED
5 days ago
Role Overview
We’re looking for an experienced Operations & Reliability Lead to strengthen our monitoring, security, automation, and cloud operations. This role drives reliability, resilience, and a security‑first posture across all systems and environments.
What You’ll Do
  • Build and maintain application and infrastructure monitoring, dashboards, and automated alerts.
  • Implement cloud and On Premise resource provisioning and enforce standardized configuration baselines.
  • Manage backup, recovery, and resilience workflows with regular testing cycles.
  • Conduct AI‑assisted performance testing, security audits, and penetration testing.
  • Coordinate with NOC and SOC to support continuous monitoring and threat detection.
  • Lead incident response, root‑cause analysis, and operational readiness activities.
  • Implement cost optimization and resource governance across cloud environments.
  • Automate operational tasks and integrate AI‑Ops capabilities.
What You Bring
  • Strong experience with monitoring tools (New Relic, Datadog, Prometheus, Azure Monitor, etc.).
  • Hands‑on expertise with cloud platforms, IaC, CI/CD, and configuration management.
  • Solid understanding of security frameworks, threat detection, and compliance.
  • Experience with backup/DR strategies and resilience best practices.
  • Strong troubleshooting, documentation, and cross‑team collaboration skills.
Valuable Extras
  • Cloud or security certifications (Azure/AWS Architect, Security+, CISSP, ITIL, SRE).
  • Experience with AI‑Ops platforms or ML‑based operational tooling.
  • Background in regulated industries.
Education & Experience
  • Bachelor's degree in Computer Science or related field.
  • At least 2 years of experience working with systems.

Powered by JazzHR

About the Company

T

Truenorth Corporation