Key Responsibilities
Architect highly available, secure, scalable platforms
Define SRE standards, roadmap, and best practices
Own availability, resiliency, and DR strategies
Lead high-severity, cross-team incidents
Drive large-scale automation and platform improvements
Mentor senior engineers and influence leadership
Mandatory Skills (Skill → Experience)
Linux, networking & distributed systems design – 8–12 yrs
Cloud architecture (AWS / Azure / GCP at scale) – 7–10 yrs
Kubernetes, platform engineering & service mesh – 6–8 yrs
Infrastructure as Code & governance (Terraform) – 6–8 yrs
Automation & systems programming (Python / Go) – 6–8 yrs
Observability strategy (metrics, logs, tracing) – 6–8 yrs
SRE practices (SLOs, error budgets, toil reduction) – 6–8 yrs
Soft Skills
Strategic and systems-level thinking
Strong technical leadership and influence
Executive-level communication skills
Coaching and mentoring senior engineers
Ownership of reliability vision and outcomes