| Senior Site Reliability Engineer (SRE) - Release & Observability Focus | |
| Score out of 10 | Key Responsibilities |
| Solid hands-on experience in SRE or Release Engineering Roles | |
| Strong experience deploying and operating containerized applications on Kubernetes across on-Prem and AWS Cloud | |
| Strong of Linux and networking fundamentals | |
| Own release automation, deployment strategies, rollback mechanisms, and release validation | |
| Proven experience supporting REST API services in production environments | |
| Dr. Continuous improvements in release safety, reliability,monitoring,alerting and operational readiness | |
| Experience with monitoring and observability tools such as Splunk, Prometheus/Grafana | |
| Lead troubleshooting of complex production incidents and service degradations | |
| Participate in on call rotations and lead incident response and post incidence reviews | |
| Nice To Have | |
| Python scripting for automation and platform tooling | |
| Knowledge or experience with Honeycomb for observability |