Senior Site Reliability Engineer (SRE) - Release & Observability Focus

PeopleNTech LLC

Santa Clara, CA

JOB DETAILS
SALARY
SKILLS
Application Programming Interface (API), Automation, Continuous Improvement, Identify Issues, Incident Response, Machine Tool, On Call, Production Systems, Python Programming/Scripting Language, REST (Representational State Transfer), Reliability Engineering, Scripting (Scripting Languages), Splunk
LOCATION
Santa Clara, CA
POSTED
30+ days ago
Role : Senior Site Reliability Engineer (SRE) - Release & Observability Focus
Location : Scottsdale AZ (100% Onsite)
Rate : $65.
Senior Site Reliability Engineer (SRE) - Release & Observability Focus
Score out of 10Key Responsibilities
Solid hands-on experience in SRE or Release Engineering Roles
Strong experience deploying and operating containerized applications on Kubernetes across on-Prem and AWS Cloud
Strong of Linux and networking fundamentals
Own release automation, deployment strategies, rollback mechanisms, and release validation
Proven experience supporting REST API services in production environments
Dr. Continuous improvements in release safety, reliability,monitoring,alerting and operational readiness
Experience with monitoring and observability tools such as Splunk, Prometheus/Grafana
Lead troubleshooting of complex production incidents and service degradations
Participate in on call rotations and lead incident response and post incidence reviews
Nice To Have
Python scripting for automation and platform tooling
Knowledge or experience with Honeycomb for observability

About the Company

P

PeopleNTech LLC