Production Support Analyst / SRE Reliability engineer
Location: South Jordan, UT (Hybrid)
12 monthsContract-to-Hire |
Pay rate:$53/hr W2
Interview Process:
- 1st Round: Zoom
- 2nd Round: Onsite
Experience & Education:
- 2–5 years of relevant experience
- Bachelor's Degree required
Shifts:
- Morning: 8:00 AM – 5:00 PM
- Evening: 12:30 PM – 8:00 AM
- Weekend: On-call (Remote)
Key Responsibilities:
- Monitor and support production systems across OS, applications, and network
- Troubleshoot incidents, perform root cause analysis, and resolve live issues
- Collaborate with Dev teams to reduce recurring issues and improve system reliability
- Automate repetitive tasks using Python/scripting
- Maintain SOPs and support operational readiness activities
- Participate in on-call rotation and critical event support
Must-Have Skills:
- Strong hands-on experience with Linux/Unix (OS-level troubleshooting)
- Production support experience (incident management, debugging live systems)
- Python scripting (automation-focused, not development-heavy)
- SQL knowledge
- Experience with ServiceNow(ticketing)
- Understanding of ITIL principles
- Excellent communication skills
Nice to Have:
- Exposure to Java, Go, C++, Scala
- Monitoring tools (Grafana, Splunk, Dynatrace, etc.)
- Cloud experience
- Snowflake knowledge
- CI/CD, Kafka, Docker, or distributed systems exposure
- Awareness of SRE concepts or Agentic AI
Role Overview:
This role supports Production Support / SRE (Reliability Engineering)functions—focused on system stability, incident resolution, automation, and improving platform reliability in a large-scale Linux environment.