Site Reliability Engineer (Edge Services), Infrastructure Services

Apple Inc

Austin, TX

JOB DETAILS
SKILLS
Algorithms, Amazon Web Services (AWS), Ansible, Artificial Intelligence (AI), Automation, Budget Management, Cloud Computing, Construction, Consulting, Continuous Deployment/Delivery, Continuous Integration, Data Structures, Debugging Skills, Distributed Computing, Ecosystems, Establish Priorities, GCP (Good Clinical Practices), HTTP (HyperText Transport Protocol), HTTPS (HyperText Transport Protocol Secure), Identify Issues, Incident Management, Linux Operating System, Metrics, Microsoft Windows Azure, Python Programming/Scripting Language, Release Management/Engineering, Reliability Engineering, SSL-TLS (Secure Socket Layer - Transport Layer Security), Software Engineering, User Interface/Experience (UI/UX)
LOCATION
Austin, TX
POSTED
30+ days ago

We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive the vision for our visibility, moving beyond simple uptime metrics to build a sophisticated, data-driven reliability framework. You will play a pivotal role in ensuring our services are resilient, scalable, and observable, bridging the gap between complex distributed systems and seamless user experiences. As a key member of the SRE team, your mission is to treat operations as a software problem. You will focus on designing and implementing a next-generation observability and alerting strategy that prioritizes high-cardinality data and meaningful signals over noise. You will spend your time building "self-healing" systems, reducing toil through aggressive automation, and partnering with development teams to bake reliability into the CI/CD pipeline. Your goal is to move us toward a proactive stance where performance bottlenecks are identified and mitigated before they impact the customer.Understanding of Linux internals and deep networking expertise, including HTTP/2, HTTP/3 (QUIC), and HTTPS/TLS. You should be comfortable debugging protocol-level issues and optimizing traffic flow. Proven ability to automate repetitive tasks and complex workflows using Python or Go Experience configuring and managing modern monitoring suites (e.g., Prometheus, Grafana, ClickHouse) with a focus on creating actionable, high-signal quality alerting. Grasp of Data Structures and Algorithms (DSA) to write efficient, performant code and troubleshoot complex system bottlenecks. Practical knowledge of SLIs, SLOs, Error Budgets, Release Management and Incident Management to drive engineering priorities.Experience managing cloud environments (AWS, GCP, or Azure) using Terraform, Ansible, or Pulumi. Orchestration: Hands-on experience scaling and securing containerized workloads via Kubernetes. A track record of leading "blameless post-mortems" and using those insights to harden the system against future failures. Ability to consult with product teams on service design to improve long-term maintainability. A proactive engineering mindset focused on shifting from "fixing things when they break" to "designing things so they dont break" (or so they fail gracefully). Practical fluency in applying Generative AI tools within SRE and software engineering workflows - from accelerating observability query construction and alert design to building AI-assisted debugging and triage capabilities that encode institutional knowledge into repeatable, context-aware workflows - with the engineering rigour to validate, own, and iterate on AI-assisted outputs in production-adjacent contexts

About the Company

A

Apple Inc

We bring amazing people together to make amazing things happen.

We’re a diverse collection of thinkers and doers, continually reimagining what’s possible to help us all do what we love in new ways. The people who work here have reinvented entire industries with the Mac, iPhone, iPad, and Apple Watch, as well as with services, including iTunes, the App Store, Apple Music, and Apple Pay. And the same passion for innovation that goes into our products also applies to our practices — strengthening our commitment to leave the world better than we found it.

About Apple

There’s a place here for every kind of brilliant. Everyone here is an innovator, or an innovator-to-be, no matter what your team or your role. So bring your passion, courage, and original thinking and get ready to share it, because every new product, service, or feature we invent is the result of people working together to make each others’ ideas stronger. Innovation at this level depends on people who represent the variety of the human experience and inspire us with their own fresh perspectives. Together, we’ll do amazing work that can make a difference in people’s lives. Including your own. Learn more about working at Apple.

COMPANY SIZE
10,000 employees or more
INDUSTRY
Computer/IT Services
FOUNDED
1976
WEBSITE
https://www.apple.com/jobs