As a Sr Site Reliability Engineer you will be responsible for building and supporting the platform/application infrastructure of one of the largest eCommerce sites in the world. This will require you to maintain high site uptime/availability while embracing rapid change and growth using a strong devops mindset of continuous delivery and site automation. This role requires deep technical knowledge, adaptability, hands on execution, and a ruthless drive towards higher levels of availability and resiliency. In this role:
You will have a maniacal focus on site uptime
Engineer application infrastructure that is reliable, efficient, and maintainable
Partner closely with software engineering teams using a strong devops mindset
Constantly improve operational processes and efficiency
Automate, Automate, Automate!!!
NATURE AND SCOPE
Typically reports to the Software Engineer Manager or Sr. Manager.
ENVIRONMENTAL JOB REQUIREMENTS
Environment: Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable.
Travel: Typically requires overnight travel less than 10% of the time.
MAJOR TASKS, RESPONSIBILITIES AND KEY ACCOUNTABILITIES
70% – Delivery & Execution:
Collaborates and pairs with other product team members (UX, engineering, and product management) to create secure, reliable, scalable software solutions
Documents, reviews and ensures that all quality and change control standards are met
Works with Product Team to ensure user stories that are developer-ready, easy to understand, and testable
Writes custom code or scripts to automate infrastructure, monitoring services, and test cases
Writes custom code or scripts to do “destructive testing” to ensure adequate resiliency in production
Configures commercial off the shelf solutions to align with evolving business needs
Creates meaningful dashboards, logging, alerting, and responses to ensure that issues are captured and addressed proactively
20% – Support & Enablement:
Fields questions from other product teams or support teams
Monitors tools and participates in conversations to encourage collaboration across product teams
Provides application support for software running in production
Proactively monitors production Service Level Objectives for products
Proactively reviews the performance and capacity of all aspects of production: code, infrastructure, data, and message processing
10% – Learning:
Participates in learning activities around modern software design and development core practices (communities of practice)
Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations
Cloud Computing Security
Java (Programming Language)
Open Source Softwares
Sql (Programming Language)