Manager, Reliability Engineering (Remote)

Kohl's Corp

Menomonee Falls, WI(remote)

JOB DETAILS
SKILLS
Adoption, Amazon Web Services (AWS), Ansible, Artificial Intelligence (AI), Automation, Best Practices, Chef (Configuration Management), Cloud Computing, Computer Programming, Computer Science, Configuration Management, Continuous Deployment/Delivery, Continuous Integration, Distributed Computing, Ecosystems, Embedded Software, Follow Through, GCP (Good Clinical Practices), Go Programming Language (Golang), Hybrid Cloud, Identify Issues, Java, Leadership, Management of Information Systems/Technology (MIS), Mentoring, Microsoft Windows Azure, Network Security, Network Topology, Node.js, Operating Systems, Problem Solving Skills, Product Design, Product Engineering, Product Management, Puppet (Configuration Management), Python Programming/Scripting Language, Reliability Engineering, Risk Management, Root Cause Analysis, Software Development, Software Development Lifecycle (SDLC), Strategic Planning, System Architecture, Systems/Internals Programming, Technical Leadership, Test Driven Development (TDD), Training Program Development, Training/Teaching, Unix System Internals/Programming
LOCATION
Menomonee Falls, WI
POSTED
10 days ago

About the Role

In this role you will lead and mentor a team of reliability engineers to drive operational excellence across Kohl's distributed systems. You will develop and implement strategies, collaborate closely with engineering teams and ensure SRE best practices are embedded throughout the software development lifecycle.

What You'll Do

  • Conduct design reviews, implement robust monitoring and alerting and establish auto-healing practices

  • Provide leadership and guidance during critical incidents to triage, troubleshoot and resolve complex issues

  • Drive comprehensive root cause analysis and follow-through on preventative measures

  • Manage the software lifecycle, driving reliability, observability and efficiency in collaboration with peers across Design, Product Management, and Engineering

  • Lead major automation and toil reduction initiatives, simplifying the ecosystem and reducing risks

  • Set the vision and drive cultural transformation within the team

  • Lead technical initiatives within the team

  • Coach team through empathy and hands-on mentoring

  • Develop and deliver training programs to upskill the team and broaden SRE adoption across the organization

  • Hire, mentor, cultivate and lead a high-performing SRE team aligned with business priorities

  • Additional tasks may be assigned

What Skills You Have

Required

  • Bachelor's Degree or equivalent in MIS, Computer Science or related field

  • 6+ years of experience in software development and 2+ years of progressive leadership experience, mentoring diverse teams

  • Successful transformation of technical leadership into people leadership

  • Advanced in-depth knowledge of application design patterns, event-driven architecture, database schemas and testing strategies

  • Demonstrated knowledge of systems architecture, operating system internals and networking

  • Proven experience with multi-region application troubleshooting and performance tuning

  • Demonstrated experience working with (at least one) cloud platform (GCP, AWS, or Azure) and a hybrid cloud environments

  • Advanced in-depth knowledge and experience with continuous integration, continuous deployment and test-driven development

  • Strong programming skills in one or more languages (Java, Python, Go or Node.js)

  • Strong leadership skills

Preferred

  • In-depth experience with containerization and container orchestration (e.g., Docker, Kubernetes, Rancher).

  • Demonstrated experience with one or more configuration management systems (e.g., Chef, Ansible, Puppet)

  • Demonstrated experience with monitoring techniques and tools (e.g., CloudWatch, Grafana, Prometheus, OpenTelemetry, Tracing)

  • Strong understanding of systems architecture, UNIX internals, networking topologies, multi-cluster applications, multi-tenant platforms and systems/network security

  • Passion for and experience with AI and ML methodologies (MLOps) and how to leverage solutions such as LLMs to automate.

About the Company

K

Kohl's Corp

At Kohl's, our mission is to inspire and empower families to lead fulfilled lives. And there's no more rewarding job than that. Because it's not just about selling things. It's about letting customers know that the things that make their lives better are within their reach. We build great brands, launch new technologies to make shopping easier, contribute our time and dollars to improve the world we live in and dream up ways to empower our customers and Associates to create a life they love. Our Associates make a difference in the lives of our customers. Let us make a difference in yours. Welcome to Kohl's.
COMPANY SIZE
10,000 employees or more
INDUSTRY
Retail
FOUNDED
1962
WEBSITE
http://www.kohls.com