Job Summary: As a Medrio Senior Site Reliability Engineer, you will be a part of the ITOps group responsible for maintaining all environments supporting the SDLC for Medrio's platform. To see detailed information on the data we collect during the application process, and how Medrio complies with data privacy laws, visit our Careers page.. Mountain View, CA30+ days ago There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year. San Francisco, California30+ days ago p style="min-height:1.5em">As a Site Reliability Engineer (SRE) at Air Apps, you will be responsible for ensuring the reliability, availability, and scalability of our systems. Automate infrastructure provisioning, deployment, and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation. p style="min-height:1.5em">Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide “white glove” guidance on the path to production. p style="min-height:1.5em">As an Airflow Reliability Engineer on the Customer Reliability Engineering (CRE) team at Astronomer, you will have the opportunity to become an Apache Airflow expert, learning directly from leaders of the Airflow project. Spend up to 20% of your time on side projects that contribute to Astronomer’s overall success, such as contributing to the open-source Airflow repository or developing Astronomer’s internal monitoring and alerting systems built on Airflow. San Francisco, California30+ days ago li>Good programming skills and ability to apply sound coding principles to IaC and scripting code with languages such as Terraform, Ansible, Bash (shell scripting), and/or Python. We’re looking for an experienced Site Reliability Engineer (SRE) to help us scale our platform with reliability, observability, and operational excellence at the core. li>Drive the development of Key Performance Indicators (KPIs), such as Asset Utilization, Overall Equipment Effectiveness (OEE), Mean Time Between Failure (MTBF), On-Stream Time (OST), and benchmark performance against best-in-class, track progress, and measure improvements. This role will partner with and support Operations and Maintenance through identification and reduction and/or elimination of production losses and high maintenance cost assets to promote plant objectives in the areas of Health, Safety, and Environmental (HSE), asset capability, quality, and production. San Francisco, CA25 days ago p>The Role: Youll be the infrastructure and reliability engineer on the Data Replication team - a full-stack product team running over 3 million sync jobs a week powering thousands of data use cases across multiple regions and clouds. Maintain and enhance AI-augmented release and internal tooling: canary deployments, progressive rollouts, automated release qualification, and rollback automation - with an eye towards LLM automation. |