p>This role is ideal for someone who enjoys working directly in Azure, improving production systems, troubleshooting issues across infrastructure and application layers, and building practical monitoring and alerting solutions that help teams respond faster and operate more confidently. Wellfit is the dental industry’s fintech solution, breaking down financial barriers so patients, providers, employers, and payors can all access better care.
Toyota is proud to have 10+ different Business Partnering Groups across 100 different North American chapter locations that support team members' efforts to dream, do and grow without questioning that they belong. You must have the right to work in the United States and not require Toyota support or sponsorship for immigration-related employment (e.g., H-1B, O-1, E-3, H-1B1, TN, F-1 OPT, F-1 STEM OPT, F-1 CPT, TN, 'job flexibility benefits' (also known as I-140 or Adjustment of Status portability), etc.
p/>As part of our journey from traditional operations toward a mature SRE model, the Senior SRE will partner with product engineering, platform teams, and the Command Center including Service Desk and Major Incident Command (MIC) to deliver measurable improvements in service reliability.
Deep knowledge of:
Azure: AKS, App Services, Functions, VMSS, Storage, Front Door, API Management, Load Balancers, Monitor, Log Analytics, App Insights, Key Vault, Policy, Defender.
p>As a Lead Site Reliability Engineer at JPMorgan Chase within the Infrastructure Platforms, Web Hosting team , you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issue facing them. Leads reuse-first adoption of AI-assisted reliability workflows across SDLC/toolchain practices (e.g., CI/CD quality checks, test/validation automation, and operational readiness), ensuring traceability/auditability, resiliency, and security controls.
p>As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the Chief Data & Analytics Office (CDAO) AI/ML & Data Platforms team, you work with your fellow stakeholders to define non-functional requirements (NFRs) and availability targets for services supporting large-scale data platforms and data lake ecosystems. You will ensure those NFRs are embedded into product design and testing phases, that service level indicators effectively measure customer and data platform performance, and that service level objectives are defined with stakeholders and implemented in production to support secure, scalable, and high-performing analytics and AI/ML workloads.
p> Determining compensation for this role (and others) at Vaco by Highspring depends upon a wide array of factors including but not limited to: - the individual’s skill sets, experience and training;
- licensure and certification requirements;
- office location and other geographic considerations;
- other business and organizational needs. Determining compensation for this role (and others) at Vaco/Highspring depends upon a wide array of factors including but not limited to the individual’s skill sets, experience and training, licensure and certifications, office location and other geographic considerations, as well as other business and organizational needs.
NTT DATA recruiters will never ask for payment or banking information and will only use @nttdata.com, @nttdatafed.com and @talent.nttdataservices.com email addresses. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact us at https://us.nttdata.com/en/contact-us.
The AWS Site Reliability Engineer (SRE) will collaborate closely with cross-functional teams, including development, quality assurance, and operations, to ensure seamless software releases and continuous improvement of our release processes. What you will do:
Infrastructure Automation: Design, implement, and manage infrastructure as code (IaC) solutions using tools like AWS CloudFormation, Terraform or Helm Charts to automate continuous database deployment and scaling processes.
p>Support reliability test methods including thermal cycling, thermal shock, high-temperature exposure, humidity, corrosion, pressure cycling, leak testing, coolant compatibility, mechanical fatigue, and bond/interface reliability. Lead and manage reliability test planning and execution for semiconductor packaging, liquid cold plates, T800 Thermadite, CVD diamond, thermal spreaders, embedded cooling structures, and related thermal assemblies.
As a Lead Site Reliability Engineer at JPMorgan Chase within the Infrastructure & Production Management sector of Consumer & Community Banking, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and conduct resiliency design reviews, break up complex problems into digestible work for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to other engineers.
p>In this role, you will contribute to environmental and stress testing efforts, support failure analysis investigations, and help analyze test and field data to identify potential reliability risks. You'll work closely with design, manufacturing, and supplier teams to help implement design-for-reliability best practices and assist with reliability verification activities from concept through production.
Mansfield, TX30+ days ago
A key aspect of this position is to establish a real time 360-degree view of the customer's experience in order to proactively monitor and support our key customer accounts, resolve quality problems in a timely manner, drive continual improvement, improve quality scorecards, and minimize cost of poor quality by fulfilling customer requirements, being responsive, and building customer relationships. TE Connectivity's Customer Quality Engineer (CQE) manages assigned strategic customer accounts quality and is responsible for ensuring that TE provides an exceptional customer experience for the Data and Devices (DND) business unit within TE.
Arlington, TX30+ days ago
Work closely with data scientists, data architects, data engineers, ETL developers, cybersecurity, network, Linux, other IT counterparts, and business partners to design and setup the environments to manage the ingested and processed datasets from the external sources, internal systems, and the data warehouse to extract features of interest. Solid experience in High Availability and distributed systems, Linux , Data and SAN Storage Networks, NAS and Networking, leveraging tools to instrument and automate proactively and eventually predictive availability solutions.
li>Protocol Expertise: Mastery of DNS-specific protocols including DNSSEC, DoT, and DoH, with a firm grasp of underlying transport layers (UDP/TCP) and dual-stack (IPv4/IPv6) networking. You will combine deep IP networking and DNS expertise with modern security protocols to ensure our platforms remain resilient against evolving threats and perform at the highest level for millions of users.
p>We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Terraform, Cloud Infrastructure, DevOps, Automation and Load Balancing, The ideal candidate will be responsible for ensuring the reliability, scalability, performance, and availability of critical enterprise applications across hybrid and multi-cloud environments. - 5+ years of experience in Site Reliability Engineering, DevOps Engineering, Platform Engineering, or related disciplines (understanding reliability engineering principles, SLIs, SLOs, error budgets, and operational excellence).