Quality & Reliability Engineer, Trainium Manufacturing, Quality & Reliability

Amazon.com Inc

Cupertino, CA

JOB DETAILS
SKILLS
Amazon Web Services (AWS), Artificial Intelligence (AI), Cloud Computing, Computer Firmware, Computer Servers, Computer Systems Design, Hardware Design, Hardware Development, Machine Learning, Manufacturing, Manufacturing Systems, Manufacturing/Industrial Processes, Mentoring, Organizational Development/Management, Original Design Manufacturer (ODM), Problem Solving Skills, Process Improvement, Product Design, Product Planning, Quality Assurance Methodology, Quality Engineering, Reliability Engineering, Reliability Testing, Software Development, Statistics, Team Player, Technical Leadership, Testing, Validation Testing, Vehicle Fleets
LOCATION
Cupertino, CA
POSTED
30+ days ago

The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that designs cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to conceive and design infrastructure technologies. You will work closely with an internal inter-disciplinary team, and outside partners to drive key aspects of product definition, execution and test in manufacturing. A successful candidate will be responsive, flexible and able to succeed within an open collaborative peer environment. You will:

  • Be responsible for the test validation of future technologies.
  • Drive manufacturing process improvements to address reliability issues and concerns.
  • You will have a fundamental understanding of Reliability statistics/Reliability tests and/or solid understanding of computer systems to influence design for reliability.
  • Lead identifying and validating product/component risks and work with design teams to mitigate them and define the test methodology and test coverage to assure product reliability.
  • Deep-dive in technologies aligned with product roadmap.
  • Provide technical leadership and mentor engineers.
  • Perform Reliability prediction of failure mechanisms, products under development and products in the field.
  • Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.

Key job responsibilities

  • Responsible for defining reliability tests to be implemented during manufacturing
  • Drive manufacturing process improvements to address reliability issues and concerns.
  • Perform Reliability prediction of failure mechanisms, products under development and products in the field.
  • Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.

About the team

Annapurna Labs is a wholly owned subsidiary of AWS, focused on developing custom silicon and servers including the Nitro, Graviton, Inferentia, and Trainium families of processors.

Machine Learning Annapurna (MLA) functions as a vertically integrated team including software, firmware, hardware, and silicon design in a single organization.

We are the Training Servers and Systems organization under MLA focused on Hardware Development, Software Development, Fleet Ops Systems, and Manufacturing, Quality, and Reliability.

This position is in the Manufacturing, Quality and Reliability team.

About the Company

A

Amazon.com Inc

At Amazon, we don’t wait for the next big idea to present itself. We envision the shape of impossible things and then we boldly make them reality. So far, this mindset has helped us achieve some incredible things. Let’s build new systems, challenge the status quo, and design the world we want to live in. We believe the work you do here will be the best work of your life.

Wherever you are in your career exploration, Amazon likely has an opportunity for you. Our research scientists and engineers shape the future of natural language understanding with Alexa. Fulfillment center associates around the globe send customer orders from our warehouses to doorsteps. Product managers set feature requirements, strategy, and marketing messages for brand new customer experiences. And as we grow, we’ll add jobs that haven’t been invented yet.

It’s Always Day 1
At Amazon, it’s always “Day 1.” Now, what does this mean and why does it matter? It means that our approach remains the same as it was on Amazon’s very first day – to make smart, fast decisions, stay nimble, invent, and stay focused on delighting our customers. In our 2016 shareholder letter, Amazon CEO Jeff Bezos shared his thoughts on how to keep up a Day 1 company mindset. “Staying in Day 1 requires you to experiment patiently, accept failures, plant seeds, protect saplings, and double down when you see customer delight,” he wrote. “A customer-obsessed culture best creates the conditions where all of that can happen.” You can read the full letter here

Our Leadership Principles
Our Leadership Principles help us keep a Day 1 mentality. They aren’t just a pretty inspirational wall hanging. Amazonians use them, every day, whether they’re discussing ideas for new projects, deciding on the best solution for a customer’s problem, or interviewing candidates. To read through our Leadership Principles from Customer Obsession to Bias for Action, visit https://www.amazon.jobs/principles
COMPANY SIZE
10,000 employees or more
INDUSTRY
Retail
FOUNDED
1994
WEBSITE
http://Amazon.com/militaryroles