Senior Engineering Manager AI Inference Platform, Distributed Cloud

Google LLC

Sunnyvale, CA

JOB DETAILS
SKILLS
Advertising, Artificial Intelligence (AI), Benchmarking, C++ Programming Language, Cloud Computing, Compiler Technology, Computer Science, Computer Systems, Construction, Distributed Computing, Energy Efficiency, Engineering Management, English Language, Equal Employment Opportunity (EEO), GPU (Graphics Processing Unit), Genetics, Identify Issues, Independent Software Vendor (ISV), Internet Security, Internet of Things, JAX (Java API for XML), Kernel Programming, Leading Edge Technology, Machine Learning, Machine Tool, Manufacturing Operations Management, Matrix Management, Mentoring, Network Operations Center, Operations Management, Operations Security (OPSEC), Organizational Development/Management, Organizational Skills, People Management, Performance Analysis, Performance Management, Product Management, Production Systems, Project/Program Management, Psychology, Python Programming/Scripting Language, Recruiting/Staffing Agency, Regulatory Requirements, Resource Utilization, Retail, Stock Purchase Plans, Supply Chain Operations, Systems Engineering, Team Lead/Manager, Technical Leadership, Technical Strategy, Technical Support, YouTube
LOCATION
Sunnyvale, CA
POSTED
13 days ago

Senior Engineering Manager AI Inference Platform, Distributed Cloud - Google Careers

Careers

Careers

Skip navigation links

homehome

Home

Home

work_outlinework_outline

Jobs

Jobs

noogler_hatnoogler_hat

Students

Students

googlegoogle

How we work

How we work

handymanhandyman

How we hire

How we hire

person_outlineperson_outline

Your career

Your career

help_outline

Help link

feedback

Send feedback

more_vert

  • Help
  • Send Feedback

Sign in

Careers

Careers

homeHome

work_outlineJobs

expand_more

noogler_hatStudents

expand_more

googleHow we work

expand_more

handymanHow we hire

expand_more

person_outlineYour career

expand_more

job details

arrow_back

Back to jobs search

Jobs search results

3,711 jobs matched

  • Staff Developer Experience Engineer, DeepMind

Bengaluru, Karnataka, India

  • YouTube Ads Action Demand Verticals Tech Lead

Mountain View, CA, USA

  • Forward Deployed Engineer IV, GenAI, Google Cloud

San Francisco, CA, USA; Atlanta, GA, USA; +24 more; +23 more

  • Google Cloud Dedicated Operations Product Management Lead

New York, NY, USA; Kirkland, WA, USA; +2 more; +1 more

  • Senior Software Engineer, Storage AI/ML

Seattle, WA, USA

  • Principal Growth Manager, AppDev, Google Customer Solutions

New York, NY, USA; Los Angeles, CA, USA; +3 more; +2 more

  • Account Representative, Search Ads 360, Retail, LCS

Chicago, IL, USA

  • Senior Staff Software Engineer, Cloud Billing

Seattle, WA, USA; Kirkland, WA, USA

  • Regional IoT Operations and Cyber security Specialist

New York, NY, USA

  • Senior Staff Software Engineer, Agentic Commerce, Business Agent

Mountain View, CA, USA

  • Senior Software Engineer

Mountain View, CA, USA; San Bruno, CA, USA; +4 more; +3 more

  • Electrical Engineer, Platform Realization, Hardware

Mountain View, CA, USA

  • Senior ASIC Power Delivery Engineer

Sunnyvale, CA, USA

  • Senior Program Manager, Data Center Construction

Monrovia, IN, USA

  • Staff Data Scientist, Product

Kirkland, WA, USA; Sunnyvale, CA, USA

  • AI SoC Design Verification Engineer, Google Cloud

Tel Aviv, Israel; Haifa, Israel

  • Rack Power Engineer, Platforms Infrastructure

Sunnyvale, CA, USA

  • Program Manager III, Manufacturing Operations, Cloud Supply Chain

Sunnyvale, CA, USA

  • Staff Software Engineer, YouTube Ads Machine Learning

Mountain View, CA, USA

  • Customer Engineer II, Platform, Strategic AI and Independent Software Vendor, Google Cloud

Cambridge, MA, USA

Showing 1 to 20 of 3711 rows

1‑20 of 3711

navigate_next

Follow Life at Google on

*

More about us

About usopen_in_newContact usopen_in_newPressopen_in_new

Related Information

Investor relationsopen_in_newBlogopen_in_new

Equal Opportunity

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google"s EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

More about us

expand_more

Related information

expand_more

Equal opportunity

expand_more

Privacyopen_in_newApplicant & Candidate Privacyopen_in_newTermsopen_in_newManage cookies

helpHelpopen_in_new

arrow_back

Back to jobs search

Senior Engineering Manager AI Inference Platform, Distributed Cloud

share

  • linkCopy link
  • emailEmail a friend

corporate_fareGoogleplaceSunnyvale, CA, USA

bar_chartAdvanced

Advanced

Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain.

Apply

share

  • linkCopy link
  • emailEmail a friend

Minimum qualifications:

  • Bachelor"s degree or equivalent practical experience.
  • 8 years of experience programming in C++ or Python.
  • 7 years of experience optimizing, profiling, and scaling production-grade systems on GPU accelerators or specialized AI hardware.
  • 5 years of experience directly managing and leading engineering teams focused on machine learning infrastructure, AI platforms, or high-performance distributed computing systems.
  • 5 years of experience in a people management or team leadership role.
  • 4 years of experience managing engineering organizations across multi-team infrastructure dependencies.

Preferred qualifications:

  • Master's degree or PhD in Engineering, Computer Science, or a related technical field.
  • 5 years of experience working in a complex, matrixed organization.
  • 5 years of experience implementing advanced LLM serving architectures and optimization techniques, such as disaggregated serving, continuous batching, or specialized compiler technologies (e.g., XLA).
  • 4 years of experience utilizing deep-dive ML profiling tools (e.g., Nsight, xprof) to troubleshoot and resolve low-level bottlenecks within major frameworks like JAX, PyTorch, or TensorFlow.

About the job

In this role, you will be pivotal in architecting and optimizing the serving stack for models like Gemini in an on-prem cloud environment, addressing exciting challenges to improve speed and efficiency. This is a unique opportunity to go deep, leading system-level design and performance profiling, ensuring Google"s LLMs run faster and more cost-effectively than ever before.

Google Cloud accelerates every organization's ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google's cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Individual pay is determined by factors including job-related skills, experience, and relevant education or training.

US: $262000 - $365000 (USD) + 25% bonus target + equity + benefits

Learn more about benefits at Google.

Responsibilities

  • Lead, mentor, and grow a high-performing team of systems and ML engineers. Drive a culture of excellence, psychological safety, and continuous learning while guiding career paths and OKRs.
  • Define the technical vision and strategy for enhancing the LLM serving stack, focusing on performance, scalability, and resource efficiency.
  • Oversee the infrastructure and tooling for in-depth performance analysis, profiling, and benchmarking of LLM models on GPU accelerators.
  • Partner closely with Research, SRE, Product, and core library teams to optimize and deploy LLMs globally.
  • Drive the design, implementation, and optimization of advanced serving architectures-including disaggregated serving-while collaborating with core library and kernel partners to eliminate low-level performance bottlenecks, maximize resource utilization, and minimize latency.

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google"s Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google"s EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

Equity is granted exclusively and discretionarily by Alphabet Inc. on the basis of an agreement concluded between you and Alphabet Inc. Alphabet Inc. is your sole contractual partner with respect to equity grants. GSU grants are not guaranteed, are discretionary, are subject to approval by the Alphabet Inc. board of directors or its delegate, the terms of the relevant Alphabet Inc. stock plan, and your grant agreement. They have no impact on statutory payments. Current or past grants do not confer an acquired right.

Follow Life at Google on

*

More about us

About usopen_in_newContact usopen_in_newPressopen_in_new

Related Information

Investor relationsopen_in_newBlogopen_in_new

Equal Opportunity

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google"s EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

More about us

expand_more

Related information

expand_more

Equal opportunity

expand_more

Privacyopen_in_newApplicant & Candidate Privacyopen_in_newTermsopen_in_newManage cookies

helpHelpopen_in_new

Follow Life at Google on

*

More about us

About usopen_in_newContact usopen_in_newPressopen_in_new

Related Information

Investor relationsopen_in_newBlogopen_in_new

Equal Opportunity

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google"s EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

More about us

expand_more

Related information

expand_more

Equal opportunity

expand_more

Privacyopen_in_newApplicant & Candidate Privacyopen_in_newTermsopen_in_newManage cookies

helpHelpopen_in_new

Google apps

Main menu

About the Company

G

Google LLC