Senior Machine Learning Engineer

Cloudflare Inc

San Francisco, CA

JOB DETAILS
SKILLS
Analysis Skills, Artificial Intelligence (AI), Best Practices, Business Strategy, Communication Skills, Computer Science, Computer Skills, Continuous Deployment/Delivery, Continuous Integration, Conversation Engine, Cross-Functional, Data Analysis, Data Lake, Data Science, Data Sets, Database Extract Transform and Load (ETL), Database Technology, DevOps, Distributed Computing, Docker, Engineering, Git, Intelligence Agencies, Large-Scale Systems, MCP - Microsoft Certified Professional, Machine Learning, Machine Tool, Mathematics, Mentoring, Modeling Languages, PostgreSQL, Presentation/Verbal Skills, Problem Solving Skills, Product Development, Product Engineering, Product Strategy, Python Programming/Scripting Language, SQLite, Scalable System Development, Software Administration, Software Engineering, Source Code/Configuration Management (SCM), Statistics, Use Cases
LOCATION
San Francisco, CA
POSTED
30+ days ago

Location: Bengaluru, India

About the team

The Data Intelligence & Analytics organization builds the core data platform and internal products that power decision-making across the company. We design and operate large-scale data systems, own the company's data lake, ingestion infrastructure, and platform tooling, and develop end-to-end applications that transform complex datasets into fast, reliable, business-critical products used daily by go-to-market, product, and engineering teams. Our work sits at the intersection of data platforms, distributed systems, and product development, giving engineers the opportunity to own meaningful problems across the stack and build systems that truly run the business.

About the role

We are looking for an experienced Machine Learning Engineer to join our Bengaluru team and help scale the next generation of our AI/ML models. In this role, you will build and operate the pipelines behind AI-driven applications, Agents, Chatbots that power teams across the company, including go-to-market, engineering, and product. Because the products we build are initiated and owned by the team, you will be involved end-to-end - from shaping requirements and designing systems to implementation, deployment, and long-term ownership.

You will focus on building scalable, reliable services and application backends, partnering closely with data and full-stack engineers to deliver new features and operate the pipelines and platforms behind our products. Our stack spans multiple languages and technologies, including Python, React, and TypeScript, and continues to evolve. Many of our applications also incorporate strong AI components, giving you opportunities to work with modern AI infrastructure and tools such as vector databases, MCP-based services, and Workers AI.

Responsibilities

Deploy, manage & support ML Applications & Services on Kubernetes

Understand MLOps landscape i.e tooling, tech stack, source systems etc. and work on introducing new tools and solutions for ML & AI initiatives.

Partner and align with Data Scientists, Data Engineers and internal teams to deliver ML solutions in a globally distributed environment.

Lead development of efficiencies to boost model training to deployment lead times

Understand business/product strategy and high-level roadmap and align analysis efforts to enable them with data insights and help achieve their strategic goals.

Leverage Cloudflare products and services for AI & ML initiatives and applications

Use software engineering best practices to publish model scores/insights/learnings at scale within the company.

Requirements

M.S or Ph.D in Computer Science, Statistics, Mathematics, or other quantitative fields.

5+ years of ML Engineering experience with proven industry experience in a large scale environment (PBs scale & globally distributed teams)

Strong experience in scientific computing using Python with Scikit-Learn & PyTorch or Tensorflow.

Strong experience working with Docker & Kubernetes to build and deploy applications and systems.

Experience working with ML Platform tools (AirFlow, Argo Workflows, ArgoCD) preferred.

Experience working with Data Scientists to deploy Machine Learning applications systems for training, inference and observability.

Experience with Full-stack Web technologies and languages (FastAPI, Streamlit, JavaScript/TypeScript, Cloudflare Workers, etc.) preferred with the ability to quickly learn and contribute across a multi-language stack.

Experience with Terraform, Google Cloud Platform (or any other public cloud equivalent) etc.

Experience working with CI/CD systems, version control (Git, Bitbucket, etc.) and DevOps tools.

Experience with Databases such as BigQuery, Postgres, SQLite and ETL/ELT practices

Strong cross-functional collaboration experience with data engineering and data analysts teams within the function.

Proficiency in large language models and the frameworks necessary for implementing GenAI applications, such as chatbots and related use cases.

Strong communication and presentation skills catered to different audiences within the company.

Demonstrated ability to design scalable, reliable, and observable systems, with experience influencing architecture and improving platform foundations.

Strong problem-solving and communication skills, with a track record of working effectively with cross-functional partners and driving projects through ambiguous problem spaces.

Experience mentoring engineers, leading by example in code and design, and contributing to a culture of technical excellence.

About the Company

C

Cloudflare Inc