PROCESSING APPLICATION
Hold tight! We’re comparing your resume to the job requirements…
ARE YOU SURE YOU WANT TO APPLY TO THIS JOB?
Based on your Resume, it doesn't look like you meet the requirements from the employer. You can still apply if you think you’re a fit.
Job Requirements of Software Engineer, Systems ML - HPC Specialist:
-
Employment Type:
Full-Time
-
Location:
Bellevue, WA (Onsite)
Do you meet the requirements for this job?
Software Engineer, Systems ML - HPC Specialist
You can create a Career Profile to get job suggestions, prepare for the interview process, and more.
Software Engineer, Systems ML - HPC Specialist
Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web. Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS, cuDNN, AITemplate, FlashAttention and development of runtimes such as LLM disaggregated runtime. HPC specialists spend time optimizing the program to reduce the accelerators idle time. They also develop tools to debug (cuda-gdb), profiler utilizing the accelerated computing hardware (such as PE’s/SFU etc in MTIA or Transformer engine in H100). They are experts in systems who are able to design, debug and accelerate AI workloads from single-node scale up to multi-node scale out distributed systems. They also are able to influence the next generation of Silicon architectures (such as Tensor Core in V100, Transformer Engine in H100) based on the evolving AI workload needs. We are hiring in multiple locations.
Responsibilities
- Apply relevant AI and machine learning techniques to build & optimize our intelligent systems that improve Meta's products and experiences.
- Develop custom/novel architectures, define use cases, and develop methodology & benchmarks to evaluate different approaches.
- Apply in-depth knowledge of how the machine learning system interacts with the other systems around it.
- Assist in goal setting related to project impact, AI system design, and ML excellence.
Minimum Qualifications
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
- 2+ years of experience in HPC and parallel computing.
- Proficiency in GPU programming using CUDA and familiarity with CUDA libraries (cuBLAS, cuDNN, etc.).
- Proven track record of leading successful HPC projects.
- Proven technical expertise in HPC architectures and technologies.
Preferred Qualifications
- PhD in Computer Science, Computer Engineering, or relevant technical field.
- Experience developing AI algorithms or AI-System infrastructure in C/C++ or Python.
- Experience developing AI Compiler (TorchInductor in PyTorch 2.0).
Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to
.Compensation: $70.67/hour to $208,000/year + bonus + equity + benefits. Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics.
#J-18808-Ljbffr
Recommended Skills
- Algorithms
- Artificial Intelligence
- C++ (Programming Language)
- Computer Engineering
- Debugging
- Distributed Architectures
Help us improve CareerBuilder by providing feedback about this job: Report this job
Job ID: kty9av3
CareerBuilder TIP
For your privacy and protection, when applying to a job online, never give your social security number to a prospective employer, provide credit card or bank account information, or perform any sort of monetary transaction. Learn more.
By applying to a job using CareerBuilder you are agreeing to comply with and be subject to the CareerBuilder Terms and Conditions for use of our website. To use our website, you must agree with the Terms and Conditions and both meet and comply with their provisions.