technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization) Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens) Publications in deep learning theory Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR Optimization (Training & Inference) PhD focused on topics related to optimizing training of very large deep learning models Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression Experience optimizing training for a 10B+ model Deep knowledge of deep learning algorithmic and/or optimizer design Experience with compiler design. Basic Qualifications: Currently has, or is in the process of obtaining, a PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields, with an exception that required degree will be obtained on or before the scheduled start date or M.S. in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields plus 2 years of experience in Applied Research.