technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization) Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens) Publications in deep learning theory Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR Optimization (Training & Inference) PhD focused on topics related to optimizing training of very large deep learning models Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression Experience optimizing training for a 10B+ model Deep knowledge of deep learning algorithmic and/or optimizer design Experience with compiler design. Cambridge, MA: $218,700 - $249,600 for Applied Researcher I McLean, VA: $218,700 - $249,600 for Applied Researcher I New York, NY: $238,600 - $272,300 for Applied Researcher I San Francisco, CA: $238,600 - $272,300 for Applied Researcher I San Jose, CA: $238,600 - $272,300 for Applied Researcher I.