San Francisco, CA30+ days ago
technical reports of pre-trained LLMs, SSL techniques, model pre-training optimization) Member of team that has trained a large language model from scratch (10B + parameters, 500B+ tokens) Publications in deep learning theory Publications at ACL, NAACL and EMNLP, Neurips, ICML or ICLR Optimization (Training & Inference) PhD focused on topics related to optimizing training of very large deep learning models Multiple years of experience and/or publications on one of the following topics: Model Sparsification, Quantization, Training Parallelism/Partitioning Design, Gradient Checkpointing, Model Compression Experience optimizing training for a 10B+ model Deep knowledge of deep learning algorithmic and/or optimizer design Experience with compiler design. Cambridge, MA: $262,500 - $299,600 for Applied Researcher II McLean, VA: $262,500 - $299,600 for Applied Researcher II New York, NY: $286,400 - $326,800 for Applied Researcher II San Francisco, CA: $286,400 - $326,800 for Applied Researcher II San Jose, CA: $286,400 - $326,800 for Applied Researcher II.