London, GBR
22 hours ago
Software/Research Engineer
**Summary:** Meta is seeking an AI Research Engineer/Software Engineer to join our team. The ideal candidate will have experience working on maximizing performance of AI models. This role involves applying these skills to solve some of the most crucial and exciting problems that exist on the web.The AI Applications Engineering team is dedicated to maximizing training and inference performance of Generative AI (GenAI) and Recommendation models on Meta's Training and Inference Accelerator (MTIA). We employ innovative optimization and data parallelization strategies to maximize training throughput for the next generations of GenAI and recommendation models. Additionally, we work cross-functionally with many partner teams to ensure end-to-end performance of large-scale pre-training and inference, enabling us to deliver the next generation of AI experiences more quickly to our users. **Required Skills:** Software/Research Engineer Responsibilities: 1. Applying state-of-the-art optimization techniques to our latest large-scale AI workloads running on Meta’s fleet of accelerators 2. Profiling, analyzing, debugging, and optimizing large-scale workloads on our next-generation training superclusters 3. Work tightly with our customers to co-design models to maximize pre-training and inference efficiency 4. Set direction and goals for the team related to project impact, capacity, and developer efficiency 5. Collaborating cross-functionally with the compiler, framework, communication and firmware teams to capture performance bottlenecks 6. Implement custom kernels to maximize model performance 7. Lead large and complex technical efforts across many engineers and teams **Minimum Qualifications:** Minimum Qualifications: 8. Bachelor’s degree in computer science or a related STEM field 9. Experience programming AI accelerators (e.g. GPUs, custom silicon etc.) using AI frameworks such as PyTorch or similar 10. Experience developing custom kernels and compiler infrastructure to improve performance using low-level programming models such as CUDA, OpenCL or similar 11. Minimum 5 years of experience developing and optimizing performance in modern C/C++ 12. Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment **Preferred Qualifications:** Preferred Qualifications: 13. Master’s/PhD in computer science or related STEM field 14. A proven track record of impactful contributions to pre-training of AI models at scale using GPUs/custom ASIC or similar (publications, relevant work experience, shipped products, patents etc) 15. Experience with neural network training using ML frameworks such as PyTorch etc. 16. Experience with distributed AI systems and communication protocols such as MPI or collective libraries such as NCCL etc. 17. Experience or knowledge in one or more of LLMs and recommender systems. **Industry:** Internet
Confirmar seu email: Enviar Email
Todos os Empregos de Meta