Seattle, WA, US
23 hours ago
Senior Manager, GenAI/ML Infrastructure Planning, S&OP Planning
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.

You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

AWS Infrastructure Services (AIS) is looking for a Senior Manager to lead GenAI/ML Infrastructure Demand & Supportability Planning. Our team is responsible for AWS's global GenAI and Machine Learning infrastructure supply and demand strategy, spanning 105 availability zones, 33 geographical regions, and serving over 245 countries and territories. The team drives compute and datacenter supply planning, ML infrastructure capacity delivery, and GenAI supportability to ensure scalable cloud services for millions of AWS customers. In other words, we're the people who keep the cloud running for the next generation of AI workloads.

As the Senior Manager of GenAI/ML Infrastructure Demand Planning, you will be responsible for leading teams that drive the S&OP (Sales and Operations Planning) process specifically for AWS's Machine Learning and GenAI infrastructure. You will work closely with EC2 product/forecasting teams, Hardware/Network/DC Engineering, Material Planning, Data Center planning, and Capacity Delivery teams to build S&OP plans that drive short-term and long-term ML/GenAI Supply and Demand strategy. You will be responsible for the demand signal provided to AIS for Machine Learning racks and GenAI infrastructure, ensuring alignment with financial plans, sales and growth projections, agreed upon demand levers, transitions, NPI roadmaps, and prior commits and plan of records.

You will outline supply chain automation roadmaps to scale capacity delivery of Machine Learning infrastructure and guide automation teams on implementing the systems required for various elements of the demand planning process. The GenAI/ML Demand and S&OP Team is responsible for the 13-103 week Demand Plan of Record (POR) for ML Server material planning and the 0-10 year Demand POR for ML Data Center planning.

You are an experienced leader who will have demonstrated leading large cross-functional and cross-organizational projects in the ML/AI infrastructure space. An ability to take large, technically complex projects and break them down into manageable pieces, develop actionable plans, and successfully deliver them are expected. This role is inherently cross-functional and requires the ability to think big and collaborate with others as you work closely with teams across AWS Infrastructure and AWS Service teams. The shifting power and permitting constraints, combined with the rapid evolution of GenAI workloads and the critical nature of the role, requires someone who maintains momentum, clarity of vision, and adaptability while communicating effectively across a diverse set of customers, partners, and leadership.

The Sr. Manager must effectively distinguish between one/two way door decisions. Decisions driven by the Sr. Manager create significant impact to the AWS Infrastructure organization and all of our customers, so excellent judgement (based on domain and technical expertise) is required in managing complex problems, tough trade-offs, proposals, and escalations.

Communication with executive audiences is a regular occurrence. High judgment, negotiation skills, ability to influence without authority, analytical talent, technical aptitude, and leadership to collaborate with a diverse set of stakeholders across multiple time zones, manage capital budgets, eliminate non-value-add activity, design solutions, remove roadblocks, and find creative ways to accelerate ML infrastructure delivery are therefore essential for success in this role.


Key job responsibilities
- Drive the S&OP process for AWS GenAI/ML Infrastructure and influence AWS's ML Supply and Demand strategy
- Partner with EC2 product/forecasting, Hardware/Network/DC Engineering, DC ops teams to capture system and operational requirements for ML infrastructure
- Outline supply chain automation roadmap to scale capacity delivery of Machine Learning and GenAI infrastructure
- Guide automation teams on implementing the systems required for the various elements of the ML demand planning process, while also directly managing/dialing in new programs until such a time as it makes sense to automate
- Partner with SDMs/PEs/Ops teams to evaluate pros and cons of alternate solutions in order to finalize high level system solution and operational processes for ML capacity delivery
- Manage end to end implementation of new supply chain automation capabilities by partnering with software development teams
- Create product strategy and feature road map via Amazon Working Backwards documents
- Provide thought-leadership to Amazon leadership and business partners, delivering solutions to strategic problems related to ML/GenAI infrastructure scaling
- Communicate performance against goals and objectives through narratives and business reviews, including bi-weekly leadership updates on capacity delivery status of Machine Learning racks
- Lead broader initiatives, as assigned, to further advance the effectiveness of the organization
- Drive operational initiatives to scale capacity delivery of Machine Learning racks through collaboration with EC2 capacity planning, DC Infra planning, EC2 product, and operations teams


About the team
*Why AWS*
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

*Diverse Experiences*
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

*Work/Life Balance*
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

*Inclusive Team Culture*
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.

*Mentorship and Career Growth*
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Confirmar seu email: Enviar Email