Seattle, WA, US
15 hours ago
Software Development Manager, AI Inference Technology, Neuron SDK
DESCRIPTION
AWS Utility Computing (UC) provides product innovations — from foundational services such as
Amazon Elastic Compute Cloud (EC2), to new product innovations that continue to set AWS’s
services and features apart in the industry.

We develop AWS Neuron, the complete software stack for Trainium, Amazon's custom cloudscale
machine learning accelerators. Come optimize LLMs such as Llama and GPT OSS to run
really fast on Trainium.

As the SDM for the Neuron Inference Technology building blocks team, you will guide your
expert AI engineers to build fundamental inference technology building blocks and libraries to
enable AI developers to optimize model for inference on Trainium and Inferentia devices. We’re
currently focusing on MoE models such as GPT OSS for Trainium 2 and the upcoming
Trainium 3. You will develop and optimize blocks such as attention kernels and
deliver them in the Neuronx_Distributed Inference Libraries, enabling customers to optimize
LLMs, multimodal, and generative models.

The ideal candidate will have an established background in optimizing LLMs, such as delivering
high-performance models using distributed inference libraries. You should be capable of
managing demanding, fast-changing priorities. You should have a strong technical ability to
understand and deliver as part of a vertically integrated system stack consisting of the PyTorch
inference library, Neuron compiler, runtime and collectives.

A day in the life

You will work with your senior management and technical leaders to define the building blocks
for the latest LLMs, build and deliver them to customers. You will manage changing priorities as
new models and new technologies emerge, and you adapt your team’s work to manage them.
You will dive deep to help your team solve technical challenges.

About the team
About AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud
platform. We pioneered cloud computing and never stopped innovating — that’s why
customers from the most successful startups to Global 500 companies trust our robust suite of
products and services to power their businesses.

Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed
in the job description, we encourage candidates to apply. If your career is just starting, hasn’t
followed a traditional path, or includes alternative experiences, don’t let it stop you from
applying.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of
sacrifices at home, which is why we strive for flexibility as part of our working culture. When we
feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer.
That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing
resources here to help you develop into a better-rounded professional.
Confirmar seu email: Enviar Email