RTP, North Carolina, US
9 hours ago
XDR Sr. Site Reliability Engineering

The application window is expected to close on: 8/10/2025.  

Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.  

 

Meet the Team 

The Cisco XDR SRE Team is dynamic, decisive, and fast-moving due to the critical nature of the products we support. Joining our team means working on meaningful projects that directly enhance the security of our customers. We take pride in seeing the tangible impact of our work, knowing it helps safeguard organizations and individuals against evolving threats. 


Your Impact 

As a Senior Site Reliability Engineer, you will play a leading role in improving the efficiency, scalability, and reliability of the XDR Incident Generation team. Your work will focus on implementing sophisticated automation, encouraging a culture of operational perfection, and driving the adoption of Infrastructure as Code (IaC) and CI/CD best-practices. Additionally, you'll mentor team members, design robust platforms and services, and ensure their flawless lifecycle management. 

Key Responsibilities 

Manage the full lifecycle of platform services, from design and implementation to maintenance.Promote and enforce Infrastructure-as-Code (IaC) practices to enable scalable, version-controlled, and auditable infrastructure.Lead the automation of build, deploy, and release processes to boost team efficiency and innovation.Design, develop, and maintain modern CI/CD - pipelines aligned with industry best-practices.



Minimum Qualifications 

Extensive experience with AWS services (including VPC, S3, Lambda, SQS, Network Firewall, ECS/EKS, IAM, DynamoDB or CloudWatch) along with expertise in AWS security and/or cost optimization.Proficiency in Infrastructure-as-Code tools such as Terraform, and scripting/programming languages including Python and/or Bash.Experience in building and maintaining CI/CD pipelines using tools like GitHub Actions or TeamCity, combined with robust knowledge of incident management, postmortem analysis, and/or supervising SLOs/SLAs.Ability to participate in on-call rotation.



Preferred Qualifications 

Bachelors + 7 years, or Masters + 4 years of related experience.Collaborate across teams, effectively communicating technical concepts with transparency and precision.Mentor junior engineers, foster skill development, and uphold SRE best-practices within the team.Expertise in crafting AI driven workflows for incident response, forecasting potential issues (e.g., resource exhaustion, outages), and enabling auto-scaling or remediation.Proficient in integrating AI/ML tools for anomaly detection, threat response, and serverless architecture optimization on AWS.




Why Cisco? 

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint. Simply put – we power the future. 

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere. 

We are Cisco, and our power starts with you. 


Confirmar seu email: Enviar Email