Kuala Lumpur, Malaysia
41 days ago
Senior Site Reliability Engineer (SRE)

Summary

We are searching for a Senior Site Reliability Engineer hungry for a rare chance to transform insurance with the industry's leading cloud platform

Job Description

Senior Site Reliability Engineer (SRE) - Guidewire Cloud Platform (Application)

The Opportunity

As a member of the SRE-Application team, you'll be responsible for building and evolving our SRE practice for the application running on our Guidewire Cloud Platform. This is an opportunity to apply your expertise in automation, software engineering, and operational discipline to ensure our cloud-based solutions' reliability, performance, and scalability.

What You'll Do

Collaborate with development teams to troubleshoot and solve problems, reducing customer impact.

Develop automated runbooks and implement measures to handle issues proactively.

Apply sound engineering principles and mature automation to our operating environments.

Monitor, maintain, and enhance the reliability and performance of applications on our Guidewire Cloud Platform.

Leverage your automation and software engineering expertise to optimize systems and eliminate toil.

Document and examine incidents to improve processes and prevent future occurrences continuously

Stay up-to-date with the latest industry trends, tools, and best practices in site reliability engineering.

Contribute to a culture of innovation, learning, and continuous improvement.

Participate in on-call rotations to ensure the availability and reliability of our services.

What You'll Bring

Proven experience as a Senior SRE or similar role, with a track record of improving system reliability

Strong problem-solving skills and the ability to analyze complex systems and devise effective solutions

Excellent collaboration and communication abilities to work cross-functionally and clearly document processes

Experience with automation, monitoring, and performance optimization tools and techniques

Dedication to maximizing uptime, scalability, and delivering an exceptional end-user experience

A passion for technology and a strong desire to continuously learn and grow your skills

Alignment with Guidewire's mission to leverage technology to help protect and support others

Required Skills

Strong software engineering background with experience in Python, Go, or Java, following best practices (SOLID, DRY, KISS) and writing clean, testable code

Proven experience designing and deploying SLI’s, SLO’s, and Error Budgets

Proven experience leveraging application performance monitoring (APM) and telemetry tools to ensure we maintain expected service levels for our applications.

Proven experience triaging and debugging distributed systems on cloud infrastructure. 

Proven experience in designing and engineering CICD pipelines within K8S and legacy ecosystems

Proven experience in designing and engineering monitors, dashboards, and synthetic transactions in Datadog

Proven experience in building, deploying, and running scalable infrastructure within AWS and Kubernetes ecosystems using Terraform and other cloud-native approaches

Proven experience in managing infrastructure config at scale using multiple approaches and/or tools such as GitOps, Puppet, or Ansible

Strong understanding of cloud networking, security, and vulnerability management, with the ability to programmatically remediate infrastructure issues at scale

Preferred Skills

SRE Certified in multiple categories

AWS Certified in multiple categories

Experience writing production-quality code in languages like Python, Go, or Java, with a focus on performance, reliability, and maintainability

Proficiency in designing software architectures for distributed systems, including microservices and event-driven architectures

Proficiency with SQL, database administration, data pipelines, performance tuning, and schema design

Proficiency with multiple pipelining tools such as Team City, Bitbucket Pipelines, Jenkins, and GitHub Actions

Familiarity with open-source distributed data processing frameworks such as Hadoop, Apache Spark, AWS RedShift, etc

About Guidewire

Guidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. We combine digital, core, analytics, and AI to deliver our platform as a cloud service. More than 540+ insurers in 40 countries, from new ventures to the largest and most complex in the world, run on Guidewire.

As a partner to our customers, we continually evolve to enable their success. We are proud of our unparalleled implementation track record with 1600+ successful projects, supported by the largest R&D team and partner ecosystem in the industry. Our Marketplace provides hundreds of applications that accelerate integration, localization, and innovation.

For more information, please visit www.guidewire.com and follow us on Twitter: @Guidewire_PandC.

Guidewire Software, Inc. is proud to be an equal opportunity and affirmative action employer. We are committed to an inclusive workplace, and believe that a diversity of perspectives, abilities, and cultures is a key to our success. Qualified applicants will receive consideration without regard to race, color, ancestry, religion, sex, national origin, citizenship, marital status, age, sexual orientation, gender identity, gender expression, veteran status, or disability. All offers are contingent upon passing a criminal history and other background checks where it's applicable to the position.

Confirmar seu email: Enviar Email
Todos os Empregos de Guidewire Software, Inc.