Pune, Maharashtra, India
1 day ago
Principal Dev Ops Engineer

Do you enjoy analyzing complex problems? Do you thrive on the challenge of experimenting with new research or technologies to find creative solutions? Are you looking for career and learning opportunities within a dynamic work environment with an innovative company experiencing high growth? Then joining our Fortinet team is the right move for you!


Here at Fortinet, we are looking for a highly motivated individual who can thrive in a fast-paced environment and successfully contribute to the team. The ideal candidate will have a can-do attitude, passion for technology, extensive development experience, and will be able to learn quickly.


FortiSOAR is looking for passionate and talented problem solvers to join our DevOps team for designing, implementing, and maintaining a comprehensive observability framework for our SaaS platform. This role will focus on ensuring that our systems and applications are well-instrumented and that we have the necessary insights into our infrastructure, application performance, and user experience. You will work closely with development, DevOps, and operations teams to ensure robust monitoring, alerting, and incident response capabilities.

Responsibilities

Designing and developing the DevOps infrastructure for business-critical systems. Maintaining and improving container-based Kubernetes environments. Develop and integrate observability solutions across the stack (infrastructure, application, network, and user experience) to monitor and provide actionable insights.  Work with developers and engineers to ensure that all relevant services, applications, and infrastructure components are instrumented using the latest observability best practices (e.g., logging, tracing, and metrics collection).  Set up automated alerting systems for real-time detection of performance bottlenecks, failures, or anomalies, and integrate with incident management workflows.  Build pipelines for data collection, storage, and visualisation to help the teams gain insights from monitoring data. Use observability data to improve system reliability, availability, and performance by driving root cause analysis and continuous improvement initiatives.  Implement automated solutions for monitoring and alerting that scale with platform growth and reduce manual intervention.  Develop and maintain comprehensive documentation on monitoring, alerting, and incident response processes. Provide training and support to engineering teams to use observability tools effectively. 
Confirmar seu email: Enviar Email