Hyderabad
1 day ago
Senior Observability Engineer with Python
Role Responsibilities

Design and implement observability solutions leveraging Dynatrace, Datadog, and OpenTelemetry, ensuring scalable, automated, and developer-friendly platforms.

Develop reusable patterns, templates, and automation scripts to drive consistency across observability practices and minimize manual onboarding efforts.

Build and maintain insightful dashboards that provide actionable visibility into system performance, reliability, and user experience.

Integrate observability into CI/CD pipelines (e.g., Jenkins) to enable continuous feedback, faster incident detection, and proactive issue resolution.

Automate infrastructure provisioning and deployment using Terraform and Ansible to support observability at scale.

Lead Proof of Value (POV) initiatives to evaluate emerging observability tools and ensure alignment with organizational roadmaps and future-state architecture.

Implement and manage OpenTelemetry pipelines for standardized trace, metric, and log collection to enable vendor-agnostic ingestion strategies.

Collaborate with Business Units (BUs), Developers, and Platform Engineers to embed observability within the software delivery lifecycle, enhancing developer experience and operational efficiency.

Provide training, documentation, and reusable assets to accelerate adoption and promote observability best practices across teams.

Define and implement SLIs/SLOs and error budgets in partnership with BUs to improve reliability engineering practices and service health visibility.

Drive operational excellence by enabling proactive monitoring, reducing customer impact, and optimizing incident management workflows.

Amplify AIOps outcomes through integration of observability data into intelligent automation and decision-making processes across business and technology functions.

Mentor junior team members, fostering a culture of learning, innovation, and continuous improvement.

What Your Background Looks Like

Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience).

7–10 years of experience in Observability, Monitoring Engineering, or DevOps roles.

Strong hands-on experience with one or more observability platforms: Dynatrace, Datadog, OpenTelemetry, and Python scripting.

Proven experience building dashboards and writing automation scripts for monitoring and reporting.

Familiarity with CI/CD tools (e.g., Jenkins) and Infrastructure as Code (IaC) tools such as Terraform and Ansible.

Strong understanding of distributed systems, cloud-native architectures, and microservices (Kubernetes and containerized environments).

Comprehensive knowledge of the software development lifecycle (SDLC) and experience working in Agile environments with cross-functional teams.

Excellent communication, collaboration, and stakeholder management skills.

Strong organizational and multitasking abilities, with a track record of thriving in fast-paced, dynamic environments.

Confirmar seu email: Enviar Email