Job Location
HYDERABAD OFFICE INDIA PSC PGHJob Description
Your team
Based out of P&G's strategic technology center in Hyderabad, you will be a key member of our global Site Reliability Engineering (SRE) team. You will work in a highly collaborative environment, partnering with local and international teams of application developers, infrastructure engineers, and data scientists. Your work will involve staying current with emerging technologies in data solutions, AI, and machine learning to drive innovation.
How success looks like
Over time, you will become a subject matter expert in system reliability, automation, and observability. Successful SREs can grow into senior engineering roles, architectural positions, or expand their experience into other specialized areas within P&G's global IT organization. You will be a key part of our digital transformation, leveraging your skills in Python, cloud computing (Azure), and observability tools (like Grafana and Prometheus) to ensure our systems are resilient and scalable. Your impact will be critical to the stability of the technology that drives our business.
Key Responsibilities
Maintain and improve the availability, reliability, and performance of all production systems.
Implement and manage application monitoring and observability solutions, using tools like Grafana and Prometheus to proactively identify issues.
Develop and maintain automation to streamline operations, reduce manual intervention, and improve system efficiency.
Participate in the software development lifecycle, leveraging SRE/DevOps practices and CI/CD pipelines to ensure new services meet production standards.
Troubleshoot, debug, and resolve software and infrastructure issues across our technology stack, performing bug fixes as needed.
Collaborate with development teams to design scalable and reliable systems from the ground up.
Continuously learn and apply emerging technologies in data solutions, AI, and machine learning to enhance our SRE capabilities.
Job Qualifications
Minimum of 3+ years of relevant professional experience
A fundamental understanding of cloud computing concepts and services, particularly within Microsoft Azure.
Basic proficiency in Python programming.
Understanding of AI principles and an interest in leveraging Generative AI (GenAI) to implement innovative operational solutions and enhance automation.
Experience in implementing Application Monitoring and Observability. Knowledge of Grafana and Prometheus is a strong advantage.
Strong analytical and problem-solving skills to effectively identify and resolve software issues.
Familiarity with SRE or DevOps practices and tools, including CI/CD pipelines, is an advantage.
Excellent written and verbal communication skills in English.
Strong ownership & self-leadership skills to understand objectives and deliver required outcomes in a fast-paced environment.
Job Schedule
Full timeJob Number
R000142470Job Segmentation
Entry Level