Monitoring Engineer – Cloud Operations
We are looking for a skilled and proactive Monitoring Engineer to join our Cloud Operations team. The ideal candidate will have hands-on experience in AWS, Linux/Windows systems monitoring, and modern observability tools. You will be responsible for implementing and maintaining monitoring solutions to ensure the availability, performance, and reliability of cloud-hosted infrastructure and applications.
Key Responsibilities
Design, deploy, and manage monitoring solutions for AWS-based infrastructure using tools such as CloudWatch, Prometheus, Grafana, and the ELK Stack. Configure and maintain monitoring for Linux and Windows servers, ensuring system health and optimal performance. Develop and maintain ing systems and dashboards for real-time visibility into system metrics and logs. Collaborate with DevOps, Cloud, and Security teams to improve observability and enhance incident response. Automate monitoring tasks and integrate them with CI/CD pipelines for proactive issue detection. Create and maintain monitoring standards, SOPs, and troubleshooting guides.Technical Skills & Experience
Skill Area
Required Experience
AWS Infrastructure
3+ years
Linux / Windows Admin
3+ years
Monitoring Tools
CloudWatch, Prometheus, Grafana, ELK
Scripting & Automation
Python, Bash, PowerShell (preferred)
Incident Management
Familiarity with ITIL or similar frameworks
Qualifications
Bachelor’s degree in Computer Science, Information Technology, or a related field. AWS certifications (e.g., Cloud Practitioner, SysOps Administrator) are a plus. Strong analytical and problem-solving skills. Excellent communication and documentation abilities.Preferred Attributes
Experience working in multi-account AWS environments. Familiarity with Infrastructure as Code (Terraform, CloudFormation). Exposure to containerized environments (Docker, Kubernetes).