Bucharest, Massachusetts, Romania
1 day ago
Site Reliability Engineer (SRE)
Company description Tremend is the newest global software engineering hub for Publicis Sapient. For over 18 years, the company has been infusing its advanced technical expertise into complex and innovative solutions that meet today's digital transformation needs and pave the way for a better and smarter future. By joining forces with Publicis Sapient we're accelerating the impact, providing a good mix of talented engineers, technology, continuous improvement, innovation, and R&D. Here, you'll have the opportunity to unleash your potential, powering up advanced software solutions for some of the world's most iconic brands. Embrace your passion for technology, creativity, and continuous improvement, and join us in making a difference through engineering. Overview We’re looking for a Site Reliability Engineer (SRE) to ensure the availability, performance, and scalability of our production systems. You’ll work closely with engineering, infrastructure, and security teams to build reliable systems, automate operations, and improve monitoring and incident response. Responsibilities Key Responsibilities · Monitor system performance and ensure high availability. · Improve observability through dashboards, logging, and alerting. · Automate deployments, configuration, and operational tasks. · Collaborate with developers to ensure production readiness. · Lead incident response, root cause analysis, and postmortems. · Contribute to capacity planning, DR/BCP, and performance tuning. · Maintain documentation, runbooks, and SOPs. Qualifications What You Bring · Solid knowledge of Linux/UNIX systems and networking fundamentals. · Experience with CI/CD tools, Git, and infrastructure automation (e.g., Ansible, Puppet, Azure DevOps). · Hands-on with monitoring tools like Prometheus, Grafana, Zabbix, or ELK. · Strong troubleshooting and communication skills. · Basic understanding of databases and scripting (PowerShell, Bash). Nice to Have · Experience with containers (Docker, Kubernetes). · Familiarity with ITIL/Agile practices. · Knowledge of SQL Server or Elasticsearch. Additional information Benefits of Working Here: Besides an exciting job in a tremendous team, here's what you can expect: A fast-paced tech environment Continuous growth & learning Open feedback culture Room for own initiative & ideas Transparency about results & strategy Recognition & reward for hard work Working with a flexible schedule Medical subscription Meal tickets Extra vacation days - starting with 25 vacation days Many others perks
Confirmar seu email: Enviar Email
Todos os Empregos de Publicis Media