Palo Alto, CA, 94301, USA
3 days ago
Remote Platform Engineer
Job Description The Staff Platform Engineer is responsible for designing, building, and operating the company’s core platform infrastructure. This role focuses on enabling engineering teams through reliable, secure, and scalable platforms, while driving improvements across cloud infrastructure, identity, networking, and developer tooling. Responsibilities: - Design, implement, and maintain core platform services and shared infrastructure - Define and evolve platform architecture, standards, and best practices - Lead and participate in technical design reviews for platform initiatives - Build and operate cloud infrastructure primarily in AWS - Operate and improve Kubernetes-based platforms and supporting services - Manage and troubleshoot Linux-based systems at scale - Design and maintain identity and access integrations (e.g., Okta, Active Directory, SSSD, Samba) - Implement access control models for servers, clusters, and platform services - Build internal tooling and automation to improve developer productivity - Improve CI/CD workflows, environment provisioning, and service onboarding - Act as a primary escalation point for complex platform and infrastructure issues - Improve observability, monitoring, and alerting for platform services - Lead incident response and root cause analysis for platform-related outages - Collaborate with security and application teams to ensure platform reliability and security We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/. Skills and Requirements - 8–10 years of professional experience in software engineering, platform engineering, DevOps, or SRE roles, with a strong foundation in building and operating production systems - Extensive experience designing, deploying, and managing cloud infrastructure on AWS, following security, scalability, and reliability best practices - Strong Linux systems engineering experience, including system internals, performance tuning, security hardening, and low-level troubleshooting - Hands-on experience operating Kubernetes in production environments, including cluster lifecycle management, scaling, upgrades, and observability - Proven experience with infrastructure-as-code (IaC) and automation, particularly using Terraform (or similar tools such as CloudFormation or Pulumi) - Strong software engineering background, with hands-on development experience in Python and the ability to write maintainable, testable, and production-quality code - Deep understanding of the software development lifecycle (SDLC), including design, development, testing, deployment, and maintenance - Experience building and maintaining CI/CD pipelines, integrating automated testing, security checks, and deployment workflows - Strong troubleshooting and debugging skills across complex, distributed systems, with the ability to identify root causes and drive long-term improvements - Excellent collaboration and communication skills, with experience working cross-functionally with engineering, product, and security teams - Experience with identity and access management systems - Experience building internal developer platforms or shared infrastructure - Experience operating large-scale or mission-critical systems
Confirmar seu email: Enviar Email