Staff DevOps Engineer
Full-time Mid-Senior LevelJob Overview
We are hiring a Staff DevOps Engineer for the AI Factory product team, responsible for architecting, scaling, and standardizing the cloud and platform foundations that power AI-driven products and internal platforms.
This is a senior individual contributor role with org-level technical influence. The Staff DevOps Engineer operates beyond a single product or squad, shaping platform strategy, reliability standards, CI/CD foundations, and cloud cost governance across the AI Factory. The role is hands-on where it matters most, while also acting as a technical authority and escalation point for complex infrastructure, deployment, and reliability challenges.
Key Responsibilities
Architect and evolve cloud-native platform foundations supporting AI Factory products and internal platforms
Define and enforce DevOps standards across environments: CI/CD, deployment patterns, observability, security, and cost controls
Design scalable infrastructure patterns for multi-service, multi-environment systems
Act as the final escalation point for complex infrastructure and reliability issue
Own architecture and best practices across AWS / GCP / Azure (multi-cloud exposure)
Design highly available, secure, and cost-efficient infrastructure
Drive reliability engineering practices: SLOs, SLIs, incident response, post-mortems, and resilience planning.
Identify systemic risks early (capacity, cost, security, scaling) and address them proactively
Architect and standardize CI/CD pipelines (GitLab CI, GitHub Actions, Azure DevOps)
Enable safe, repeatable deployments across products and teams
Improve developer experience by reducing friction in build, test, and deployment workflows.
Support release strategies such as blue-green, canary, and rollback mechanisms
Stay hands-on with critical infrastructure code, platform refactors, and high-impact improvements
Review infrastructure designs and pipeline changes for correctness, simplicity, and long-term sustainability
Build foundational tooling that enables multiple teams to move faster and safer
Partner closely with Engineering Managers, Tech Leads, and Product teams to align platform capabilities with delivery needs
Mentor senior DevOps and Platform engineers through reviews, architecture discussions, and coaching
Act as a multiplier, improving outcomes across teams without direct people management
Make Your Resume Now