DevOps Engineer
Full-time ExecutiveJob Overview
Responsibilities:
Design and build high-availability architecture on Kubernetes clusters, including handling upgrades, capacity planning, and cost optimization.
Develop, maintain and manage tools to automate operational activities and enhance engineering productivity.
Maintain our monitoring stack (VictoriaMetrics, Jaeger, Grafana), build alerts to detect anomalies before reaching users, and handle incident response and post-mortem analysis to improve system availability.
Ensuring the database reliability including handling replication, failover, and performance tuning on various datastore (PostgreSQL, MySQL, MongoDB, Elasticsearch, Kafka, Redis, and Memcache).
Manage all infrastructure using IaC (Opentofu, Ansible, ArgoCD, Helm). Develop a modular and reusable IaC for the entire stack.
Develop, optimize and maintain GitHub Actions, GitLab CI, and CircleCI workflows to ensure fast and reliable delivery.
Ensure a secure production environment, including managing secrets, firewalls, and access controls.
Analyze infrastructure utilization and ensure high availability of the infrastructure while minimizing cost.
Update, track and resolve technical issues in a timely manner.
Explore and integrate AI-driven insights into operational processes to improve reliability, reduce noise, and empower engineering teams with intelligent decision-making.
Make Your Resume Now