DevOps / AI Infrastructure Engineer
MT Employee FTJob Overview
About the role
The DevOps / AI Infrastructure Engineer owns the reliability, performance, and scalability of Paradise Media’s production infrastructure and AI platform. This is not traditional DevOps, you will build and maintain the infrastructure that powers autonomous AI agents, LLM pipelines, and machine learning workloads alongside 10+ WordPress content sites. The company operates across iGaming, esports, lifestyle, news, and legal verticals. Your job is to solve that permanently using AI-powered infrastructure: self-healing systems driven by AI monitoring agents, AI-assisted CI/CD that catches issues before deployment, and autonomous infrastructure management that runs 24/7. You will use AI coding tools daily to build infrastructure at 10x velocity.
This is a Malta-office based position on a hybrid model. Only candidates residing and eligible to work in Malta (EU), are to apply.
Roles & Responsibilities:
AI-Powered CI/CD & Deployment
- Design and implement CI/CD pipelines using GitHub Actions with AI-assisted code review, automated test generation, and intelligent deployment gates
- Build AI-powered deployment validation agents that verify deployments by testing site functionality, checking for regressions, and auto-rolling back on failure
- Implement AI-assisted pull request review that catches security vulnerabilities, performance issues, and breaking changes before merge
- Standardise deployment processes across 10+ WordPress sites with AI-enforced quality gates
AI Infrastructure & Platform Engineering
- Build and maintain the AI platform infrastructure: model serving endpoints, vector database clusters, embedding pipeline compute, and agent execution environments
- Own and optimise Cloud infrastructure (VMs, IAP, VPC, Secret Manager) for both traditional and AI workloads
- Manage compute resources for LLM API calls, ML model inference, and batch AI processing jobs
- Build scalable agent execution infrastructure that can run hundreds of autonomous AI agents concurrently
AI-Driven Monitoring, Alerting & Self-Healing
- Build AI-powered monitoring that goes beyond thresholds agents that understand context, correlate events, and diagnose root causes autonomously
- Design self-healing infrastructure systems where AI agents detect issues, determine fixes, and execute remediation without human intervention
- Implement AI-driven capacity planning that predicts resource needs based on traffic patterns and AI workload growth
Requirements:
- 3+ years of DevOps, SRE, or infrastructure engineering experience
- Experience building CI/CD pipelines with GitHub Actions, GitLab CI, or Jenkins
- Daily use of AI coding tools - strongly preferred, willingness to adopt immediately required
- Scripting proficiency in Python and/or Bash
- Experience with containerisation (Docker) and container orchestration
- Experience with monitoring and observability tools (Prometheus, Grafana, Datadog, or GCP native)
- Understanding of networking fundamentals (DNS, SSL/TLS, CDN, load balancing)
- Familiarity with AI/ML infrastructure concepts (model serving, vector databases, API gateway management)
Preferred Qualifications
- Experience managing infrastructure for AI/ML workloads (model serving, GPU compute, embedding pipelines)
- Experience with vector database operations (Pinecone, Weaviate, pgvector)
- Experience managing WordPress hosting at scale (Kinsta, WP Engine)
- Experience in digital media, affiliate marketing, or iGaming
- Experience building self-healing automation and AI-driven remediation systems
- Knowledge of AI safety and LLM security (prompt injection, output filtering, cost controls)
- Experience with Cloudflare or similar CDN/WAF platforms
- Security certifications or experience with compliance frameworks (GDPR, SOC2
- Previous startup or scale-up environment experience
Our Benefits:
We offer a competitive salary, and the opportunity to work with a talented and passionate team in a fast-paced, dynamic environment.
Make Your Resume Now