Make Your Resume Now

Site Reliability Engineer

Posted January 05, 2026
Full-time Mid-Senior Level

Job Overview

The Role

We are seeking a Site Reliability Engineer (SRE) to join our growing SRE team as a permanent member of staff. This is a hands-on technical role focused on ensuring the reliability, performance, and operational excellence of our digital platforms. This position is the first technical hire within our Singapore office and will play a key role in establishing operational coverage and engineering presence in the Asia time zone. As part of a globally distributed team, you will help maintain 24/7 availability of our systems by supporting platform health, reducing operational toil, and enhancing observability, resilience, and redundancy. You will work under the direction of the Lead and Senior SRE Engineers, contributing to incident response, automation, and continuous improvement initiatives. This role is ideal for engineers with a solid foundation in cloud-native operations and a passion for improving system reliability through engineering excellence.

PRINCIPAL ACCOUNTABILITIES

Platform Reliability and Operations

  • Monitor and maintain the health, availability, and performance of production and non-production platforms.
  • Respond to incidents and alerts, performing triage, resolution, and escalation as needed.
  • Contribute to the development and maintenance of observability tooling (e.g. Grafana, Prometheus) to ensure robust monitoring, alerting, and telemetry.
  • Identify and reduce operational toil through automation and process improvement.
  • Support the development and maintenance of runbooks and operational documentation.
  • Participate in the Major Incident Management as a Service (MIMAS) process, providing technical input and follow-up actions.
  • Collaborate with Engineering, Cloud, and Data teams to support platform resilience and redundancy initiatives.
  • Contribute to proactive issue detection and reliability improvements across the platform estate.

Collaboration and Continuous Improvement

  • Work closely with Senior SREs and other technical teams to implement reliability-focused enhancements.
  • Participate in daily stand-ups, retrospectives, and planning sessions to align on priorities and share operational insights.
  • Communicate clearly with stakeholders across Engineering, Data, Apps Support, and InfoSec.
  • Contribute to a culture of blameless post-incident reviews and continuous improvement.
  • Support the onboarding and mentoring of junior team members and seconded staff.

Ready to Apply?

Take the next step in your career journey

Stand out with a professional resume tailored for this role

Build Your Resume – It’s Free!