Senior Site Reliability Engineer (SRE)
Full-TimeJob Overview
About The Role
We’re looking for a hands-on Senior SRE to ensure our production services run reliably, securely, and predictably at scale.
You’ll focus on real-world system behaviour — detecting failures, leading incident response, improving resilience, and using SLI/SLOs and automation to make reliability, risk, cost, and delivery trade-offs explicit
What You'll Do
• Own reliability and resilience for live production services
• Define and evolve SLIs/SLOs aligned to customer impact
• Improve observability, alert quality, and operational signals
• Lead during incidents and contribute to blameless post-incident reviews
• Strengthen disaster recovery and recoverability (RTO/RPO)
• Reduce operational toil through smart automation
• Support cost-awareness and reliability trade-offs (FinOps mindset)
• Mentor engineers and contribute to SRE best practices
What We're Looking For
• Strong experience running live production systems in an SRE/reliability role
• Deep understanding of SLI/SLO-driven models
• Proven incident response and on-call experience
• Solid observability and automation skills
• Cloud experience (AWS preferred)
• Calm, structured approach under pressure
• Fluent English
Our Values
- We work together
- We believe in people
- We won’t accept the ‘way it has always been done’
- We listen to learn
- We’re trying to do the right thing
Equal Employment Opportunity Statement
Individuals seeking employment at Camlin are considered without regards to race, colour, religion, national origin, age, sex, marital states, ancestry, physical or mental disability, gender identity or sexual orientation.
Make Your Resume Now