Site Reliability Engineer

Posted on 20/03/2026

Apply

Swinton

Competitive + benefits

Permanent

Home
Vacancies
Site Reliability Engineer

About the role

The Site Reliability Engineer plays a critical role in ensuring that our AI-driven, cloud-native platform is reliable, observable, secure, and able to scale with the organisation’s growth. As we adopt intelligent agents, autonomous workflows, and increasingly complex distributed systems, the SRE ensures that resilience, performance, and operational excellence are built into everything we deliver. By partnering closely with Engineers, Architects, and the Engineering Manager, the SRE defines the patterns, tooling, and automation that enable fast, safe, and repeatable deployments.

This role safeguards our production environment, drives continuous improvement across CI/CD and observability, and establishes the reliability practices that empower autonomous squads to move quickly without compromising stability. The SRE is essential to maintaining customer trust, supporting AI-first innovation, and ensuring our platform remains robust, secure, and highly available at scale.

In this position you will ensure the reliability, scalability, and security of our engineering systems. Working closely with the Engineering Manager and Head of Engineering, the SRE will identify priorities to remove friction from engineering teams, streamline processes, and enhance operational excellence. This role combines software engineering principles with systems administration to deliver robust, automated, cost-effective, and secure-by-design solutions.

Key Responsibilities

Reliability, Performance & Security:

Design and implement strategies to improve system reliability, availability, and security.
Ensure all solutions follow secure-by-design principles, incorporating cybersecurity best practices from inception through deployment.
Conduct regular security reviews and collaborate with security teams to address vulnerabilities.

CI/CD Management:

Own and optimise Continuous Integration and Continuous Deployment pipelines.
Embed security checks (e.g., static analysis, dependency scanning) into CI/CD workflows.
Ensure secure, efficient, and automated deployment processes across environments.

Monitoring & Observability:

Implement and maintain monitoring solutions for infrastructure and applications.
Develop dashboards and alerting systems to ensure proactive incident and security event management.
Evaluate and integrate new observability tools as needed.

Automation & Tooling:

Automate repetitive tasks to improve efficiency and reduce human error.
Build and maintain internal tools that support engineering productivity and security compliance.
Champion Infrastructure as Code (IaC) practices using tools like Terraform or ARM templates.

Cloud Infrastructure Management:

Manage and optimise services across AWS and Azure environments.
Ensure scalability, resilience, and security of service-based architectures.
Implement cost management strategies to optimise cloud spend without compromising performance or security.

Incident Response & Root Cause Analysis:

Lead incident response efforts, including security incidents, and conduct post-mortem reviews.
Drive continuous improvement through lessons learned and preventive measures.

Skills & experience

Proven experience in AWS and Azure cloud environments.
Strong background in CI/CD tools (e.g., Azure DevOps, Pipelines, GitHub Actions, Jenkins).
Expertise in monitoring and observability platforms (e.g., Prometheus, Grafana, Datadog).
Proficiency in scripting and automation (Python, Bash, PowerShell).
Familiarity with containerisation and orchestration (Docker, Kubernetes).
Solid understanding of networking, security, and cost optimisation in cloud environments.
Knowledge of cybersecurity principles, secure coding practices, and compliance frameworks.
A problem-solver with a proactive mindset.
Comfortable working in fast-paced, evolving environments.
Strong communicator who can bridge gaps between operations, development, and security teams.
Passionate about automation, scalability, cost efficiency, and security.

Benefits & culture

Part of the Zellis Group, Moorepay is a team of over 500 friendly professionals across four offices in Swinton (Manchester), Sheffield, Birmingham and Kochi (India). We’re passionate about making Moorepay a fantastic place to work for every single one of our colleagues. The average length of service at Moorepay is 12 years, which speaks for itself.

To help make Moorepay such a great place to work, we focus on three things in our company culture: mental health support, maintaining a healthy work/life balance, and equal opportunities and inclusion for all.

Here’s what you’ll gain if you join our team:

A career packed with opportunity, in a stable and growing company.
A comprehensive programme of learning and development.
Competitive base salary.
25 days annual leave, with the opportunity to buy more. You’ll even get your birthday off as well!
Private medical insurance.
Life assurance 4x salary.
Enhanced pension with up to 8.5% employer contributions.
A huge range of additional flexible benefits across financial & personal wellbeing, lifestyle & leisure.

Apply

Other jobs like this

Similar