Job Description

Join us as a Site Reliability Engineer

In this key role, you’ll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
We're offering this role at senior analyst

What you'll do

As a Site Reliability Engineer, you’ll be supporting colleagues and feature team members to meet defined service level objectives and continually improve systems and environments. You’ll also be proactively contributing new ideas and innovations to meet short term and longer term goals while balancing and managing risk.

You’ll also be accountable for the day-to-day health of both production and non-production environments, including responding to incidents.

A typical day will involve:

Ensuring service availability
Proactively monitoring the production environment
Completing root cause analysis of issues

The skills you'll need

We’re looking for someone with at least four years of experience in incident, problem and change management experience, paired with production support experience. You’ll need a Cloud environment skillset, as well as experience of monitoring, and Splunk or DX-APM dash-board creation.

Additionally, you'll need experience of:

Working with AWS services such as EC2, EKS or ECS, Lambda, RDS, S3, Python automation, FastAPI, and MongoDB.
Managing highly available production systems, participate in on-call rotations, troubleshoot incidents, and perform root cause analysis.
Building automated operational workflows, support CI/CD pipelines, monitoring, alerting, and incident management.
Working closely with engineering teams to improve system reliability and performance; exposure to Kubernetes, Terraform, observability tools, and AI-driven automation is a plus

Hours

Job Posting Closing Date:

28/05/2026

About NatWest Group

We’re a business that understands when our customers and people succeed, our communities succeed, and our economy thrives. As part of our purpose, we’re looking at how we can drive change for our communities in enterprise, learning and climate.

As one of the leading supporters of UK business, we’re prioritising enterprise as a force of change. We’re focusing on the people and communities who have traditionally faced the highest barriers to entry and figuring out ways to remove these.

Learning is also key to our continued growth as a company in an ever changing and increasingly digital world. By setting a dynamic and leading learning culture, our people prosper, and our customers are given the tools to continue to improve their financial capability and confidence.

One of the biggest challenges we all face in our future is climate change. That’s why we’ve put it right at the core of our purpose. We want to champion climate solutions with financing and entrepreneurial support, fully embed climate into our culture and decision making, and be climate positive by 2025.

We’re committed to using our purpose to break down barriers, drive change and ultimately create a great place to work.

Industry

Finance & Insurance

Company Size

10,000+ employees

Headquarters

Edinburgh, GB

Year Founded

Unknown

Website

natwestgroup.com

Social Media