Senior SRE Engineer
New Yesterday
Job Description
Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services
- Location: London (Hybrid, typically 3 days onsite)
- Permanent, Full-time
- Salary: £80k–£90k + bonus + benefits
- Visa sponsorship: Not available
The Role
You’ll join as the first dedicated SRE hire, with responsibility for establishing SRE practices across a live Azure-based platform and a new strategic platform being brought into service.
The role is focused on reliability, observability, incident management, resilience, and automation. You’ll help define how services are measured and operated, introducing practical improvements around SLIs, SLOs, error budgets, monitoring, and service ownership.
This is a hands-on role for someone who has done this before and can bring structure, prioritise well, and build an SRE capability in a pragmatic way.
Non-Negotiables
- Site Reliability Engineering in production environments
- Azure cloud environments in enterprise-scale businesses
- SLO / SLI / error budget design and implementation
- Observability tooling (Prometheus, Grafana, OpenTelemetry or similar)
- Incident leadership across Sev1 / Sev2 environments
- Disaster recovery, resilience testing, RTO / RPO
- Terraform infrastructure as code
- CI/CD pipelines and engineering enablement
- Strong scripting with PowerShell, Bash or Python
- Experience improving reliability in hybrid estates (cloud + IaaS)
- Ability to introduce new ways of working and build an SRE practice from scratch
They are looking for someone with a strong Azure background, but the priority is proven SRE capability and the ability to apply it effectively.
What You’ll Work With
- Azure platform engineering
- Azure Container Apps / cloud-native services
- Terraform infrastructure as code
- Prometheus monitoring
- Grafana dashboards
- OpenTelemetry tracing
- Azure DevOps pipelines
- GitHub Actions CI/CD
- Windows Server and Linux estates
- Service Bus, Event Hubs and Kafka
- Incident management, runbooks, failover and resilience testing
Nice to Haves
- Financial services or regulated environment experience
- FCA / PRA operational resilience exposure
- Payments or FX platform experience
- Chaos engineering
- FinOps or cloud cost awareness
- Kubernetes exposure
Kubernetes knowledge is useful, but not essential.
Why Join / Projects
- Establish the SRE capability from the ground up
- Define and implement SLIs, SLOs and error budgets
- Improve observability across platforms and services
- Lead incident response and post-incident improvements
- Drive resilience, failover and automation initiatives
- Support the move toward a modern, reliability-first platform
You’ll play a key role in shaping how reliability is engineered across both the current platform and a new strategic platform being brought into production.
Employee Benefits
- Pension
- Private healthcare
- Training and certification support
Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services
- Location:
- London
- Job Type:
- FullTime
- Category:
- Technology
We found some similar jobs based on your search
-
New Yesterday
Senior SRE Engineer
-
London
- Technology
Job Description Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services Location: London (Hybrid, typically 3 days onsite) Permanent, Full-time Salary: £80k–£90k + bonus + benefits Vis...
More Details -
-
New Yesterday
Senior SRE Engineer
-
City Of London
- Technology
Job Description Senior SRE Engineer | Azure, Observability & Reliability Engineering | Platform Transformation in Financial Services Location: London (Hybrid, typically 3 days onsite) Permanent, Full-time Salary: £80k–£90k + bonus + benefits Vis...
More Details -
-
1 Days Old
Senior Software Engineer & SRE — BQL Reliability
-
Greater London, England, United Kingdom
A leading financial data provider in London is seeking a Senior Software Engineer/SRE to focus on the reliability and resilience of its BQL platform. You will monitor infrastructure health, formulate service level indicators, and collaborate with eng...
More Details -
-
1 Days Old
Senior Software Engineer/SRE - BQL Reliability Engineering
-
Greater London, England, United Kingdom
Senior Software Engineer/SRE - BQL Reliability Engineering Location London Business Area Engineering and CTO Ref # 10049751 Description & Requirements Bloomberg runs on data. It’s our business and our product. BQL is the single API for all client-fac...
More Details -
-
5 Days Old
Senior Site Reliability Engineer (SRE)
-
London
- IT
Senior Site Reliability Engineer (SRE) Remote 12-month contract (high chance of extension) Job Description Join a global pioneer in the video game industry and own the reliability of high-traffic, revenue-critical platforms used by millions worldwi...
More Details -
-
9 Days Old
Senior Software Engineer, Site Reliability Engineering, Core SRE
-
Greater London, England, United Kingdom
Senior Software Engineer, Site Reliability Engineering, Core SRE corporate_fare Google place London, UK Apply Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 5 years of experience with software development...
More Details -