Lead Site Reliability Engineer

New Yesterday

Role: Tech Lead, Site Reliability EngineerLocation: London (Hybrid – 1 day per week in office)We are working with a mission-led technology organisation that continues to invest heavily in its cloud platform and infrastructure capabilities. As part of this growth, they are looking for a Tech Lead SRE to help shape the platform's reliability, scalability, and operational maturity.This is a technical leadership role rather than a people management role. You will serve as the senior technical voice within the SRE function, guiding engineering standards and reliability practices across the platform while remaining hands‑on.You will work closely with platform engineers, cloud engineers and product teams to ensure services are reliable, scalable and observable as the platform continues to grow.Key ResponsibilitiesActing as the technical lead within the SRE functionImproving platform reliability, monitoring and observabilityWorking closely with engineering teams to design resilient systemsDriving automation, CI/CD and infrastructure improvementsSupporting incident response and root cause analysisHelping define SRE best practices and reliability standardsTech EnvironmentCloud platforms (AWS, GCP or Azure)Kubernetes and containerised workloadsTerraform or Infrastructure as CodePrometheus, Grafana or modern observability toolingCI/CD pipelines and automationPython, Go or similar scripting languagesAbout YouStrong background in SRE, DevOps or Cloud InfrastructureExperience running production systems at scaleStrong understanding of monitoring, reliability and automationComfortable acting as a technical leader within engineering teams #J-18808-Ljbffr
Location:
Greater London
Job Type:
FullTime

We found some similar jobs based on your search