Senior Site Reliability Engineer

New Yesterday

Job Description

Who is our client?

Our client is a leading payment orchestration platform resolving numerous costly issues for merchants who navigate desperate, fast-moving and fragmented payment service providers. Series A funded and now focused on Series B an the expansion and creation of the Manchester hub the team is growing and looking for a Senior Site Reliability Engineer to join the team.

What is our client looking for?

The guardian of our production environment, responsible for its health, performance, and scalability. You will apply software engineering principles to solve operational problems, automate everything, and ensure our platform exceeds the reliability expectations of our customers.

Responsibilities:

Architect & Automate: Design, build, and maintain our core infrastructure using Infrastructure as Code (IaC) principles. You will be instrumental in evolving our CI/CD pipelines to ensure safe, rapid, and reliable releases.
Enhance Reliability & Scalability: Proactively identify and address performance bottlenecks, single points of failure, and scalability limits. You will define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to maintain and improve platform health.
Champion Observability: Implement and manage comprehensive monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK Stack) to provide deep insights into system behavior and ensure rapid incident detection.
Lead Incident Management: Participate in our on-call rotation, acting as a key player in incident response and resolution. You will lead blameless post-mortems to identify root causes and implement preventative measures.
Collaborate & Empower: Work closely with software engineering teams to foster a culture of reliability. You will provide guidance on building resilient services, implementing best practices for observability, and improving the developer experience.
Secure the Foundation: Implement and maintain security best practices across our cloud infrastructure, ensuring our platform is robust and compliant.

If this sounds like you, get in touch. We would love to hear from you.

***Please note, we are recruiting on behalf of our client. We will host a screening call with you and should you be successful you will then enter into a 3 stage interview process with our client****

Apply

Location:: Manchester
Job Type:: FullTime
Category:: Technology

Start a New Search

Senior Site Reliability Engineer

Job Description

We found some similar jobs based on your search

Senior Site Reliability Engineer

Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)