Senior Site Reliability Engineer
New Yesterday
Job Description
Who is our client?
Our client is a leading payment orchestration platform resolving numerous costly issues for merchants who navigate desperate, fast-moving and fragmented payment service providers. Series A funded and now focused on Series B an the expansion and creation of the Manchester hub the team is growing and looking for a Senior Site Reliability Engineer to join the team.
What is our client looking for?
The guardian of our production environment, responsible for its health, performance, and scalability. You will apply software engineering principles to solve operational problems, automate everything, and ensure our platform exceeds the reliability expectations of our customers.
Responsibilities:
- Architect & Automate: Design, build, and maintain our core infrastructure using Infrastructure as Code (IaC) principles. You will be instrumental in evolving our CI/CD pipelines to ensure safe, rapid, and reliable releases.
- Enhance Reliability & Scalability: Proactively identify and address performance bottlenecks, single points of failure, and scalability limits. You will define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to maintain and improve platform health.
- Champion Observability: Implement and manage comprehensive monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK Stack) to provide deep insights into system behavior and ensure rapid incident detection.
- Lead Incident Management: Participate in our on-call rotation, acting as a key player in incident response and resolution. You will lead blameless post-mortems to identify root causes and implement preventative measures.
- Collaborate & Empower: Work closely with software engineering teams to foster a culture of reliability. You will provide guidance on building resilient services, implementing best practices for observability, and improving the developer experience.
- Secure the Foundation: Implement and maintain security best practices across our cloud infrastructure, ensuring our platform is robust and compliant.
If this sounds like you, get in touch. We would love to hear from you.
***Please note, we are recruiting on behalf of our client. We will host a screening call with you and should you be successful you will then enter into a 3 stage interview process with our client****
- Location:
- Manchester
- Job Type:
- FullTime
- Category:
- Technology
We found some similar jobs based on your search
-
New Yesterday
Senior Site Reliability Engineer
-
Manchester
- Technology
Job Description Who is our client? Our client is a leading payment orchestration platform resolving numerous costly issues for merchants who navigate desperate, fast-moving and fragmented payment service providers. Series A funded and now focused...
More Details -
-
18 Days Old
Senior Site Reliability Engineer (SRE)
-
Manchester
- Engineering
Job Description Paydock is a leading payments orchestration platform, empowering businesses to manage and scale their payment strategies seamlessly. We provide a single, elegant API to connect to a vast ecosystem of payment gateways and methods, s...
More Details -
-
40 Days Old
Senior Site Reliability Engineer (SRE)
-
Manchester
- Technology
Job Description Paydock is a leading payments orchestration platform, empowering businesses to manage and scale their payment strategies seamlessly. We provide a single, elegant API to connect to a vast ecosystem of payment gateways and methods, s...
More Details -