Observability Engineer Financial Trading Infrastructure
New Today
Observability Engineer Financial Trading Infrastructure (Hybrid Belfast or Remote UK) Do you enjoy finding clarity in chaos? Are you the kind of engineer who can quickly diagnose complex issues in live trading systems, and then automate away the root cause? My client is a leading provider of financial trading infrastructure and data services, with a global footprint and a long-standing presence in the capital markets. They connect institutional clients to markets, liquidity, and critical data and theyre investing heavily in next-generation tooling and automation to support their trading platforms. Theyre now hiring an Observability Engineer to help drive operational resilience and real-time visibility across production systems spanning both on-prem environments and a growing estate in the cloud. This role sits within a wider production support group and works closely with DevOps, infrastructure, and offshore monitoring teams. What youll be doing Managing and enhancing observability across a range of business-critical trading platforms. Working closely with an offshore NOC team to ensure alerts are triaged and escalated correctly using clear, up-to-date runbooks. Leading the response on incident bridges surfacing real-time telemetry and helping drive fast root cause analysis. Participating in post-mortems to identify gaps in monitoring and ensure continuous improvement. Onboarding new systems and workflows into the monitoring stack, including developing custom solutions where vendor tools fall short. Supporting a long-term transformation to the cloud by ensuring observability is aligned across hybrid architectures. Collaborating with DevOps teams to ensure monitoring and telemetry are part of every deployment. Contributing to automation efforts particularly around post-deployment smoke checks and operational readiness validation. Occasional weekend or evening work during major upgrades or testing windows. What theyre looking for Must-haves: 5+ years in SysOps or NOC roles within complex, high-availability environments ideally in financial services, capital markets, or trading platforms. Experience monitoring both on-prem and cloud-hosted systems (AWS preferred). Hands-on scripting and automation skills (e.g. Python, Bash, PowerShell, Perl). Strong understanding of Unix/Linux and Windows Server. Practical experience troubleshooting network-level issues (e.g. firewalls, routing). Proven ability to improve monitoring coverage, reduce alert noise, and build robust runbooks. Confident participating in live incident calls and structured post-mortems. Nice to haves: AWS certifications (SysOps Admin or Cloud Practitioner). Familiarity with OTEL, STATSD, SNMP or iLo protocols. Experience with monitoring stacks like Prometheus, ITRS, or similar tools. Understanding of DevOps practices and CI/CD pipelines. Familiarity with messaging systems (e.g. MQ, Solace, Tibco) and protocols like FIX. Knowledge of databases such as MSSQL, Oracle, or Sybase. ITIL Foundation certification or equivalent practical experience. If youre motivated by the idea of keeping mission-critical platforms resilient and want to automate the boring stuff while driving real change this role offers a great opportunity within a global trading technology organisation. If you have any further questions on the role, please feel free to reach out to Brendan McCrory directly.
- Location:
- Belfast
- Category:
- IT | Infrastructure