Software Reliability Engineer
Job Description
This role is designed for engineers who view software through a systems lens, not through isolated services. You will write production grade application code daily, not just scripts or tooling, that ships and runs at scale. You will be embedded directly within teams, bringing deep knowledge of using software to solve problems.
ResponsibilitiesThe work centers on designing, building, and maintaining cloud native systems where low latency, security, and scalability are non negotiable. Reliability is woven into the architecture and code from the start. You will think in terms of SLAs and SLOs to shape system design, and engineer solutions that prevent incidents rather than just responding to them. Monitoring, observability, and metrics are critical, using tools such as Open Telemetry, Grafana, Splunk, and PagerDuty.
The hybrid setup based in the North West balances collaborative work with uninterrupted focus time for deep engineering. You will spend time working closely with teams to think through system design, then break away to deliver high impact code, and elevate the teams you join with a reliability mindset while contributing directly to application codebases.
Qualifications- Strong software development background
- Proficient in modern programming languages such as Python, JavaScript, or Go
- Experience with distributed or cloud native systems
- Proactive approach to performance and system health
- Comfortable picking up new tools as needed
- Ability to work with SLAs and SLOs as design constraints
If you build, ship, and genuinely care about reliability at a systems level, this role gives you the chance to work on meaningful, high impact infrastructure within a quality focused engineering culture.
About This Role
Career insights for Software Developers positions