Engineer, Reliability
Job Description
Business Segment: Personal & Private Banking
Location: ZA, GP, Johannesburg, Simmonds Street
We are seeking a Site Reliability Engineer (SRE) in Johannesburg to drive system reliability, scalability, and performance. This role requires a strong software engineering mindset to automate CI/CD pipelines, use infrastructure as code, and support both legacy and containerised platforms. Strong SRE skills are essential, with some familiarity with DevOps practices to help streamline deployments and improve collaboration. Your focus will be on monitoring, proactive issue resolution, and leading reliability improvements across teams.
Overview Responsibilities- Drive system reliability, scalability, and performance.
- Automate CI/CD pipelines and use infrastructure as code to support both legacy and containerised platforms.
- Utilize a strong software engineering mindset and DevOps practices to streamline deployments and improve collaboration.
- Focus on monitoring, proactive issue resolution, and leading reliability improvements across teams.
- Bachelor's or Master's Degrees in Computer Engineering, Software engineering
- Site Reliability Engineer Certification
- 5-7 years of hands-on experiencein Site Reliability Engineering or a closely related field, with a strong focus on system reliability and performance.
- Expertise in cloud computing, containerisation (Docker, Kubernetes), and DevOps practices, with a solid understanding of Linux/Unix environments and incident management.
- Strong analytical and troubleshooting abilities, with a proactive approach to identifying and resolving system bottlenecks and reliability issues.
- Proficiency in reliability-centered maintenance principles, infrastructure as code, and monitoring tools, with the ability to manage multiple projects and drive continuous improvement.
- Strong attention to detail and ability to manage multiple projects efficiently
- Adopting Practical Approaches
- Articulating Information
- Checking Things
- Developing Expertise
- Documenting Facts
- Application Knowledge for Support
- Business Continuity and Disaster Recovery Planning