Site Reliability Engineering Manager - Data

Company:  Apollo Solutions
Location: London
Closing Date: 02/11/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description
Site Reliability Platform Engineering Manager London Hybrid - 2 days per week onsite Salary: Up to £120k Excellent Benefits + 30% Bonus + Stock Options My client Global Financial Services Client is looking for a Site Reliability Platform Engineering Manager to lead their team to focus on keeping their services running, while simultaneously supporting programme timescales and business outcomes. This will be a Hybrid working model. Lead Cloud Site Reliability Platform Engineer Responsibilities: Leading the L1/L2 team to continually improve the cycle time and efficiency of incident & service request resolution, blameless post-mortems, and problem records. Leading the team to ensure service tickets and incidents are resolved within SLA and effectively passed on to product teams, where L3/L4 support is required. Driving several cloud compliance framework controls such as Annual DR and recovery testing, capacity management, etc. Continually improve the percentage of service tickets and incidents resolved by the team and not escalated to another team. Identifying top reasons for service requests and incidents and addressing the root cause thereby reducing the number of tickets quarter by quarter. Provide thought leadership in operational areas such as change and release management, capacity management, backup and recovery etc. Ensuring the team is correctly skilled for the roles and identifying candidates to transition from Ops roles to SRE Must-Haves: Solid understanding of the SRE role and principles Experience working with a wide range of products in Azure and GCP, Kubernetes, container registries, networking, etc. Experience working with several CI/CD and infrastructure as code-related tools such as Terraform, GitHub, Azure DevOps, Jenkins, Chef, etc. Experience leading an SRE or Operations team Negotiating skills to influence technical and leadership decisions to achieve the right consumer outcomes and operational needs A good understanding of public cloud security Experience leading teams in a large, complex, highly regulated industry Previous experience leading a team responsible for the public cloud estate Azure or GCP Certifications is desirable Experience in handling risks and controls across technical platforms Desire to learn and cross-skill Benefits: ~ Up to 15% pension contribution ~30% bonus ~ Hybrid working pattern ~ Private Healthcare ~ Access to Share Schemes If you are passionate about Platform Engineering/Site Reliability and want to be part of a dynamic team shaping the future of Technology, please send your CV, for a confidential discussion. Please note: No Sponsorship is offered
Apply Now
Share this job
Apollo Solutions
An error has occurred. This application may no longer respond until reloaded. Reload 🗙