Jump to main content

Site Reliability Engineer

  1. Greene King Corporate - BRA_001
  1. Full time

Sunrise House

Competitive Salary

Great news! We will let you know when a new job like this has been added!

Job description

At Greene King, we believe digital will enable us to deliver a step change in the pub experience. We obsess over building best-in-class user and customer experiences across our digital products. We will live up to being the pride of British hospitality by making sure we deliver digital products that fit into our physical experiences seamlessly and ensure our customers come back time and time again.
We are looking for a skilled Site Reliability Engineer (SRE) to join our team and ensure the reliability, scalability, and performance of our production systems. As an SRE, you will collaborate with software engineers and support teams to build robust, automated, and highly available infrastructure. You will be responsible for designing and maintaining monitoring solutions, incident response strategies, and performance optimisation.

 

  • Maintain and improve system reliability, availability, and performance
  • Develop and implement monitoring, alerting, and logging solutions
  • Automate repetitive operational tasks using scripting and configuration management tools
  • Troubleshoot production issues, conduct root cause analysis, and drive resolutions
  • Collaborate with software development teams to integrate reliability best practices into the development lifecycle
  • Design and maintain CI/CD pipelines to enable smooth deployments and rollbacks
  • Ensure security and compliance best practices across infrastructure
  • Optimize system resources for cost efficiency while maintaining performance
  • Participate in on-call rotations and incident response processes
  • Example KPIs: Monitoring Coverage, Change Failure Rate, Application Response Time, Error Rate, Incident Count, MTTR, RCA Completion Rate

Additional Information

We’re all about rewarding our team’s hard work, that’s why…

You’ll receive a competitive salary, pension contribution as well as:

  • The chance to further your career across our well-known brands – as one of the industry's top apprenticeship providers, we can provide training and development at each level of your career. 
  • Discount of 33% for you and 15% for your loved ones on all of our brands – so you enjoy your favourite food and drink at a discount.
  • Free employee assistance program – mental health, well-being, financial, and legal support because you matter!
  • Discount of 50% for you and 25% for your loved ones at our Greene King Inns and hotels. – so you can enjoy a weekend away without breaking the bank.
  • Refer a friend – who do you know who could be interested in a new role? When they are placed, you could earn up to £1,500 for referring them!
  • Wage Stream – access your wage before payday for when life happens.
  • Retail discounts – Receive up to 30% off at Superdrug, exclusive discounts with three mobile along with many more…

Qualifications

  • Lead thinking and development of digital site reliability, performance and availability
  • Proven problem solver, takes a proactive approach to finding the right solution to a meet customer’s needs
  • Able to coach and mentor team members to share knowledge and support the growth of the team
  • Demonstrates customer focussed ownership and accountability
  • Is able to build good user relationships
  • Is a highly motivated team player who demonstrates team centred behaviours
  • Highly effective communication skills
  • Able to delivery in a fast moving, changing business environment
  • Provide technical leadership and mentoring for others
  • Has a proven track record of delivering results with enthusiasm and a positive "can do" attitude, able to work with multiple priorities and progress activities with urgency and pace
  • Welcomes and can support business change in terms of new and evolving processes that will support Greene King Digital delivery and development
  • Proactive and able to enhance and evolve all aspects of themselves and the team in terms of new process, software and techniques
  • 3+ years of experience in Site Reliability Engineering, DevOps, or related roles
  • Strong experience of observability tools such as Datadog or similar
  • Strong problem-solving and debugging skills in distributed systems
  • Strong experience with IaC
  • Hands on experience of cloud-based environment builds using Microsoft Azure or AWS in a structured way using infrastructure as code
  • Hands on experience of building and maintaining software toolsets and CI/CD pipelines to support automated and robust software development practises
  • Experience of security solutions and controls that must be used in Digital
  • Experience of defining, governing and evolving engineering principles
  • Strong knowledge of software development standards, best practises such as TDD, BDD
  • Experience of creating technical documentation to support solution design, delivery, and release activities
  • Experience of working with 3rd party engineering teams to collaborate and delivery solutions
  • Experience with using Observability tooling to monitor platform health.
  • Experience of Event driven Architecture patterns
  • Familiar with building out HA/ DR compliant solutions that can handle scaled public Digital solutions
  • Computer Science degree
  • Interest in AI
  • Certifications in Cloud platforms such as Azure, AWS and integration platforms

Great news! We will let you know when a new job like this has been added!

FIND A JOB

Quick search