Share this Job

Site Reliability Engineer

Date: 29-Jan-2021

Location: Oxford, GB, OX4 4DQ

Company: Nominet



Maybe you know us, maybe you don't. Even though you probably use our services every day. So

we'd like to share more about what we do at Nominet, and why we do it, so that you can help us to

build the right team.


We're proudly at the heart of the UK's critical internet infrastructure. Leading the charge in cyber

security, as we explore and pioneer new tech.

We do this to enable connectivity, inclusivity and security for our world - and create a vibrant digital



That's what drives us - and the kind of people we want to work with.




Competitive salary depending on experience, plus excellent benefits including a 10% bonus, 30 days holidays, Healthcare, Pension Scheme, Life Assurance, Wellbeing allowance, Flex benefits.



We are establishing a new, dedicated DevOps team in our Internet Engineering & Operations function, working closely with software developers and infrastructure engineers - all operating at the heart of the internet. 

Working within this DevOps team, you will advance our observability analytics and metrics capabilities for mission critical applications and infrastructure, helping to drive continuous improvement across the stack. You will use your experience and insights to transform our monitoring, ensuring we can continuously assess the health of our services and applications.


Do you have experience of pioneering observability through metrics and log analysis (preferably using the ELK stack) and would be excited to apply that experience in an environment that's serving and securing the millions of business and individuals using the .UK namespace every day?



  • Ensure consistent and thorough observability and monitoring across all environments.
  • Work closely with development and QA teams to capture meaningful and detailed heuristics to measure the health of each application during releases and regular operation.
  • Engineer new tools and applications that improve production and deployment capabilities for all teams.
  • Drive evolution of our logging platform including integrating with Logstash.
  • Engineer automated software releases (including new applications) across the environments including production.
  • Champion the testability of the monitoring system.
  • Be an ambassador for DevOps across the business, influencing others to embrace automation and DevOps principles.  
  • Identify and fix bugs and streamline development workflows.



  • Proven ability to pioneer observability through metrics and log analysis, preferably using the ELK stack.
  • Experience deploying, managing and troubleshooting of software applications (including Web Apps and B2B).
  • Proficient with containerisation technologies and approaches.
  • Experience working with Kubernetes clusters and pods.
  • Good understanding of operating microservice architectures.
  • Solid grasp of CI/CD tooling and delivery best practices.
  • Good understanding of operational security.
  • Demonstrable knowledge of DevOps principles and best practices.
  • Confident with Git and general Linux administration skills.
  • Experience with provisioning tools such as Terraform or Ansible an advantage.
  • Experience with AWS desirable.





Please note: As part of the job application, you will be asked to complete a brief online application form and profile.

Job Segment: Quality Assurance, Linux, Technology