Share this Job

Site Reliability Engineer

Date: 22-Oct-2020

Location: Oxford, GB, OX4 4DQ

Company: Nominet



Maybe you know us, maybe you don't. Even though you probably use our services every day. So

we'd like to share more about what we do at Nominet, and why we do it, so that you can help us to

build the right team.


We're proudly at the heart of the UK's critical internet infrastructure. Leading the charge in cyber

security, as we explore and pioneer new tech.

We do this to enable connectivity, inclusivity and security for our world - and create a vibrant digital



That's what drives us - and the kind of people we want to work with.




Competitive salary depending on experience, plus excellent benefits including a 10% bonus, 28 days holidays, Healthcare, Pension Scheme, Life Assurance, Wellbeing allowance, Flex benefits and onsite fitness classes and studio.



We are establishing a new, dedicated DevOps team in our Internet Engineering & Operations function, working closely with software developers and infrastructure engineers - all operating at the heart of the internet. Working within thIS DevOps team, you will be responsible for our application monitoring across the technical estate. Building on our existing monitoring framework, you use your infrastructure and scripting experience to add bespoke and robust monitoring checks to continuous assess the health of our services and applications.



  • Take ownership over the monitoring of applications, services and infrastructure.
  • Write and maintain software and scripts that capture detailed heuristics about the health of applications and alert accordingly.
  • Design and implement monitoring checks for new services prior to launch.
  • Ensure consistent and thorough monitoring across all environments (development, beta, production, etc). 
  • Capture improvements to the logging platform including integrating with LogStash.
  • Expand the existing monitoring within Zabbix and investigate and prototype monitoring checks using alternative frameworks.
  • Integrate with 3rd-party APIs and services to export application log data for auditing purposes.
  • Work with DevOps Engineers, Sys Admins and Software Developers during software releases.
  • Write automated monitoring tests and integrate within the CI/CD framework.
  • Be an ambassador for DevOps across the business, influencing others to embrace automation and DevOps principles.  
  • Work with the Release Manager to ensure successful and streamlined production deployments.



  • background in software engineering (using languages such as Java, Python, etc).
  • knowledge of JMX and Java-based application monitoring.
  • experience with Linux.
  • system design knowledge.
  • Experience monitoring Kubernetes clusters and pods.
  • Confident monitoring the health of servers (cloud-based and on-prem) including CPU, Memory, Storage.
  • Confident with Ansible, Terraform, GIT.
  • Experience with AWS.
  • Network and security knowledge.
  • Experience deploying, managing and troubleshooting of software applications (including Web Apps and B2B).
  • Happy working using Agile practices, and JIRA.
  • Knowledge of Zabbix and LogStash/ELK highly desirable.





Please note: As part of the job application, you will be asked to complete a brief online application form and profile.

Job Segment: Developer, Java, Linux, Cloud, Technology