I am hiring for Site Reliability Engineer (OpenShift / Kubernetes / DevOps)
Location:
Reading, UK (Hybrid)
Job Description
We are seeking a skilled
Site Reliability Engineer (SRE)
with strong hands-on experience in
OpenShift
to join our team. The ideal candidate will focus on platform reliability, automation, and observability while driving continuous improvement across cloud-native systems.
Key Responsibilities:
Design, deploy, and maintain highly reliable OpenShift and Kubernetes environments.
Implement and manage observability tools (Prometheus, Grafana, Loki, Tempo) for proactive monitoring.
Develop Infrastructure as Code (IaC) using Helm or Kustomize for scalable deployments.
Build and maintain CI/CD pipelines with Tekton and ArgoCD.
Manage and support OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ).
Conduct security reviews and enforce compliance with infrastructure standards.
Collaborate with cross-functional teams and mentor junior engineers to adopt SRE best practices.
Key Skills:
OpenShift, Kubernetes, RedHat Linux, Bash, Python, Helm, Kustomize, Tekton, ArgoCD, Prometheus, Grafana, Loki, Tempo, ServiceMesh, ODF, ACS, ACM, AMQ, VMware, vSphere, CI/CD, IaC, DevOps, Security