Kaluza

Site Reliability Engineer (SRE)

Kaluza

Apply
15 days ago
Edinburgh, United Kingdom or London, United Kingdom
Mid Level / Senior

Responsibilities

  • Engineer a scalable, reliable, and developer-friendly platform.
  • Collaborate closely with product teams to enhance platform reliability.
  • Develop tools to improve developer experience and operational efficiency.
  • Manage infrastructure as code using Terraform and Helm.
  • Establish robust monitoring and logging foundations.
  • Incorporate security best practices and ensure compliance.
  • Participate in on-call incident management and troubleshooting.
  • Foster strong relationships with stakeholders for clear communication.

Requirements

  • Experience building and running production systems focused on reliability.
  • Strong communication and collaboration skills.
  • Hands-on experience with cloud platforms, preferably AWS.
  • Practical experience with Kubernetes in production.
  • Solid foundational knowledge of distributed systems.
  • Familiarity with CI/CD, GitOps, and Infrastructure as Code.
  • Appreciation for Site Reliability Engineering practices.
  • Curiosity and motivation to continuously learn and improve.

Benefits

  • Pension Scheme
  • Discretionary Bonus Scheme
  • Private Medical Insurance + Virtual GP
  • Life Assurance
  • Access to a Climate Action app
  • Free Mortgage Advice and Eye Tests
  • Access to thousands of retail discounts
  • 5% Flex Fund for personalized benefits
  • 26 days holiday plus flexible bank holidays
  • Progressive leave policies including 26 weeks full pay for new parents
  • Dedicated personal learning and home office budgets
  • Flexible working arrangements

Tech Stack

Apache KafkaAWSCloudflareDatadogGoHelmKubernetesPythonTerraform

Categories

DevOpsSecurity