GrepJob
PhysicsX

Senior Software Engineer - SRE Core Infrastructure

PhysicsX
Apply
1 day ago
London, United KingdomSenior
H1B Sponsor

Responsibilities

  • Design and deliver core infrastructure across multi-cloud providers and on-premises environments using Terraform and Crossplane.
  • Architect and operate Kubernetes clusters for single and multi-tenant workloads with a focus on performance and reliability.
  • Implement infrastructure provisioning patterns ensuring reproducibility and auditability.
  • Manage secrets management solutions, including dynamic provisioning and access control.
  • Maintain GPU driver configurations for AI and simulation workloads.
  • Design cluster networking, including CNI selection and service mesh integration.
  • Implement vCluster based multi-tenancy for workload isolation.
  • Develop Kubernetes Operators or controllers for automating infrastructure tasks.
  • Establish SLOs and lead responses to production incidents.
  • Collaborate with security and platform teams to enforce governance and compliance.

Requirements

  • 5+ years of experience operating Kubernetes in production environments.
  • Significant hands-on experience with Crossplane compositions and managed resources.
  • Strong proficiency in Terraform, including state management and CI integration.
  • Practical experience with multi-cloud and on-premises infrastructure.
  • Experience designing single and multi-tenant Kubernetes architectures.
  • Familiarity with secrets management tools like Vault or External Secrets Operator.
  • Solid knowledge of Kubernetes networking and CNI plugins.
  • Experience with vCluster or similar tools for isolated Kubernetes environments.
  • Familiarity with GPU driver management and accelerated workloads.
  • Experience writing or extending Kubernetes Operators, ideally in Golang or Python.
  • Strong understanding of distributed systems concepts.

Benefits

  • Equity options to share in the company's success.
  • 10% employer pension contribution.
  • Free office lunches.
  • Enhanced parental leave with full pay.
  • YellowNest nursery scheme for childcare support.
  • 25 days of annual leave plus public holidays.
  • Private medical insurance with 100% employee cover.
  • Wellhub subscription for access to gyms and wellness apps.
  • Eye tests for employee health.
  • Support for personal development and learning.
  • Employee Assistance Programme for confidential wellbeing support.
  • Bike2Work scheme and season ticket loan for commuting.
  • Octopus EV salary sacrifice for sustainable driving.

Tech Stack

Argo CDAWSAzureElixirErlangGoGoogle Cloud PlatformIstioKubernetesPythonTerraformVault

Categories

AI & MLData EngineeringDevOps