Site Reliability Engineer (SRE)
Kaluza
15 days ago
Edinburgh, United Kingdom or London, United Kingdom
Mid Level / Senior
Responsibilities
- Engineer a scalable, reliable, and developer-friendly platform.
- Collaborate closely with product teams to enhance platform reliability.
- Develop tools to improve developer experience and operational efficiency.
- Manage infrastructure as code using Terraform and Helm.
- Establish robust monitoring and logging foundations.
- Incorporate security best practices and ensure compliance.
- Participate in on-call incident management and troubleshooting.
- Foster strong relationships with stakeholders for clear communication.
Requirements
- Experience building and running production systems focused on reliability.
- Strong communication and collaboration skills.
- Hands-on experience with cloud platforms, preferably AWS.
- Practical experience with Kubernetes in production.
- Solid foundational knowledge of distributed systems.
- Familiarity with CI/CD, GitOps, and Infrastructure as Code.
- Appreciation for Site Reliability Engineering practices.
- Curiosity and motivation to continuously learn and improve.
Benefits
- Pension Scheme
- Discretionary Bonus Scheme
- Private Medical Insurance + Virtual GP
- Life Assurance
- Access to a Climate Action app
- Free Mortgage Advice and Eye Tests
- Access to thousands of retail discounts
- 5% Flex Fund for personalized benefits
- 26 days holiday plus flexible bank holidays
- Progressive leave policies including 26 weeks full pay for new parents
- Dedicated personal learning and home office budgets
- Flexible working arrangements
Tech Stack
Apache KafkaAWSCloudflareDatadogGoHelmKubernetesPythonTerraform
Categories
DevOpsSecurity