
Senior Site Reliability Engineer- Remote
ClickHouseabout 1 month ago
Remote, SingaporeSenior / Staff+
Responsibilities
- Collaborate with various engineering teams to design and implement scalable, secure, and highly available systems.
- Establish and manage service level objectives (SLOs) and service level agreements (SLAs) for ClickHouse Cloud.
- Ensure all infrastructure components have monitoring and alerting in place for timely incident detection and resolution.
- Enhance incident response processes and conduct post-mortem analysis for outages.
- Continuously improve the reliability and performance of ClickHouse services.
- Plan and drive Chaos initiatives across Engineering teams.
- Manage on-call processes to respond to performance and reliability issues.
Requirements
- Bachelor’s or Master’s degree in Computer Science or a related field.
- At least 8 years of experience in Site Reliability Engineering or a related field.
- Hands-on experience with Go and/or Python.
- Strong knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform.
- Excellent understanding of distributed databases and SQL, particularly ClickHouse is a major plus.
- Hands-on experience with container orchestration tools such as Kubernetes or Docker Swarm.
- Strong experience with automation and configuration management tools such as Ansible, Terraform, or Puppet.
- Strong problem-solving skills and solid production debugging skills.
- Passionate about efficiency, availability, scalability, and data governance.
- Thrives in a fast-paced environment and sees themselves as a partner with the business.
- High level of responsibility, ownership, and accountability.
- Excellent communication and interpersonal skills.
Benefits
- Flexible work environment - ClickHouse is a globally distributed company and remote-friendly.
- Employer contributions towards your healthcare.
- Every new team member receives stock options.
- Flexible time off in the US, generous entitlement in other countries.
- A $500 Home office setup for remote employees.
- Opportunities for in-person connection at company-wide offsites.