GrepJob
ClickHouse

Database Reliability Engineer - Core Team

ClickHouse
Apply
about 1 month ago
Remote, GermanyMid Level / Senior

Responsibilities

  • Continuously improve the reliability and performance of ClickHouse core.
  • Create metrics and alerts to identify and prevent production issues.
  • Investigate common customer problems to identify root causes and suggest improvements.
  • Enhance incident response processes and conduct post-mortem analyses.
  • Plan and drive chaos engineering initiatives across engineering teams.
  • Manage on-call processes to address performance and reliability issues.

Requirements

  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • At least 5 years of experience in Reliability Engineering, QA, or customer-facing engineering.
  • Experience operating ClickHouse or other SQL databases in production.
  • Strong understanding of distributed database internals and SQL.
  • Scripting experience with Shell or Python, and ability to read C++ code.
  • Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform.
  • Strong problem-solving skills and production debugging capabilities.
  • Ability to thrive in a fast-paced, global team environment.
  • High level of responsibility, ownership, and accountability.
  • Excellent communication skills.

Benefits

  • Flexible work environment with remote-friendly options.
  • Employer contributions towards healthcare.
  • Equity in the company with stock options for new team members.
  • Flexible time off in the US and generous entitlement in other countries.
  • A $500 home office setup for remote employees.
  • Opportunities for global gatherings and company-wide offsites.

Tech Stack

AWSAzureC++ClickHouseGoogle Cloud PlatformPythonSQL

Categories

BackendData EngineeringDevOps