GrepJob
YugabyteDB

Staff Site Reliability Engineer

YugabyteDB
Apply
about 2 months ago
Remote, United StatesStaff+
H1B Sponsor

Base Salary

$220k - $250k/yr

Responsibilities

  • Define and drive the technical vision, architecture, and strategy for YugabyteDB’s DBaaS.
  • Lead, design, develop, test, debug, troubleshoot, and maintain components of the DBaaS cloud infrastructure.
  • Manage operational priorities of the DBaaS infrastructure.
  • Establish processes for handling and leading response to incidents on databases or infrastructure.
  • Automate and manage regular maintenance operations such as upgrades.
  • Design and build DBaaS processes for encryption, security key/password management, and storage management.
  • Utilize SRE golden signals to analyze and optimize the DBaaS system's performance and reliability strategies.

Requirements

  • Strong software design and implementation skills in building infrastructure frameworks.
  • 15+ years of experience as a SRE and 5+ years of technical leadership experience.
  • Experience in building and managing large-scale distributed systems.
  • Experience building and operating data systems for production applications, including fault tolerant designs.
  • Strong track record of Incident Response and Management in a managed service.
  • Experience with relational database systems, preferably PostgreSQL.
  • Familiarity with public cloud infrastructure (AWS, GCP, Azure).
  • Knowledge of containerization tooling, theory, and design (Docker, Kubernetes).
  • Experience with Infrastructure as Code (Terraform preferred).
  • Proficiency in automation scripting (Python and Bash preferred).
  • Solid understanding of Linux systems operations and troubleshooting.
  • Willingness and ability to learn new languages and concepts.

Benefits

  • Market competitive cash compensation ranging from USD 220,000-USD 250,000.
  • Equity options when applicable.
  • Health plans and retirement plans.
  • Unlimited paid time off (PTO).

Categories

Data EngineeringDevOps