GrepJob
January

Senior SRE, Software Engineering

January
Apply
about 1 year ago

Base Salary

$205k - $225k/yr

Responsibilities

  • Lead incident response and establish sustainable on-call practices.
  • Develop and maintain self-service observability solutions using modern monitoring tools.
  • Create and maintain infrastructure as code for scalable and secure cloud environments.
  • Partner with feature teams to architect resilient infrastructure for critical components.
  • Design and implement robust CI/CD pipelines with advanced deployment strategies.
  • Advocate for best practices in feature design to ensure reliability.

Requirements

  • Expertise in leading incident response for high-availability production systems.
  • Experience designing highly available deployment architectures across multiple targets.
  • Track record of implementing effective monitoring and observability solutions.
  • Strong knowledge of AWS cloud services and infrastructure-as-code practices.
  • Experience with CI/CD pipelines and automation for reliable deployments.
  • Excellent communication skills and experience documenting processes.

Tech Stack

AWSDatadogPrometheusTerraform

Categories