GrepJob
Bretton AI

Software Engineer, Infrastructure

Bretton AI
Apply
3 months ago
San Francisco, CA, USASenior / Staff+

Base Salary

$168k - $213k/yr

Responsibilities

  • Own and evolve Kubernetes infrastructure, including cluster management and container security policies.
  • Design and implement progressive delivery pipelines with automated rollbacks and deployment health validation.
  • Build and maintain observability infrastructure in Datadog, including dashboards and distributed tracing.
  • Drive incident response for high-severity outages and model capacity needs for AI inference.
  • Architect and automate secure infrastructure using Infrastructure-as-Code for cloud deployments.
  • Maintain and improve infrastructure controls for SOC 2 compliance.
  • Lead customer engagements for enterprise rollouts and mentor mid-level engineers.

Requirements

  • 8+ years in infrastructure engineering or DevOps at high-growth companies.
  • Experience with Docker and Kubernetes, including production cluster management.
  • Proven track record of architecting and operating AWS, GCP, or Azure at an enterprise scale.
  • Experience with observability platforms, preferably Datadog.
  • Strong background in Infrastructure-as-Code and safe deployment practices.
  • Strong programming skills in Python.

Benefits

  • $168k - $213k + equity.
  • Comprehensive healthcare and 401k matching.
  • 15 days PTO + holidays and unlimited sick days.
  • Flexible leave options.
  • DoorDash and Uber home coverage for late work.

Tech Stack

AWSAzureDatadogDockerGoogle Cloud PlatformHelmKubernetesPythonTerraformTypeScript

Categories