GrepJob
Hebbia

Software Engineer, Site Reliability

Hebbia
Apply
2 months ago
San Francisco, CA, USA or New York, NY, USAMid Level / Senior
H1B Sponsor

Base Salary

$160k - $300k/yr

Responsibilities

  • Own critical production services end-to-end, from design to incident response.
  • Profile, benchmark, and rewrite hot paths to eliminate bottlenecks.
  • Lead incident response and drive post-mortem culture for architectural improvements.
  • Design and build observability frameworks and custom instrumentation.
  • Define and enforce SLOs across platform services.
  • Own capacity planning and cost efficiency to prevent resource exhaustion.
  • Build robust internal platforms and deployment tooling.
  • Continuously improve CI/CD systems for safe and quick engineering releases.
  • Embed with product engineering teams as a peer software engineer.
  • Partner on infrastructure security through threat modeling and compliance tooling.

Requirements

  • 5+ years of software development experience with a focus on production services.
  • Proficiency in at least one systems or backend language: Go, Python, C++, or Rust.
  • Experience as a Production Engineer, SRE, or software engineer with a focus on infrastructure.
  • Deep understanding of distributed systems.
  • Expertise in container orchestration and debugging complex distributed failures.
  • Working knowledge of OS-level concepts.
  • Fluency in cloud platforms, preferably AWS.
  • Experience in building and maintaining observability stacks.
  • Strong CI/CD pipeline expertise with a focus on improving developer velocity.
  • Background in a company with a Production Engineering or software-focused SRE culture is a plus.

Benefits

  • Unlimited PTO.
  • Medical, dental, and vision insurance plus 401K.
  • Catered lunch daily and DoorDash dinner credit for late work.
  • Parental leave policy: 3 months for non-birthing parents, 4 months for birthing parents.
  • Fertility benefits with a $15k lifetime benefit.
  • Competitive equity package with unmatched upside potential.

Tech Stack

AWSC++GoPythonRust