2 days ago
Denver, CO, USA or San Francisco, CA, USA
Mid Level / Senior
H1B Sponsor
Base Salary
$127k - $176k/yr
Responsibilities
- Design, build, ship, and maintain core observability libraries and tools.
- Troubleshoot complex production issues related to performance and availability.
- Participate in a cross-organization incident response team.
- Contribute to architectural discussions within the SRE team.
- Influence cross-team projects and the reliability roadmap.
- Provide consultation and feedback to ensure reliable and scalable systems.
Requirements
- Bachelor’s degree in Computer Science or related field, or equivalent experience.
- 2+ years of software engineering experience, with 1+ years focused on reliability.
- Proficiency in Python, Go, or Ruby in Linux environments.
- Experience with production systems in AWS or Azure using Kubernetes and Docker.
- Skilled in observability and incident response practices using tools like Datadog and Grafana.
- Strong collaboration and communication skills, with experience leading small projects.
- An A-player mindset with a strong bias for action.
Benefits
- Fast-paced and collaborative work environment.
- Learning and development allowance.
- Competitive cash and equity compensation.
- 100% medical, dental, and vision coverage.
- Up to $25K reimbursement for fertility, adoption, and parental planning services.
- Flexible PTO policy.
- Monthly wellness stipend.
Tech Stack
AWSAzureDatadogDockerGoGrafanaKubernetesLinuxPrometheusPythonRubySplunkTerraform
Categories
BackendDevOps