19 days ago
San Francisco, CA, USAMid Level / Senior
Base Salary
$170k - $265k/yr
Responsibilities
- Own impactful runtime problems end-to-end from architecture to production launch.
- Build and evolve core services for session lifecycle and streaming responses.
- Design for performance, correctness, and cost optimization.
- Integrate with leading LLM providers to improve quality and predictability.
- Harden the platform with fault isolation and graceful degradation.
- Instrument deep observability and create playbooks for high availability.
- Collaborate closely with product and application teams.
Requirements
- 3+ years of software engineering experience in production distributed systems.
- BS/BA in Computer Science or related field, or equivalent experience.
- Strong coding skills in Python, Go, Java, or C++.
- Product-minded with a focus on customer impact and clear SLAs.
- Ownership-driven with a proactive attitude.
- Experience operating services on Kubernetes and major cloud platforms.
- Familiarity with event/streaming systems and caching for low-latency paths.
- Practical understanding of LLM/agents building blocks.
- Strong observability and debugging skills.
Benefits
- Comprehensive benefits package including medical, vision, and dental coverage.
- Generous time-off policy and 401k plan contributions.
- Home office improvement stipend and annual education and wellness stipends.
- Vibrant company culture with regular events and healthy daily lunches.
