about 4 hours ago
Responsibilities
- Design and operate high-throughput, data-intensive ingestion and trace-query systems.
- Build monitoring, alerting, and automated recovery for resilient pipelines.
- Define and enforce API, SDK, and CLI standards across multiple programming languages.
- Build and maintain integrations for framework-agnostic usage of LangSmith.
- Debug performance bottlenecks and optimize database queries.
- Participate in an on-call rotation focused on post-incident learning and automation.
Requirements
- Hands-on experience designing and running data-intensive systems at scale.
- Track record of building high-quality, widely-adopted CLIs, SDKs, or API standards.
- Production experience with OSS datastores like PostgreSQL and Redis.
- Strong backend software engineering skills in Go, Python, or TypeScript.
- Solid knowledge of cloud object storage, Kubernetes, and cloud platforms like GCP or AWS.
- Hands-on experience with observability stacks such as Datadog or Prometheus.
Benefits
- Medical, dental, and vision coverage.
- Flexible vacation policy.
- 401(k) plan.
- Meals on in-office days in the US.
