4 days ago
Base Salary
$265k - $285k/yr
Responsibilities
- Design and develop production-grade distributed services in Rust using async/Tokio.
- Own the full service lifecycle from system design and implementation through deployment and operations.
- Build and optimize data-processing and transformation pipelines.
- Create and maintain integration tests with real service dependencies in containerized environments.
- Improve test determinism, stability, and reliability across distributed systems.
- Deploy and operate services using infrastructure-as-code practices.
- Implement safe rollout and rollback procedures using GitOps and CI/CD workflows.
- Develop and evolve observability systems including logs, metrics, and distributed tracing.
- Define service-level objectives (SLOs) and lead incident response.
- Design and maintain distributed cluster coordination systems.
- Plan and execute performance benchmarking and load testing.
- Drive performance optimization initiatives across distributed services.
- Apply fuzz testing techniques to critical components.
- Practice chaos engineering in lower environments.
- Participate in architecture and code reviews.
- Contribute to technical design documents and RFCs.
- Mentor peers and collaborate cross-functionally.
- Full-time telecommuting permitted from anywhere in the United States.
Requirements
- Bachelor’s degree in Computer Science, Management, or a closely related field.
- 5 years of progressively responsible experience in the job offered or a related occupation.
- 3 years of experience with Rust, including Tokio and asynchronous programming.
- 2 years of experience with Apache DataFusion and Apache Arrow.
- 3 years of experience creating integration tests with real dependencies using Docker.
- 2 years of experience with behavior-driven testing for distributed services.
- 2 years of experience with performance benchmarking and capacity planning.
- 2 years of experience with load testing using Locust and wrk.
- 1 year of experience with chaos engineering and fault injection.
- 2 years of experience designing and scaling distributed backend services.
- 3 years of experience with Kubernetes for production deployments.
- 3 years of experience with Terraform and infrastructure-as-code practices.
- 3 years of experience with advanced Redis patterns.
- 2 years of experience with PostgreSQL and OLAP databases.
- 2 years of experience with Ruby for backend and service tooling.
- 2 years of experience with Java or Kotlin for backend services.
- 3 years of experience implementing observability and CI/CD systems.