11 months ago
Base Salary
$160k - $230k/yr
Responsibilities
- Design and implement the Models API for accessing training, evaluation, and deployment data.
- Ensure backwards compatibility and feature versioning across evolving schemas.
- Build scalable pipelines for ingesting, transforming, and serving large volumes of data.
- Create CI/CD workflows that align with changes to the underlying data schema.
- Enable fine-grained querying of historical and real-time data for networks.
- Define and enforce principles for data management to simplify downstream code.
- Collaborate with modelers to design reproducible and efficient training and evaluation pipelines.
- Own performance across key endpoints to meet real-time serving constraints.
Requirements
- Experience designing large-scale data infrastructure across batch and streaming modes.
- Strong understanding of schema design, versioning, and data quality.
- Ability to create user-friendly systems that minimize misuse.
- Experience collaborating closely with research or modeling teams.
- Comfortable in early-stage environments and building from scratch.
Benefits
- Medical, dental & vision coverage for you and your dependents.
- Annual memberships to One Medical, Headspace, and Wellhub.
- 401k options including traditional and Roth.
- Flexible time off.
- Commuter benefits.
- Parental leave.
- Onsite meals at the San Francisco office.
Tech Stack
Categories
BackendData Engineering
