6 months ago
San Francisco, CA, USAMid Level / Senior
Responsibilities
- Architect GPU inference workflows for large-scale video models.
- Build cloud upload infrastructure for large-scale data ingestion.
- Design distributed processing pipelines using Kubernetes and event-driven systems.
- Develop and maintain the NomadicML Python SDK and APIs for video analysis.
- Implement observability solutions for monitoring GPU utilization and job performance.
- Support frontend integrations through backend endpoints and SDK bindings.
Requirements
- Deep proficiency in Python, Go, or TypeScript for backend systems.
- Experience with AWS, GCP, or Azure cloud services.
- Strong understanding of GPU inference scaling and Kubernetes.
- Prior experience designing REST/gRPC APIs and developer-facing infrastructure.
- Familiarity with asynchronous job orchestration tools like Ray or Airflow.
- A practical mindset for making research-grade systems reliable and usable.
