11 months ago
San Francisco, CA, USAMid Level / Senior
Responsibilities
- Design and operate cloud infrastructure for AI agents.
- Create infrastructure as code and automated deployment pipelines.
- Implement systems for handling usage spikes with autoscaling.
- Build monitoring, logging, and dashboards for system health.
- Implement security best practices including IAM and audit trails.
Requirements
- 4+ years of experience running distributed systems at scale on major cloud platforms.
- Proven record of owning infrastructure-as-code and CI/CD pipelines.
- Experience optimizing systems and databases for latency and cost.
- Fluent with modern monitoring and tracing tools.
- Understanding of enterprise security requirements and compliance needs.
- Product mindset with the ability to prioritize and iterate on ambiguous requirements.
