about 5 hours ago
Boston, MA, USASenior / Staff+
H1B Sponsor
Base Salary
$119k - $221k/yr
Responsibilities
- Design and operate cloud infrastructure on AWS for core SaaS and AI services.
- Build and maintain AI/ML infrastructure and monitoring for LLM-powered services.
- Establish and enforce infrastructure-as-code standards using Terraform.
- Implement observability frameworks for data integrity and automated regression detection.
- Build deployment automation to eliminate human-memory dependencies.
- Support big data infrastructure for analytics and AI training workflows.
- Implement security and compliance controls for AI workloads in regulated environments.
- Drive environment parity with automated drift detection and remediation.
- Improve disaster recovery capabilities with documented procedures and testing.
- Lead architecture reviews for new services and integrations.
Requirements
- 6+ years of DevOps/SRE/Platform Engineering experience.
- 2+ years of experience with AI/ML infrastructure.
- Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
- Experience building infrastructure for traditional and AI/ML workloads at a SaaS company.
- Deep experience with AWS cloud services like ECS, Lambda, and Redshift.
- Strong infrastructure-as-code skills with Terraform.
- Understanding of data infrastructure and analytics at scale.
- Experience in compliance-sensitive environments is a plus.
- Strong communication skills with technical and non-technical stakeholders.
- Proficiency in at least one programming language such as Python, Go, or TypeScript.
Benefits
- Equity grants for all employees.
- A 4% matching 401(k) program.
- Medical, dental, vision, disability, and life insurance for employees working 30+ hours per week.
- Monthly wellness stipend.
- Paid parental leave.
- Flexible vacation policy.
