about 9 hours ago
Responsibilities
- Lead high-impact platform projects to enhance developer experience, reliability, and security.
- Design tooling and workflows for AI-assisted development.
- Own Infrastructure-as-Code for Kubernetes, AWS, and GCP using Terraform.
- Evolve CI/CD processes to improve deployment speed and safety.
- Drive observability initiatives and manage dashboards and SLOs.
- Participate in on-call duties and lead incident response efforts.
- Reduce technical debt with pragmatic remediation plans.
- Translate product needs into actionable platform features.
- Mentor junior engineers and support their professional growth.
- Shape the platform roadmap by identifying high-leverage opportunities.
Requirements
- 5+ years of experience in software engineering, DevOps, or Site Reliability Engineering.
- B.S. in Computer Science or a related technical field.
- Production experience with Kubernetes and containerized applications.
- Proficiency in at least one programming language, preferably Golang or Python.
- Working knowledge of AWS core services and networking/security fundamentals.
- Familiarity with GitOps workflows and the CNCF ecosystem.
- A track record of delivering projects that improve reliability and performance.
- Curiosity about AI's role in infrastructure work.
- Strong communication skills for explaining complex topics.
- Ability to navigate ambiguity and make data-driven decisions.