about 15 hours ago
Responsibilities
- Design and operate cloud infrastructure for petabyte-exabyte-scale systems.
- Manage and evolve production Kubernetes clusters for high availability.
- Build and optimize CI/CD pipelines for faster and safer deployments.
- Improve developer experience by automating workflows.
- Contribute to end-to-end testing and release-confidence systems.
- Enhance observability across logging, metrics, and alerting.
- Collaborate on architecture, reliability, and scaling challenges.
- Guide customer-facing technical integrations and troubleshoot issues.
- Drive operational excellence through automation and best practices.
Requirements
- 5+ years in infrastructure, platform, or distributed systems engineering.
- Strong coding skills in Go, Java, Python, or similar languages.
- Production-grade Kubernetes experience with cloud and containerization skills.
- Deep experience with CI/CD systems and cloud infrastructure.
- Proficiency with infrastructure-as-code tools like Terraform.
- Ability to debug complex cross-layer issues.
- Track record of designing scalable and reliable systems.
- Thrives in fast-paced startup environments with strong communication skills.
Benefits
- Competitive salary, meaningful equity, and substantial bonuses for top performers.
- Flexible time off and comprehensive health coverage for you and your family.
- Support for research, publication, and deep technical exploration.
