
Staff Site Reliability Engineer
FloSports, Inc.6 months ago
Responsibilities
- Lead the migration from a legacy GCP environment to AWS EKS.
- Architect and design core infrastructure patterns for Terraform and GitOps.
- Champion a culture of SLOs, defining and implementing strategies for critical user journeys.
- Develop critical tooling and automation in Node.js and Go.
- Oversee the evolution of the K6-based load testing platform.
- Act as a subject matter expert for the Istio service mesh.
- Spearhead high-priority initiatives in SRE domains.
- Participate in a blameless on-call rotation and mentor engineers through incidents.
Requirements
- 8-10+ years of experience in SRE, DevOps, or Software Engineering.
- Proven technical leadership and mentoring experience.
- Expertise in Node.js or Go with a history of building automation.
- Expert-level understanding of Kubernetes, especially EKS.
- Terraform expertise in designing large-scale IaC frameworks.
- Experience in designing observability strategies using platforms like Datadog.
- Proficient in building and scaling CI/CD systems, ideally with GitHub Actions.
- Strong systems thinking skills to solve complex problems.
Benefits
- Recognized as a Top Workplace by the Austin-American Statesman for three consecutive years.
- Flexible work schedule to balance professional and personal life.
- Annual equity awards for top performers.
- Comprehensive medical, dental, and vision plans.
- Company-paid short-term and long-term disability insurance.
- Generous 401(K) company match vested immediately.
- Progressive parental leave policies and flexible paid time off.
- Hack-a-thons and team-building events.
- Stocked snack bar and catered meals weekly.