Site Reliability Engineer II
Coalition
about 3 hours ago
Remote, United States
Mid Level / Senior
H1B Sponsor
Base Salary
$111k - $163k/yr
Responsibilities
- Design, build, and scale production environments using AWS and Terraform.
- Improve the resilience and operability of our platform through failure-based testing and automated recovery strategies.
- Design and implement reusable platform components and self-service tools to streamline the developer experience.
- Implement and maintain robust observability practices, including system metrics, distributed tracing, and SLO management.
- Guide junior engineers, uphold high infrastructure quality, and contribute to the team’s evolving best practices.
- Participate in technical design discussions, sharing feedback and adapting strategies based on team input and evolving requirements.
Requirements
- 4+ years in SRE, DevOps, Cloud Engineering, or Software Development roles.
- Hands-on experience operating and scaling production environments within AWS.
- Strong expertise with Terraform for managing complex cloud infrastructure.
- Proficiency in Go or Python, with experience building production-grade automation, tooling, or libraries.
- Experience with ECS or Kubernetes.
- Familiarity with modern deployment tools, specifically GitHub Actions.
- Strong written and verbal communication skills.
Benefits
- 100% medical, dental and vision coverage.
- Flexible PTO policy.
- Annual home office stipend and WeWork access.
- Mental & physical health wellness programs.
- Competitive compensation and opportunity for advancement.
Tech Stack
Apache KafkaAWSGitHub ActionsGoKubernetesPythonTerraform
Categories
DevOpsSecurity