GrepJob
Coalition

Senior Site Reliability Engineer

Coalition
Apply
1 day ago
Remote, Canada
Senior / Mid Level
H1B Sponsor

Base Salary

$136k - $215k/yr

Responsibilities

  • Design, build, and scale production environments using AWS and Terraform.
  • Lead efforts to improve platform resilience through failure-based testing and automated recovery strategies.
  • Own the design and delivery of reusable platform components and self-service tools.
  • Define and evolve observability standards across the platform.
  • Manage projects end to end from scoping to successful rollout.
  • Mentor engineers and uphold high infrastructure quality.
  • Engage in technical design discussions and adapt strategies based on team input.

Requirements

  • 6+ years of experience in SRE, DevOps, Cloud Engineering, or Software Development roles.
  • Hands-on experience operating production environments in AWS.
  • Proficiency in Go or Python with experience in building production-grade automation.
  • Strong experience with Terraform.
  • Experience with container orchestration platforms like ECS or Kubernetes.
  • Familiarity with CI/CD tools such as GitHub Actions.
  • Experience designing and implementing reusable platform components.
  • Solid understanding of observability practices including system metrics and SLOs.
  • Exposure to failure-based testing approaches and automated recovery strategies.
  • Strong leadership and communication skills.

Benefits

  • 100% medical, dental, and vision coverage.
  • Flexible PTO.
  • Annual home office stipend and WeWork access.
  • Mental and physical health wellness programs.
  • Competitive compensation and opportunity for advancement.

Tech Stack

Apache KafkaAWSGitHub ActionsGoKubernetesPythonTerraform

Categories

DevOpsSecurity