GrepJob
AlayaCare

Senior Site Reliability Engineer

AlayaCare
Apply
29 days ago
Melbourne, AustraliaSenior

Responsibilities

  • Design, build, and maintain infrastructure and platform services, including Kubernetes and observability tooling.
  • Implement infrastructure as code, configuration management, and automated testing.
  • Contribute to code and configuration reviews to improve scalability and maintainability.
  • Monitor production systems and troubleshoot issues.
  • Participate in on-call rotations and post-incident reviews.
  • Partner with product and engineering teams to translate requirements into infrastructure solutions.
  • Identify risks related to operability, security, performance, and cost.
  • Contribute to operational quality through runbooks and process improvements.

Requirements

  • 5+ years of experience in SRE/DevOps or a similar role.
  • Hands-on experience with AWS and Terraform.
  • Practical experience running workloads on Docker and Kubernetes.
  • Proficiency in at least one development or scripting language (e.g., Python, Go, Bash).
  • Knowledge of infrastructure as code (CloudFormation or Terraform).
  • Familiarity with APM, logging, and metrics systems (e.g., New Relic, Prometheus, ELK).
  • Background knowledge of system and network security fundamentals.
  • Experience participating in incident management.

Benefits

  • Hybrid working model: 2 days in office, 3 days WFH.
  • Competitive salary plus company stock (RSUs).
  • 5 wellness days per year.
  • $1,000/year flexible benefits package.
  • 22 weeks company-paid parental leave.
  • 2 days company-paid volunteer leave.
  • Team lunches, events, and wellness activities.
  • Inclusive and collaborative culture.

Tech Stack

AWSAzureBashDockerGoKubernetesMySQLPostgreSQLPrometheusPythonTerraform

Categories