GrepJob
Appspace

Senior DevOps & Site Reliability Engineer - Americas

Appspace
Apply
about 4 hours ago
Toronto, CanadaSenior

Responsibilities

  • Identify and automate manual 'toil' in large-scale VM environments.
  • Lead the integration of AI tools for enhanced operational efficiency.
  • Design and maintain self-service deployment frameworks and CI/CD pipelines.
  • Evaluate platform components for cost-effective automation or migration.
  • Design a comprehensive observability stack across Azure and GCP.
  • Collaborate with engineering, security, and operations teams for reliable feature delivery.
  • Investigate complex performance defects across various system tiers.
  • Ensure platforms meet security standards through automated policy enforcement.

Requirements

  • 6+ years in DevOps or SRE roles with a focus on cloud environments.
  • Extensive experience with Microsoft Azure and/or Google Cloud Platform.
  • Expert-level skills in PowerShell and Python, with hands-on experience in Bicep or Terraform.
  • Strong background in Windows/Linux Server OS and Kubernetes.
  • Familiarity with middleware and PaaS technologies.
  • Expert-level troubleshooting skills in large-scale platform environments.

Benefits

  • Generous PTO.
  • 5 additional days off for training.
  • Flexible work schedules.
  • Remote work opportunities.
  • Appspace Quiet Fridays with no non-essential meetings.
  • Paid company holidays.

Tech Stack

AzureBambooGitHub ActionsGoogle Cloud PlatformHelmKubernetesLinuxMicrosoft SQL ServerMongoDBMySQLPowerShellPythonRabbitMQTerraform