GitLab

Intermediate Site Reliability Engineer, Environment Automation

GitLab

Apply
4 months ago
Remote, Worldwide
Mid Level

Responsibilities

  • Automate the provisioning, configuration, and management of GitLab environments using Terraform, Ansible, and Kubernetes.
  • Investigate and troubleshoot issues in Kubernetes clusters and GitLab services.
  • Write and maintain Terraform modules and scripts for routine operations.
  • Monitor environment health using tools like Prometheus, ELK, and Grafana.
  • Participate in the incident response process and support resolution efforts.
  • Collaborate with Infrastructure and Development teams to enhance platform reliability.

Requirements

  • Familiarity with Terraform and Ansible for managing cloud infrastructure.
  • Experience using kubectl, Helm, or Kustomize with Kubernetes clusters.
  • Basic programming skills to read and modify infrastructure tooling in Go or Ruby.
  • Experience working with multiple environments or customer setups.
  • Familiarity with observability tools and logs for troubleshooting.
  • Ability to work well in cross-functional teams and a willingness to learn.
  • Experience participating in on-call rotations for production systems.

Benefits

  • Benefits to support your health, finances, and well-being.
  • Flexible Paid Time Off.
  • Team Member Resource Groups.
  • Equity Compensation & Employee Stock Purchase Plan.
  • Growth and Development Fund.
  • Parental leave.
  • Home office support.

Tech Stack

AnsibleAWSGoGoogle Cloud PlatformGrafanaHelmKubernetesPrometheusRubyTerraform

Categories

DevOpsSecurity