Intermediate Site Reliability Engineer, Environment Automation
GitLab
4 months ago
Remote, Worldwide
Mid Level
Responsibilities
- Automate the provisioning, configuration, and management of GitLab environments using Terraform, Ansible, and Kubernetes.
- Investigate and troubleshoot issues in Kubernetes clusters and GitLab services.
- Write and maintain Terraform modules and scripts for routine operations.
- Monitor environment health using tools like Prometheus, ELK, and Grafana.
- Participate in the incident response process and support resolution efforts.
- Collaborate with Infrastructure and Development teams to enhance platform reliability.
Requirements
- Familiarity with Terraform and Ansible for managing cloud infrastructure.
- Experience using kubectl, Helm, or Kustomize with Kubernetes clusters.
- Basic programming skills to read and modify infrastructure tooling in Go or Ruby.
- Experience working with multiple environments or customer setups.
- Familiarity with observability tools and logs for troubleshooting.
- Ability to work well in cross-functional teams and a willingness to learn.
- Experience participating in on-call rotations for production systems.
Benefits
- Benefits to support your health, finances, and well-being.
- Flexible Paid Time Off.
- Team Member Resource Groups.
- Equity Compensation & Employee Stock Purchase Plan.
- Growth and Development Fund.
- Parental leave.
- Home office support.
Tech Stack
AnsibleAWSGoGoogle Cloud PlatformGrafanaHelmKubernetesPrometheusRubyTerraform
Categories
DevOpsSecurity