Site Reliability Engineer

about 1 month ago

Gothenburg, SwedenMid Level / Senior

H1B Sponsor

Responsibilities

Ensure performance, capacity, scalability, reliability, and security of the platform.
Make systemic improvements for recurring issues.
Perform Root Cause Analysis for outages.
Design and maintain scalable infrastructure on AWS.
Develop observability solutions using tools like Grafana and ELK.
Automate infrastructure provisioning using Terraform and Chef.
Participate in a 24/7 on-call rotation for production incidents.
Collaborate with engineering teams for high availability applications.
Identify and address performance bottlenecks.
Drive continuous improvement through automation and process optimization.

Apache Kafka AWSChefElasticsearchGrafanaKibanaKubernetesLogstashMongoDBPrometheusRabbitMQTerraform