GrepJob
Solvd

Infrastructure / Site Reliability Engineer (SRE)

Solvd
Apply
about 19 hours ago
Buenos Aires, Argentina +4 moreMid Level / Senior
H1B Sponsor

Responsibilities

  • Design, provision, and maintain secure and scalable cloud infrastructure.
  • Write and maintain Terraform or OpenTofu scripts for immutable infrastructure.
  • Manage and optimize containerized environments using Docker and Kubernetes.
  • Build and maintain CI/CD pipelines for zero-downtime deployments.
  • Implement GitOps workflows for automated application delivery.
  • Develop custom tools and scripts to automate repetitive tasks.
  • Design and implement observability stacks for monitoring and alerting.
  • Conduct chaos engineering and load testing for system resilience.
  • Participate in on-call rotation and drive root-cause analysis.

Requirements

  • 3+ years of experience in an SRE, DevOps, or Cloud Infrastructure role.
  • Deep production experience with at least one major cloud provider.
  • Strong proficiency with Terraform and managing Kubernetes clusters.
  • Solid understanding of Linux networking, internals, and security fundamentals.
  • Strong coding skills in Go or Python are preferred.
  • Good grasp of VPC architecture, DNS, and load balancers is beneficial.
  • Familiarity with cloud-native databases and caching layers is a plus.

Benefits

  • Opportunity to work on real-world AI-driven projects across key industries.
  • Collaborate with a global team across continents and cultures.
  • Thrive in an inclusive environment prioritizing continuous learning and innovation.

Tech Stack

AWSAzureBashDatadogDockerGitHub ActionsGitLab CI/CDGoGoogle Cloud PlatformGrafanaJenkinsKubernetesPostgreSQLPrometheusPythonRedisTerraform

Categories