GrepJob
Alpaca

Site Reliability Engineer

Alpaca
Apply
about 2 hours ago
Remote, WorldwideMid Level / Senior
H1B Sponsor

Responsibilities

  • Operate production day-to-day, including on-call duties and incident response.
  • Define and refine SLIs/SLOs and error budgets for reliability practices.
  • Enhance observability across metrics, logs, traces, and alerting.
  • Implement infrastructure as code using a GitOps workflow.
  • Manage PostgreSQL performance tuning, schema reviews, and online migrations.
  • Mentor engineers on reliability and database fundamentals.

Requirements

  • 4+ years of experience in SRE, DevOps, or backend engineering with production operations ownership.
  • Hands-on experience with Kubernetes and GitOps workflows.
  • Solid knowledge of PostgreSQL in production environments.
  • Understanding of cloud networking fundamentals.
  • Proficient with Linux at the operator level.
  • Experience in incident response and structured debugging.
  • Working proficiency in Go or Python, with strong communication skills.
  • Genuine interest in databases and PostgreSQL expertise.

Benefits

  • Competitive Salary & Stock Options
  • Health Benefits
  • One-time USD $500 for new hire home-office setup
  • Monthly stipend of USD $150 via a Brex Card

Tech Stack

Apache KafkaGoKubernetesPostgreSQLPythonRabbitMQ

Categories