about 3 hours ago
Atlanta, GA, USA
Senior / Staff+
H1B Sponsor
Base Salary
$116k - $155k/yr
Responsibilities
- Define and own the enterprise-wide observability architecture.
- Evaluate, select, and standardize observability tooling to optimize costs.
- Design scalable data pipelines and storage strategies for telemetry data.
- Create Terraform modules and Helm charts for observability infrastructure.
- Establish instrumentation standards using the OpenTelemetry framework.
- Define and champion SLO/SLI/error-budget frameworks across teams.
- Serve as a senior escalation point during critical incidents.
- Provide architectural mentorship to Observability Engineers and SRE team members.
Requirements
- 5-8 years of experience in Observability Architecture, SRE, or Platform/Infrastructure Engineering.
- Post-secondary Diploma/Degree in Engineering, Computer Science, or a related field.
- Mastery of the OpenTelemetry ecosystem and knowledge of Prometheus-compatible metrics systems.
- Advanced experience with tracing systems and log aggregation platforms.
- Expert-level proficiency in cloud infrastructure, preferably GCP, and Kubernetes architecture.
- Strong software engineering skills in Go, Python, or similar languages.
- Excellent communication skills to influence technical direction.
- Preferred certifications include Google Cloud Professional Cloud Architect or Certified Kubernetes Administrator.
Benefits
- Flex working arrangements.
- Home office reimbursement program.
- Baby bonus and parental leave top-up program.
- Online learning and networking opportunities.
- Electric vehicle purchase incentive program.
- Competitive medical and dental benefits.
- Retirement savings program.
Tech Stack
GoGoogle BigQueryGrafanaHelmKubernetesPrometheusPythonTerraform
Categories
AI & MLData EngineeringDevOps
