5 days ago
Atlanta, GA, USAStaff+
Base Salary
$170k - $213k/yr
Responsibilities
- Define and drive the observability strategy and roadmap across multiple teams.
- Design and improve scalable observability capabilities for actionable insights.
- Establish best practices for monitoring, alerting, incident management, and postmortems.
- Drive operational excellence by evolving incident management and on-call practices.
- Lead initiatives to improve end-to-end reliability and resolve systemic risks.
- Leverage automation and AI to accelerate root cause analysis.
- Partner with leadership to translate observability insights into strategic decisions.
- Identify trends to proactively detect and mitigate large-scale issues.
- Optimize observability platforms for cost and scalability.
- Mentor engineers to enhance reliability and observability maturity.
Requirements
- Significant hands-on experience in observability engineering or related roles.
- Strong expertise in monitoring and observability, particularly with Datadog.
- Experience defining and driving observability or reliability strategy.
- Proficiency with Kubernetes, AWS, and infrastructure-as-code tools like Terraform.
- Proven ability to influence technical direction across teams.
- Deep understanding of distributed systems principles and trade-offs.
- Experience implementing SLOs, SLIs, and alerting strategies.
- Strong software engineering fundamentals in at least one modern programming language.
- Experience driving improvements through automation and reducing organizational toil.
- Strong analytical skills to translate technical signals into business impact.
- Excellent communication and stakeholder management skills.
- A mindset of ownership focused on long-term impact and continuous improvement.
Benefits
- Array of health plans including mental health support and fitness benefits.
- Generous paid time off and sick leave.
- Annual bonus and long-term incentive opportunities based on performance.
- 401k with up to a 5% match.
- Commuter benefits and pet insurance.
