GrepJob
FanDuel

Staff Observability Engineer

FanDuel
Apply
5 days ago
Atlanta, GA, USAStaff+

Base Salary

$170k - $213k/yr

Responsibilities

  • Define and drive the observability strategy and roadmap across multiple teams.
  • Design and improve scalable observability capabilities for actionable insights.
  • Establish best practices for monitoring, alerting, incident management, and postmortems.
  • Drive operational excellence by evolving incident management and on-call practices.
  • Lead initiatives to improve end-to-end reliability and resolve systemic risks.
  • Leverage automation and AI to accelerate root cause analysis.
  • Partner with leadership to translate observability insights into strategic decisions.
  • Identify trends to proactively detect and mitigate large-scale issues.
  • Optimize observability platforms for cost and scalability.
  • Mentor engineers to enhance reliability and observability maturity.

Requirements

  • Significant hands-on experience in observability engineering or related roles.
  • Strong expertise in monitoring and observability, particularly with Datadog.
  • Experience defining and driving observability or reliability strategy.
  • Proficiency with Kubernetes, AWS, and infrastructure-as-code tools like Terraform.
  • Proven ability to influence technical direction across teams.
  • Deep understanding of distributed systems principles and trade-offs.
  • Experience implementing SLOs, SLIs, and alerting strategies.
  • Strong software engineering fundamentals in at least one modern programming language.
  • Experience driving improvements through automation and reducing organizational toil.
  • Strong analytical skills to translate technical signals into business impact.
  • Excellent communication and stakeholder management skills.
  • A mindset of ownership focused on long-term impact and continuous improvement.

Benefits

  • Array of health plans including mental health support and fitness benefits.
  • Generous paid time off and sick leave.
  • Annual bonus and long-term incentive opportunities based on performance.
  • 401k with up to a 5% match.
  • Commuter benefits and pet insurance.

Tech Stack

AnsibleAWSDatadogGoHelmJavaKubernetesPythonTerraformTypeScriptVault

Categories