about 3 hours ago
Remote, United States
Staff+
H1B Sponsor
Base Salary
$204k - $255k/yr
Responsibilities
- Drive platform reliability and operational excellence through deployment pipelines and observability tooling.
- Contribute to runtime resiliency initiatives for the multi-tenant GraphQL platform.
- Architect AI-powered operational tooling for incident triage and debugging.
- Shape the future of Viaduct Modern by improving developer experience.
- Investigate and resolve complex production issues.
- Design observability features and develop tooling for incident response automation.
- Lead technical design discussions and partner with tenant teams.
Requirements
- 9+ years of software engineering experience with a focus on backend systems.
- Deep expertise in observability and monitoring, including SLO frameworks.
- Proven track record in reliability engineering and incident response.
- Strong experience with performance tuning in JVM-based systems.
- Experience operating high-traffic systems with a focus on deployment safety.
- Familiarity with GraphQL or similar API technologies.
- Experience building developer tooling with a focus on self-service capabilities.
- Strong leadership and communication skills.
Benefits
- Eligible for bonus, equity, benefits, and Employee Travel Credits.
Tech Stack
GraphQLKotlin
Categories
AI & MLBackendDevOps