Staff Software Engineer
Nubank
about 7 hours ago
Toronto, Canada
Staff+
Responsibilities
- Lead the long-term roadmap for reliability and resilience.
- Execute Chaos Engineering experiments and Disaster Recovery simulations.
- Implement robust SLOs and SLIs across the organization.
- Provide training and architectural patterns to product squads.
Requirements
- Expertise in architecting and maintaining high-availability systems in public cloud environments, preferably AWS.
- Deep experience in advanced root cause analysis and creating feedback loops to prevent incident recurrence.
- Hands-on experience defining and implementing SLOs, SLIs, and error budgets in distributed microservices architectures.
- Real-world experience implementing Chaos Engineering and Disaster Recovery planning in production-scale environments.
Benefits
- Health Insurance
- Life Insurance
- Pension Plan
- Extended maternity and paternity leaves
- Nucleo - Our learning platform of courses
- NuLanguage - Our language learning program
- NuCare - Our mental health and wellness assistance program
- Vacations
Tech Stack
AWS
Categories
DevOpsSecurity