Staff SRE Engineer
Nubank
17 days ago
Toronto, Canada
Staff+
Responsibilities
- Lead initiatives to refine the strategic direction of the SRE team.
- Provide expert guidance for the design and maintenance of reliable data systems.
- Champion the adoption of advanced automation frameworks.
- Develop anomaly detection and predictive analytics mechanisms.
- Refine incident response protocols and conduct post-incident analysis.
- Mentor engineers and foster a culture of reliability engineering excellence.
Requirements
- Proven experience in SRE or Systems Engineering at a staff level.
- Solid experience with Clojure, Datomic, Scala, and Spark.
- Deep knowledge of managing workloads on AWS with Kubernetes and other services.
- Demonstrated ability to innovate and build automation frameworks.
- Experience defining Service Level Objectives and managing system observability.
- Ability to translate complex architectural challenges into scalable solutions.
Benefits
- Opportunity to earn equity at Nu.
- Medical, dental, and vision insurance.
- Life insurance and AD&D coverage.
- Extended maternity and paternity leaves.
- Access to learning platforms and language programs.
- Mental health and wellness assistance program.
- 401K and saving plans.
- Work-from-home allowance.
- Relocation assistance package, if applicable.
Tech Stack
Apache SparkAWSClojureKubernetesScala
Categories
AI & MLData EngineeringDevOpsSecurity