1 day ago
Responsibilities
- Design and build SRE platform systems and capabilities with cutting-edge AI.
- Participate in livesite monitoring rotations and handle escalations.
- Drive availability, scalability, and performance improvements based on livesite learnings.
- Ensure technical deliverables meet or exceed expectations on reliability and performance.
- Onboard other teams onto your platforms by driving outcomes yourself.
- Ship early, seek feedback, and iterate fast.
- Drive task planning, estimation, scheduling, and staffing.
- Mentor Software Engineers through hands-on coaching and training.
- Participate in and influence process improvements across the engineering organization.
Requirements
- 10+ years of experience architecting and engineering large-scale, distributed applications.
- Experience building complex internal platforms adopted by multiple teams.
- Demonstrated ability to drive adoption of systems through hands-on work.
- Experience with AI-powered applications in production.
- Proficiency in object-oriented languages such as C#, C++, Java, or Python.
- Deep understanding of data structures, algorithms, and cloud programming.
- Experience with service-oriented and microservice-based architectures.
- Familiarity with agile development, CI/CD, and DevOps practices.
- Experience with production Kubernetes infrastructure is a plus.
- Experience with cloud providers and database backends is a plus.