
Senior Service Reliability Engineer
PlayStation Global2 days ago
Berlin, GermanySenior / Staff+
Responsibilities
- Take a leadership role in ongoing improvements in reliability and scalability.
- Work closely with SRE Management to define KPIs and drive continuous improvement.
- Influence the architecture and implementation of solutions within the division.
- Mentor junior SRE staff and enable their success.
- Act as a voice to represent SRE in the wider organization.
- Lead small-scale projects from inception to implementation.
- Design platform-wide solutions and provide technical leadership during their implementation.
- Demonstrate a high level of organizational skills and initiative.
Requirements
- Minimum of 7+ years working experience in Software Development and/or Linux Systems Administration role.
- Strong interpersonal, written and verbal communication skills.
- Available to be scheduled in on-call rotation.
- Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
- Development experience in one or more programming languages, preferably Python, Bash, Go, Java, C++, or Rust.
- Experience with distributed data storage at scale, NoSQL at scale, data aggregation technologies, and traditional RDBMS with High Availability.
- Familiarity with monitoring and alerting tools, Kubernetes, AWS, software distribution, and configuration management.