
Site Reliability Engineer
PlayStation Global2 days ago
Berlin, GermanySenior
Responsibilities
- Lead team technical discussions focused on reliability and scalability improvements.
- Create High Level Designs (HLDs) for new products and platforms.
- Mentor junior SRE staff to enable their success.
- Lead incident response and post-mortem activities within your service team.
- Collaborate with engineers to prioritize reliability improvements.
- Contribute code to enhance system reliability.
- Implement automation to reduce ongoing operational toil.
Requirements
- Minimum of 5+ years of experience in Software Development and/or Linux Systems Administration.
- Strong interpersonal, written, and verbal communication skills.
- Availability for on-call rotation.
- Proficient as a Linux Production Systems Engineer managing large scale Web Services infrastructure.
- Development experience in Python, Bash, Go, Java, C++, or Rust.
- Experience with distributed data storage, NoSQL, data aggregation technologies, and traditional RDBMS.
- Familiarity with monitoring and alerting tools, Kubernetes, AWS, and configuration management.