GrepJob
PlayStation Global

Senior Service Reliability Engineer

PlayStation Global
Apply
2 days ago
Berlin, GermanySenior / Staff+

Responsibilities

  • Take a leadership role in ongoing improvements in reliability and scalability.
  • Work closely with SRE Management to define KPIs and drive continuous improvement.
  • Influence the architecture and implementation of solutions within the division.
  • Mentor junior SRE staff and enable their success.
  • Act as a voice to represent SRE in the wider organization.
  • Lead small-scale projects from inception to implementation.
  • Design platform-wide solutions and provide technical leadership during their implementation.
  • Demonstrate a high level of organizational skills and initiative.

Requirements

  • Minimum of 7+ years working experience in Software Development and/or Linux Systems Administration role.
  • Strong interpersonal, written and verbal communication skills.
  • Available to be scheduled in on-call rotation.
  • Proficient as a Linux Production Systems Engineer, with experience managing large scale Web Services infrastructure.
  • Development experience in one or more programming languages, preferably Python, Bash, Go, Java, C++, or Rust.
  • Experience with distributed data storage at scale, NoSQL at scale, data aggregation technologies, and traditional RDBMS with High Availability.
  • Familiarity with monitoring and alerting tools, Kubernetes, AWS, software distribution, and configuration management.

Tech Stack

AnsibleApache CassandraApache HadoopApache KafkaAWSBashC++ChefElasticsearchGoGrafanaJavaKubernetesLinuxMongoDBMySQLPostgreSQLPrometheusPuppetPythonRedisRust

Categories

DevOpsGaming