Staff Site Reliability Engineer
Sumo Logicabout 4 hours ago
Bengaluru, India
Staff+
H1B Sponsor
Responsibilities
- Support engineering teams by maintaining and executing a reliability roadmap.
- Collaborate with development infrastructure and SRE teams to refine the reliability roadmap.
- Define and manage SLOs for teams within your product area.
- Participate in on-call rotations to improve the on-call experience.
- Complete projects to optimize the on-call experience for engineering teams.
- Improve the lifecycle of microservices from inception to refinement.
- Write code and automation to reduce operational workload and improve efficiency.
- Work with developer infrastructure teams to advance the reliability roadmap.
- Scale systems sustainably through automation and push for reliability improvements.
- Facilitate blame-free root cause analysis meetings for incidents.
- Drive root cause identification and issue resolution with teams.
- Work in a fast-paced iterative environment.
- Hire and mentor new team members.
Requirements
- Cloud native application development experience leveraging best practices.
- Strong debugging and troubleshooting skills across the technology stack.
- Deep understanding of AWS Networking, Compute, Storage, and managed services.
- Competency with modern CI/CD tooling like Kubernetes, Terraform, Ansible, and Jenkins.
- Experience with full life cycle support of services from creation to production support.
- Versed in Infrastructure as Code practices using Terraform or Cloud Formation.
- Ability to author production-ready code in Java, Scala, or Go.
- Experience with Linux systems and command line proficiency.
- Understanding of modern approaches to cloud-native software security.
- Experience with agile frameworks like Scrum and Kanban.
- Flexibility to step into new roles and responsibilities.
- Willingness to learn and use Sumo Logic products for reliability and security.
- Bachelor’s or Master’s Degree in Computer Science, Electrical Engineering, or a related field.
- 8+ years of professional experience in an applied software security role.
Tech Stack
AnsibleApache KafkaAWSGoJavaJenkinsKubernetesLinuxPythonScalaSumo LogicTerraform
Categories
DevOpsSecurity