Sr. Site Reliability Engineer
Netskope
16 days ago
San José, Costa Rica
Senior / Staff+
H1B Sponsor
Responsibilities
- Partner closely with development teams to architect and build highly available and secure features.
- Develop innovative methods to measure, monitor, and report application and infrastructure health.
- Gain deep knowledge of the application stack.
- Improve the performance of micro-services and address scaling issues.
- Manage capacity planning and ensure efficient resource utilization.
- Participate in 24X7 on-call rotations with development teams.
- Debug and optimize code while automating routine tasks.
- Drive efficiencies in systems and processes through performance tuning and root cause analysis.
Requirements
- 8+ years of experience troubleshooting Unix/Linux systems.
- Experience in managing large-scale web operations.
- Proficiency in programming languages such as C, C++, Java, Python, Go, Perl, or Ruby.
- Knowledge of algorithms, data structures, and software design.
- Hands-on experience with cloud services in a scalable production environment.
- Familiarity with continuous integration and deployment tools like Jenkins and Ansible.
- Knowledge of distributed systems is a plus.
- Strong interpersonal communication skills and ability to work in a diverse team environment.
- Experience leading teams and collaborating cross-functionally.
- BSCS or equivalent required; MSCS or equivalent preferred.
Benefits
- Opportunity to work with cutting-edge cloud tools and infrastructure management.
- Engage in solving complex challenges that enhance technical and analytical skills.
- Contribute to a market-leading product that supports a global customer base.
Tech Stack
AnsibleCC++DockerGoGoogle CloudJavaJenkinsKubernetesLinuxPerlPythonRubySpinnaker
Categories
BackendDevOps