Sr. Site Reliability Engineer

Netskope

16 days ago

San José, Costa Rica

Senior / Staff+

H1B Sponsor

Responsibilities

Partner closely with development teams to architect and build highly available and secure features.
Develop innovative methods to measure, monitor, and report application and infrastructure health.
Gain deep knowledge of the application stack.
Improve the performance of micro-services and address scaling issues.
Manage capacity planning and ensure efficient resource utilization.
Participate in 24X7 on-call rotations with development teams.
Debug and optimize code while automating routine tasks.
Drive efficiencies in systems and processes through performance tuning and root cause analysis.

8+ years of experience troubleshooting Unix/Linux systems.
Experience in managing large-scale web operations.
Proficiency in programming languages such as C, C++, Java, Python, Go, Perl, or Ruby.
Knowledge of algorithms, data structures, and software design.
Hands-on experience with cloud services in a scalable production environment.
Familiarity with continuous integration and deployment tools like Jenkins and Ansible.
Knowledge of distributed systems is a plus.
Strong interpersonal communication skills and ability to work in a diverse team environment.
Experience leading teams and collaborating cross-functionally.
BSCS or equivalent required; MSCS or equivalent preferred.

Opportunity to work with cutting-edge cloud tools and infrastructure management.
Engage in solving complex challenges that enhance technical and analytical skills.
Contribute to a market-leading product that supports a global customer base.

AnsibleCC++DockerGoGoogle CloudJavaJenkinsKubernetesLinuxPerlPythonRubySpinnaker