Site Reliability Engineer, GNC (Falcon)
SpaceX
4 months ago
Hawthorne, CA, USA
Mid Level / Senior
Base Salary
$120k - $145k/yr
Responsibilities
- Deploy, upgrade, operate/maintain, and scale GNC products and services.
- Provision and maintain virtual and physical servers.
- Monitor and maintain a 4000+ thread HPC cluster.
- Collaborate with GNC software engineers for product operability.
- Add monitoring for web apps and respond to outages.
- Manage GNC computational infrastructure with IT collaboration.
- Engage in the full lifecycle of services from design to refinement.
- Recommend future hardware purchases.
- Practice sustainable incident response and postmortems.
- Provide end-user support for GNC engineering products.
- Configure automated deployment pipelines for web apps.
- Develop or improve GNC web apps for usability and robustness.
- Document new software changes and improvements.
- Focus on performance bottlenecks and improvement techniques.
Requirements
- Bachelor’s degree in computer science, IT, engineering, math, or a related field, or equivalent professional experience.
- 2+ years of software development experience or 4+ years in site reliability or DevOps.
- Experience with Linux operating systems.
- Proficiency in Python and Python-based development frameworks.
- 2+ years of systems administration or site reliability engineering experience.
- Expertise with Docker, Vagrant, and Kubernetes.
- Experience with configuration management tools like Ansible or Terraform.
- Strong understanding of virtualization and hypervisor technologies.
- Knowledge of databases and data modeling.
- Experience managing multiple servers automatically.
- Strong networking knowledge of TCP/IP.
- Experience scaling web applications for performance.
- Familiarity with front-end technologies like HTML, CSS, and JavaScript.
- Solid understanding of UI/UX design principles.
- Experience with high-performance computing systems.
Benefits
- Comprehensive medical, vision, and dental coverage.
- 401(k) retirement plan with company matching.
- Short and long-term disability insurance.
- Life insurance and paid parental leave.
- 3 weeks of paid vacation and 10 or more paid holidays per year.
- Paid sick leave in accordance with company policy.
Tech Stack
AnsibleBackbone.jsBazelCSSDockerGradleHTMLJavaScriptKubernetesLinuxMakenpmPolymerPuppetPythonReactTerraformVagrant
Categories
BackendDevOpsTesting