SpaceX

Site Reliability Engineer, GNC (Falcon)

SpaceX

Apply
4 months ago
Hawthorne, CA, USA
Mid Level / Senior

Base Salary

$120k - $145k/yr

Responsibilities

  • Deploy, upgrade, operate/maintain, and scale GNC products and services.
  • Provision and maintain virtual and physical servers.
  • Monitor and maintain a 4000+ thread HPC cluster.
  • Collaborate with GNC software engineers for product operability.
  • Add monitoring for web apps and respond to outages.
  • Manage GNC computational infrastructure with IT collaboration.
  • Engage in the full lifecycle of services from design to refinement.
  • Recommend future hardware purchases.
  • Practice sustainable incident response and postmortems.
  • Provide end-user support for GNC engineering products.
  • Configure automated deployment pipelines for web apps.
  • Develop or improve GNC web apps for usability and robustness.
  • Document new software changes and improvements.
  • Focus on performance bottlenecks and improvement techniques.

Requirements

  • Bachelor’s degree in computer science, IT, engineering, math, or a related field, or equivalent professional experience.
  • 2+ years of software development experience or 4+ years in site reliability or DevOps.
  • Experience with Linux operating systems.
  • Proficiency in Python and Python-based development frameworks.
  • 2+ years of systems administration or site reliability engineering experience.
  • Expertise with Docker, Vagrant, and Kubernetes.
  • Experience with configuration management tools like Ansible or Terraform.
  • Strong understanding of virtualization and hypervisor technologies.
  • Knowledge of databases and data modeling.
  • Experience managing multiple servers automatically.
  • Strong networking knowledge of TCP/IP.
  • Experience scaling web applications for performance.
  • Familiarity with front-end technologies like HTML, CSS, and JavaScript.
  • Solid understanding of UI/UX design principles.
  • Experience with high-performance computing systems.

Benefits

  • Comprehensive medical, vision, and dental coverage.
  • 401(k) retirement plan with company matching.
  • Short and long-term disability insurance.
  • Life insurance and paid parental leave.
  • 3 weeks of paid vacation and 10 or more paid holidays per year.
  • Paid sick leave in accordance with company policy.

Tech Stack

AnsibleBackbone.jsBazelCSSDockerGradleHTMLJavaScriptKubernetesLinuxMakenpmPolymerPuppetPythonReactTerraformVagrant

Categories

BackendDevOpsTesting