about 3 hours ago
Base Salary
$170k - $235k/yr
Responsibilities
- Lead the design, architecture, and implementation of secure, scalable AI platforms and proxy systems used company-wide.
- Own complex platform initiatives end-to-end, including technical strategy, implementation, testing, deployment, and long-term evolution.
- Drive onboarding of new frontier models, expansion of compute resources, and optimization of proxy systems for performance, security, logging, and control.
- Champion adoption across SpaceX by mentoring engineers, communicating value, and providing hands-on support.
- Identify high-value opportunities and deliver solutions that allow users to convert problems into tested software and deployed applications.
- Develop and scale tools that mitigate business risk, such as advanced AI-powered code review and PR automation.
- Define best practices for lean, effective, and secure applied AI.
- Collaborate with and influence cross-functional stakeholders, including ML researchers and security teams.
Requirements
- Bachelor’s degree in computer science, computer engineering, or other engineering discipline and 5+ years of professional experience building production software; OR 7+ years of professional experience in lieu of a degree.
- Experience developing and operating production platforms (AI/ML infrastructure, developer platforms, or large-scale backend systems).
- Proven success building and scaling internal AI gateways, secure proxy systems, or MLOps platforms in a fast-paced environment.
- Deep hands-on experience with Docker, Kubernetes, security architecture, and integrations across multiple cloud and frontier model providers.
- Strong background in model onboarding, compute orchestration, observability, and infrastructure for high-volume AI usage.
- Demonstrated ability to enable and mentor others through tools, documentation, training, and direct partnership.
- Entrepreneurial track record of identifying opportunities and delivering outsized impact.
- Proficiency in Python and strong experience with additional languages (Go, Java, TypeScript, Rust, etc.).
- Experience with infrastructure-as-code, CI/CD, and building highly reliable distributed systems.
- Passion for AI risk mitigation, developer productivity, and turning complex engineering challenges into simple, scalable AI-powered outcomes.
Benefits
- Eligible for long-term incentives, including company stock or long-term cash awards.
- Potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan.
- Access to comprehensive medical, vision, and dental coverage.
- 401(k) retirement plan with company matching.
- Short and long-term disability insurance and life insurance.
- Paid parental leave and various other discounts and perks.
- Accrue 3 weeks of paid vacation and eligible for 10 or more paid holidays per year.
- Paid sick leave in accordance with company policy.