Software Engineer, Model Performance Systems

6 months ago

San Francisco, CA, USA or New York, NY, USAEntry Level

H1B Sponsor

Base Salary

$160k - $200k/yr

Responsibilities

Run and automate standard LLM quality benchmarks and custom performance suites.
Create automated acceptance tests for new GPU clusters across x86 and ARM systems.
Develop and maintain internal GPU-enabled development environments.
Build and contribute to tools for model evaluation and optimization.
Collect performance profiles and identify bottlenecks using profiling tools.
Develop real-time dashboards and alerts for system monitoring.
Automate performance testing via CI/CD pipelines.
Build tools to identify optimal configurations for models and workloads.

A strong interest in systems and hardware, particularly GPU memory subsystems.
An automation mindset with a passion for scripting repetitive tasks.
Mathematical curiosity regarding the underlying math of Transformers.
Interest in optimization techniques like quantization and kernel-level optimizations.
Familiarity with Python and eagerness to master the NVIDIA software stack.
C++ familiarity is a plus.

Competitive compensation with meaningful equity.
100% coverage of medical, dental, and vision insurance for employees and dependents.
Flexible PTO policy including a company-wide Winter Break.
Paid parental leave and fertility/family-building stipend.
Company-facilitated 401(k) plan.
Exposure to various ML startups for learning and networking opportunities.