4 months ago
San Francisco, CA, USA or New York, NY, USAEntry Level
Base Salary
$160k - $200k/yr
Responsibilities
- Run and automate standard LLM quality benchmarks and custom performance suites.
- Create automated acceptance tests for new GPU clusters across x86 and ARM systems.
- Develop and maintain internal GPU-enabled development environments.
- Build and contribute to tools for model evaluation and optimization.
- Collect performance profiles and identify bottlenecks using profiling tools.
- Develop real-time dashboards and alerts for system monitoring.
- Automate performance testing via CI/CD pipelines.
- Build tools to identify optimal configurations for models and workloads.
Requirements
- A strong interest in systems and hardware, particularly GPU memory subsystems.
- An automation mindset with a passion for scripting repetitive tasks.
- Mathematical curiosity regarding the underlying math of Transformers.
- Interest in optimization techniques like quantization and kernel-level optimizations.
- Familiarity with Python and eagerness to master the NVIDIA software stack.
- C++ familiarity is a plus.
Benefits
- Competitive compensation with meaningful equity.
- 100% coverage of medical, dental, and vision insurance for employees and dependents.
- Flexible PTO policy including a company-wide Winter Break.
- Paid parental leave and fertility/family-building stipend.
- Company-facilitated 401(k) plan.
- Exposure to various ML startups for learning and networking opportunities.
