GrepJob
Baseten

Software Engineer, Model Performance Systems

Baseten
Apply
4 months ago
San Francisco, CA, USA or New York, NY, USAEntry Level

Base Salary

$160k - $200k/yr

Responsibilities

  • Run and automate standard LLM quality benchmarks and custom performance suites.
  • Create automated acceptance tests for new GPU clusters across x86 and ARM systems.
  • Develop and maintain internal GPU-enabled development environments.
  • Build and contribute to tools for model evaluation and optimization.
  • Collect performance profiles and identify bottlenecks using profiling tools.
  • Develop real-time dashboards and alerts for system monitoring.
  • Automate performance testing via CI/CD pipelines.
  • Build tools to identify optimal configurations for models and workloads.

Requirements

  • A strong interest in systems and hardware, particularly GPU memory subsystems.
  • An automation mindset with a passion for scripting repetitive tasks.
  • Mathematical curiosity regarding the underlying math of Transformers.
  • Interest in optimization techniques like quantization and kernel-level optimizations.
  • Familiarity with Python and eagerness to master the NVIDIA software stack.
  • C++ familiarity is a plus.

Benefits

  • Competitive compensation with meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employees and dependents.
  • Flexible PTO policy including a company-wide Winter Break.
  • Paid parental leave and fertility/family-building stipend.
  • Company-facilitated 401(k) plan.
  • Exposure to various ML startups for learning and networking opportunities.

Tech Stack

C++PythonPyTorch

Categories

AI & MLData EngineeringDevOps