18 days ago
New York, NY, USAMid Level / Senior
Base Salary
$175k - $275k/yr
Responsibilities
- Build and maintain world-class on-device inference engines for LLMs and other models.
- Integrate emerging AI/ML technologies as production-ready features in LM Studio.
- Develop with and contribute to OSS engines like llama.cpp, MLX, and more.
- Collaborate closely with model authors to ship day-0 support for new models.
- Profile, debug, and improve process memory, CPU usage, and GPU usage.
- Be an excellent communicator, contributor, and collaborator.
Requirements
- 3+ years of experience with C++ and Python; TypeScript experience is a plus.
- 2+ years of experience with machine learning frameworks and model inference.
- Excellent problem-solving and communication skills.
- Strong understanding of operating systems.
- Strong understanding of software system design.
- Interest in local LLMs and experience tinkering with them in LM Studio.
- Passionate about creating a great user and developer experience.
