
Software Engineer, Infrastructure
Scaled Cognition3 months ago
Boston, MA, USA +2 moreMid Level
Responsibilities
- Design and improve inference infrastructure for AI models.
- Benchmark, profile, monitor, and analyze latency and throughput.
- Drive improvements throughout the stack based on analysis.
- Collaborate with research scientists and product engineers for model deployment.
Requirements
- Experience deploying systems on major cloud platforms (AWS, GCP, Azure).
- Prior experience designing and implementing GPU infrastructure/tooling.
- Strong sense for scalability and developing secure, highly reliable environments.