GrepJob
Scaled Cognition

Software Engineer, Infrastructure

Scaled Cognition
Apply
3 months ago
Boston, MA, USA +2 moreMid Level

Responsibilities

  • Design and improve inference infrastructure for AI models.
  • Benchmark, profile, monitor, and analyze latency and throughput.
  • Drive improvements throughout the stack based on analysis.
  • Collaborate with research scientists and product engineers for model deployment.

Requirements

  • Experience deploying systems on major cloud platforms (AWS, GCP, Azure).
  • Prior experience designing and implementing GPU infrastructure/tooling.
  • Strong sense for scalability and developing secure, highly reliable environments.

Tech Stack

AWSAzureGoogle Cloud Platform