GrepJob
BrightAI Corporation

Staff MLOps Engineer – ML Platform

BrightAI Corporation
Apply
2 days ago
Palo Alto, CA, USAStaff+

Responsibilities

  • Design, build, and operate the ML/AI development platform on AWS.
  • Establish project templates and internal libraries for standardized workflows.
  • Implement Infrastructure-as-Code and workflow orchestration.
  • Build automated data pipelines and ensure data quality.
  • Set up experiment tracking and model registry with versioning.
  • Implement CI/CD for ML with testing and deployment strategies.
  • Ship real-time endpoints and batch jobs while optimizing performance.
  • Build monitoring and observability for production models.
  • Enforce security and governance measures.
  • Collaborate with backend engineers to productionize notebooks.

Requirements

  • B.S. or M.S. in Computer Science, Electrical/Computer Engineering, or related field.
  • 8+ years in software/ML engineering, with 4+ years in MLOps.
  • Proficient in Python and experienced with Docker and Terraform.
  • Hands-on experience with AWS services like SageMaker, S3, and IAM.
  • Experience in building and operating CI/CD for ML workloads.
  • Familiarity with experiment tracking and model registry tools.
  • Knowledge of monitoring and quality assurance for ML services.
  • Understanding of security and compliance in cloud ML environments.

Tech Stack

Amazon RedshiftApache AirflowAWSDockerFastAPIGrafanaMLflowPrometheusPythonTerraform

Categories

AI & MLData EngineeringDevOps