
Staff MLOps Engineer – ML Platform
BrightAI Corporation2 days ago
Palo Alto, CA, USAStaff+
Responsibilities
- Design, build, and operate the ML/AI development platform on AWS.
- Establish project templates and internal libraries for standardized workflows.
- Implement Infrastructure-as-Code and workflow orchestration.
- Build automated data pipelines and ensure data quality.
- Set up experiment tracking and model registry with versioning.
- Implement CI/CD for ML with testing and deployment strategies.
- Ship real-time endpoints and batch jobs while optimizing performance.
- Build monitoring and observability for production models.
- Enforce security and governance measures.
- Collaborate with backend engineers to productionize notebooks.
Requirements
- B.S. or M.S. in Computer Science, Electrical/Computer Engineering, or related field.
- 8+ years in software/ML engineering, with 4+ years in MLOps.
- Proficient in Python and experienced with Docker and Terraform.
- Hands-on experience with AWS services like SageMaker, S3, and IAM.
- Experience in building and operating CI/CD for ML workloads.
- Familiarity with experiment tracking and model registry tools.
- Knowledge of monitoring and quality assurance for ML services.
- Understanding of security and compliance in cloud ML environments.