3 days ago
Bengaluru, IndiaStaff+
Responsibilities
- Design and lead the evolution of a unified MLOps platform.
- Champion best practices for CI/CD for ML and infrastructure-as-code.
- Lead collaborative efforts across Data Engineering, DevOps, and Product teams.
- Partner with leadership to define the technical vision for AI infrastructure.
- Set standards for observability and incident response for ML systems.
- Mentor P2 and P3 engineers to promote technical rigor.
Requirements
- Bachelor’s or Master’s degree in Computer Science, AI, or a related field.
- 10-12+ years of professional software engineering experience.
- 6-7 years focused on productionizing and scaling ML systems.
- Expert-level proficiency in Python, Scala, or Java/Kotlin.
- Extensive experience with PySpark and high-performance computing.
- Proven track record with LLM applications and related technologies.
- Superior ability to design distributed systems for high request volumes.
- Experience with microservice architecture and AWS tooling.
- Knowledge of software engineering best practices and tools.
- Exceptional problem-solving and analytical skills.
- Outstanding communication and interpersonal skills.