about 2 hours ago
Bengaluru, India
Senior / Staff+
H1B Sponsor
Responsibilities
- Build and run the LLM control plane/gateway with smart routing and cost tracking.
- Ship a unified API and SDKs with normalized schemas and full observability.
- Enforce safety and privacy by default through content filtering and PII redaction.
- Enable multi-model, multi-vendor LLMs with automated canarying and versioning.
- Own the agent runtime including tool registry and function calling.
- Design orchestration patterns and manage agent state and workflows.
- Create components for monitoring model and data drift.
- Add human-in-the-loop review before agents interact with dealer systems.
- Evolve the domain graph and build reliable data ingestion pipelines.
- Serve real-time context to agents with access controls.
- Power retrieval with hybrid search and smart caching.
- Run continuous evaluations for quality and safety of the platform.
- Define SLOs for latency and uptime, enabling autoscaling.
- Maintain a model/agent registry and support compliance.
- Provide templates and documentation to facilitate fast product development.
Requirements
- 12–15+ years of experience in building large-scale data/ML or platform systems.
- Strong software engineering fundamentals including API design and distributed systems.
- Production experience with Python and one of Java/Scala/Go.
- Experience with MLOps at scale including pipelines and CI/CD for models.
- Knowledge of cloud and container technologies, preferably AWS.
- Practical ML knowledge including feature engineering and model deployment.
- Experience building or operating an LLM gateway/control plane.
- Familiarity with agentic systems and orchestration frameworks.
- Experience with knowledge graphs and hybrid retrieval patterns.
Tech Stack
Apache AirflowApache FlinkApache KafkaApache SparkAWSDockerGoGraphQLJavaKubernetesMLflowNeo4jPythonScala
Categories
AI & MLBackendData Science