about 3 hours ago
Remote, United StatesSenior / Staff+
H1B Sponsor
Base Salary
$125k - $156k/yr
Responsibilities
- Design and implement foundational GenAI services such as vector search and prompt tuning.
- Build infrastructure for autonomous AI agents with memory persistence.
- Create standardized APIs/SDKs for deploying Generative AI workloads.
- Ensure platform components meet enterprise-grade requirements for scalability and efficiency.
- Stand up LLM runtimes with governance and caching.
- Implement RAG at scale with ingestion pipelines and feedback loops.
- Build agent orchestration for multi-agent communication.
- Integrate tooling for secure data interaction by agents.
- Automate data and model pipelines for RAG and LLM fine-tuning.
- Integrate observability tools for monitoring AI outputs.
- Partner with security and governance teams to productize platform capabilities.
- Embed compliance measures for data handling and security.
- Establish technical standards and mentor junior engineers.
Requirements
- 8+ years in software/ML engineering, with 5+ years in ML engineering at scale.
- Expertise in building production-grade ML/LLM systems on AWS tech stack.
- Proven track record with GenAI/LLMs and safety guardrails.
- Hands-on experience with RAG systems and LLM runtime operations.
- Experience building agentic AI platforms.
- Deep knowledge of data-intensive systems and distributed architectures.
- Strong grounding in compliance-first engineering in healthcare preferred.
- Track record of building secure, compliant data/AI systems.
- Excellent ability to influence across teams and mentor engineers.
Benefits
- Comprehensive medical, dental, vision, life, and disability plans.
- Free testing for employees and their immediate families.
- Fertility care benefits and pregnancy/baby bonding leave.
- 401k benefits and commuter benefits.
- Generous employee referral program.
Tech Stack
Categories
AI & MLData Engineering
