20 days ago
Remote, CanadaSenior / Staff+
H1B Sponsor
Responsibilities
- Lead the design and development of Cresta’s next-generation AI Agents and Agentic Assist systems.
- Architect intelligent, multi-step agent workflows that integrate real-time guidance and automated actions.
- Design, deploy, and optimize LLM-powered systems, including Retrieval-Augmented Generation (RAG) pipelines.
- Improve reasoning, planning, and tool-use capabilities in real-world AI applications.
- Develop evaluation strategies for complex, non-deterministic systems.
- Diagnose and mitigate real-world failure modes in AI systems.
- Define and measure quality metrics to enhance system reliability and performance.
- Optimize AI systems for scalability, latency, security, and cost efficiency.
- Collaborate cross-functionally with product, frontend, and backend teams.
- Mentor engineers and contribute to the technical strategy and roadmap.
Requirements
- Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or Ph.D. preferred.
- 5–8+ years of industry experience building and deploying machine learning systems in production.
- Strong expertise in NLP, Generative AI, transformer architectures, embeddings, and retrieval systems.
- Proven experience designing and deploying Retrieval-Augmented Generation (RAG) systems.
- Experience building and evaluating complex agentic or multi-step LLM workflows.
- Strong knowledge of modern ML frameworks and tools such as PyTorch and TensorFlow.
- Demonstrated ability to optimize real-time ML systems for performance and reliability.
- Strong technical leadership skills to influence cross-functional decisions.
Benefits
- Variety of medical, dental, and vision plans for employees and their families.
- Paid parental leave to support employees and their families.
- Monthly Health & Wellness allowance.
- Work from home office stipend.
- Lunch reimbursement for in-office employees.
- 3 weeks of PTO in Canada.
Tech Stack
PyTorchTensorFlow
Categories
AI & MLData Science
