5 days ago
Remote, Canada
Staff+
H1B Sponsor
Responsibilities
- Architect and scale the infrastructure for AI quality and reliability.
- Design and implement an end-to-end AI evaluation framework.
- Define performance metrics and build datasets for automated scoring.
- Architect reusable agent infrastructure using frameworks like LangGraph.
- Build and scale production-grade AI infrastructure with reliability.
- Make informed build-vs-buy decisions for AI systems.
- Own projects from scope to delivery and set technical direction.
- Lead discussions on AI system design and evaluation methodology.
- Mentor team members and enhance technical communication.
Requirements
- 8+ years of experience in software engineering with 5+ years in AI/ML systems.
- Deep production experience with LLM systems and agentic systems.
- Strong command of AI evaluation methodology and statistical experimentation.
- Proficient in production-grade Python and experience with LangGraph.
- Experience with vector databases and cloud environments like AWS.
- Familiarity with TypeScript is a plus.
- Engaged in AI research and industry trends.
Benefits
- Medical and dental insurance.
- Life, AD&D, and disability insurance.
- Wellness apps and natural disaster support.
- Paid parental leave and time off including holidays.
- Remote work stipend and WFH office setup stipend.
- Retirement plan and financial planning support.
- Learning and development budget.
Tech Stack
AWSDatadogMLflowPythonTypeScript
Categories
AI & MLData Science
