11 months ago
San Francisco, CA, USA or New York, NY, USASenior / Mid Level
Responsibilities
- Develop and integrate LLM-powered features such as AI-assisted workflow automation and content strategy.
- Optimize AI performance through prompt engineering and evaluation frameworks.
- Build and scale AI infrastructure for low-latency responses and cost-efficient model usage.
- Implement AI observability and safeguards for quality and compliance.
- Collaborate with product and engineering teams to deliver intuitive AI-driven user experiences.
- Stay updated on AI advancements to continuously improve capabilities.
- Work with early adopters to optimize model performance and usability.
Requirements
- 3+ years of experience in machine learning engineering or applied NLP.
- Proven track record of building LLM-powered applications.
- Strong experience with foundation models and advanced prompt engineering.
- Experience with embedding models and local vector stores.
- Deep understanding of retrieval-augmented generation and contextual AI response optimization.
- Familiarity with frameworks for orchestrating LLM-powered applications.
- Strong programming skills in Python and experience with AI frameworks.
- Experience in evaluating LLM performance and implementing feedback loops.
- Solid understanding of caching, rate limiting, and cost optimization strategies.
- Ability to work cross-functionally with engineers and product managers.
Benefits
- Equity in a fast-growing startup.
- Competitive benefits package tailored to your location.
- Flexible time off policy.
- Parental Leave.
- A fun-loving and fast-moving team.
