about 1 month ago
Beijing, ChinaMid Level / Senior
Responsibilities
- Build and operate backend systems for AI-powered features in production.
- Design inference pipelines and orchestration layers around models.
- Manage production concerns including monitoring, logging, and incident response.
- Optimize latency and throughput through caching, batching, and streaming.
Requirements
- Strong backend engineering fundamentals in production environments.
- Experience with high-throughput, low-latency services.
- Familiarity with AI inference patterns such as LLMs and embeddings.
- Ability to debug distributed systems under load.
- A bias toward shipping and learning from production behavior.
