1 day ago
Amsterdam, Netherlands or London, United Kingdom
Senior
Responsibilities
- Design, implement, and operate core runtime services for serving search queries.
- Build and optimize request flows including query processing and response assembly.
- Develop systems that meet strict latency budgets under high load.
- Optimize CPU, memory, and data access patterns in performance-critical paths.
- Ensure reliability, observability, and predictability in production.
- Build well-tested systems with clear boundaries while allowing architectural evolution.
- Define observability primitives such as logs, metrics, and traces.
- Monitor and improve latency, throughput, and cost efficiency.
- Collaborate with indexing and ML teams to integrate retrieval and ranking components.
- Support experimentation through controlled rollouts and benchmarking.
Requirements
- 5+ years of experience building production backend systems.
- Strong expertise in C++ or Rust.
- Experience with high-load, low-latency user-facing systems.
- Worked on systems handling thousands of requests per second under strict latency constraints.
- Understanding of performance at a systems level including CPU, memory, and networking.
- Experience operating services in production and handling incidents and debugging.
- Understanding of distributed systems fundamentals and tradeoffs.
- Ability to think end-to-end about request flows rather than isolated components.
- Ability to balance correctness, latency, and development speed.
- Effective collaboration across engineering, ML, and product teams.
Benefits
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
Tech Stack
C++Rust
Categories
AI & MLBackend