about 3 hours ago
Remote, United States
Senior / Mid Level
Responsibilities
- Act as the main technical interface for customers running workloads on Nebius GPU infrastructure.
- Support customers in deploying, configuring, and tuning GPU-based environments for performance and reliability.
- Investigate and resolve complex issues spanning hardware, networking, operating systems, and cluster-level behavior.
- Partner with internal teams to coordinate and drive resolution of customer-impacting issues.
- Convert customer requirements into practical architectures, configurations, and execution plans.
- Identify opportunities to improve system performance, stability, and overall customer experience.
- Develop and maintain technical documentation, including solution patterns and troubleshooting guides.
- Contribute to continuous improvement by surfacing recurring issues and optimization opportunities.
Requirements
- Experience in a customer-facing technical role such as solutions engineer or support engineer.
- Strong understanding of GPU infrastructure, including NVIDIA-based systems and performance considerations.
- Hands-on experience with Linux systems and system-level troubleshooting.
- Familiarity with large-scale compute environments like GPU clusters or supercomputing systems.
- Ability to diagnose issues across hardware, networking, and software layers.
- Strong analytical and problem-solving skills, especially in high-pressure situations.
- Excellent communication skills to explain complex technical concepts clearly.
- A proactive, ownership-driven approach with a focus on customer success.
Benefits
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
Tech Stack
Linux
Categories
AI & MLBackendData Engineering